Type error when using a parser in Python

Question

I have the following html parser:

from HTMLParser import HTMLParser

class MLStripper(HTMLParser):
    def __init__(self):
        self.reset()
        self.fed = []

    def handle_data(self, d):
        self.fed.append(d)

    def get_data(self):
        return ''.join(self.fed)

def strip_tags(html):
    s = MLStripper()
    s.feed(html)
    return s.get_data()

I would like to use this on the following data.frame:

 df = pd.DataFrame([['<br> test </br>', 1]], columns=('body', 'ticketID'))

My assumption would be that it would work like this:

 for row in df.iterrows():
     input = row['body']
     print(strip_tags(input)

But this gives me a type error. Any thoughts where this goes wrong?

@Frits Please be more generous, use 4 spaces for indentation. 1 space is too low. — Mohammad Yusuf
– Mohammad Yusuf, Commented Jan 25, 2017 at 13:53

Community · Accepted Answer · 2020-06-20 09:12:55Z

1

From the (Docs):

DataFrame.iterrows()

Iterate over DataFrame rows as (index, Series) pairs.

So you get the index, along with the row.

Working Code:

for index, row in df.iterrows():
    input = row['body']
    print(strip_tags(input))

edited Jun 20, 2020 at 9:12

CommunityBot

11 silver badge

answered Jan 25, 2017 at 19:25

Stephen Rauch♦

50.1k32 gold badges118 silver badges143 bronze badges

Sign up to request clarification or add additional context in comments.

Collectives™ on Stack Overflow

Type error when using a parser in Python

1 Answer 1

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

Comments

Your Answer

Sign up or log in

Post as a guest

Related