I'm a beginner in coding and a friend of mine told me to use BeautifulSoup instead of htmlparser. After running into some problems I got a tip to use lxml instead of BeaytifulSoup because it's 10x better.
I'm hoping someone can give me a hint how to scrape the text I'm looking for.
What I want is to find a table with the following rows and data:
<tr>
<td><a href="website1.com">website1</a></td>
<td>info1</td>
<td>info2</td>
<td><a href="spam1.com">spam1</a></td>
</tr>
<tr>
<td><a href="website2.com">website2</a></td>
<td>info1</td>
<td>info2</td>
<td><a href="spam2.com">spam2</a></td>
</tr>
How do I scrape the website with info 1 and 2, without spam, with lxml and get the following results?
[['url' 'info1', 'info2'], ['url', 'info1', 'info2']]