What is the preferred way to extract elements from a HTML page in Java?
My HTML is has many of the following rows:
<tr class="item-odd">
<td class="data"><a href="http://.....">TITLE</a></td>
<td><div class="cost">$1.99</div></td>
</tr>
The class alternates item-odd
and item-even.
I need to extract:
- Url
- Title
- price
Is regular expressions the way to go?