I get some URL from a XML feed. Now the question is how do I get a specific data from each page represented by those URLs. For example if I have a URL: www.abc.com in the feed data and on that page there is a table like this:
<table>
<body>
<tr>
<td class="snip">
<span class="summary">
abc ... abc & xyz ...
<br>
.......
<br>
</span>
<span>......</span>
</td>
</tr>
</body>
</table>
Now the question is how do I get the content of the span which has the class "summary" and which is the child of the having a class name "snip". We do have to decode\remove the encoded html contained by the span.
Any regex based soln? Any idea how to do it from server side?