I am trying to use XPath
to extract some HTML tags and data and for that I need to use XML::LibXML
module.
I tried installing it from CPAN shell but it doesn't install.
I followed the instructions from CPAN site about the installation, that we need to install libxml2
, iconv
and zlib
wrappers before installing XML::LibXML
and it didn't work out.
Also, if there is any other simpler module that gets my task done, please let me know.
The task at hand:
I am searching for a specific <dd>
tag on a html page which is really big ( around 5000 - 10000) <dd>
and <dt>
tags. So, I am writing a script which matches the content within <dd>
tag and fetches the content within the corresponding (next) <dt>
tag.
I wish i could i have been a little more clearer. Any help is greatly appreciated.