views:

848

answers:

1

I'm looking for a good C++ library for web scraping.
It has to be C/C++ and nothing else so please do not direct me to Options for HTML scraping or other SO questions/answers where C++ is not even mentioned.

+5  A: 
  • libcurl to download the html file
  • libtidy to convert to valid xml
  • libxml to parse/navigate the xml
redmoskito
your second link should probably go to http://tidy.sourceforge.net/ ?
therefromhere