I've been a big fan of the site and frankly this is the first time I ever came across a problem that Stackoverflow didn't have the answer to.
I need to grab some content from an html(xhtml valid) page. I grab the page using curl and store it in memory. I played with the idea of using regex with the PCRE library, but simply I couldn't find any examples using it with C. Then I moved on to look at html parsers and again there is not a good selection. All I could find was a skimpy documented module for libxml called HTMLparser.
Are there any alternatives? If not, then examples for what I found already?