Hello all,
I have a C# WPF application that needs to consume data that is exposed on a webpage as a HTML table.
After getting inspiration from this url I tried using Linq to Xml to parse the Html document, but this only works if the HTML document is extremely well formed (and doesn't have any comments or HTML entities inside it). I have managed to get a working solution using this technique, but it is far from ideal.
I am after a solution that is intended for parsing HTML. I have hacked "solutions" before, but they are brittle. I am after a robust way of parsing/manipulating the document. I'd ideally like something that makes the task as easy as it would be from Javascript/JQuery.
Does anyone know of a good .Net library or utility for parsing/manipulating HTML?
Thanks for any help you can offer.