ansaurus

Question

How to read xpath values from many HTML files in .Net?

Answer 1

+2 A:

I think you should look into the HTML Agility Pack. It is an HTML parser rather than an XML parser, and is better for this task. If there is anything that doesn't agree with the XML being parsed then the parser will throw and exception. Using an HTML parser gives you a bit more leeway with the input files.

Example showing how to do something with all HREF (link) attributes:

 HtmlDocument doc = new HtmlDocument();
 doc.Load("file.htm");
 foreach(HtmlNode link in doc.DocumentElement.SelectNodes("//a[@href"])
 {
    HtmlAttribute att = link["href"];
    att.Value = FixLink(att);
 }

I'm not near a compiler but the example you want is something like:

string title = doc.DocumentNode.SelectSingleNode("//title").InnerText;

BrianLy 2010-07-27 01:58:46

works like a charm. thanks a bunch

el chief 2010-07-27 03:33:01

ansaurus

tags:

views:

answers:

How to read xpath values from many HTML files in .Net?

related questions