ansaurus

Question

How to parse html and return array of values in c# using regex.split

Answer 1

+6 A:

Do not parse HTML using regular expressions.

Instead, you should use the HTML Agility Pack.

For example:

HtmlDocument doc = new HtmlDocument();
doc.Parse(str);

IEnumerable<string> cells = doc.DocumentNode.Descendants("td").Select(td => td.InnerText);

SLaks 2010-09-27 20:37:14

Answer 2

+1 A:

You really should not use regex to parse html. html is not a regular language, so regex isn't capable of interpreting it properly. You should use a parser.

c# has html parsers for this.

JoshD 2010-09-27 20:38:14

ansaurus

tags:

views:

answers:

How to parse html and return array of values in c# using regex.split

related questions