views:

170

answers:

1

basically i want to extract keywords or words or tokens that are present in the webpage after removing the stopwords..please help if anybody knws how to do it..will be thankful to u...code in c# would be appreciable..seeking for replies..pls help

A: 

Use an HTML parsing library like the HTML Agility Pack.

Once you load an HTML document with it, you can query it with Xpath syntax - it exposes the HTML in a similar way to an XmlDocument.

Oded