I would like to spell check html files programmatically using C#? The text only without the tags.
Any recommendations? Please mention ones that strip out the tags.
I would like to spell check html files programmatically using C#? The text only without the tags.
Any recommendations? Please mention ones that strip out the tags.
I suggest you you strip all tags first then spell check the resulting text. That's what is done inweb based wysiwyg html editors.
To strip html tags:http://en.csharp-online.net/Strip_all_HTML_tags
To extract the text from HTML use the HTML Agility pack. There has also been some open source work for HTML spell checking on Codeplex: see the Spello project.