tags:

views:

194

answers:

3

Hi

I am doing mail parsing application which required to convert the HTML file to Plain Text. regarding this i have found some scripts which does conversion. I want to do same thing in C++.

So please suggest me any Cross platform and open source C++ libraries for converting HTML to Plain Text.

Thanks in advance Regards Subbi

+1  A: 

Try using regular expression extracting html tags and save result as file text. But it not simple. Use this help class DEELX - Regular Expression Engine.

lsalamon
Thank your for the link to DEELX. I dont always like having to deal with including boost.
mfperzel
A: 

Take a look at html2text. It's a command tool and not a pure lib, but contains code which strips and converts html. So you should be able to use it.

Martin Wickman
A: 

As 'obvious' as it may sound you can just keep all the text between > & <

Eugen Constantin Dinca
I think you have misinterpreted my question. my desire is to convert HTML to Text.
subbi
@subbi : the HTML tags are enclosed between < (i.e.  , <)
Eugen Constantin Dinca