views:

15

answers:

0

Hi,

There seem to be a lot of tools/components to convert HTML to PDF. I am wondering if there is a .Net component or library that can go the other way around. Given a PDF, convert it into HTML or XML or any other sort of structured format that can maintain layout information such as headings/titles/paragraphs etc.

iTextSharp, AbcPDF, TallPDF all seem to have excellent PDF generation capabilities, but I haven't seen a good component (not a stand alone tool or exe) that can go the other way around and generate a structured markup from PDF.

Any help would be greatly appreciated.