djvu

Libraries for parsing PDF, PostScript, and/or DjVu

What I want to do is pretty simple: given a PDF/PS/DjVu file containing a paper/book, find the authors and title of the paper (any other metadata would be good, but less needed). This recognition doesn't have to be perfect, but I'd like to make it as good as I can. I am looking for open-source .NET and/or Java libraries (preferably .NET)...

Does anybody use DjVu files in their production tools ?

When it's about archiving and doc portability, it's all about PDF. I heard about DjVu somes years ago, and it seems to be now mature enough for serious usages. The benefits seems to be a small size format and a fast open / read experience. But I have absolutely no feedback on how good / bad it is in the real world : Is it technically ...

how to extract text from djvu and other ebooks format (possibly in Python)

I have a collection of ebooks in djvu, pdf, chm format and I am looking for a way to search the keyword in the content. I have been researching around and find couple suggestion to parse pdf content but there seems to be no way to convert the content in djvu into text. By any chance, does anyone know a way to decode djvu content into tex...

Html or pdf from mobile

I have Nokia 2700 How to read html and pdf/djvu from him? Or it's impossible? ...