I want to be able to read the content of pdf files. I need to do that with C on Linux.
The closer i can get to this was here but I think Haru can only create pdf and is not able to read them (not 100% sure).
PS: I only need the plain text from pdf
I want to be able to read the content of pdf files. I need to do that with C on Linux.
The closer i can get to this was here but I think Haru can only create pdf and is not able to read them (not 100% sure).
PS: I only need the plain text from pdf
How well do you need to parse them? Just extracting strings should be relatively easy, fully accurate rendering is harder. Take a look at the source for evince or ghostscript?
This is for C++ but might be a good starting point for understanding PDF structure http://www.codeproject.com/KB/cpp/ExtractPDFText.aspx (sorry wrong link before)
Check out libpoppler. I've never used it work extracting text, just querying PDF attributes. It's pretty easy to use.
Another possible, though I've never used it is VersyPDF. It claims to allow you to edit PDFs ... http://versypdf.sybrex-systems-ltd.qarchive.org/
Hi every one,
Iam c application programmer,I want read opened pdf files through C Code,Iam able to read opened text and jpg files but not able read opened pdf files. I am able to read not opened pdf files but when iam reading pdf file through C code at that time if manualy iam trying to open pdf file.I can't open PDf file.
Plz suggest me solution for above problem,,,,,,
Thanks in advance
regards Sunil Kumar G