pdf

PDF Text Extraction at hyperlink locations

Anybody know of a (FREE) SDK of some sort that can start text extraction at the point in the PDF Document where a hyperlink takes you to (within the same PDF document)? The links end up taking us to specific points on specific pages. More specifically we need a program that can parse a pdf document that holds questions and answers to a...

setting a thumbnail image for pdf file when seen in explorer file list

Hi, 1) I'm trying to set the thumbnail image that you see in explorer for a pdf. At the moment, all the thumbnails are just the Abode PDF logo ... I'd like to set it to an image of my choice. I have looked at the Win 7 shell pack api, and cannot find any way to set the image, only ways to extract the (default) image. Can this even be do...

Choosing a cross-platform library for PDF rendering and analysis app (preferably using C#)

I am planning to write an app that can open and display PDF documents, and perform OCR on vector graphic elements within the PDFs. The user must be able to select regions of the document and I need to draw real-time annotations on the document. I don't need to alter or save the document itself. I have plenty of experience with C# and WP...

Is there Python PDF metadata writing library for Windows?

Hi, can someone point me to Python PDF package that can do metadata writing? I was surprised that I couldn't find any. I mean I found Python XMP Toolkit, but building Exempi on cygwin is nightmare I want to avoid. Thanks ...

making a pdf from html using c#

there is a way, a collegue of mne is usng it he will not tell me how so help folks! please note i c# .net framework only, not itextsharp or any other utility ...

Alternative to TCPDF for large-volume pdf generation

I generate pdf documents with TCPDF. It's fairly easy to setup and very good for creating pdf files with <10 pages. I can generate 50 pages long PDF files within the 60 seconds default php timeout and can generate upto 200 pages when the timeout is set to 300 (SET_TIME_LIMIT(300)). I have also allocated 256 mb memory in the PHP ini....

problem when parsing pdf files

I use htmlparser 1.6 to parse web sites. The problem is that when I parse pdf web sites, I obtain in the output file strange characters like ØÇÁÖÜ/:?ÖQØ?WÕWÏ This is a fragment of my code : try { parser = new Parser (); if (1 < args.length) filter = new TagNameFilter (args[1]); else { filter = n...

android : how to make my apps open specific PDF file on click

Hello, I am wondering if my codes here can view a specific pdf file using existing PDF viewer in Android Phone (HTC Desire).. If i would like to open pdf files from local folder.. What should i do? public class ghcm_Submenu1 extends Activity { private ListView lv1; private String lv_arr[]= {"item1", "item2"}; @Override public ...

Parsing PDF files hosted in web servers

I have used iText to parse pdf files. It works well on local files but I want to parse pdf files which are hosted in web servers like this one: "http://protege.stanford.edu/publications/ontology_development/ontology101.pdf" but I don't know how??? Could you please answer me how to do this task using iText or other libraries... thx ...

JsDoc to PDF renderer

Does anyone know a tool to convert JsDoc to PDF ? Or you have to do that trough convert HTML files into PDF ? Thanks for answer ! ...

Displaying PDF documents on iPad - Color Problems

I've built a PDF reader for the iPad and I've noticed some color problems when doing a side-by-side comparison of the document in preview verus the simulator and device. The best way to describe it is to say that the colors have become more intense. Any discrepancies between similar colors used in close proximity have become more notic...

read pdf files using java

Hello, I want to parse pdf websites. Can anyone say how to extract all the words (word by word) from a pdf file using java. The code below extract content from a pdf file and write it in another pdf file. I want that the program write it in a text file. import java.io.FileOutputStream; import java.io.IOException; import com.itextpd...

Inline images using itextsharp xmlparser

I've needed to generate some very complex dynamic pdfs for a client. After messing around with several frameworks I ended up building an XML pseudo-language that translates to the itextsharp xml definition. I then use the xml parser to build the document. I have everything built, it works really well except for check-marks, which I can'...

Ghostscript functionality

Hello, is it possible to use ghostscript to find out the resolution of an image within a PDF file? thanks in advance. John ...

Can't see Acrobat JS console when PDF is in an FF browser window.

Windows 7 (64 bit), Acrobat Pro 9.4(32 bit), FF 3.6.11(32 bit) In Firefox, I briefly see the JS console flash and then vanish... but it stays empty. IE will show the JS console if it's already open, and displays a brief error message and then crashes Acrobat: "Exception in line 18 of function top_level, script Page:Open". I don't hav...

Page automatically printed from winform application is not oriented correctly

We are printing pdf from winform application. We are creating pdf file using infragistics report and printing it without opening it. following is the source code for printing it using default printer. protected Report _report = new Infragistics.Documents.Report.Report(); var printDocument = new PrintDocument(); _report.Preferences.Pri...

What is the best approach to make a pdf document?

Hi there. Im trying to make a pdf document, and so far I can note two things, if I make a document using Word and I try to export it to pdf, it doesn't work very well. If I try to use Adobe Acrobat Professional to make a document, it works ok, but the editor per se sucks big time, IMO of course. So, how do you make your own pdf docume...

How do I modify the Magento Order Slip to show a custom product attribute?

I need to display an additonal custom attribute from a product on the Order Slip PDF so that our distributor can find the proper product id. We have 3 different types of SKUs being used; one is our own, the other is from our supplier and another is from the manufacturer. Which part of Magento do I have to change in order to add the cust...

PDF to HTML via API or through ColdFusion?

This has been asked many times, i'm sure, but i was curious if it's possible to upload a PDF to some site, keep it private, and get back HTML/CSS? Or, is there a way with ColdFusion to do this? It needs to be damn near the same as the PDF. I heard through a developer at a hackathon there was a PDF converter that uses the Webkit engine, b...

How to send a PDF to the browser from a Java webapplication?

I have a java class that generates a PDF file to a folder in my computer. I have managed to connect this class to a link on a web application and when i click this link it generates the pdf and writes it to the folder on my computer. I would want to change this and have the link send the pdf to the browser instead. How can i do this? Th...