tags:

views:

33

answers:

0

Hi,

I want to retrieve following information from a PDF file using PHP.

  1. Total no of pages with individual page sizes
  2. PDF print size and bounding box dimensions
  3. Colorspace (RGB/CMYK) used in PDF
  4. To confirm that font is either embedded or outlined
  5. Total no of objects and their list along with their types
  6. For all the image objects : a) height and width in pixels b) DPI stored in header c) colorspace d) bit depth e) x and y position in PDF f) Is image grayscaled ?
  7. List of all ICC profiles like dot gains
  8. Save pages in jpg (highest resolution image object should visible as its original size)
  9. Save pages as thumbnails
  10. If crop box is present, get its x-y location in PDF and size

I've tried (though not extensively) Zend_pdf, fpdf, tcpdf, imagemagick (convert and identify tools) but none of them give me complete information. For some reasons, I couldn't use Imagick PHP class on my host. pdflib is also not an option as I'm looking for free solutions.

Any help on this would be appreciated. Thanks.