Hi,
I want to retrieve following information from a PDF file using PHP.
- Total no of pages with individual page sizes
- PDF print size and bounding box dimensions
- Colorspace (RGB/CMYK) used in PDF
- To confirm that font is either embedded or outlined
- Total no of objects and their list along with their types
- For all the image objects : a) height and width in pixels b) DPI stored in header c) colorspace d) bit depth e) x and y position in PDF f) Is image grayscaled ?
- List of all ICC profiles like dot gains
- Save pages in jpg (highest resolution image object should visible as its original size)
- Save pages as thumbnails
- If crop box is present, get its x-y location in PDF and size
I've tried (though not extensively) Zend_pdf, fpdf, tcpdf, imagemagick (convert and identify tools) but none of them give me complete information. For some reasons, I couldn't use Imagick PHP class on my host. pdflib is also not an option as I'm looking for free solutions.
Any help on this would be appreciated. Thanks.