ghostscript

PDF on Linux: Combine font subsets and replace Type 3 with Type 1

I have a PDF file that I'd like to post-process on Linux. In particular I'd like to: Replace Type 3 fonts with Type 1 fonts Replace multiple subsets of the same font with a single subsets (the subsets are the result of including figures in LaTeX, where each figure contains a subset'ed font) With Windows these two steps are possible w...

Recommendations for PDF text extraction

Hello, can anyone recommend a library/API for extracting the text and images from a PDF? We need to be able to get at text that is contained in pre-known regions of the document, so the API will need to give us positional information of each element on the page. We would like that data to be output in xml or json format. We're curren...

process the data of an image like pdf or something else using pdfcreator

hay all. maybe you guys can help me in my project. im using pdfcreator as a virtual printer to print to a file some images. can be pdf can be any type of image. but i need to extract data from it. can it be done? im using C#. ...

Converting .ps to .pdf and removing a single page.

Hi there, I have a postscript file of a poster made in latex, and want to convert this to a pdf (I'm using ubuntu) I am using ps2pdf but if possible I would like to remove the first, blank page, and keep only the second page. Is there a command that allows this? Been trying to find one for longer than would seem sensible! Thanks! ...

Why a .png converted from a gnuplot postscript output, has transparent background?

I used gnuplot (4.2) to write a graph to a postscript file. But when I convert this .ps file to a .png file using the 'convert file.ps file.png' command, I get a png image which has no background (transparent). Why does this happen, and how do I get a png image with a white background? ...

How can I make a program overlay text on a postscript file?

I have some graphs in postscript format, generated by gnuplot. I need to place some identifying information on the graph. How can I script some instructions to do that? I want to write a number at the top right corner of the graph (a .ps file). ...

Will 32-bit version of GhostScript work on 64-bit systems?

I'm going to redistribute gsdll32.dll, main library of GhostScript. There are two versions available for download, 32 and 64 bit. Will the first one work on 64-bit systems, or I need package two versions separately? ...

PHP Converting PDF's to images -dUseCropBox

I'm trying to convert a PDF to an image and I need to make sure that the -dUseCropBox parameter is specified for when calling Ghostscript. Can this be done? convert "/var/www/vhosts/site.co.uk/httpdocs/uploads/source_pdf/PP4SDpdf.pdf" -resize 500X500 "/var/www/vhosts/site.co.uk/httpdocs/uploads/image_pdf/SaturdayTest.jpg" It works wel...

Methods of Parsing Large PDF Files

I have a very large PDF File (200,000 KB or more) which contains a series of pages containing nothing but tables. I'd like to somehow parse this information using Ruby, and import the resultant data into a MySQL database. Does anyone know of any methods for pulling this data out of the PDF? The data is formatted in the following manne...

GhostScript Error

On a particular PDF, when running the following command: gs -dSAFER -dBATCH -dNOPAUSE -dQUIET -dFirstPage=1 -dLastPage=1 -sDEVICE=pdfwrite -sOutputFile=output.pdf input.pdf I get this error: GPL Ghostscript 8.71: Warning: 'loca' length 188 is greater than numGlyphs 93 in the font MGOXZX+Arial-BoldMT. Any ideas what this error means...

Ghost Script Wierd Image Output

We are using ghostscript to turn PDF to image, any idea why this would happen? ...

PDF: How to Optimize Filesize & Convert to PNG (embedded fonts problem)

I have a PDF with embedded fonts that I can't seem to work with. Right now, I'm using GhostScript and trying to do 2 things: Minimize filesize of PDF: gswin32c -dSAFER -dBATCH -dNOPAUSE -dQUIET -sDEVICE=pdfwrite -sOutputFile=output.pdf input.pdf Convert PDF to PNG (super sample, to be used for creating other thumbnails): gswin32c -dS...

PHP: How to tell if PDF has embedded fonts?

Given a PDF document, is there a way to tell if it has embedded fonts? I'm working on the problem posted here: http://stackoverflow.com/questions/3774995/pdf-how-to-optimize-filesize-convert-to-png-embedded-fonts-problem. Perhaps I can rework some of my application's logic based on whether a PDF has embedded fonts or not. ...

Why does gswin32.exe have a 237-character limit on the value of its sOutputFile parameter?

I am using version 8.71 and my OS is Windows XP SP3. The error I get is as follows: Error: /undefinedfilename in --showpage-- Operand stack: 1 true Execution stack: %interp_exit .runexec2 --nostringval-- --nostringval-- --nostringval- - 2 %stopped_push --nostringval-- --nostringval-- --nostringval-- fa lse ...

Ghostscript fonts using the api

I've got a bit of an issue with Ghostscript. I'm writing an application in .NET that uses the Ghostscript API, gs32dll.dll. It works just fine for a number of pdfs, but there a couple that have this little movie can dingbat, and ghostscript throws an error about not having that font. I know I have to make a listing for that font in th...

Linux command-line utility to remove colors in a PDF file?

Hi, I'm searching for a linux command-line utility/script capable of removing colors in a PDF. The output of the utility should be the same PDF, but in grayscale. Does anyone know how to do this? Thanks ...

XPS document to PDF,DOC

Which one is the best third party libary to convert XPS to PDF and XPS to DOC. ...

Convert source code to syntax highlighted image

Background OpenOffice Writer lacks the ability to link to an ASCII text source file, apply syntax highlighting, wrap it in a frame, and update the frame contents whenever the source file changes. However, OpenOffice Writer can link to images, and will update the images automatically when they change. Problem The images need to be high...

ghostscript using nonlocal pdf

Is there any way to use the ghostscript library to work with a pdf given as a byte stream, WITHOUT saving the file itself on the local machine? I'm using C#. ...

Ghostscript functionality

Hello, is it possible to use ghostscript to find out the resolution of an image within a PDF file? thanks in advance. John ...