ansaurus

Question

How to convert PDF binary parts into ASCII/ANSI so I can look at it in a text editor?

Answer 1

A:

Ghostscript has a small utility program written in PostScript in its source code repository. It's called pdfinflt.ps. If you are lucky, it may already slumber in a 'toolbin' subdirectory of your Ghostscript installation location. Otherwise, g et it here:

http://svn.ghostscript.com/ghostscript/trunk/gs/toolbin/pdfinflt.ps

Now run it together with your targetted PDF through the Ghostscript interpreter:

gswin32c.exe -- c:/path/to/pdfinflt.ps your-input.pdf deflated-output.pdf

pdfinflt.ps will (try to) expand all 'streams' contained in the PDF which use the following compression filters/methods: /FlateDecode, /LZWDecode, /ASCII85Decode, /ASCIIHexDecode.

It will not attempt to remove /RunLengthDecode, /CCITTFaxDecode, /DCTDecode, /JBIG2Decode and /JPXDecode. (Compressed/binary fonts will also pass unchanged into the output PDF.)

If you are in an adventurous mood, you may dare to uncomment those lines in the utility which disable /RunLengthDecode, /DCTDecode and CCITTFaxDecode and see if it still works...

That's the best I can offer right now.

pipitas 2010-08-14 14:20:22

@pipitas: Thank you ... this works for me at least in parts. I can now better poke at all these *obj 1 0 R* parts... trying to understand that stuff better.

2010-08-14 22:17:29

ansaurus

tags:

views:

answers:

How to convert PDF binary parts into ASCII/ANSI so I can look at it in a text editor?

related questions