tags:

views:

40

answers:

1

My program downloads a PDF file from a source location every day. When I see the binary text of the PDF file in Notepad, I find that sometimes the PDF file has the string <!-FTCACHE-1-> at the end. Sometimes this word is missing from the PDF file.

My program downloads this PDF daily and compares it with the previous day's PDF file using the Windiff binary comparison.

99% of the time, Windiff reports differences in the PDF file just because one PDF contains the string <!-FTCACHE-1-> at the end.

Does anyone knows what the reason behind this is?

Thanks, Praveen

A: 

"FT" could be FreeType, the open source font engine. The comment probably comes from the software that generates the PDF. If you can somehow identify that, you could (assuming it is open source) perhaps take a look through it and see what causes it to emit the comment.

FreeType has a source folder dedicated to caching, the root source file there is called ftcache.c. It doesn't do a lot though, just #includes (!) the other source files.

Googling on the string you see, reveals several more or less random PDF:s that seem to contain it.

unwind
I found something with further analysis that if I delete this string from the file and check the file properties then Fast Web Access becomes true. If I again add it then Fast Web Access becomes false.So is there something in downloading the file that sometime Fast Web Access is true or false?