Hi all,
I was just wondering if anyone knew of any good libraries for parsing .doc files (and similar formats, like .odt) to extract text, yet also keep formatting information where possible for display on a website.
Capability of doing similarly for PDFs would be a bonus, but I'm not looking as much for that.
This is for a Rails proj...
I have a PHP script being served over https:// that is trying to push a PDF file to the user. The problem that in Internet Explorer 6 (which unfortunately I still have to support) is not obeying the filename being set in the header. FireFox and IE7 are both working properly.
The file name keeps coming out as a random name with the c...
using ImageMagick, what command should i use to convert a PDF to PNG? I need highest quality, smallest file size. this is what I have so far (very slow by the way):
convert -density 300 -depth 8 -quality 85 a.pdf a.png
Looking at what Gmail does when a user "view" a PDF, the quality is awesome and the file size very minimal. The DPI i...
Hi everyone,
I'd like to know (from a high level view) what would be required to take a pdf floor plan of a building and determine where exactly you are on that floor plan using GPS coordinates? In addition to location, the user would be presented with a "turn by turn" directions to another point on the map, navigating down hallways, b...
I have a pdf file with English and Hindi Text in it and I need to extract text into raw text(utf-8).
I tried using openoffice but the hindi characters get ruined
...
I'm getting characters in my PDF, i've stripped out \r\n \r \n \t, trimmed everything, decoded html entities and stripped tags. Nothing helps. The data is coming from a MySQL database.
Any help would be appreciated.
...
Hi.
I need to convert below mentioned file formats to pdf using C#/VB.Net. User will upload the file using FileUpload control and system will returns the pdf file after converting the document.
doc/docx to pdf
xls/xlsx to pdf
ppt/pps to pdf
Does ITextSharp provide such facility ? Please Only mentioned open source or free libraries.
T...
I wanted to know if there was a way, or a tool available, to output java source into a pdf that has syntax highlighting available?
...
I have been playing around with creation of pdf documents for a project that I'm working on. I would like to store the generated pdf document in a SQL database and then later be able to retrieve this pdf as well.
What are some suggestions for doing this? Can the document be stored in the database without physically creating the docume...
I have a requirement to print an existing PDF file from a Python script.
I need to be able to specify the printer in the script. It's running on Windows XP.
Any ideas what I could do?
This method looks like it would work except that I cannot specify the printer:
win32api.ShellExecute (
0,
"print",
filename,
None,
".",
0
...
I'm developing a CMS aplication in ASP .Net using WebForms and I'm looking for a way to create new PDF files based on a template.
This feature will be used to generate contracts where some placeholders will be replaced with the customer data.
What's the best approach to do that?
Edited: The templates will be static, the main content w...
I'm looking for suggestions on the easiest way to create charts and have them printed out as PDFs. This has to be done in Java.
I was looking at something like: http://jcckit.sourceforge.net/index.html
But I don't see how to turn those charts into PDFs.
Any ideas?
...
I am creating pdf in java using iText jar. Here I am manually positioning the all components like Text,Paragraph,line etc. is There Any GUI Tool available for this?. I tried it google but couldnt find.
...
the below code merges the pdf files and returns the combined pdf data. while this code runs, i try to combine the 100 files with each file approximately around 500kb, i get outofmemory error in the line document.close();. this code runs in the web environment, is the memory available to webspehere server is the problem? i read in an arti...
Hi
I want to read pdf file with original content like its font(its possible that some font size is small while some font size is big ) and paragraph and table if it is.
how its possible.
plz help.
...
I want to read an existing PDF file, get not only the text, but also the format information like: Font (Bold, Italic),paragraphs,images, tables. Basically I want to write an HTML similar to PDF.
Is there an code library for doing this? I am looking for an Open Source Library.
Regards,
Tina Agrawal
...
How do I programmatically open a password protected PDF. I want to use ruby to open a password protected PDF and read its content.
I would like to know whether its possible and if it is possible guidance on the right way to do it
...
Hello,
I've recently started using iTextSharp to generate PDF reports from data. It works quite nicely.
In one particular report, I need a section to always appear at the bottom of the page. I'm using the PdfContentByte to create a dashed line 200f from the bottom:
cb.MoveTo(0f, 200f);
cb.SetLineDash(8, 4, 0);
cb.LineTo(doc.PageSize.W...
In my application I will receive a byte stream and convert it to a pdf file in the phone memory. How do I render that to a pdf? And show it on an activity?
...
I'm dynamically generating a PDF with a few variables but also need to be able to embed an image on the PDF. Anyone have any experience doing this using ?
...