ansaurus

Question

Answer 1

+2 A:

First of all you should be closing your document.

Besides that, what I suggest doing is resaving your original Word document as a Word XML document, then changing the extension manually from .XML to .doc . Then look at the XML of the actual document you're working with and trace the content to make sure you're not accidentally editing hexadecimal values (AAA and EEE could be hex values in other fields).

Without seeing the actual Word document it's hard to say what's going on.

There is not much documentation about POI at all, especially for Word document unfortunately.

AlbertoPL 2009-05-10 21:41:10

First of all, Thank you very much for your answer...I add 'Finally' section just to close the document, thanks for your interest.The values AAA and EEE is not a valid values, i use them just for an example, so my actual values not like that, it something like : <<SubSource>>, <<Date>> and so on....For your suggestion about save the doc file as a XML file from MS Word : can i ask you a question : if i save the file as a xml file can i open the file from SAXParser and replace the text what i need to replace or it should be encrypted ???

Saeed 2009-05-11 06:46:06

Yes you can open it from SAXParser once saved to an XML.

AlbertoPL 2009-05-11 11:18:25

Answer 2

A:

Could this be the issue?

pugmarx 2009-05-11 06:53:26

Answer 3

+1 A:

You could try OpenOffice API, but there arent many resources out there to tell you how to use it.

01 2009-05-11 07:15:06

Thanks very much. I use These APIs just for open the .docx files and get the core document as an XML file and parse it using XML parser, then search for what i need using XPath, and every thing is OK. Another Solution, without OpenXML API, you can open the .docx file in MSWord 2007, save it as XML file -NOT 2003 XML-, parse the XML file it in java and replace what you need. In this solution you can replace images also. Image stored in the XML file as a Base64 encoded string and you can replace this string with another string representing the encoding of another images using Base64 encoding

Saeed 2009-05-15 14:50:51

Answer 4

A:

Hi all,,, I don't know : is its OK to answer myself, but Just to share the knowledge, I'll answer myself.

After navigating the web, the final solution i found is : The Library called docx4j is very good for dealing with MS docx file, although its documentation is not enough till now and its forum is still in a beginning steps, but overall it help me to do what i need..

Thanks 4 all who help me..

Saeed 2009-05-20 08:40:14

The documentation has improved somewhat over the last 6 months or so. See http://dev.plutext.org/svn/docx4j/trunk/docx4j/docs/Docx4j_GettingStarted.html

plutext 2010-07-16 01:11:46

Answer 5

A:

You can also try this one: http://www.dancrintea.ro/doc-to-pdf/

gusti 2009-11-19 09:20:14

ansaurus

tags:

views:

answers:

Open Microsoft Word in Java

related questions