hwpf

How to use Apache HWPF to extract text and images out of a DOC file

Hi...! I downloaded the Apache HWPF. I want to use it to read a doc file and write its text into a plain text file. I don't know the HWPF so well. My very simple program is here: I have 3 problems now: Some of packages have errors (they can't find apache hdf). How I can fix them? How I can use the methods of HWDF to find and extract...

Java: parsing ms-word document using POI/HWPF

I have a ms-word document (MS-Office 2003; non-xml). Within this document there is a string associated with a bookmark. Furthermore, the word document contains word-macros. My goal is to read the document with java, replace the string associated with the bookmark, and save the document back to word format. My first approach was using Ap...

Java MS Word libarary

Hello, I've document and I want to change all template parameters like {name}, {address} to my own values. I tried to use POI HWPF Library, but I lost document formatting after saving. Even when I open my MS Word document with POI and save it (without any changes) document formatting is lost:) Formatting means centering, font size, etc (...

Apache POI HWPF Nested Lists?

Hi, I used the following code to read list structures from a word file using hwpf. My question is how to read a list structure that is nested ie a list within a list within a list and so on. if (p instanceof ListEntry) { ListEntry entry = (ListEntry) p; outText = entry.text(); outText = ...

Apache POI - HWPF Table of Contents

When I try to parse a doc file with table of contents I get something like this. PAGEREF _Toc126303503 \h How do I seek to the specific part of the doc file where any item of the table of contentst is pointing to? Thanks ...

Apache POI HWPF - problem in convert doc file to pdf

Hi. I am currently working Java project with use of apache poi. Now in my project I want to convert doc file to pdf file. The conversion done successfully but I only get text in pdf not any text style or text colour. My pdf file looks like a black & white. While my doc file is coloured and have different style of text. This is my code,...