tags:

views:

33

answers:

1

I have extracted a number of OleObject files from a Word DOCX document (oleObject1.bin ... etc.) Is there Open Source code (primarily in Java) to read this. (I would only be interested in *.net code if it can be easily ported to Java). I am not interested in the rest of the Word document (in this question). There is an Apache project (http://poi.apache.org/) - I'd be interested in how well this works

+1  A: 

Apache POI - POIFS is a Java implementation of the OLE 2 Compound Document format. In my experience it works pretty well for the sort of scenario you are reffering to. Check it out.

VoidPointer
+1 for the encouragement. I'll post how I get on
peter.murray.rust