views:

15

answers:

0

Hello SO

i am having trouble trying to extract embedded documents inside Office <2003 . I am able to parse and index office files but i need to index embedded ole documents also.

How can i achieve? Any lib for python or command line programs?

i found ripole: www.pldaniels.com/ripole/ , but it just giving me Segfault..

I need native python or C or java . in platform independent manner (no reliant on COM , win32 API, or .NET)