Hello SO
i am having trouble trying to extract embedded documents inside Office <2003 . I am able to parse and index office files but i need to index embedded ole documents also.
How can i achieve? Any lib for python or command line programs?
i found ripole: www.pldaniels.com/ripole/ , but it just giving me Segfault..
I need native python or C or java . in platform independent manner (no reliant on COM , win32 API, or .NET)