I know there are already objects supporting Office 2007 files, but is there any native Office 2003 or earlier support ?
+1
A:
There doesn't seem to be anything bundled with Zend_Search_Lucene
, for those.
Still, considering it can index HTML documents, if you can find a way to convert your Office 2003 documents to HTML (at least, for indexing -- keeping to original version alonside the HTML one, for consultation), you might be able to index those...
Pascal MARTIN
2009-10-30 05:54:49
A:
I would recommend indexing the documents with Solr and Tika together and using JSON to search your Solr/Lucene index from PHP. See the ExtractingRequestHandler (Solr wiki page) article for more information.
Brian Mansell
2009-10-30 14:36:08