I have a digital library system where I store metadata and the path to physical file in the database. The files may be anything: plain text,Word,PDF,MP3,JPEG,MP4...
How can I provide full text search to both my domain objects and the physical files (or some text extraction of the files).
Is my only choice to store the document text in the domain object? I do need to be able to retrieve a list of domain objects regardless of if the search results come from the domain object or the physical document. There is of course a possible connection using the file path and I actually drop each document in a folder named by a GUID, so the connection is there.
I need to do this in Grails, ideally using the solr or searchable plugin, but a Java solution would help.