Suppose you have huge amount of documents, a few millions, and more coming. Suppose you have to publish them on a website with hundreds of thousands of daily pageviews, and suppose you have to let the visitors perform searches on all the documents. What is the best way to store/retrieve the documents? And to perform searches on them?
Currently the system i'm talking about uses swish-e to create indexes for static html files. These files are served as static content to the users.
Is this a good way?
What's the best solution between a database to store and search the contents (MySQL) and store the contents in static files and use a tool to index them and perform searches (Swish-e)?