Best practices for combining Lucene.NET and a relational database? | ansaurus

tags:

lucene.net

views:

192

answers:

1

+2 Q:

Best practices for combining Lucene.NET and a relational database?

I'm working on a project where I will have a LOT of data, and it will be searchable by several forms that are very efficiently expressed as SQL Queries, but it also needs to be searched via natural language processing.

My plan is to build an index using Lucene for this form of search.

My question is that if I do this, and perform a search, Lucene will then return the ID's of matching documents in the index, I then have to lookup these entities from the relational database.

This could be done in two ways (That I can think of so far):

N amount of queries (Horrible)
Pass all the ID's to a stored procedure at once (Perhaps as a comma delimited parameter). This has the downside of being limited to the max parameter size, and the slow performance of a UDF to split the string into a temporary table.

I'm almost tempted to mirror everything into lucenes index, so that I can periodicly generate the index from the backing store, but only need to access it for the frontend.

Advice?

A:

When I encountered this problem I went with a relational database that has full-text search capabilities (I used PostgreSQL 8.3, which has built in ft support, with stemming and thesaurus support). This way the database can query using both SQL and ft commands. The downside is that you need a DB that has full-text-search capabilities, and these capabilities might be inferior to what lucene can do.

SztupY 2009-06-13 14:51:54

related questions

Lucene.Net and Geosearch - is it outthere somewhere?

How to use a Stemmer in Lucene.net?

Search results Highlighting using Lucene.net

Paging using Lucene.net

How do I load balance Lucene.Net ?

using date range in Lucene.net

How to index and find numbers with Lucene.NET?

What are the main differences between search engines that should influence the decision as to which to use to search proprietary data?

Is Lucene.Net suitable as the search engine for frequently changing content?

Lucene.NET --> access denied to segments

Indexing Multiple Tables in Lucene

How to make the Lucene QueryParser more forgiving?

Lucene.net with IndexSearcher/IndexWriter in a Web Application

SetSystemFileCacheSize and RtlCompressBuffer

How do you implement search functionality using location information in ASP.NET?

Delete all indices in Lucene.net

Lucene.Net fails at my host because it calls GetTempPath(). What's the work around?

Does Lucene.Net manage multiple threads accessing the same index, one indexing while the other is searching?

How to have synonyms in Lucene.Net

Lucene.Net Search result to highlight search keywords

Has anyone used lucene.net with Linq-to-Entities?

Can someone give me a high overview of how lucene.net works?

How to sort by Lucene.Net field and ignore common stop words such as 'a' and 'the'?

Best full text search alternative to ms sql, c++ solution

Lucene.Net and SQL Server