I am looking for a very robust software search engine to integrate in a .Net web site.
The current proposed solution is Lucene.NET a stack based on Lucene. However, I would like to evaluate other search engines before making my mind up.
The feature set we need is the following:
- Ability to crawl arbitrary pages via HTTP
- Ability to parse sitemaps
- Ability to get lists of URIs to parse via a database look-up
- Ability to restrict the search to a particular language/locale
- Ability to restrict the search to a subset of the pages (e.g. via a regex on the URI)
- Speed and scalability (this is for a public website with a ton of traffic)
- Must have .NET API support or a super-easy http-based API that can be wrapped in a .NET API
- Language-dependent full-text support
Other things which would be great, but not deal-breakers if they aren't supported:
- Reporting
- Aliasing and biasing of results
- HTTP-based administration pages
- SQL Server support
What other software search engines have worked for you? Is there any you would recommend or that we should avoid?