ansaurus

Question

Answer 1

+3 A:

Instead of using Lucene, you could use Solr to index with nutch (see here), then you can connect very easily to Solr using one of the two libraries available: SolrSharp and SolrNet.

Mauricio Scheffer 2008-10-15 12:54:52

looks really good, will it be able to take my lucene indexes?

Scott Cowan 2008-10-15 14:16:45

Haven't tried, but it should... trying it is the only way to be sure :)

Mauricio Scheffer 2008-10-15 14:28:41

I'm looking at hadoop compatibility too

Scott Cowan 2008-10-15 16:50:09

Hadoop is java-only AFAIK, and I don't know its interoperability with other platforms...

Mauricio Scheffer 2008-10-17 00:49:18

I'm running everything on debian anyways even asp.net

Scott Cowan 2008-10-17 09:17:53

Answer 2

A:

Instead of using Solr, I wrote a java based indexer that runs in a cron job, and a java based web service for querying. I actually didn't index pages so much as different types of data that the .net site uses to build the pages. So there's actually 4 different indexes each with a different document structure that can all be queried in about the same way (say: users, posts, messages, photos).

By defining an XSD for the web service responses I was able to both generate classes in .net and java to store a representation of the documents. The web service basically runs the query on the right index and fills out the response xml from the hits. The .net client parses that back into objects. There's also a json interface for any client side JavaScript.

dlamblin 2008-10-15 14:41:29

Answer 3

A:

SearchBlackBox Luca.Net is a commercial Apache Lucene compatible full-text search API for .NET. It allows you to provide Lucene-powered solutions for .NET.

gimel 2008-10-15 15:38:38

Good solution but its out of our budget, we're just a poor startup that can't afford 3500 for a interop library

Scott Cowan 2008-10-15 16:49:35

link is currently broken

Mauricio Scheffer 2009-08-18 21:01:10

Answer 4

+3 A:

In case it wasn't totally clear from the other answers, Lucene.NET and Lucene (Java) use the same index format, so you should be able continue to use your existing (Java-based) mechanisms for indexing, and then use Lucene.NET inside your .NET web application to query the index.

From the Lucene.NET incubator site:

In addition to the APIs and classes port to C#, the algorithm of Java Lucene is ported to C# Lucene. This means an index created with Java Lucene is back-and-forth compatible with the C# Lucene; both at reading, writing and updating. In fact a Lucene index can be concurrently searched and updated using Java Lucene and C# Lucene processes

Winston Fassett 2008-10-27 19:25:34

how about using it with hadoop?

Scott Cowan 2008-10-30 22:43:15

How do you want to combine Lucene with Hadoop? Index data that's already in Hadoop? Store a distributed lucene index in Hadoop? The latter would probably require a special version of lucene in order to distribute/query, but maybe someone's tried to do it, but probably in java.

Winston Fassett 2008-10-31 15:02:13

Answer 5

+1 A:

I'm also working on this.

http://today.java.net/pub/a/today/2006/02/16/introduction-to-nutch-2.html

It seems you can submit your query to nutch and get the rss results back.

edit:

Got this working today in a windows form as a proof of concept. Two textboxes(searchurl and query), one for the server url and one for the query. One datagrid view.

private void Form1_Load(object sender, EventArgs e)
        {
            searchurl.Text = "http://localhost:8080/opensearch?query=";


    }

    private void search_Click(object sender, EventArgs e)
    {
        string uri;

        uri = searchurl.Text.ToString() + query.Text.ToString();
        Console.WriteLine(uri);

        XmlDocument myXMLDocument = new XmlDocument();

        myXMLDocument.Load(uri);

        DataSet ds = new DataSet();

        ds.ReadXml(new XmlNodeReader(myXMLDocument));

        SearchResultsGridView1.DataSource = ds;
        SearchResultsGridView1.DataMember = "item";

    }

Sam 2009-02-24 16:39:27

well done, We're starting to use Solr for this

Scott Cowan 2009-02-25 10:55:30

And it seems our division is probably going with windows search server express.

Sam 2009-02-25 21:58:59

Answer 6

A:

Why not switch from java lucene to the dot net version. Sure it's an investment but it's mostly a class substitution exercise. The last thing you need is more layers that add no value other than just being glue. Less glue and more stuff is what you should aim for...

mP 2009-05-18 06:38:53

lucene.net has no Hadoop provider which is why we're on solr now

Scott Cowan 2009-05-19 12:11:38

ansaurus

tags:

views:

answers:

Java Lucene integration with .Net

related questions