ansaurus

Question

Answer 1

A:

Apart from tempFinalQuery being unused and an unnecessary map lookup to get the state, there doesn't seem to be anything too egregious in the code you post. Apart from the formatting...

If all the time is taken in the Parse methods, posting their code here would make sense.

Alabaster Codify 2009-01-02 13:58:50

Answer 2

A:

Hi,

Thanks for your response..... Here is that code which is calling Parse method (in foreach loop)and is taking the maximum amount of time.

    public void BuildNearestCitiesQuery(Hashtable nearestCities, string[] fields, BooleanQuery finalQuery)
    {
        QueryParser queryParserCity = new QueryParser("city", _analyzer);
        QueryParser queryParserState = new QueryParser("state", _analyzer);


        //Base City Query
        BooleanQuery baseCityStateQuery = new BooleanQuery();
        Query queryCity = queryParserCity.Parse(this._baseCity);
        Query queryState = queryParserState.Parse(this._baseStateCode);
        baseCityStateQuery.Add(queryCity, BooleanClause.Occur.MUST); //must is like an "AND"       
        baseCityStateQuery.Add(queryState, BooleanClause.Occur.MUST);
        BooleanQuery nearestCityQuery = new BooleanQuery();
        nearestCityQuery.Add(baseCityStateQuery, BooleanClause.Occur.SHOULD); //should is like an OR
        BooleanQuery cityStateQuery = null;
        queryCity = null;
        queryState = null;

        //Nearest Cities Query
        foreach (string city in nearestCities.Keys)
        {

            BooleanQuery tempFinalQuery = finalQuery;
            cityStateQuery = new BooleanQuery();
            queryCity = queryParserCity.Parse(city);
            queryState = queryParserState.Parse(((string[])nearestCities[city])[1]);
            cityStateQuery.Add(queryCity, BooleanClause.Occur.MUST); //must is like an AND
            cityStateQuery.Add(queryState, BooleanClause.Occur.MUST);
            nearestCityQuery.Add(cityStateQuery, BooleanClause.Occur.SHOULD); //should is like an OR
        }

        finalQuery.Add(nearestCityQuery, BooleanClause.Occur.MUST);
    }

Thanks.

2009-01-09 01:25:13

Answer 3

A:

I might have missed the point of your question but do you have the possibility of storing latitude and longitude for zip codes? If that is an option, you could then compute the distance between two coordinates providing a much more straightforward scoring metric.

Sugerman 2009-06-10 19:38:35

Could you please have a look at this and comment??Thanks.http://stackoverflow.com/questions/1052086/spatialquery-for-location-based-search-using-lucene

2009-06-27 23:54:18

Answer 4

+2 A:

Not quite sure if I completely understand your code, but when it comes to geospatial search a filter approach might be more appropriate. Maybe this link can give you some ideas - http://sujitpal.blogspot.com/2008/02/spatial-search-with-lucene.html

Maybe you can use Filters for other parts of your query as well. To be honest your query looks quite complex.

--Hardy

Hardy 2009-06-12 09:01:49

Could you please have a look at this and comment??Thanks.http://stackoverflow.com/questions/1052086/spatialquery-for-location-based-search-using-lucene

2009-06-27 23:55:05

Answer 5

A:

I believe the best approach is to move the the nearest city determination into a search filter. I would also reconsider how you have the field setup; consider creating one term that has city+state so that would simplify the query.

Aaron Saunders 2009-06-13 04:26:21

Answer 6

A:

I'd suggest:

storing the latitude and longitude of locations as they come in
when a user enters a city and distance, turn that into a lat/lon value and degrees
do a single, simple lookup based on numerical distance lat/lon comparisons

You can see an example of how this works in the Geo::Distance Perl module. Take a look at the closest method in the source, which implements this lookup via simple SQL.

Anirvan 2009-06-26 18:21:31

Answer 7

A:

Agree with the others here that this smells too much. Also doing a textual search on city names is not always that reliable. There is often a bit of subjectivity between place names (particularly areas within a city which might in themselves be large).

Doing a geo spatial query is the way to go. Not knowing the rest of your set up it is hard to advise. You do have Spatial support built into Fluent to NHibernate, and SQL Server 2008 for example. You could then do a search very quickly and efficiently. However, your challenge is to get this working within Lucene.

You could possibly do a "first pass" query using spatial support in SQL Server, and then run those results through Lucene?

The other major benefit of doing spatial queries is that you can then easily sort your results by distance which is a win for your customers.

Perhentian 2009-11-06 13:22:51

ansaurus

tags:

views:

answers:

Need Lucene query optimization advice

related questions