solr

Range Queries in Haystack

I need Haystack to send Solr range queries of the type q=age:[10 TO 15]. It messes up the querystring by tokenizing and adding the default operator AND. In this example, Solr receives the query as age:[10 and TO and 15] and it doesn't work. ...

Need help to retreive data after indexing a mysql table

Hi, Even after indexing a mysql table,in solr am not able to retreive data after querying like http://localhost:8983/solr/select/?q=slno:5 My data-config.xml file is : <?xml version="1.0" encoding="UTF-8"?> <dataConfig> <dataSource type="JdbcDataSource" driver="com.mysql.jdbc.Driver" url="jdbc:...

posting nutch data into a BASIC auth secured Solr instance

Hi. I've secured a solr instance using BASIC auth, kind of how it is shown here: http://blog.comtaste.com/2009/02/securing_your_solr_server_on_t.html Now i'm trying to update my batch processes to push data into the authenticated instance. The ones using "curl" are easy, but i also have a Nutch crawl that uses the "solrindex" command to...

Solr/Lucene: Indexing facet values

For example, say I have the following facet: Colors Red (7825) Orange (2343) Green (843) Blue (5412) In my database, colors would be a table and each color would have a primary key and a name/value. When indexing with Solr/Lucene, in all of the examples I've seen, the value is indexed and not the primary key. So if I filter by the ...

Multiple solr instances within Jetty or run Multiple Jetty servers, which is less intensive?

Hi, I am about to embark upon a new linode VPS server.I currently use both Tomcat and Jetty (on my development server) to serve different Solr, but having read around a bit I realise Tomcat can be quite a resource hog, hence the suggestion for me to use Jetty. I already have a Jetty development server set up with an application in Jetty...

Solr identifying words as spelled correctly when they're not

I'm following along with the Solr SpellCheck Component documentation, but can't seem to get this to work. The spell checking components seems to be running, but Solr is identifying the word as correctly spelled when it's not. How can I fix this? Example search query results for 'applicatoin' are... <response> <lst name="responseH...

Log too big in Solrj

Hi, I’m developing a Java project where I just included Solrj. I have noticed that the log file increases tremendously in size due to all the Sorlj messages. Is there any way to disable this option? Thanks very much in advance ...

solr admin gives 404 errors after integrating nutch

I've followed the instructions from http://www.lucidimagination.com/blog/2009/03/09/nutch-solr/ Had solr up and running before that, could handle test cases, access admin pages, etc. Copied the nutch schema.xml over to solr as per instructions. Worked, could access admin. When I added in the requesthandler snippet (see 5d on the websi...

How to manage "paging" with Solr?

I have a classifieds website... I have Solr doing the searching of the classifieds, and then return ID:nrs which I then use to put into an array. Then I use this array to find any classifieds in a MySql db where the ID:s match the ID:s in the array returned by Solr. Now, because this array can be very very big (100thousand records or mo...

Solr and facet search

Does facet searching come built in when you setup your schema or do you have to do some things to set this up? Does it basically work out of the box on all the fields that you have setup to be sortable? then you just use the fq query syntax and it will return the facet xml along with the search results? Is there a nice article on this...

Solr delete not working for some reason.

Just trying to delete all the documents, and did this: http://localhost:8983/solr/update?stream.body=%3Cdelete%3E%3Cquery%3E*:*%3C/query%3E%3C/delete%3E then committed: http://localhost:8983/solr/update?stream.body=%3Ccommit/%3E I get the response: <lst name="responseHeader"> <int name="status">0</int> <int name="QTime">17</int>...

Striping HTML in SOLR for storage, not indexing

Hey guys, I've managed to strip HTML from content when indexing data in SOLR. But is it possible to strip HTML from data when simply storing data? This is my field: <field name="Content" type="textNoHTML" indexed="true" stored="true"/> And, the field type "textNoHTML" implements the solr.HTMLStripCharFilterFactory: <charFilter clas...

Need help with location aware search

Hi, I am trying location aware search with spatial example found in http://www.ibm.com/developerworks/java/library/j-spatial/#indexing.approaches. The schema.xml has geohash field ,but this field is not present in any of the .osm files(present in data folder) used to index. Am not able to understand how value is assigned to it , so th...

Syntax for hsin query in solr

I have created a database and have used dataimport for retrieving the data. I tried out the hsin query: http://localhost:8983/solr/select/?q=name:Denver AND _val_:"recip(hsin(0.611941,-1.567437,lat_rad,lng_rad,3963.205),1,1,0)"^100 Though there are records that are retived, when the value of the radius is drastically reduced(7.205), th...

How to get Solr Extraction Handler to extract multiple META tags with the same name?

I am uploading some HTML documents to the SOLR extraction handler (Solr-Cell), which have META elements in the head in this form: <meta name="product" content="firstproduct" /> <meta name="product" content="anotherproduct" /> My schema has product defined as a multi-valued field: <field name="product" type="string" indexed="true" sto...

apache solr auto suggestions

I use solr+django-haystack I set settings.HAYSTACK_INCLUDE_SPELLING = True and rebuild index I'm trying to get any suggestion using: SearchQuerySet().auto_query('tryng ani word her').spelling_suggestion() But I always get None What should I do to get at least one working suggestion ? may be I need add some configuration into solr con...

Another dateTime question

I currently have a date in this format 2010-03-03 10:39:18 which is a TIMESTAMP field in MySQL. I need to have the date in this format for a search engine called Solr: 1995-12-31T23:59:59Z Here is some text from their website about dates: Solr expects dates to be in UTC when indexing. The format for this date field is of t...

Remove Expired Records in solr query syntax

Hi, I'm having a solr query syntax issue (I think) with solr 1.4. I'm trying exclude expired records from a solr query. However if the record doesn't have an expiry record, i would like to bring that record back also. E.g. To only get a list of record that haven't expired i am using this query: expirydate:[NOW/DAY TO *] Then I thoug...

Need help to resolve xml parser error

Hi, My xml file looks like this: <?xml version="1.0" encoding="UTF-8"?> <table_data name="finals"> <row id="2706" lat="40.6894729000" lng="-73.9692572000" body="Rumor:owner does buybacks atBrooklyn Public House in Fort Greene...are the bartenders as nice? #bars #nyc #fortgreene - www.buybacknyc.com"/> <row id="2707" lat="40.712...

Why are Solr Filters disable bypassed when doing a fuzzy searches?

My Solr setup contains the Filter LowerCaseFilterFactory for indexing and querying, so - the index only contains lowercase words and - every query is supposed to be turned lowercase before used for searching. When doing a non-fuzzy search like "mix:barba", they are working fine: - barba finds Barba and BARBA - Barba finds Barba and BARB...