solr

WildcardQuery error in Solr

I use solr to search for documents and when trying to search for documents using this query "id:*", I get this query parser exception telling that it cannot parse the query with * or ? as the first character. HTTP Status 400 - org.apache.lucene.queryParser.ParseException: Cannot parse 'id:*': '*' or '?' not allowed as first character i...

How to get facet ranges in solr results?

Assume that I have a field called price for the documents in Solr and I have that field faceted. I want to get the facets as ranges of values (eg: 0-100, 100-500, 500-1000, etc). How to do it? I can specify the ranges beforehand, but I also want to know whether it is possible to calculate the ranges (say for 5 values) automatically base...

What are some Search Servers out there?

I'm looking to find alternatives to Solr from the Apache Software Foundation. For those that don't know, Solr is an enterprise search server. A client application uses a web-services like interface to submit documents for indexing and also to perform search queries. Solr has other features built in like caching and replication. I belie...

Java Lucene integration with .Net

I've got nutch and lucene setup to crawl and index some sites and I'd like to use a .net website instead of the JSP site that comes with nutch. Can anyone recommend some solutions? I've seen solutions where there was an app running on the index server which the .Net site used remoting to connect to. Speed is a consideration obviously ...

Using Nutch crawler with Solr

Am I able to integrate Apache Nutch crawler with the Solr Index server? Edit: One of our devs came up with a solution from these posts Running Nutch and Solr Update for Running Nutch and Solr Answer Yes ...

Solr - Getting facet counts without returning search results

Hi I need to return only the facet counts from solr. So I basically want to search over all documents and return the facet counts, but I don't want to return any search results. Is this possible? Thanks ...

Solr Filter for Logging/Analysing Queries

I'm using Solr and wish to be able to log queries/tokens to a database for analytics for both marketing and suggested search functionality. I'm guessing that a Lucene filter would be a good way of doing this, maybe putting a filter in the query filter chain for a given field type which logs to a specified database. I'm wondering the bes...

How do we create a simple search engine using Lucene, Solr or Nutch?

Our company has thousands of PDF documents. How do we create a simple search engine using Lucene, Solr or Nutch? We'll provide a basic Java/JSP web page were people can type in words and perform basic and/or queries then show them the document links of all matching PDF's. ...

Retrieving specific fields in a Solr query?

I am running a Solr instance on Jetty and when I search using the Solr admin panel, it returns the entire document. What should I do to get only specified fields from each Solr document returned by the search? ...

Boolean NOT in solr query

How do I pick/ delete all the documents from Solr using the boolean NOT notion? i.e. How do I delete all the documents from Solr who's id does NOT start with A59? ...

Recommendations for a spidering tool to use with Lucene or Solr?

What is a good crawler (spider) to use against HTML and XML documents (local or web-based) and that works well in the Lucene / Solr solution space? Could be Java-based but does not have to be. ...

Is there a set of best practices for building a Lucene index from a relational DB?

I'm looking into using Lucene and/or Solr to provide search in an RDBMS-powered web application. Unfortunately for me, all the documentation I've skimmed deals with how to get the data out of the index; I'm more concerned with how to build a useful index. Are there any "best practices" for doing this? ...

Crawler/parser for Xapian

I would like to implement a search engine which should crawl a set of web sites, extract specific information from the pages and create full-text index of that specific information. It seems to me that Xapian could be a good choice for the search engine library. What are the options for a crawler/parser to integrate with Xapian? Would...

Book recomendation for Solr

I want to know how to get the best of Solr. Which books, tutorials, reference sites should I read to learn about it? Thanks. ...

Solr: Using Regex fragmenter to extract paragraphs

Hello, I posted this message to the Solr mailing list, but I'm trying here too in case there's a Solr expert lurking around. I am trying to use the regex fragmenter and am having a hard time getting the results I want. I am trying to get fragments that start on a word character and end on punctuation, but for some reason the fragments ...

Is there an easy way to make Solr reference different indexes based on a set of credentials submitted with the request?

I'd like to have a single instance of Solr, protected by some sort of authentication, that operated against different indexes based on the credentials used for that authentication. The type of authentication is flexible, although I'd prefer to work with open standards (existing or emerging), if possible. The core problem I'm attempting ...

acts_as_solr returns all rows in the database when using the model as search query

In our application we're using acts_as_solr for search. Everything seems to be running smoothly except for the fact that using the model name as the search query returns every single row in the table. For example, let's say we have a users table. We specify acts_as_solr in our model to search the fields first name, last name and hand...

preceding and trailing words with the searched word

How to get some preceding and trailing words with the word we are searching for in "Solr" ? ...

Best full text search for mysql?

We're currently running MySQL on a LAMP stack and have been looking at implementing a more thorough, full-text search on our site. We've looked at MySQL's own freetext search, but it doesn't seem to cope well with large databases, which makes it far too slow for our needs. Our main requirements are: speed returning results simple upd...

Solr search with escaping solr reserved keywords

How do i query fields that contain solr reserved keywords as ":" in solr? For instance, q = 'uri:http://www.example.com' throws up an error for "http://www.example.com" containing reserved word ":" ...