solr

Ways to send binary/structured documents to SOLR?

I am using SOLR's ExtractingRequestHandler to ingest the text of documents. The examples in the documentation all use curl to stream documents, like so: curl 'http://.../extract?literal.id=doc1&commit=true' -F "[email protected]" That works just fine, but there is this note: using "curl" or other command line tools to p...

SOLR Tika: add text of file to existing record (ExtractingRequestHandler)

I am indexing posts in SOLR with "name", "title", and "description" fields. I'd like to later be able to add a file (like a Word doc or a PDF) using Tika / the ExtractingRequestHandler. I know I can add documents like so: (or through other interfaces) curl 'http://localhost:8983/solr/update/extract?literal.id=post1&commit=true...

SOLR indexing and searching?

Currently, I'm trying to add a new field to our SOLR engine. I've added the following into the schema.xml file. <field name='FIELDNAME' type='string' indexed='true' stored='false' /> The xml passed to solr for indexing is: <FIELDNAMES> <FIELDNAME>1</FIELDNAME> : : : <F...

Solr + Jetty Gives HTTP 503 on Debian

(This is a cross-post from servefault. I'm posting it here because no one answered my post there, and I feel that this sort of hits an awkward space half-way between both stackoverflow and serverfault.) I have modified the example project included with Solr for my needs (removing things like the example stopwords and defining my own sch...

EmbeddedSolrServer in SolrJ 1.4

I am using SolrJ 1.4, I am having trouble figuring out in what jar file the EmbeddedSolrServer lives. The documentation says that its namespace is org.apache.solr.client.solrj.embedded but I don't see this namespace among the libraries that make up SolrJ 1.4, according to the SolrJ Wiki. I am also missing the CoreContainer class that is...

Search result to show hits counts on artist and title

While playing with Solr and facets, I wanted to get the count of search hits on title and artist of my music albums. I find that faceting in solr doesn't have a straight forward method to do this since every document in Solr here will have both title and artist and the result expected here is count of search hits on each. Following is a...

How do I Improve performance of WCF Web Service?

I have an MVC application that calls several different web services. While running load tests on the application, I noticed that performance (both throughput and response times) degraded quickly as the concurrent users increased. By process of elimination, I stripped everything out except for a single dummy call to the web service which ...

How to sort solr without stopwords

I'm trying to sort a solr query by a field ignoring stopwords, but can't seem to find a way to do that. For example, I want the results to be sorted like: Charlie A Fox Helicopter Is this possible? Right now the field type is defined like: <fieldType name="alphaOnlySort" class="solr.TextField" sortMissingLast="true" omitNorms="t...

Display ellipsis before and after fragment in SOLR

I have SOLR configured to return fragments with a fragsize of 500. Sometimes, the whole field is 500 characters or less, so the fragment is identical to the field. For fields that are longer than that, SOLR just returns the fragment without any indication (or so it seems) that the fragment only represents part of the content of a field...

Indexing and searching MySQL with solr

(I have put ' in the XML below to make it display) Hi all I want to index my MySQL db table with solr. I have installed the necessary java components/adaptors etc. My database is called 'test_db' and the table in it is called 'table_tb'. The table contains 2 columns (fields) -Field 1 is called 'ID' and is an autoincremented primary ke...

Indexing and searching MySQL with solr

I have set up Solr and am trying to index a simple 2 column, 2 row table (MySQL 'test_tb' tabe within database 'test_db') with (first column) unique id (in the mysql of type int) and (second column) some text. I keep getting the error: WARNING: Error creating document : SolrInputDocument[{ID_F=ID_F(1.0)={1}}] org.apache.solr.common.Solr...

do I need to rebuild my lucene index for this change?

Do I need to rebuild a Lucene index when I only add a random field to a schema? Or could I run some code to update that field without rebuilding the index? This is the field I need to add: http://lucene.apache.org/solr/api/org/apache/solr/schema/RandomSortField.html In this case, Lucene is running on Solr. ...

Solr DataImportHandler delta import queries

The delta import syntax for the Solr 1.4 data import handler allows for up to 4 queries (query, deltaImportQuery, deltaQuery & parentDeltaQuery), but I am unclear on the usage of the "query" query. In the following example, the "query" query does the same as the deltaImportQuery without the where clause. <entity name="data-table" pk="...

Installing liferay with Solr : admin console unreachable + exceptions on launching

Hi, I try to install solr 1.4.1 with Liferay 5.5-5.2.3 running with Tomcat; my system is RHEL5. Below I explain the errors I meet; then I give the details of my solr install. When I start Tomcat, I have the following exceptions : INFO: Solr home set to '/opt/apache-solr-1.4.1/example/solr/' Aug 4, 2010 10:44:26 AM org.apache.solr.co...

Is it possible to combine SOLR and NHibernate Search

I am new to SOLR but am aware that it is built on top of lucene. I was wondering if it is possible to use existing NHibernate Search, which obviously works directly with Lucene, to add/update items in SOLR? If so, is this considered a bad practice? Thank in advance, JP ...

Using multiple tokenizers in Solr

Excuse me if this is a dumb question. I was just thrown into this task, so I don't know much about Solr, indexing, etc. But basically what we want to be able to do is perform a query and get results back that are not case sensitive and that match partial words from the index. We have a Solr schema set up at the moment that has been mo...

Solr Faceting result with more than just count

This is my requirment: I have a list of products indexed in Solr and categoryid, category name and category alias are MultiValued fields. I would like to do faceting on category but want the faceting result to have all categoryID, CategoryName and Category Alias along with the count number. Is this possible? Or I have to do faceting only...

Bypassing authentication for localhost in order to implement search in Etherpad

I'm trying to implement Nutch + Solr based search engine into my Etherpad installation. The main issue I'm having is that Nutch doesn't support POST authentication. Etherpad and Nutch are installed on the same machine, so an obvious solution would be to find a way to bypass authentication for localhost. This is where I'm stuck. I don't ...

Binding SolrNet query results to a GridView

Anyone know how to bind a SolrNet result set to a GridView control? I suspect it involves ObjectDataSource but I can't seem to get the right thing thrown together. Any help would be appreciated, I need this part done quickly. ...

Solr query for items with field:value or no value for field at all

I'm trying to query Solr for entries that either have a certain value in a field OR no value for the field. The negated field value works when used by itself but causes the query to return no results when combined with anything else. Below is a response with debugQuery enabled. The config file I use can be found here. <?xml version="1.0...