I'm playing around with a Solr-powered search for my webapp, and I figured it'd be best to use the DataImportHandler to handle syncing with the app via the database. I like the elegance of just checking the last_updated_date field. Good stuff. However, I don't know how to handle deleting documents with this approach. The way I see it...
Hi,
I am using PySolr to run my search. I want to index an rss feed and was wondering if this is possible using PySolr and if so how do you do it.
I have found instructions on how to do this in Solr at http://wiki.apache.org/solr/DataImportHandler#HttpDataSource_Example
but can't find anything on how to do the equivalent in PySolr...
Hi,
I'm new to Solr and having difficulties with the DataImportHandler. I get this message when the server starts:
Jan 27, 2010 12:27:22 PM org.apache.solr.handler.dataimport.DataImportHandler processConfiguration
INFO: Processing configuration from solrconfig.xml: {config=dih-gensigdb-jdbc.xml}
Jan 27, 2010 12:27:22 PM org.apache....
I have a WordPress MU (WPMU) install that I am trying to index in Solr (1.4). The way the database is set up in WPMU is that there is a master blogs table that contains some information about each blog and the primary blog ID. This blog ID is then used to create/access the necessary database tables.
ex:
Blog ID = 3 -> wp_3_posts
Blog I...
Is it possible to do so with a transformer (data-fonfig.xml) or in schema.xml somehow?
For instance, my database returns rows with two columns: IntegerColumn, VarcharColumn
1, 'small'
2, 'medium'
3, 'small, medium'
I'd like to add the 3rd row as two separate entries in the Solr index.
1, 'small'
2, 'medium'
3, 'small'
3, 'medium...
My data-config.xml defines 3 different entities under the document tag (lets call them foo, bar and baz). When I issue a basic full import localhost:8983/solr/dataimport?command=full-import, only 2 of the 3 entities get indexed (foo and bar are in my index but baz never makes it). However, if I then issue a command to just import baz v...
Hey guys, some help here would as always be greatly appreciated.
I'm indexing data from a db using Solr. Each row in the first table, event_titles, can have more than one start date associated with it, contained in the table event_dates. Data-config is as follows;
<entity name="events"
query="select id,title_id,name,summary,descripti...