distinct SOLR field values without count

tags:

facet
solr

views:

answers:

+2 Q:

distinct SOLR field values without count

Hi,
My question is pretty similar to this question
The difference, I'd need the least RAM intensive way to gather information about the distinct values. I DON'T care for the actual count in this case, I just want to know the possible values for that field.
I'm constantly running out of heap space (30 million+ documents) and there has to be some way/parameter to do this in a memory saving way

I don't know about RAM usage, but you might wanna try Field collapsing You will find the patch for Solr here.

Jem 2010-07-14 12:40:19

That seems to be only relevant for the result set. I don't let solr return any rows. I'm only interested in the facet fields

Marc Seeger 2010-07-16 09:03:04

If the number of distinct values is high, you will probably need to do facet paging. Use the facet.offset and facet.limit parameters.

Pascal Dimassimo 2010-07-14 13:23:18

what would be "high"? The top field probably has 100 possible values

Marc Seeger 2010-07-14 14:39:50

The default is 100, so it is usually not considered "high". But try with facet.limit=10 and see how it goes.

Pascal Dimassimo 2010-07-14 16:25:19

ansaurus

tags:

views:

answers:

distinct SOLR field values without count

related questions