[prev in list] [next in list] [prev in thread] [next in thread]
List: solr-user
Subject: How might one search for dupe IDs other than faceting on the ID field?
From: Dotan Cohen <dotancohen () gmail ! com>
Date: 2013-07-30 18:16:03
Message-ID: CAKDXFkMmWm6OadxYqOUUadJFPUmJOkxxjnjA_msUwBWEfE758Q () mail ! gmail ! com
[Download RAW message or body]
To search for duplicate IDs, I am running the following query:
select?q=*:*&facet=true&facet.field=id&rows=0
However, since upgrading from Solr 4.1 to Solr 4.3 I am receiving
OutOfMemoryError errors instead of the desired facet:
<response><lst name="error"><str
name="msg">java.lang.OutOfMemoryError: Java heap space</str><str
name="trace">java.lang.RuntimeException: java.lang.OutOfMemoryError:
Java heap space
at org.apache.solr.servlet.SolrDispatchFilter.sendError(SolrDispatchFilter.java:670)
at org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:380)
at org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:155)
at ...
Might there be a less resource-intensive way to get this information.
This is Solr 4.3 running on Ubuntu Server 12.04 in Jetty. The index
has over 100,000,000 small records, for a total of about 95 GiB of
disk space, with Solr running on it's own disk. Actually, the 'disk'
is an Amazon Web Service EBS volume.
--
Dotan Cohen
http://gibberish.co.il
http://what-is-what.com
[prev in list] [next in list] [prev in thread] [next in thread]
Configure |
About |
News |
Add a list |
Sponsored by KoreLogic