[prev in list] [next in list] [prev in thread] [next in thread] 

List:       solr-user
Subject:    Re: How might one search for dupe IDs other than faceting on the ID field?
From:       Michael Della Bitta <michael.della.bitta () appinions ! com>
Date:       2013-07-30 18:43:56
Message-ID: CAPe6Lt1d8P41Qad8GuhdUmdDmUy3LdLi-eYeBQcrjrqd9h4TwQ () mail ! gmail ! com
[Download RAW message or body]


Since this is a one-time problem, Have you thought of just dumping all the
IDs and looking for dupes using sort and awk or something similar to that?

Michael Della Bitta

Applications Developer

o: +1 646 532 3062  | c: +1 917 477 7906

appinions inc.

“The Science of Influence Marketing”

18 East 41st Street

New York, NY 10017

t: @appinions <https://twitter.com/Appinions> | g+:
plus.google.com/appinions<https://plus.google.com/u/0/b/112002776285509593336/112002776285509593336/posts>
                
w: appinions.com <http://www.appinions.com/>


On Tue, Jul 30, 2013 at 2:38 PM, Dotan Cohen <dotancohen@gmail.com> wrote:

> On Tue, Jul 30, 2013 at 9:23 PM, Michael Della Bitta
> <michael.della.bitta@appinions.com> wrote:
> > Are you talking about the document's ID field?
> > 
> > If so, you can't have duplicates... the latter document would overwrite
> the
> > earlier.
> > 
> > If not, sorry for asking irrelevant questions. :)
> > 
> 
> In Solr 4.1 we were using overwrite=false&allowDups=false in order to
> discard the new document, not overwrite the extant document. We knew
> at the time that the features were depreciated, and apparently
> allowDups=false stopped working in 4.3. We are testing new solutions,
> but we need to identify the dupes to get them out.
> 
> --
> Dotan Cohen
> 
> http://gibberish.co.il
> http://what-is-what.com
> 



[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic