[prev in list] [next in list] [prev in thread] [next in thread]
List: solr-user
Subject: Re: Why would one not use RemoveDuplicatesTokenFilterFactory?
From: Dotan Cohen <dotancohen () gmail ! com>
Date: 2013-05-27 7:17:19
Message-ID: CAKDXFkOx-NXuz7gDgSHLh0hm-gJK6E2oEr6m3Pn-uv204u0QMQ () mail ! gmail ! com
[Download RAW message or body]
On Sun, May 26, 2013 at 8:16 PM, Jack Krupansky <jack@basetechnology.com> wrote:
> The only comment I was trying to make here is the relationship between the
> RemoveDuplicatesTokenFilterFactory and the KeywordRepeatFilterFactory.
>
> No, stemmed terms are not considered the same text as the original word. By
> definition, they are a new value for the term text.
>
>
I see, for some reason I did not concentrate on this key quote of yours:
"...to remove the tokens that did not produce a stem ..."
Now it makes perfect sense.
Thank you, Jack!
--
Dotan Cohen
http://gibberish.co.il
http://what-is-what.com
[prev in list] [next in list] [prev in thread] [next in thread]
Configure |
About |
News |
Add a list |
Sponsored by KoreLogic