[prev in list] [next in list] [prev in thread] [next in thread] 

List:       solr-user
Subject:    Re: Why would one not use RemoveDuplicatesTokenFilterFactory?
From:       Dotan Cohen <dotancohen () gmail ! com>
Date:       2013-05-27 7:17:19
Message-ID: CAKDXFkOx-NXuz7gDgSHLh0hm-gJK6E2oEr6m3Pn-uv204u0QMQ () mail ! gmail ! com
[Download RAW message or body]

On Sun, May 26, 2013 at 8:16 PM, Jack Krupansky <jack@basetechnology.com> wrote:
> The only comment I was trying to make here is the relationship between the
> RemoveDuplicatesTokenFilterFactory and the KeywordRepeatFilterFactory.
>
> No, stemmed terms are not considered the same text as the original word. By
> definition, they are a new value for the term text.
>
>

I see, for some reason I did not concentrate on this key quote of yours:
"...to remove the tokens that did not produce a stem ..."

Now it makes perfect sense.

Thank you, Jack!


--
Dotan Cohen

http://gibberish.co.il
http://what-is-what.com
[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic