'Re: Confused about non-tokenized fields'

[prev in list] [next in list] [prev in thread] [next in thread] 

List:       lucene-user
Subject:    Re: Confused about non-tokenized fields
From:       Gusenbauer Stefan <gusenbauer () eduhi ! at>
Date:       2005-05-28 2:10:52
Message-ID: 4297D32C.4040004 () eduhi ! at
[Download RAW message or body]

Erik Hatcher wrote:

>
> On May 27, 2005, at 12:14 PM, Gusenbauer Stefan wrote:
>
>> Max Pfingsthorn wrote:
>>
>>
>>> Hi!
>>>
>>> Thanks for the reply. I figured already that fields are actually 
>>> not tokenized... I lost track of the filenames/dirnames and there 
>>> were some duplicates...
>>>
>>> About case-insensitivity: Okay, I can make my query lower case,  but
>>> my strings in the field are not... I guess I have to do that 
>>> manually during indexing? Or is there some nicer way?
>>>
>>>
>>>
>> I think this is not a problem. This should be done automatically when
>> you make a case insensitiv search so that you don't have to think  about
>> it. If it should become a problem write another email *g*
>
>
> If you index but do not tokenize, then case is preserved from the 
> original text.  It's the tokenization process, via the specified 
> Analyzer, that typically lowercases.
>
> So, yes, you would need to do that manually on the text you hand to a 
> Field for untokenized fields.
>
>     Erik
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>
>
thanks that was new to me i will be more carefull before i give out some
suggestions
stefan


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org

[prev in list] [next in list] [prev in thread] [next in thread]
Configure | About | News | Add a list | Sponsored by KoreLogic