[prev in list] [next in list] [prev in thread] [next in thread] 

List:       lucene-dev
Subject:    [jira] Created: (LUCENE-2183) Supplementary Character Handling in
From:       "Simon Willnauer (JIRA)" <jira () apache ! org>
Date:       2009-12-28 17:12:29
Message-ID: 236487053.1262020349599.JavaMail.jira () brutus ! apache ! org
[Download RAW message or body]

Supplementary Character Handling in CharTokenizer
-------------------------------------------------

                 Key: LUCENE-2183
                 URL: https://issues.apache.org/jira/browse/LUCENE-2183
             Project: Lucene - Java
          Issue Type: Improvement
          Components: Analysis
            Reporter: Simon Willnauer
             Fix For: 3.1


CharTokenizer is an abstract base class for all Tokenizers operating on a character \
level. Yet, those tokenizers still use char primitives instead of int codepoints. \
CharTokenizer should operate on codepoints and preserve bw compatibility. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic