[prev in list] [next in list] [prev in thread] [next in thread]
List: lucene-dev
Subject: [jira] Created: (LUCENE-2183) Supplementary Character Handling in
From: "Simon Willnauer (JIRA)" <jira () apache ! org>
Date: 2009-12-28 17:12:29
Message-ID: 236487053.1262020349599.JavaMail.jira () brutus ! apache ! org
[Download RAW message or body]
Supplementary Character Handling in CharTokenizer
-------------------------------------------------
Key: LUCENE-2183
URL: https://issues.apache.org/jira/browse/LUCENE-2183
Project: Lucene - Java
Issue Type: Improvement
Components: Analysis
Reporter: Simon Willnauer
Fix For: 3.1
CharTokenizer is an abstract base class for all Tokenizers operating on a character \
level. Yet, those tokenizers still use char primitives instead of int codepoints. \
CharTokenizer should operate on codepoints and preserve bw compatibility.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org
[prev in list] [next in list] [prev in thread] [next in thread]
Configure |
About |
News |
Add a list |
Sponsored by KoreLogic