[prev in list] [next in list] [prev in thread] [next in thread]
List: icu
Subject: Re: Getting strings in normalized decomposed state
From: "Mark Davis" <mark.davis () us ! ibm ! com>
Date: 2001-09-24 14:22:02
[Download RAW message or body]
Yes, ICU will give you either uniformally decomposed (or precomposed,
according to NFC). There is a low-level interface, or you can also use a
transliterator.
The CollationKey gives you back a sort key. For more information on that
(and the above) see the user guide.
Mark
___
Mark Davis, IBM GCoC, Cupertino
(408) 777-5850 [fax: 5892], mark.davis@us.ibm.com, president@unicode.org
http://maps.yahoo.com/py/maps.py?Pyt=Tmap&addr=10275+N.+De+Anza&csz=95014
\
"Oodi Pilzer" \
<oodi.pilzer@east.sun.com> To: "Icu" \
<icu@www-126.southbury.usf.ibm.com>
Sent by: cc: \
icu-admin@www-124.southbury.u Subject: Getting strings \
in normalized decomposed state
sf.ibm.com \
\
\
09-23-2001 13:33 \
\
\
Hi. I wonder if there is any way to use the ICU to pass in a Unicode
string
that has potentially both precomposed and decomposed characters and get
back
a string with all precomposed characters in their decomposed state. I.e.
the new string should have no precomposed characters. I realize the
Collator class does this internally but can I somehow leverage this to get
the new normalized string back?
Also, I noticed the class CollationKey but did not understand what it's use
is. Could someone who knows explain this to me.
Thanks,
Oodi
_______________________________________________
icu mailing list
icu@oss.software.ibm.com
http://oss.software.ibm.com/developerworks/oss/mailman/listinfo/icu
[prev in list] [next in list] [prev in thread] [next in thread]
Configure |
About |
News |
Add a list |
Sponsored by KoreLogic