'Re: Getting strings in normalized decomposed state'

[prev in list] [next in list] [prev in thread] [next in thread] 

List:       icu
Subject:    Re: Getting strings in normalized decomposed state
From:       "Mark Davis" <mark.davis () us ! ibm ! com>
Date:       2001-09-24 14:22:02
[Download RAW message or body]

Yes, ICU will give you either uniformally decomposed (or precomposed,
according to NFC). There is a low-level interface, or you can also use a
transliterator.

The CollationKey gives you back a sort key. For more information on that
(and the above) see the user guide.

Mark
___
Mark Davis, IBM GCoC, Cupertino
(408) 777-5850 [fax: 5892], mark.davis@us.ibm.com, president@unicode.org
http://maps.yahoo.com/py/maps.py?Pyt=Tmap&addr=10275+N.+De+Anza&csz=95014

                                                                                      \

                    "Oodi Pilzer"                                                     \

                    <oodi.pilzer@east.sun.com>          To:     "Icu" \
                <icu@www-126.southbury.usf.ibm.com>                        
                    Sent by:                            cc:                           \

                    icu-admin@www-124.southbury.u       Subject:     Getting strings \
                in normalized decomposed state              
                    sf.ibm.com                                                        \

                                                                                      \

                                                                                      \

                    09-23-2001 13:33                                                  \

                                                                                      \

                                                                                      \

Hi.  I wonder if there is any way to use the ICU to pass in a Unicode
string
that has potentially both precomposed and decomposed characters and get
back
a string with all precomposed characters in their decomposed state.  I.e.
the new string should have no precomposed characters.  I realize the
Collator class does this internally but can I somehow leverage this to get
the new normalized string back?

Also, I noticed the class CollationKey but did not understand what it's use
is.  Could someone who knows explain this to me.

Thanks,
Oodi

_______________________________________________
icu mailing list
icu@oss.software.ibm.com
http://oss.software.ibm.com/developerworks/oss/mailman/listinfo/icu

[prev in list] [next in list] [prev in thread] [next in thread]