[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [idn] WG Update



In a message dated 2001-10-09 20:20:32 Pacific Daylight Time, 
tsenglm@cc.ncu.edu.tw writes:

>  UTF-8 can keep the case of ASCII
>  but can not preserve the case of non-ASCII . But now AMC-ACE-Z can preserve
>  one case of non-ASCII , it is an excellent improvements over UTF-8 . We
>  should consider the specification based on new technology ---AMC-ACE-Z.

I have generally stayed out of the UTF-8 versus ACE debate, but the above 
statement cannot go unpunished.  UTF-8 is a character encoding scheme for 
*every single code point* in ISO/IEC 10646.  That means every character, 
upper- and lower-case, in every script, has its own UTF-8 representation.

To say that UTF-8 does not preserve case distinctions is complete nonsense.  
It is the nameprep stage that folds away case distinctions (for better or 
worse).

-Doug Ewell
 Fullerton, California