[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [idn] WG Update
In a message dated 2001-10-09 20:20:32 Pacific Daylight Time,
tsenglm@cc.ncu.edu.tw writes:
> UTF-8 can keep the case of ASCII
> but can not preserve the case of non-ASCII . But now AMC-ACE-Z can preserve
> one case of non-ASCII , it is an excellent improvements over UTF-8 . We
> should consider the specification based on new technology ---AMC-ACE-Z.
I have generally stayed out of the UTF-8 versus ACE debate, but the above
statement cannot go unpunished. UTF-8 is a character encoding scheme for
*every single code point* in ISO/IEC 10646. That means every character,
upper- and lower-case, in every script, has its own UTF-8 representation.
To say that UTF-8 does not preserve case distinctions is complete nonsense.
It is the nameprep stage that folds away case distinctions (for better or
worse).
-Doug Ewell
Fullerton, California