[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [idn] Namprep-02: ß.com



At 09:30 04/02/2001 -0800, Mark Davis wrote:
>The source documents to look at are:
>
>http://www.unicode.org/Public/3.1-Update/CaseFolding-3d4.beta.txt
>
>http://www.unicode.org/unicode/reports/tr21/
>
>There it explains the treatment of es-zed.

gotcha.

what I missed was that this section:

>3.1 Case mapping
>
>For each character in the input, if there is a lowercase mapping for
>that character, the input character is changed to the mapped lowercase
>character(s). The entries in the mapping table are derived from [UTR21].

could have read:

The input string is case folded according to [UTR21] section 2.3.
For most cases, this is the same thing as changing the input character to a 
lowercase character; however, for some cases more complex things happen.
The mapping table in appendix N is used. This is derived from [UTR21] by 
applying the rules for equivalence classes of section 2.3

This makes it easier to see that this is NOT a simple conversion to lowercase.
(there was no

--
Harald Tveit Alvestrand, alvestrand@cisco.com
+47 41 44 29 94
Personal email: Harald@Alvestrand.no