[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [idn] nameprep inconsistency (was turkish i)





On Wed, 21 Nov 2001 19:20:26 +0000 "Adam M. Costello"
<idn.amc+0@nicemice.net.RemoveThisWord> writes:
> Soobok Lee <lsb@postel.co.kr> wrote:
> 
> > Even though 0049 0307 === 0130 (modulo NFC), two have different    
>    
> > output labels .                                                    
>    
> 
> Oh dear, that is just wrong.  It violates the Unicode principle that
> canonically equivalent strings should always be treated the same.
> 
> I think this points to a larger problem:  The Unicode Consortium has
> provided a normalization algorithm that squashes equivalent 
> variations,
> and a folding algorithm that squashes case differences, but they 
> haven't
> provided an algorithm that squashes both equivalent variations and 
> case
> differences.  So the IETF has tried to build one, and we've gotten 
> it
> wrong.
> 

BANNER! :-)  Liana

> I suggest that the Unicode Consortium should define two new 
> algorithms:
> one that is like NFC but also squashes case, and one that is like 
> NFKC
> but also squashes case.  Then nameprep can simply refer to one of 
> those
> (and also specify a set of prohibited characters).
> 
> AMC
>