[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [idn] spoofing by combining diacritical marks




----- Original Message ----- 
From: "Soobok Lee" <lsb@postel.co.kr>
 > 
> BTW, Unicode Standard Chap2. Section 2.6, Figure 2-10 has an example:
> 
>   <latin a><combining dot below><combining dot above>
>   <latin a><combining dot above><combining dot below>
> 
>  These two sequences are defined to have the same look
>  But, I can't find yet any _NORMALIZATION_ rules to unify them.
> 
>  Does unicode standards have any rules to unify them  ?

Let me answer for myself. :-)

Yes.   UTR15 (unicode normalization)
 http://www.unicode.org/unicode/reports/tr15/

describes the concept of canonical ordering  and combining class 
of combining characters.

The two sequences in the example are normalized into one in NFC/NFKC.

Great!


> 
>  Soobok