[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [idn] spoofing by combining diacritical marks
----- Original Message -----
From: "Soobok Lee" <lsb@postel.co.kr>
>
> BTW, Unicode Standard Chap2. Section 2.6, Figure 2-10 has an example:
>
> <latin a><combining dot below><combining dot above>
> <latin a><combining dot above><combining dot below>
>
> These two sequences are defined to have the same look
> But, I can't find yet any _NORMALIZATION_ rules to unify them.
>
> Does unicode standards have any rules to unify them ?
Let me answer for myself. :-)
Yes. UTR15 (unicode normalization)
http://www.unicode.org/unicode/reports/tr15/
describes the concept of canonical ordering and combining class
of combining characters.
The two sequences in the example are normalized into one in NFC/NFKC.
Great!
>
> Soobok