[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [idn] SC/TC equivalence
----- Original Message -----
From: "Adam M. Costello" <amc@cs.berkeley.edu>
> liana.ydisg@juno.com wrote:
>
> > [nameprep] is the place for case folding for Latin, then it should be
> > the place for other script folding as well.
>
> That's a valid argument, but there are also some counter-arguments:
>
> Existing domain names are already case-insensitive, so IDNs ought to
> be case-insensitive for consistency with the current standard. This
> consistency constraint does not apply to SC/TC equivalence.
>
This is a really controversial issue.
NAMEPREP also applies additional equivalence rules in addition to
case-folding:
1) compatibility decomposition
2) canonical composition
What is the difference between these equivalences and SC/TC equivalence?
> With very few exceptions, case mapping is one-to-one. SC/TC mapping is
> more complex more often.
>
> The case mapping rules are already defined in the Unicode standard.
> This is not true for SC/TC mapping.
>
The Unicord Standard is not the only document we can refer to.
I have no doubt about the accuracy of CNNIC-provided SC/TC mapping tables.
> Some people have expressed a desire to be able to register the
> simplified and traditional versions of a name as separate domain names
> No one has expressed an analogous desire to be able to register the
> all-uppercase and all-lowercase versions of a name as separate domain
> names.
>
If SC/TC equivalence is like the equivalence between "humor" and "humour",
registration/dispute resolution policy may be the right path to solve this
issue.
But, if SC/TC equivalence is like the equivalence between "H" and "h",
we should fold them. BTW, in which direction ? SC->TC or TC->SC?
How can we recover the pre-normalization form of folded TC/SC domains?
--LSB