[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[idn] Re: REORDERING: stability issues and UTC solutions




----- Original Message ----- 
From: "James Seng/Personal" <jseng@pobox.org.sg>
To: "Soobok Lee" <lsb@postel.co.kr>; <idn@ops.ietf.org>; "Martin Duerst" <duerst@w3.org>
Sent: Sunday, October 21, 2001 11:22 PM
Subject: Re: REORDERING: stability issues and UTC solutions


> Firstly, lets not try to twist the whole argument to Normalization,
> which you are doing here. We are talking about Reordering so lets focus
> on it.
> 
> > My  question was that:
> >   1) newly-approved TAGALOG characters X,Y  have   NFC X -> Y,
> >   2) old Nameprep/ACE encodes X.com and Y.com as two distinct ACE
> labels,
> >       because it don't know about NFC X -> Y
> >   3) new Nameprep/ACE encodes X.com as the the same ACE label as
> Y.com's
> >       with NFC  X -> Y.
> 
> My shortest answer is for you to read UTR#15. It is already addressed
> there.

I read http://www.unicode.org/unicode/standard/policies.html and
http://www.unicode.org/unicode/reports/tr15/#3.1_Corrigendum.

Would anyone suggest more ?

Soobok Lee

> 
> >   If X.com and Y.com are taken by two distinct registrants, there will
> >     be a mess. of course, it should have be blocked by careful
> registries.
> >   Even if new ACE libaries are distributed, some old ACE tools will
> still
> >     try to send X.com dns queries which may fail.
> 
> No it wont. Patrik Faltstorm have already discuss this possiblities and
> the work around this. It was one of the version of Nameprep but I
> realised it is no longer in -06.
> 
> > Are you saying ?:
> >    Tagalog folks may come back  with complaints about old nameprep
> >    which has no NFC X->Y, saying "unfair treatment!" ??
> 
> No, I am saying REORDERING. Lets keep the topic on REORDERING.
> 
> > 1) last resort: somewhat tricky
> >
> > Future TAGALOG may provides two sets of TAGALOG basic alphabets.
> > One set A in official lexicographical ordering and the other set B
> > is in frequecy ordering (sub-optimal one OKAY) with 1:1 NFKC defined
> > from A onto B.
> > Then all tagalog basic alphabets in Set A will be "reordered by NFKC"
> > in nameprep , not in ACE. A and B share the same font.
> > Then valid ACE labels of TAGALOG script only contains characters from
> B.
> > This may have some problems in comparisons which i have no full
> analysis.
> 
> Okay. This should go into ISO10646 SC2/WG2 then.

YEs. When New script is added.

Soobok lee

> 
> > 2) UTC solution
> >
> > IF UTC accepts REORDERING as an official normalization form like
> > NF-REORDERING , then we need no such tricks like above, and
> > TAGALOG support can be done within that NF in the new
> > NAMEPREP steps:  mapping/NFKC/PROHIBIT and then NF-REORDERING .
> >
> > Frequency tables are always sub-optimal in its nature, and
> > marginal frequency fluctuation will occurs to make marginal
> > efficieny changes, but in most cases, i am sure, it will benefit
> > most of TAGALOG labels, and that's why i push REORDERING into UTC.
> >
> > In conclusion, 2) is the my preferenece.
> > It's the best way for acquiring stability and authority and
> applicability
> > of REORDERING.
> 
> Okay. This should go into UTC.
> 
> Both your solution unfortunately cannot be done in this WG.
> 
> -James Seng
>