[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [idn] Re: Fwd: Unicode letter ballot

To: Simon Josefsson <jas@extundo.com>
Subject: Re: [idn] Re: Fwd: Unicode letter ballot
From: Soobok Lee <lsb@postel.co.kr>
Date: Thu, 28 Nov 2002 10:26:49 +0900
Cc: Harald Tveit Alvestrand <harald@alvestrand.no>, idn@ops.ietf.org
In-reply-to: <ilun0nufstq.fsf@extundo.com>
References: <0A56805A-FF88-11D6-A024-0003934B2128@cisco.com> <iluisylonnk.fsf@latte.josefsson.org> <54030000.1038405029@askvoll.hjemme.alvestrand.no> <ilun0nufstq.fsf@extundo.com>
User-agent: Mutt/1.4i

On Thu, Nov 28, 2002 at 01:20:33AM +0100, Simon Josefsson wrote:
> Harald Tveit Alvestrand <harald@alvestrand.no> writes:
> 
> > --On mandag, november 25, 2002 13:10:39 +0100 Simon Josefsson
> > <jas@extundo.com> wrote:
> >
> >> This message seems important for IDN.
> >>
> >> Note that if option A win the vote, it seems that stringprep/nameprep
> >> will have to fork the Unicode standard to remain backwards compatible
> >> when stringprep/nameprep updates its Unicode reference.
> >>
> >
> > sigh.... all 5 are beyond-BMP characters added recently.
> > if we could go back in time, we could have implemented a policy of not
> > accepting characters as stable before 2 unicode versions had gone
> > by....
> >
> > proofreading takes time.
> 
> Even two Unicode releases doesn't guarantee that the tables are
> correct.  The only proper solution I can see is to stop modifying
> published decomposition tables.  When mistakes are discovered, new
> character codes with proper decompositions should be added and the old
> character codes declared obsolete -- which is option B in the vote,
> but unfortunately neither IDN nor IETF has any voting powers (which
> suggest a methodological problem).
> 

If option B  wins, a few issues may be raised easily:
 1) added characters does not preserve the original/ideal lexicographic
    radical-based order of those CJK glyphs.
 2) How does any obsoletion of characters work ? 
    compatiblity CJK characters are NFC-ed into other CJK characters
    in stringprep and so required obsoletion(mapping to newly added ones)
    should be done _before_ the stringprep operation is taken.
    
If option A wins, I see a chaos:
 1) NFKC_3.2_nameprep(NFKC_4.0_calling_application(IDN))  
       !=  NFKC_3.2_nameprep(IDN)  
    What if future HTML/canonical XML standards adopt NFKC_4.0 ?

Soobok Lee

Follow-Ups:
- Re: [idn] Re: Fwd: Unicode letter ballot
  - From: martin@v.loewis.de (Martin v. Löwis)

References:
- [idn] Re: Fwd: Unicode letter ballot
  - From: Simon Josefsson <jas@extundo.com>
- Re: [idn] Re: Fwd: Unicode letter ballot
  - From: Harald Tveit Alvestrand <harald@alvestrand.no>
- [idn] Re: Fwd: Unicode letter ballot
  - From: Simon Josefsson <jas@extundo.com>

Prev by Date: [idn] Re: Fwd: Unicode letter ballot
Next by Date: Re: [idn] Re: Fwd: Unicode letter ballot
Previous by thread: [idn] Re: Fwd: Unicode letter ballot
Next by thread: Re: [idn] Re: Fwd: Unicode letter ballot
Index(es):
- Date
- Thread