[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [idn] call for comments for REORDERING




----- Original Message ----- 
From: <DougEwell2@cs.com>
To: <idn@ops.ietf.org>
Cc: <lsb@postel.co.kr>; <jseng@pobox.org.sg>; <duerst@w3.org>
Sent: Friday, October 19, 2001 3:05 PM
Subject: Re: [idn] call for comments for REORDERING


> In a message dated 2001-10-18 21:33:55 Pacific Daylight Time, 
> lsb@postel.co.kr writes:
> 
> >  1) saturations in TLD namespaces would require longer names for which
> >      REORDERING is designed to give greater benefits/compression ratio.
> 
> Is it not the case that logographic/ideographic writing systems such as Han 
> and the syllable-oriented Unicode encoding of Hangul, with their large 
> numbers of characters, convey more information per character than alphabetic 
> scripts?  How long, conceptually, does a domain name really need to be?
> 
> I would probably have a warmer spot in my heart for these script-specific 
> compression proposals if I were convinced that the users of Han and Hangul 
> were somehow being cheated by not being able to register the equivalent of 
> "FourthDistrictCentralProvincialLibraryBoardOfDirectors.org" as a domain name.


First) REORDERING is  useful for very common  han/hangeul labels of length 4~8.

Second) If we take into consideration the additional length of subdomains of a
IDN, the total length of  <ML-subdomain>.<ML>.com  often  exceeds
14. ANd if we think about mailbox name in ML@Sub-ML.ML.com, it will exceed 20.
IN that case, the Sum of the lengths of  saved characters in each substring in ML@SUB-ML.ML.com will be great, around 15~20, i guess.

http://www.cnnic.net/daily/2001-8/9.shtml
will answer some of your questions.
We should not attempt to draw an upper limit on future  applications of 
han/hangeul IDN domains by imposing unfair disadvantage on max length 
of Han/hangeul labels .

Soobok Lee


> 
> >      2.0) the character frequency table are constructed from
> >           Verisign GRS' ML.com testbeds.
> >           Even for chinese han script, their
> >            registrations came from China/TAIWAN/JAPAN/KOREA and other
> >             non-asian squatters.
> 
> I know that Soobok is not specifically listing this as a goal, but I thought 
> efforts to accommodate the 15-character-and-longer Han domain names that have 
> already been registered was explicitly a non-goal, since those businesses and 
> individuals were supposed to have waited until an IDN solution was in place.
> 
> -Doug Ewell
>  Fullerton, California
>