[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [idn] Report from the ACE design team



>From ishisone@sra.co.jp

> Could you please give some example of a name of
> a Japanese organization with a long string of characters? (actually the
> number of characters in Japanese and a rough translation will suffice,
> since I cannot read Japanese).

Sure.  I looked around and found two examples.  Both names exceed 15
characters and DUDE cannot encode them, but other more space efficient
ACEs can.

1. JPNIC (the registry of .jp domain)
JPNIC's official name consists of 25 characters.  Most of the characters
are phonograms (Katakana), though.

  U+793E U+56E3 U+6CD5 U+4EBA U+65E5 U+672C U+30CD U+30C3 U+30C8 \
  U+30EF U+30FC U+30AF U+30A4 U+30F3 U+30D5 U+30A9 U+30E1 U+30FC \
  U+30B7 U+30E7 U+30F3 U+30BB U+30F3 U+30BF U+30FC

2. A helth-insurance organization in Tokyo
This name also contains a few phonograms, but most are ideograms.

  U+6771 U+4EAC U+90FD U+60C5 U+5831 U+30B5 U+30FC U+30D3 U+30B9 \
  U+7523 U+696D U+5065 U+5EB7 U+4FDD U+967A U+7D44 U+5408

						   -- ishisone@sra.co.jp

My commend:

The DUDE has a universal appeal for most languages in UNICODE,
but disregards the readability for people not reading the local script 
or do not have local script support, where most international working
people really need.  For example, an English speaker do not have 
Chinese support on his computer, but receives mails from a few 
Chinese e-mail servers, and has no option to tell where the e-mail comes 
 from besides the .cn country code.   In Chinese, a typical name of a 
company comsists the name of its province, its city, its county or
district, 
its Xiang, or/and its individual brand name, its village name, its
genentic 
business and its corporation type.  It is 10 -13 characters long for well

known entities, and 12 - 15 as the necessary length for more localized 
names as current computerization status concerns.  To say 15 
characters is enough is a strech, beause its village size is from 4 
persons in Tibet to several thousands residents in a populated area.  
There is no problem with DUDE in general, but such a tangled email 
header of 1000 companies will give the international worker a real 
headache.

However, using standard StepCode for IDN such a name can be 
phoneticly supported.  For example: 
Liaoning22Shenyang32Nanhu22Datan22Xiehe22YiYuan14
with 50 octets for a hospital.  For a grade school e-mail address,
we need another two characters for sure and reachs the limit of 
DUDE.  With phrasal StepCode by consider the address as one 
unit for IDN, this can be 
LiaoningShenyangNanhuDatanXieheYiYuan-12 in 27 octets. 
 
I suggest the WG to work out and recommend a standarded 
StepCode configuration for IDN to complement DUDE, such 
that DUDE will be the default ACE with a higher compression rate 
in a general and StepCode be the language tagged ACE for users 
who need  
1. the readability of e-mail addresses;
2. localized compression scheme;
3. the option to specify logos or trademarks outside UNICODE 
standard.

If we can have a consensus on support another ACE to address
the above needs, I would support DUDE. 

Liana