[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[idn] draft about Evaluation of various ACEs with existing Chinese (BIG5) Domain Names
Dear IETF & IDN WG:
Attached is the draft of :
Evaluation of various ACEs with existing Chinese (BIG5) Domain Names
This document offer an overview of the results of various ACEs that
being
applied to Chinese (BIG5) Domain Names. And give a brief summary of
the evaluation result.
Thanks for any suggestion and comment!
Erin Chen
TWNIC
Internet Draft Abel Yang
draft-ietf-idn-ace-eval-tw-00.txt Erin Chen
Jul 13, 2001 TWNIC
Expires Jan 13, 2002
Evaluation of various ACEs with existing Chinese (BIG5) Domain Names
Status of this memo
This document is an Internet-Draft and is in full conformance with
all provisions of Section 10 of RFC2026.
Internet-Drafts are working documents of the Internet Engineering
Task Force (IETF), its areas, and its working groups. Note that
other groups may also distribute working documents as
Internet-Drafts.
Internet-Drafts are draft documents valid for a maximum of six
months and may be updated, replaced, or obsoleted by other documents
at any time. It is inappropriate to use Internet-Drafts as reference
material or to cite them other than as "work in progress."
The list of current Internet-Drafts can be accessed at
http://www.ietf.org/ietf/1id-abstracts.txt
The list of Internet-Draft Shadow Directories can be accessed at
http://www.ietf.org/shadow.html.
Abstract
In order to help the ACE (ASCII Compatible Encoding) Design Team in
IDN Working Group to selecting an appropriate ACE proposal. This
document offer an overview of the results of various ACEs that being
applied to Chinese (BIG5) Domain Names. And give a brief summary of
the evaluation result.
1. Test data and tools
The sample data were come from TWNIC registration database. All of
them are in BIG5 encoding. The amount of the data is 28454. And the
data distribution by the length is:
Length of data quantity of BIG5 Domain Names
2 chars 3762
3 chars 3914
4 chars 4613
5 chars 1723
6 chars 1851
7 chars 4282
8 chars 1621
9 chars 1254
10 chars 2075
11 chars 1215
12 chars 753
13 chars 646
14 chars 745
amount=28454
TWNIC just accept 2 to 14 BIG5 chars in a Chinese Domain Name. Those
data are all in local encoding of BIG5 ( or CP950 ). Base on those
data, apply them to various ACEs. There are RACE-03 [RACE], BRACE-00
[BRACE], LACE-01 [LACE], UTF-6-00 [UTF6], DUDE-02 [DUDE],
AMC-ACE-M-00 [AMCACEM], AltDUDE-01 [AltDUDE], AMC-ACE-O-00 [AMCACEO],
and AMC-ACE-R-01 [AMCACER].
2. Result of each ACE length
The CHN_len is the length of Chinese (BIG5) domain name in byte
before apply ACE encoding. All of the data in the same CHN_len, the
max_len is the maximum length, the min_len is the minimum length,
the aver_len is the averager length of them after apply ACE encoding.
2.1 AMC-ACE-M
CHN_len max_len min_len aver_len
----------- -------- -------- --------
2 8 5 8
3 11 6 11
4 17 8 14
5 17 11 17
6 21 11 20
7 24 13 22
8 27 15 25
9 30 16 28
10 33 18 31
11 36 22 34
12 39 19 37
13 42 21 40
14 45 23 43
2.2 AMC-ACE-O
CHN_len max_len min_len aver_len
----------- -------- -------- --------
2 8 5 8
3 12 6 11
4 20 9 15
5 20 11 18
6 24 11 22
7 27 13 25
8 31 15 29
9 35 16 32
10 39 18 36
11 42 24 38
12 46 19 41
13 50 21 44
14 54 23 48
2.3 AMC-ACE-R
CHN_len max_len min_len aver_len
----------- -------- -------- --------
2 8 5 8
3 12 6 12
4 20 9 16
5 20 11 19
6 24 11 23
7 28 13 26
8 32 15 30
9 36 16 34
10 40 18 38
11 44 24 41
12 48 19 44
13 52 21 48
14 56 24 51
2.4 AltDUDE
CHN_len max_len min_len aver_len
----------- -------- -------- --------
2 8 5 8
3 12 6 11
4 20 8 16
5 20 13 19
6 24 11 23
7 28 14 26
8 32 15 30
9 36 18 34
10 40 21 38
11 44 26 41
12 48 26 46
13 52 25 48
14 56 30 53
2.5 BRACE
CHN_len max_len min_len aver_len
----------- -------- -------- --------
2 7 5 7
3 10 7 10
4 17 8 14
5 17 11 17
6 20 11 20
7 23 13 23
8 28 14 26
9 30 15 30
10 33 17 33
11 36 22 36
12 39 18 39
13 42 20 42
14 46 23 46
2.6 DUDE
CHN_len max_len min_len aver_len
----------- -------- -------- --------
2 8 5 8
3 12 6 11
4 20 8 16
5 20 13 19
6 24 11 23
7 28 15 26
8 32 15 30
9 36 18 34
10 40 21 38
11 44 27 41
12 48 26 46
13 52 25 48
14 56 30 53
2.7 LACE
CHN_len max_len min_len aver_len
----------- -------- -------- --------
2 8 7 8
3 12 8 12
4 18 10 15
5 18 18 18
6 21 16 21
7 24 21 24
8 31 23 28
9 31 24 31
10 34 26 34
11 37 34 37
12 40 29 40
13 44 31 44
14 47 36 47
2.8 RACE
CHN_len max_len min_len aver_len
----------- -------- -------- --------
2 8 5 8
3 12 7 12
4 18 8 15
5 18 18 18
6 21 20 21
7 24 24 24
8 31 28 28
9 31 31 31
10 34 34 34
11 37 37 37
12 40 40 40
13 44 44 44
14 47 47 47
2.9 UTF-6
CHN_len max_len min_len aver_len
----------- -------- -------- --------
2 8 5 8
3 12 6 12
4 20 9 16
5 20 14 20
6 24 14 24
7 28 18 28
8 32 20 32
9 36 21 36
10 40 23 40
11 44 30 44
12 48 28 48
13 52 29 52
14 56 33 56
3. Result of each ACE conversion speed
UTF-6 RACE LACE BRACE DUDE ALTDUDE
MAX 0.80 0.75 0.82 0.86 0.74 0.70
MIN 0.56 0.55 0.52 0.62 0.53 0.53
AVERAGE 0.68 0.64 0.62 0.73 0.64 0.62
AMC-ACE-M AMC-AMC-O AMC-ACE-R
MAX 1.12 1.92 0.99
MIN 0.86 1.69 0.67
AVERAGE 0.98 1.81 0.82
4. Summary of the result
All the ACE max_length is less than 64, because the Chinese
Domain Name is limited from 2 to 14 BIG5 chars. For those sample
data, the result of LENGTH and SPEED, it seems the LACE and RACE
are better than others.
5. Acknowledgements
This ACE evaluation testing tool was provided by JPNIC.
TWNIC provide the real registered Chinese ( BIG5 ) domain names.
Hsu Nai-Wen <snw@twnic.net.tw> and TWNIC MDNS technical team give
a lot of suggestion.
6. Author's Address
Abel Yang
Taiwan Network Information Center (TWNIC)
4F-2, No. 9, Sec. 2, Roosevelt Rd., Taipei, 100 Taiwan.
Email: abelyang@twnic.net.tw
Tel: +886-23411313#505
Erin Chen
Taiwan Network Information Center (TWNIC)
4F-2, No. 9, Sec. 2, Roosevelt Rd., Taipei, 100 Taiwan.
Email: erin@twnic.net.tw
Tel: +886-23411313#502
7. References
[RFC1035] "DOMAIN NAMES - IMPLEMENTATION AND SPECIFICATION",
RFC1034, Nov 1987, P. Mockapetris
[RACE] "RACE: Row-based ASCII Compatible Encoding for IDN",
draft-ietf-idn-race-03.txt, Nov 2000, P Hoffman
[BRACE] "BRACE: Bi-mode Row-based ASCII-Compatible Encoding
for IDN version 0.1.2"
draft-ietf-idn-brace-00.txti, Sep 2000, A Costello
[LACE] "LACE: Length-based ASCII Compatible Encoding for IDN"
draft-ietf-idn-lace-01.txt, Jan 2001, M Davis, P Hoffman
[UTF6] "UTF-6 - Yet Another ASCII-Compatible Encoding for IDN"
draft-ietf-idn-utf6-00, Nov 2000, M Welter, B Spolarich
[DUDE] "Differential Unicode Domain Encoding (DUDE)"
draft-ietf-idn-dude-02.txt, Jun 2001, M Welter,
B Spolarich, A Costello
[AMCACEM] "AMC-ACE-M version 0.1.0"
draft-ietf-idn-amc-ace-m-00.txt, Feb 2001, A Costello
[AltDUDE] "AltDUDE version 0.0.2"
draft-ietf-idn-altdude-00.txt, Mar 2001, A Costello
[AMCACEO] "AMC-ACE-O version 0.0.3"
draft-ietf-idn-amc-ace-o-00.txt, Mar 2001, A Costello
[AMCACER] "AMC-ACE-R version 0.2.1"
draft-ietf-idn-amc-ace-r-01.txt, May 2001, A Costello
[AMCACEV] "AMC-ACE-V version 0.1.0"
draft-ietf-idn-amc-ace-v-00.txt, May 2001, A Costello
[AMCACEW] "AMC-ACE-W version 0.1.0"
draft-ietf-idn-amc-ace-w-00.txt, May 2001, A Costello
[MACE] "MACE: Modal ASCII Compatible Encoding for IDN"
draft-ietf-idn-mace-00.txt, Jun 2001, M Ishisone, Y Yoneya
[LDUDE] "Improving ACE using code point reordering v0.9"
draft-ietf-idn-lsb-ace-00.txt, Jun 2001, Soobok Lee
[MDNKIT] "Multilingual Domain Name tool Kit",
http://www.nic.ad.jp/jp/research/idn/mdnkit/download/
[NAMEPREP] "Preparation of Internationalized Host Names",
draft-ietf-idn-nameprep-03.txt, Feb 2001, P Hoffman,
M Blanchet