[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[idn] draft-ietf-idn-ace-eval-cn-00
hello, all,
Last Thursday I had sent this document to internet-drafts@ietf.org
and co-chairs of IDN WG, but I can't see it in IDN group mail list
till now.
I think it is important for the people on the mail list to share it, so
I
send it to the list.
regards,
Sun Guonian
--
*****************************************************
* China Internet Network Information Center (CNNIC) *
* Sun Guonian No.4, S.4 Street *
* Zhong Guan Cun *
* Phone: 86-10-62553604 Haidian District *
* Email: sun@cnnic.net.cn Beijing *
* http://drop.cnnic.net.cn/~sun/ P.O.Box: 100080 *
*****************************************************
* 中国互联网络信息中心 孙国念 *
*****************************************************
Internet Draft Sun Guonian
draft-ietf-idn-ace-eval-cn-00.txt CNNIC
Jul 10, 2001
Expires Jan 10, 2002
Evaluation of various ACEs with existing Chinese Domain Names
Status of this memo
This document is an Internet-Draft and is in full conformance with
all provisions of Section 10 of RFC2026.
Internet-Drafts are working documents of the Internet Engineering
Task Force (IETF), its areas, and its working groups. Note that
other groups may also distribute working documents as
Internet-Drafts.
Internet-Drafts are draft documents valid for a maximum of six
months and may be updated, replaced, or obsoleted by other documents
at any time. It is inappropriate to use Internet-Drafts as reference
material or to cite them other than as "work in progress."
The list of current Internet-Drafts can be accessed at
http://www.ietf.org/ietf/1id-abstracts.txt
The list of Internet-Draft Shadow Directories can be accessed at
http://www.ietf.org/shadow.html.
Abstract
The ACE (ASCII Compatible Encoding) Design Team in IDN Working Group
is selecting an appropriate ACE proposal. To push forward the work of
ACE Design Team, this document illustrates the results of various ACEs
that being applied to Chinese Domain Names.
1. Test data and tools
These test data were sampled from CNNIC's current registry database, with
the total number of 100,000.
Applied ACEs are RACE-03 [RACE], BRACE-00 [BRACE], LACE-01 [LACE],
UTF-6-00 [UTF6], DUDE-02 [DUDE], AMC-ACE-M-00 [AMCACEM], AltDUDE-01
[AltDUDE], AMC-ACE-O-00 [AMCACEO], AMC-ACE-R-01 [AMCACER],
AMC-ACE-V-00 [AMCACEV], AMC-ACE-W-00 [AMCACEW], MACE-01 [MACE] and
LDUDE-00[LDUDE].
2. Result of each ACE
HANZI_len is the characters in the Chinese string before UCS2-ACE coding,
max_len is the maximum length that a ACE-coded string can reach, min_len
is the minimum length that a ACE-coded string can reach, aver_len is the
average length of all ACE-coded strings converted from the Chinese string
with the same HANZI_len-size.
2.1 AMC-ACE-M
HANZI_len max_len min_len aver_len
----------- -------- -------- --------
1 9 9 9.0000
2 13 10 12.7385
3 17 11 15.8155
4 20 12 18.8141
5 23 13 21.7565
6 26 14 24.6305
7 29 16 27.5662
8 33 16 30.4084
9 35 30 33.3293
10 39 27 36.1929
11 41 36 39.1082
12 45 34 42.0063
13 48 42 44.8869
14 50 43 47.8597
15 53 48 50.7955
16 57 51 53.8091
17 59 53 56.5031
18 62 57 59.7467
19 65 61 62.5106
20 68 63 65.6471
2.2 AMC-ACE-O
HANZI_len max_len min_len aver_len
----------- -------- -------- --------
1 9 9 9.0000
2 13 10 12.7401
3 17 11 16.2954
4 21 12 19.8272
5 25 13 23.3622
6 29 14 26.6862
7 32 16 30.3160
8 36 16 33.3543
9 40 30 36.9435
10 44 27 40.1309
11 48 36 43.5358
12 51 34 46.9934
13 55 41 50.1921
14 59 31 53.6718
15 62 48 56.4908
16 66 51 60.5728
17 68 53 63.0881
18 73 57 66.3467
19 76 61 70.4894
20 79 67 73.4706
2.3 AMC-ACE-R
HANZI_len max_len min_len aver_len
----------- -------- -------- --------
1 9 9 9.0000
2 13 10 12.7401
3 17 11 16.4921
4 21 12 20.3006
5 25 13 24.0866
6 29 14 27.7128
7 33 16 31.5339
8 37 16 34.8798
9 41 31 38.7328
10 45 27 42.2981
11 49 40 45.9261
12 53 34 49.7214
13 57 45 53.1325
14 61 43 56.8372
15 65 52 60.1350
16 69 56 64.5534
17 73 53 67.0566
18 75 63 71.5600
19 80 67 74.8936
20 84 72 79.2647
2.4 AMC_ACE_V
HANZI_len max_len min_len aver_len
----------- -------- -------- --------
1 8 8 8.0000
2 12 10 10.9848
3 16 11 13.9613
4 20 12 16.9441
5 23 13 19.9398
6 25 14 22.9069
7 28 16 25.9121
8 32 16 28.8512
9 35 29 31.8418
10 37 27 34.8368
11 41 35 37.7769
12 44 35 40.8079
13 47 40 43.7668
14 49 32 46.7032
15 52 45 49.6626
16 56 50 52.8026
17 58 53 55.6604
18 65 57 58.9733
19 64 60 61.8085
20 67 63 64.8235
2.5 AMC_ACE_W
HANZI_len max_len min_len aver_len
----------- -------- -------- --------
1 8 8 8.0000
2 12 10 10.9848
3 16 11 13.9790
4 19 12 16.9746
5 24 13 19.9955
6 27 14 22.9755
7 29 16 26.0036
8 34 16 28.9908
9 37 29 32.0108
10 41 27 35.0201
11 46 36 38.0537
12 49 28 41.0596
13 52 42 44.1294
14 56 44 47.1385
15 56 47 50.1472
16 62 51 53.3204
17 66 54 56.3082
18 72 58 60.1333
19 69 61 62.7234
20 71 64 65.2059
2.6 AltDUDE
HANZI_len max_len min_len aver_len
----------- -------- -------- --------
1 8 8 8.0000
2 12 9 11.7401
3 16 10 15.4679
4 20 11 19.2468
5 24 12 23.0600
6 28 13 26.7517
7 32 15 30.6092
8 36 15 34.1284
9 40 30 37.9871
10 44 29 41.5955
11 48 38 45.2093
12 52 26 49.0893
13 56 45 52.7266
14 60 49 56.3480
15 64 53 59.7935
16 68 57 63.8964
17 72 61 67.2956
18 76 65 71.6400
19 79 71 75.3404
20 83 72 78.3529
2.7 BRACE
HANZI_len max_len min_len aver_len
----------- -------- -------- --------
1 8 8 8.0000
2 11 9 10.9402
3 14 11 13.9295
4 18 12 17.8337
5 21 14 20.9788
6 24 15 23.9537
7 27 16 26.9647
8 30 18 29.9676
9 34 31 33.8942
10 37 29 36.9766
11 40 36 39.9731
12 43 36 42.9665
13 46 43 45.9814
14 50 43 49.9406
15 53 51 52.9877
16 56 54 55.9806
17 59 52 58.8868
18 62 59 61.9200
19 66 65 65.9787
20 69 69 69.0000
2.8 DUDE
HANZI_len max_len min_len aver_len
----------- -------- -------- --------
1 8 8 8.0000
2 12 9 11.7401
3 16 10 15.4679
4 20 11 19.2468
5 24 12 23.0600
6 28 13 26.7517
7 32 15 30.6092
8 36 15 34.1284
9 40 30 37.9871
10 44 29 41.5955
11 48 38 45.2093
12 52 26 49.0893
13 56 45 52.7266
14 60 49 56.3480
15 64 53 59.7935
16 68 57 63.8964
17 72 61 67.2956
18 76 65 71.6400
19 79 71 75.3404
20 83 72 78.3529
2.9 LACE
HANZI_len max_len min_len aver_len
----------- -------- -------- --------
1 9 9 9.0000
2 12 11 11.9642
3 16 12 15.9868
4 19 14 18.9913
5 22 16 21.9985
6 25 17 24.9972
7 28 19 27.9995
8 32 20 31.9990
9 35 35 35.0000
10 38 36 37.9994
11 41 41 41.0000
12 44 33 43.9934
13 48 48 48.0000
14 51 51 51.0000
15 54 54 54.0000
16 57 57 57.0000
17 60 60 60.0000
18 64 64 64.0000
19 67 67 67.0000
20 70 70 70.0000
2.10 LDUDE
HANZI_len max_len min_len aver_len
----------- -------- -------- --------
1 8 8 8.0000
2 12 9 11.3573
3 16 10 14.4717
4 20 11 17.5182
5 24 12 20.5703
6 28 13 23.5265
7 32 15 26.6787
8 36 15 29.6551
9 40 21 32.7320
10 44 23 35.9557
11 48 28 38.9694
12 51 26 42.1635
13 55 34 45.1844
14 60 37 48.4613
15 63 40 51.3865
16 66 43 54.9903
17 70 45 58.2767
18 75 50 61.8133
19 76 51 62.7234
20 78 57 67.9706
2.11 MACE
HANZI_len max_len min_len aver_len
----------- -------- -------- --------
1 8 7 7.9850
2 12 9 10.9905
3 16 10 13.9855
4 19 11 16.9912
5 24 12 19.9935
6 26 14 22.9938
7 29 15 26.0009
8 34 15 29.0167
9 37 29 32.0114
10 41 30 35.0183
11 46 37 38.0455
12 49 29 41.0432
13 52 42 44.0883
14 56 45 47.1493
15 56 48 50.1718
16 62 53 53.3042
17 66 55 56.3837
18 70 59 60.1467
19 69 62 62.7234
20 71 64 65.2941
2.12 RACE
HANZI_len max_len min_len aver_len
----------- -------- -------- --------
1 9 8 8.0014
2 12 9 11.8931
3 16 11 15.9835
4 19 12 18.9981
5 22 14 21.9994
6 25 16 24.9987
7 28 17 27.9993
8 32 19 31.9989
9 35 35 35.0000
10 38 38 38.0000
11 41 41 41.0000
12 44 44 44.0000
13 48 48 48.0000
14 51 51 51.0000
15 54 54 54.0000
16 57 57 57.0000
17 60 60 60.0000
18 64 64 64.0000
19 67 67 67.0000
20 70 70 70.0000
2.13 UTF-6
HANZI_len max_len min_len aver_len
----------- -------- -------- --------
1 8 8 8.0000
2 12 9 11.9582
3 16 10 15.9889
4 20 11 19.9985
5 24 17 23.9995
6 28 13 27.9984
7 32 21 31.9993
8 36 23 35.9989
9 40 40 40.0000
10 44 44 44.0000
11 48 48 48.0000
12 52 52 52.0000
13 56 56 56.0000
14 60 60 60.0000
15 64 64 64.0000
16 68 68 68.0000
17 72 72 72.0000
18 76 76 76.0000
19 80 80 80.0000
20 84 84 84.0000
3. Summary
ACE_name max_cstring
AMC-ACE-M 18
AMC-ACE-O 15
AMC-ACE-R 14
AMC_ACE_V 17
AMC_ACE_W 16
AltDUDE 14
BRACE 18
DUDE 14
LACE 17
LDUDE 15
MACE 16
RACE 17
UTF-6 14
The max_cstring is the maximum length of Chinese Domain Name when
max_len is less than 64.
For Chinese Domain Name, the max_len is most significant.
4. References
[RFC1035] "DOMAIN NAMES - IMPLEMENTATION AND SPECIFICATION",
RFC1034, Nov 1987, P. Mockapetris
[RACE] "RACE: Row-based ASCII Compatible Encoding for IDN",
draft-ietf-idn-race-03.txt, Nov 2000, P Hoffman
[BRACE] "BRACE: Bi-mode Row-based ASCII-Compatible Encoding for IDN
version 0.1.2"
draft-ietf-idn-brace-00.txti, Sep 2000, A Costello
[LACE] "LACE: Length-based ASCII Compatible Encoding for IDN"
draft-ietf-idn-lace-01.txt, Jan 2001, M Davis, P Hoffman
[UTF6] "UTF-6 - Yet Another ASCII-Compatible Encoding for IDN"
draft-ietf-idn-utf6-00, Nov 2000, M Welter, B Spolarich
[DUDE] "Differential Unicode Domain Encoding (DUDE)"
draft-ietf-idn-dude-02.txt, Jun 2001, M Welter, B Spolarich,
A Costello
[AMCACEM] "AMC-ACE-M version 0.1.0"
draft-ietf-idn-amc-ace-m-00.txt, Feb 2001, A Costello
[AltDUDE] "AltDUDE version 0.0.2"
draft-ietf-idn-altdude-00.txt, Mar 2001, A Costello
[AMCACEO] "AMC-ACE-O version 0.0.3"
draft-ietf-idn-amc-ace-o-00.txt, Mar 2001, A Costello
[AMCACER] "AMC-ACE-R version 0.2.1"
draft-ietf-idn-amc-ace-r-01.txt, May 2001, A Costello
[AMCACEV] "AMC-ACE-V version 0.1.0"
draft-ietf-idn-amc-ace-v-00.txt, May 2001, A Costello
[AMCACEW] "AMC-ACE-W version 0.1.0"
draft-ietf-idn-amc-ace-w-00.txt, May 2001, A Costello
[MACE] "MACE: Modal ASCII Compatible Encoding for IDN"
draft-ietf-idn-mace-00.txt, Jun 2001, M Ishisone, Y Yoneya
[LDUDE] "Improving ACE using code point reordering v0.9"
draft-ietf-idn-lsb-ace-00.txt, Jun 2001, Soobok Lee
[MDNKIT] "Multilingual Domain Name tool Kit",
http://www.nic.ad.jp/jp/research/idn/mdnkit/download/
5. Acknowledgements
CNNIC Chinese Registry Service Department provided registered
Chinese Domain Names.
XiaoDong LEE, lee@cnnic.net.cn
Wang Yanfeng, wyf@cnnic.net.cn
Deng Xiang, deng@cnnic.net.cn
6. Author's Address
Sun Guonian
China Internet Network Information Center
No.4, South 4th street, Zhongguancun,
Haidian District, Beijing,
China 100080
sun@cnnic.net.cn