[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[idn] draft about Evaluation of various ACEs with existing Chinese (BIG5) Domain Names



Dear IETF & IDN WG:

Attached is the draft of :
Evaluation of various ACEs with existing Chinese (BIG5) Domain Names

This document offer an overview of the results of various ACEs that
being
applied to Chinese (BIG5) Domain Names. And give a brief summary of
the evaluation result.

Thanks for any suggestion and comment!

Erin Chen
TWNIC
 
Internet Draft                                                 Abel Yang
draft-ietf-idn-ace-eval-tw-00.txt                              Erin Chen 
Jul 13, 2001                                                       TWNIC
Expires Jan 13, 2002                                           

   Evaluation of various ACEs with existing Chinese (BIG5) Domain Names

Status of this memo

    This document is an Internet-Draft and is in full conformance with
    all provisions of Section 10 of RFC2026.

    Internet-Drafts are working documents of the Internet Engineering
    Task Force (IETF), its areas, and its working groups.  Note that
    other groups may also distribute working documents as 
    Internet-Drafts.

    Internet-Drafts are draft documents valid for a maximum of six
    months and may be updated, replaced, or obsoleted by other documents
    at any time.  It is inappropriate to use Internet-Drafts as reference
    material or to cite them other than as "work in progress."

    The list of current Internet-Drafts can be accessed at
    http://www.ietf.org/ietf/1id-abstracts.txt

    The list of Internet-Draft Shadow Directories can be accessed at
    http://www.ietf.org/shadow.html.



Abstract

    In order to help the ACE (ASCII Compatible Encoding) Design Team in 
    IDN Working Group to selecting an appropriate ACE proposal. This 
    document offer an overview of the results of various ACEs that being
    applied to Chinese (BIG5) Domain Names. And give a brief summary of 
    the evaluation result.

1. Test data and tools

    The sample data were come from TWNIC registration database. All of 
    them are in BIG5 encoding. The amount of the data is 28454. And the 
    data distribution by the length is:

      Length of data     quantity of BIG5 Domain Names
    	2 chars			3762 
	3 chars			3914 
	4 chars			4613 
	5 chars			1723 
	6 chars			1851 
	7 chars			4282 
	8 chars			1621 
	9 chars			1254 
       10 chars			2075 
       11 chars			1215 
       12 chars		 	 753 
       13 chars			 646 
       14 chars			 745 
	
	                amount=28454 
	
    TWNIC just accept 2 to 14 BIG5 chars in a Chinese Domain Name. Those 
    data are all in local encoding of BIG5 ( or CP950 ). Base on those 
    data, apply them to various ACEs. There are RACE-03 [RACE], BRACE-00 
    [BRACE], LACE-01 [LACE], UTF-6-00 [UTF6], DUDE-02 [DUDE], 
    AMC-ACE-M-00 [AMCACEM], AltDUDE-01 [AltDUDE], AMC-ACE-O-00 [AMCACEO], 
    and AMC-ACE-R-01 [AMCACER].
    
2. Result of each ACE length
    
    The CHN_len is the length of Chinese (BIG5) domain name in byte 
    before apply ACE encoding. All of the data in the same CHN_len, the 
    max_len is the maximum length, the min_len is the minimum length, 
    the aver_len is the averager length of them after apply ACE encoding.

  2.1 AMC-ACE-M

    CHN_len    max_len  min_len  aver_len
    -----------  -------- -------- --------
    	2		 8	 5	 8 
	3		11	 6	11 
	4		17	 8	14 
	5		17	11	17 
	6		21	11	20 
	7		24	13	22 
	8		27	15	25 
	9		30	16	28 
	10		33	18	31 
	11		36	22	34 
	12		39	19	37 
	13		42	21	40 
	14		45	23	43 


  2.2 AMC-ACE-O

    CHN_len    max_len  min_len  aver_len
    -----------  -------- -------- --------
	2		 8	 5	 8 
	3		12	 6	11 
	4		20	 9	15 
	5		20	11	18 
	6		24	11	22 
	7		27	13	25 
	8		31	15	29 
	9		35	16	32 
	10		39	18	36 
	11		42	24	38 
	12		46	19	41 
	13		50	21	44 
	14		54	23	48 


  2.3 AMC-ACE-R

    CHN_len    max_len  min_len  aver_len
    -----------  -------- -------- --------
	2		 8 	 5	 8 
	3		12 	 6	12 
	4		20 	 9	16 
	5		20 	11	19 
	6		24 	11	23 
	7		28 	13	26 
	8		32 	15	30 
	9		36 	16	34 
	10		40 	18	38 
	11		44 	24	41 
	12		48 	19	44 
	13		52 	21	48 
	14		56 	24	51 

  
  2.4 AltDUDE

    CHN_len    max_len  min_len  aver_len
    -----------  -------- -------- --------
	2		 8	 5	 8 
	3		12	 6	11 
	4		20	 8	16 
	5		20	13	19 
	6		24	11	23 
	7		28	14	26 
	8		32	15	30 
	9		36	18	34 
	10		40	21	38 
	11		44	26	41 
	12		48	26	46 
	13		52	25	48 
	14		56	30	53 


  2.5 BRACE

    CHN_len    max_len  min_len  aver_len
    -----------  -------- -------- --------
    	2		 7	 5	 7 
	3		10	 7	10 
	4		17	 8	14 
	5		17	11	17 
	6		20	11	20 
	7		23	13	23 
	8		28	14	26 
	9		30	15	30 
	10		33	17	33 
	11		36	22	36 
	12		39	18	39 
	13		42	20	42 
	14		46	23	46 


  2.6 DUDE

    CHN_len    max_len  min_len  aver_len
    -----------  -------- -------- --------
    	2		 8	 5	 8 
	3		12	 6	11 
	4		20	 8	16 
	5		20	13	19 
	6		24	11	23 
	7		28	15	26 
	8		32	15	30 
	9		36	18	34 
	10		40	21	38 
	11		44	27	41 
	12		48	26	46 
	13		52	25	48 
	14		56	30	53 


  2.7 LACE

    CHN_len    max_len  min_len  aver_len
    -----------  -------- -------- --------
    	2		 8	 7	 8 
	3		12	 8	12 
	4		18	10	15 
	5		18	18	18 
	6		21	16	21 
	7		24	21	24 
	8		31	23	28 
	9		31	24	31 
	10		34	26	34 
	11		37	34	37 
	12		40	29	40 
	13		44	31	44 
	14		47	36	47 
  
  2.8 RACE

    CHN_len    max_len  min_len  aver_len
    -----------  -------- -------- --------
    	2		 8	 5	 8 
	3		12	 7	12 
	4		18	 8	15 
	5		18	18	18 
	6		21	20	21 
	7		24	24	24 
	8		31	28	28 
	9		31	31	31 
	10		34	34	34 
	11		37	37	37 
	12		40	40	40 
	13		44	44	44 
	14		47	47	47 



  2.9 UTF-6

    CHN_len    max_len  min_len  aver_len
    -----------  -------- -------- --------
    	2		 8	 5	 8 
	3		12	 6	12 
	4		20	 9	16 
	5		20	14	20 
	6		24	14	24 
	7		28	18	28 
	8		32	20	32 
	9		36	21	36 
	10		40	23	40 
	11		44	30	44 
	12		48	28	48 
	13		52	29	52 
	14		56	33	56 

3. Result of each ACE conversion speed

		UTF-6	RACE	LACE	BRACE	DUDE	ALTDUDE	
	MAX	0.80 	0.75 	0.82 	0.86 	0.74 	0.70 	
	MIN	0.56 	0.55 	0.52 	0.62 	0.53 	0.53 	
	AVERAGE	0.68 	0.64 	0.62 	0.73 	0.64 	0.62 	

		AMC-ACE-M	AMC-AMC-O	AMC-ACE-R
	MAX	1.12 		1.92 		0.99 
	MIN	0.86 		1.69 		0.67 
	AVERAGE	0.98 		1.81 		0.82 




4. Summary of the result

    All the ACE max_length is less than 64, because the Chinese 
    Domain Name is limited from 2 to 14 BIG5 chars. For those sample
    data, the result of LENGTH and SPEED, it seems the LACE and RACE 
    are better than others.
        
5. Acknowledgements

    This ACE evaluation testing tool was provided by JPNIC. 
    TWNIC provide the real registered Chinese ( BIG5 ) domain names.
    Hsu Nai-Wen <snw@twnic.net.tw> and TWNIC MDNS technical team give 
    a lot of suggestion.            

6. Author's Address

    Abel Yang
    Taiwan Network Information Center (TWNIC)
    4F-2, No. 9, Sec. 2, Roosevelt Rd., Taipei, 100 Taiwan.
    Email: abelyang@twnic.net.tw
    Tel: +886-23411313#505

    Erin Chen
    Taiwan Network Information Center (TWNIC)
    4F-2, No. 9, Sec. 2, Roosevelt Rd., Taipei, 100 Taiwan.
    Email: erin@twnic.net.tw
    Tel: +886-23411313#502

7. References

    [RFC1035]  "DOMAIN NAMES - IMPLEMENTATION AND SPECIFICATION",
               RFC1034, Nov 1987, P. Mockapetris
    [RACE]     "RACE: Row-based ASCII Compatible Encoding for IDN",
               draft-ietf-idn-race-03.txt, Nov 2000, P Hoffman
    [BRACE]    "BRACE: Bi-mode Row-based ASCII-Compatible Encoding
   	       for IDN version 0.1.2"
               draft-ietf-idn-brace-00.txti, Sep 2000, A Costello
    [LACE]     "LACE: Length-based ASCII Compatible Encoding for IDN"
               draft-ietf-idn-lace-01.txt, Jan 2001, M Davis, P Hoffman
    [UTF6]     "UTF-6 - Yet Another ASCII-Compatible Encoding for IDN"
               draft-ietf-idn-utf6-00, Nov 2000, M Welter, B Spolarich
    [DUDE]     "Differential Unicode Domain Encoding (DUDE)"
               draft-ietf-idn-dude-02.txt, Jun 2001, M Welter, 
	       B Spolarich, A Costello
    [AMCACEM]  "AMC-ACE-M version 0.1.0"
               draft-ietf-idn-amc-ace-m-00.txt, Feb 2001, A Costello
    [AltDUDE]  "AltDUDE version 0.0.2"
               draft-ietf-idn-altdude-00.txt, Mar 2001, A Costello
    [AMCACEO]  "AMC-ACE-O version 0.0.3"
               draft-ietf-idn-amc-ace-o-00.txt, Mar 2001, A Costello
    [AMCACER]  "AMC-ACE-R version 0.2.1"
               draft-ietf-idn-amc-ace-r-01.txt, May 2001, A Costello
    [AMCACEV]  "AMC-ACE-V version 0.1.0"
               draft-ietf-idn-amc-ace-v-00.txt, May 2001, A Costello
    [AMCACEW]  "AMC-ACE-W version 0.1.0"
               draft-ietf-idn-amc-ace-w-00.txt, May 2001, A Costello
    [MACE]     "MACE: Modal ASCII Compatible Encoding for IDN"
               draft-ietf-idn-mace-00.txt, Jun 2001, M Ishisone, Y Yoneya
    [LDUDE]    "Improving ACE using code point reordering v0.9"
               draft-ietf-idn-lsb-ace-00.txt, Jun 2001, Soobok Lee
    [MDNKIT]   "Multilingual Domain Name tool Kit",
               http://www.nic.ad.jp/jp/research/idn/mdnkit/download/
    [NAMEPREP] "Preparation of Internationalized Host Names",                 
               draft-ietf-idn-nameprep-03.txt, Feb 2001, P Hoffman, 
	       M Blanchet