[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [idn] ACE37,AMCW and LDUDE




----- Original Message ----- 
From: "Keith Moore" <moore@cs.utk.edu>
 

> I would *really* like to avoid complex reordering of codepoints, because
> it would require fairly large tables (that would have to be implemented
> on some machines of modest capability) and would be error-prone.  
> 

It's not complex.
Its table is not so large compared to that of KC norm and
NAMEPREP. ACE & NAMEPREP will be packaged into one library
as MDNkit of JPNIC. Look into KC norm and NAMEPREP tables and
think about the reordering table, more simpler. 

Reordering reduce the length typical ACE label about 40% ~ 50%.
Please throughly test my example sources with average length of
han/hangul domains. 

> I also fear that this kind of mapping would encourage greater use
> of proxies over end-to-end implementations, with all of the implications
> for reliability, control, delay, etc. that would result from this design 
> choice.  I don't think we want to encourage a walled garden.

No proxies needed. No walls. 
All reordering is hidden and done in ACE & de-ACE libraries transparently.

No delay at all. Because what is needed is "n=maptable[n]" operation.
Single machine instruction away!!!
What i implemented in my example source is using mapping functions
with tables. IN the final version, we can use only tables.

> 
> I could see having a very simple reordering of codepoints in order
> to increase the efficiency of DUDE for ideographic languages.  But my
> (admittedly biased) sense is that we're fast approaching the point
> of diminishing returns.  Just how important is it to provide for names 
> that are one or two ideographs longer?
> 
> Keith
> 

With DUDE+REORDERING , we get 7~8 more Han ideographs. 
If you have enough time, test with many han examples.

code-block shifting does not help to reduce average XOR distance
in general cases ( good only for the first 4096 code block ).

What is important is  to reduce ACE label length of AVERAGED 
han/hangul domains. THis working group should concentrage
attentions  not to the max length issue but to  issues of
reducing ACE label length of average hangul/han domains.

Soobok Lee