[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[idn] ACE37



One thing I forgot to point out:

From: "Adam M. Costello" <amc@cs.berkeley.edu>
> It doesn't look as simple as DUDE to me.  To me it looks no simpler than
> AMC-ACE-W, which is as efficient as ACE37 for CJK, as efficient as DUDE
> for single-row scripts, and more efficient than DUDE for Latin-based
> scripts.  Whereas AMC-ACE-W's complexity is in a state machine, ACE37's
> complexity is in encoding/recognizing lots of different patterns.

DUDE uses a similar algorithm to determine the significance of diff, which
in turn determines how many b32 characters to use:

diff value                                    DUDE Form
----------------------------------------------------------------------------
diff<=0xF                                 <"0"+quartet>
0x10<=diff<=0xFF                  <"1"+quartet><"0"+quartet>
0x100<=diff<=0xFFF              <"1"+quartet><"1"+quartet><"0"+quartet>
0x1000<=diff<=0xFFFF
<"1"+quartet><"1"+quartet><"1"+quartet><"0"+quartet>
0x10000<=diff<=0xFFFFF
<"1"+quartet><"1"+quartet><"1"+quartet><"1"+quartet><"0"+quartet>
0x100000<=diff<=0x10FFFF
<"1"+quartet><"1"+quartet><"1"+quartet><"1"+quartet><"1"+quartet><"0"+quarte
t>

Here you see that there are 6 different scenarios as opposed to ACE37's 5
scenarios.  Both DUDE and ACE37 needs to determine the significant bits of
the resulting diff to come up with the correct form.  So perhaps it doesnt
look as simple (because I created these charts in the draft), but in effect
it is the same.  You could refer to the procedure comparison I did in my
draft in Section 1
<http://www.ietf.org/internet-drafts/draft-chung-idn-ace37-00.txt>


Edmon