[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [idn] Re: permission <draft-ietf-idn-ace37-00.txt (attach)



> Hi,
>
Great hearing from you.  Actually, I sh
ould give you some credit cause you sta
rted the codepoint reordering thing fir
st, though the ACE37 idea really came a
bout earlier than I saw your draft.  An
yway, I think my code block shifting me
chanism is much more simple and can suc
cessfully get 21+ han ideographs which
is much more acceptable than DUDE's cur
rent state.

> When the diff value is less than 0xF,
what is the ACE37 form for it ?
>
When diff<=0x7F it will be in the 7bit
form, that is
<b4><b32>

the reason <b4> has to be used is that
it signifies how many more characters a
re used for the particular codepoint (or
what I termed as codepoint bracket).  If
only a <b32> is used, then you wont know
whether it is a 15 bit form or a 22 bit
form.
Therefore, if diff=0xF, ACE37=wf

> And any example C doe for ACE37  avai
lable on the web ?
>
Not yet, David & I will work on it and
post it on the web if more people are i
nterested in the idea and think that it
could be better than DUDE.  For now you
can check out my excel worksheet to see
how it works. http://www.dnsii.org/ace3
7/ace37-encode.xls (for encoding), http
://www.dnsii.org/ace37/ace37-decode.xls
(for decoding).

Edmon




> Thanks.
>
> Soobok
>
> > > >Chung & Leung
[Page 6]
> > > >ACE37       ACE Utilizing All 37
Alphanumeric Characters      July 2001
> > > >
> > > >    The following table explains
how base-4 characters are combined
> with
> > > >    base-32 characters to form a
representation of a diff (key:
> b4=base-
> > > >    4, b32=base-32):
> > > >
> > > >              diff value
|bits|       ACE37 Form
> > >
 >        -------------------------
|----|----------------------------
> > > >                  diff<=0x7F
|  7 | <b4><b32>
> > > >            0x80<=diff<=0x7FFF
| 15 | <b32><b32><b32>
> > > >          0x8000<=diff<=0x1FFFF
| 17 | w<b4><b32><b32><b32>
> > > >         0x20000<=diff<=0xFFFFF
| 20 | ww<b32><b32><b32><b32>
> > > >        0x100000<=diff<=0x10FFFF
| 22 | <b4>w<b32><b32><b32><b32>
> > > >
> > > >    Note that the "bits" column
represents the maximum number of
> > > >    significant bits for the giv
en diff value.  For example when
> > > >    diff<=0x7F, the maximum value
is 0b1111111, therefore the number of
>
>
> ----- Original Message -----
> From: "Edmon" <edmon@neteka.com>
> To: "Natalia Syracuse" <nsyracus@ietf
.org>; "David Leung"
> <david@neteka.com>; "Marc Blanchet" <
Marc.Blanchet@viagenie.qc.ca>;
> <idn@ops.ietf.org>
> Cc: <jseng@pobox.org.sg>
> Sent: Friday, July 06, 2001 12:48 AM
> Subject: [idn] Re: permission <draft-
ietf-idn-ace37-00.txt (attach)
>
>
> > Hi all,
> >
> > I was unaware that the workgroup no
longer accepts new drafts.  Anyway, I
> > have drafted a new ACE based on the
simplicity of DUDE which has hugely
> > improved compression.  Worst case s
cenario CJK could have 21 han
> characters!
> > Attached below is a copy of the dra
ft (for my original submission), you
> can
> > also find it at http://www.dnsii.or
g/idn-ace37-00.txt (easier to read) and
> > hopefully in the i-d-n.net website
soon.
> >
> > ACE37 is based on the one-pass one-
mode scheme of DUDE (diiferential XOR),
> > then utilizes a simple code block s
hifting (similar to the reference
> points
> > in the AMC series) to hugely increa
se the capacity for CJK (worst case
> > scenario 21 han characters!) and th
en utilizes base-32 for compression (as
> > in LACE) (DUDE and AMC-w/v uses bas
e-32 only for flagging).  In addition
> to
> > base-32, a base-4 scheme is introdu
ced by using the remaining characters
> > {wxyz}.  These contain 2 bits of ch
aracter information and doubles as an
> > indicator for codepoint brackets.
All the while, the algorithm is kept to
> > be as simple as DUDE.
> >
> > Hopefully you might find that it is
interesting and appropriate to be
> > considered as an ACE within the IET
F.  Afterall, it was intended to be an
> > integrated version of the three pri
mary ACEs: DUDE, LACE and the AMC
> series,
> > identified by the ACE design team r
eport.
> >
> > Looking forward to all your inputs.
> >
> > Edmon
>