[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [idn] call for comments for REORDERING




----- Original Message ----- 
From: "James Seng/Personal" <jseng@pobox.org.sg>
To: "Soobok Lee" <lsb@postel.co.kr>; <idn@ops.ietf.org>
Sent: Friday, October 19, 2001 1:17 PM
Subject: Re: [idn] call for comments for REORDERING


> You dont get my point.
> 
> Reordering achieve shorter by putting more oftenly used characters in
> one block and others at the back. BUT this also means less oftenly used
> characters would result in a *LONGER* label then usual. 

No. Even For GROUPS of those rare cases, we get Always SHORTER labels than usual.

For example, 
if REORDERING uses a frequent characters set of length 4096 in entire 20992-letter
han ideographic script block, rarely used han letter will be put
into the subblock of length 20992-4096 = 16896 which is 20% shorter than 20992.

let's assume we have with a label consisting of only those characters from 16896 block. without reordering, they are randomly distributed in 20992 letter block.
with reordering, they are in shorter 16896-letter block.
In the latter case, we got shorter successive code distances and shorter
ACE labels than in the former one.

 Even without REORDERING, ACE has favored some of them, and disfavored
others. But, with REORDERING, BOTH groups get shorter ACE labels.
That is the virtue of REORDERING.

Soobok

> And who are we
> to say these less oftenly used characters are less important or worst,
> become invalid (too long to fit) because of this reordering?
> 
> -James Seng
>