[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [idn] opting out of SC/TC equivalence





--On 31. august 2001 20:46 +0800 
"=?utf-8?B?dHNlbmdsbUDoqIjntrLkuK3lv4Mu5Lit5aSnLnR3?=" 
<tsenglm@cc.ncu.edu.tw> wrote:

>> The problem is that such a table doesn't seem to exist. I have seen in
>> one draft (as Harald points out) a table with approximately 2000
>> characters
> out
>> of the 20-40000 which exists. Harald have asked valid questions about
>> what to do with the other characters.
>>
>       If you ask an expert of chinese language to give you this "completed
> table", it will be never delivered , because the characters are in an open
> set. Even we use chinese, we just use not more than 8000 frequently used
> characters in our newspaper . So no one will answer this impossible
> problems.

Please then answer a simpler one:

When the matching algorithm you envision encounters a character where the 
tables do not tell what to do about it, what will the algorithm do to it?
Pass it through unmodified? Reject it? Does something else which you have 
not defined yet?

If you tell us this, and claim that this is acceptable treatment of the 
20-40.000 characters not in the tables, we can at least discuss this 
proposal (whether such a partial TC/SC mapping is acceptable), instead of 
asking again and again what to do about them.

But it is impossible to implement an algorithm which says "this part will 
be worked out later".

                  Harald