[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Fwd: Need for Normalization forms "KR" was: Re: [idn] case folding]
I am kind of tired of explaining day-in and day-out on this.
so i am going to write an I-D on this, explaining CJK canonicalization. i am
half way thru now. give me another week or two. i put other things on hold for
this.
(yea, my todo list is getting long. sorry paul :P)
-James Seng
Mark Davis wrote:
>
> I am certainly not against this, if it is possible (and I am not an expert in this area). However, I have heard from various sources that the mappings are not trivial, and require dictionary look-up for satisfactory results. If it can be done in an algorithmic fashion, and you have the data to implement it, then it is a different matter.
>
> Mark
>
> James Seng wrote:
>
> > Mark Davis wrote:
> > > - Han duplicates. What are you thinking of here? Simplified vs. Traditional is not algorithmic -- are you thinking of radicals, or some other mapping? Do you have data for whatever mapping you are thinking of?
> >
> > Without comment on the rest, I beg to defer on this.
> >
> > I know that the common persection is that SC-TC is impossible to do without
> > considering the lexical and context, but for domain names, it *can* be done.
> >
> > -James Seng