[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
RE: [idn] stringprep comment 5: hangul conjoining sequence
> -----Original Message-----
> From: owner-idn@ops.ietf.org
> [mailto:owner-idn@ops.ietf.org]On Behalf Of
> Adam M. Costello
> Sent: den 11 februari 2002 22:49
> To: idn@ops.ietf.org
> Subject: Re: [idn] stringprep comment 5: hangul conjoining sequence
>
>
> Kent Karlsson <kentk@md.chalmers.se> wrote:
>
> > Compatibility (non-conjoining) Hangul letters are best prohibited.
>
> Stringprep does normalization before prohibition, so prohibiting
> compatibility characters would have no effect. After the
> normalization
> step, there are no compatibility characters in the string.
Ouch! That leads to results that are completely off the board:
NFKC or NFKD on Hangul compatibility letter sequences leads
to completely wrong results; in particular the distinction between
lead and trail consonants becomes just plain wrong.
> A trick you could play, however, would be to map the characters in
> question to an arbitrary prohibited character that survives the
> normalization step (like U+0000).
Ok, consider that my revised suggestion.
Kind regards
/kent k
> AMC
>