[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

RE: [idn] stringprep comment 5: hangul conjoining sequence

To: "'IETF idn working group'" <idn@ops.ietf.org>
Subject: RE: [idn] stringprep comment 5: hangul conjoining sequence
From: "Kent Karlsson" <kentk@md.chalmers.se>
Date: Tue, 12 Feb 2002 12:41:39 +0100
In-reply-to: <20020211214832.GD23357@nicemice.net>



> -----Original Message-----
> From: owner-idn@ops.ietf.org 
> [mailto:owner-idn@ops.ietf.org]On Behalf Of
> Adam M. Costello
> Sent: den 11 februari 2002 22:49
> To: idn@ops.ietf.org
> Subject: Re: [idn] stringprep comment 5: hangul conjoining sequence
> 
> 
> Kent Karlsson <kentk@md.chalmers.se> wrote:
> 
> > Compatibility (non-conjoining) Hangul letters are best prohibited.
> 
> Stringprep does normalization before prohibition, so prohibiting
> compatibility characters would have no effect.  After the 
> normalization
> step, there are no compatibility characters in the string.


Ouch!  That leads to results that are completely off the board:
NFKC or NFKD on Hangul compatibility letter sequences leads
to completely wrong results; in particular the distinction between
lead and trail consonants becomes just plain wrong.


> A trick you could play, however, would be to map the characters in
> question to an arbitrary prohibited character that survives the
> normalization step (like U+0000).

Ok, consider that my revised suggestion.

		Kind regards
		/kent k


> AMC
>

References:
- Re: [idn] stringprep comment 5: hangul conjoining sequence
  - From: "Adam M. Costello" <idn.amc+0@nicemice.net.RemoveThisWord>

Prev by Date: RE: [idn] Comments on IDNA/stringprep/nameprep
Next by Date: Re: [idn] Re: [bidi] END OF AYAH vs NAMEPREP
Previous by thread: Re: [idn] stringprep comment 5: hangul conjoining sequence
Next by thread: [idn] IDNA comment 1 : applications' own normalization vs stringprep
Index(es):
- Date
- Thread