[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: Combining characters (was: Re: [idn] hostname historyhell)
I think [nameprep] handle them inconsistently, that is
038F; 03CE; Case map
and your latter case <A><acute> ==> > <a><acute> is
not considered. But:
0390; 03B9 0308 0301; Case map
0587; 0565 0582; Case map
do use character combination to normalize the symbols.
Liana
On Sun, 25 Nov 2001 13:59:39 +0900 "Soobok Lee" <lsb@postel.co.kr>
writes:
> (resent after correcing some formatting error)
>
> Does a combining sequence inherit the bicameral characteristic of
> the leading letter ?
>
> For example,
>
> <A acute> can be lowercased into <a acute> ? Then, <A><acute> ==>
> <a><acute> ?
> <A><acute><diaeresis>... ==> <a><acute><diaeresis>... ? I am not
> sure whether it can be.
>
> Can <I><dot-above> be lowercased into <i><dot-above> or not?
> <I dot-above> has no combined form <i dot-above>, but has <i> as its
> lowercase form
> due to its language context in turkish and azerbaijani.
> Is this case just an exception ?
>
> Soobok Lee
>
> ----- Original Message -----
> From: Soobok Lee
> To: idn@ops.ietf.org
> Sent: Sunday, November 25, 2001 1:32 PM
> Subject: Re: Combining characters (was: Re: [idn] hostname
> historyhell)
>
>
> Does a combining sequence inherit the bicameral characteristic of
> the leading letter ?
>
> For example,
>
> can be lowercased into ? Then, ==> ?
> ... ==> ... ? I am not sure whether it can be.
>
> Can be lowercased into or not?
> has no combined form , but has as its lowercase form
> due to its language context in turkish and azerbaijani.
> Is this case just an exception ?
>
> Soobok Lee
>
>
> ----- Original Message -----
> From: "Eric A. Hall"
> To: "Kenneth Whistler" ; ;
> Sent: Friday, November 23, 2001 1:33 AM
> Subject: Re: Combining characters (was: Re: [idn] hostname
> historyhell)
>
>
> >
> > "Eric A. Hall" wrote:
> >
> > > The specific concern I had is already addressed in nameprep so
> there
> > > is no need for the exception.
> >
> > What I am concerned with is the use of domain name which consist
> entirely
> > of punctuation, or in this case, which consist entirely of
> combining
> > marks. These represent a rules problem with the principles behind
> the
> > "safe-set". EG, we forbid domain names like ````````.com using
> U+0060, but
> > ````````.com may be legal if the combining character U+0300 is
> used
> > instead.
> >
> > I get lost in the shrubbery of specs, so can somebody tell me
> whether or
> > not this is a valid concern? I can't tell for certain what happens
> at the
> > end of nameprep with such a label.
> >
> > If this is a problem, a solution might be to change the
> prohibition
> > against leading hyphen to a prohibition against leading hyphen and
> > combining characters.
> >
> > --
> > Eric A. Hall
> http://www.ehsco.com/
> > Internet Core Protocols
> http://www.oreilly.com/catalog/coreprot/
> >
>
>
>