[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: Combining characters (was: Re: [idn] hostname historyhell)
I can see web pages to accomodate all of this, but does
this belong to IDN? What is IDN's duty area from symbol
set perspective? I have listed three layers of symbols.
I hope we can propose an RFC to get label separator and
bidi/vertical or whatever text related issue out of our way
so that we can deal with IDN identifier symbols only.
Liana
On Sun, 25 Nov 2001 22:56:02 +0200 <rosenne@spamcop.net> writes:
> From a bidi perspective, URLs are not structured. A present day all
> ASCII URL will be presented left-to-right, because all the
> non-letters
> contained within it, such as periods and slashes, are neutral and
> the
> directionality of the surrounding letters, i.e. LTR. Trailing digits
> work out correctly too.
>
> When the URL, in the future when it is possible, will be entirely in
> Hebrew, it will display correctly with components going from right
> to
> left.
>
> Mixed language URLs may cause some unexpected results.
>
> See http://www.qsm.co.il/Hebrew/HebrewTest/url.htm
>
> Jony Rosenne
>
> > -----Original Message-----
> > From: owner-idn@ops.ietf.org [mailto:owner-idn@ops.ietf.org]
> > On Behalf Of Eric A. Hall
> > Sent: Sunday, November 25, 2001 7:43 PM
> > To: liana Ye
> > Cc: idn@ops.ietf.org
> > Subject: Re: Combining characters (was: Re: [idn] hostname
> > historyhell)
> >
> >
> >
> > liana Ye wrote:
> >
> > > I'd like to propose a more specific layering of IDN symbols:
> > >
> > > From the top where the user input buffer offers:
> > >
> > > Layer 3: label seperators and label order normalizing
> > >
> > > Layer 2: Bidi label normalizing (or verticle label normalizing)
> >
> > What is the current display order for unstructured and
> > structured data in right-to-left display systems? Does
> > unstructured data ("the files are on
> > server1") typically flow RTL, while URLs and other structured
> > data display as LTR?
> >
> > It seems that these questions are for the structured data
> > groups to handle when they decide on an output presentation
> > mechanism. But if URLs and other structured types will
> > display RTL then that may affect us as well (your Layer 3
> > label ordering in particular).
> >
> > > Layer 1.5: diacritic marks and combining symbol normalizing
> > >
> > > Layer 1: IDN identifier matching or whatever comes out
> > > of [nameprep].
> > >
> > > The reason for Layer 1.5 is that these symbols can be
> > > treated in a similar way with Han characters depending on what
> > > architecture we end up with, and what ACE will be our focus.
> >
> > --
> > Eric A. Hall
> http://www.ehsco.com/
> Internet Core Protocols
> http://www.oreilly.com/catalog/coreprot/
>
>
>