[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Just send UTF-8 with nameprep (was: RE: [idn] Reality Check)



> As I have remarked earlier, we don't need this WG to allow
> people to create random domain name labels. ACE is not
> 'domain names just as they are'. It's a clear layer violation.

That's simply BS.  ACE is an *encoding" just like UTF-8 is an *encoding*.

> You can't do any serious kind of processing (e.g. sorting,
> searching,...) on ACEd domain names.

You can do comparison of names with ACE.  Sorted ACE names are almost
as useful to humans as sorted names in UTF-8, which is to say - not very.
It's true that you cannot search for substrings of an ACE name as
easily as you can do so in UTF-8, but this hardly compels use of UTF-8
where it will break existing applications.

> >Your proposal it to break all current applications by requiring the
> >application to both do UTF-8 (which many don't do)
> 
> There are mainly three cases:
> - Those that just pass data along. Most implementations nowadays are
>    8-bit transparent, for most others, it's easy to clean them.
> - Those that do some actual character processing, but not display.
>    These will require work, but considerably less with UTF-8 than
>    with ACE.
> - Those that do display. Again, less work with UTF-8
>    than with ACE. And more work already done.

For the latter two, the burden in supporting ACE in addition to supporting
UTF-8 is small.  The application has to call API routines in either case -
the differences between ACE and UTF-8 are mostly hidden under those routines.

The other thing that you're leaving out is the additional complexity that
would be required to add UTF-8 negotiation to existing applications that
could not be expected to deal with UTF-8 names.

> >and to do nameprep (which those that currently do UTF-8 don't do).
> 
> Again, nameprep only is easier than nameprep+ACE.

It requires more code, but the application writer doesn't really notice
the difference.

Keith