[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [idn] The layers and character handling in DNS
- To: idn@ops.ietf.org
- Subject: Re: [idn] The layers and character handling in DNS
- From: Paul Hoffman / IMC <phoffman@imc.org>
- Date: Thu, 15 Feb 2001 11:02:00 -0800
- Delivery-date: Thu, 15 Feb 2001 11:03:43 -0800
- Envelope-to: idn-data@psg.com
At 9:57 PM +0900 2/15/01, ±Λ_ƒΚΊΖ-GIM_Gyeongseog wrote:
>On Thu, 15 Feb 2001, David C Lawrence wrote:
>
>> Dan writes:
>> > If I look at how I would like things to work, if could be like this:
>> >
>> > user enters name in application using LOCAL character set.
>> > ->
>> > application calls routnes like: gethostbyname
>> > ->
>> > gethostbyname converts form LOCAL character set to
>> > normalised (not namepreped) UTF-8 (or other format).
>> > it may reject the query if the name contains forbidden characters.
>> > and sends the query to DNS server.
>>
>> How does gethostbyname() know for sure what the character set of its
>> argument is?
>
>i guess that gethostbyname need be extended so that one additional
>param indicates the local char set.
And, with that information, gethostbyname then needs to translate
completely correctly and authoritatively from that charset to UTF8
(or at least to ISO/IEC 10646 code points). For this scheme to work,
we need to have authoritative and non-changing mapping tables for
every local charset. Even is this was possible (and it's not; see
below), the software would be huge.
Those of us who have just the easy Latin characters in our names keep
forgetting that many of the local charsets in the rest of the world
don't have the mapping tables needed, and won't have them any time
soon. There is wide dispute about what the mappings should be, and
even the organizations that have spent the time doing mappings
*still* are changing them. Part of this is due to the way that
ISO/IEC 10646 is designed, so it is likely that this issue will not
be resolved for decades, if ever.
Please, folks: this is a dead end. If you want to keep going down
this path, maybe preface each message you write with "In an idealized
universe where there was authoritative and non-changing mapping
tables for every local charset, ...".
--Paul Hoffman, Director
--Internet Mail Consortium