[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [idn] What's wrong with skwan-utf8?



At 19.33 +0900 01-01-03, Martin J. Duerst wrote:
>>Also, there is a question whether UTF-8 is really what we should 
>>use. UTF-8 is not much more different than any ACE encoding because 
>>it is just yet another encoding of the 32 bit characters which is 
>>what many people feel we should start to use instead of the 7-bit 
>>which we use today. So, the transision is not from 7bit to 8bit, it 
>>is from 7bit to 32bit clean transport, and that is a completely 
>>different issue.
>
>As far as I know, the Internet, even for IPv6, is an 8-bit protocol.
>Does this imply that this should be changed? If not, wouldn't it
>just create another mismatch?

It all depends with what you mean by "is an 8-bit protocol". If you 
read RFC 821, you see that it is definitly not 8bit clean, but 
instead only handles characters in US-ASCII. Yes, sent in 8-bit 
octets in many cases, but still.

The most important issues are though:

   - Other protocol elements which are visible to users in the
     today US-ASCII only protocols.
   - How to do nameprep correctly (and implemented in a way which
     is interoperable).

Because of these things, I none of the operating systems etc which 
today use UTF-8 handle this correctly. All of them have to be changed.

     paf