[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [idn] utf8/legacy versioning
Thanks for your correction. UTF-8 (not utf8) is in that list.
But, UTF-8 is a character encoding form of UCS and does not specify the specific set of supported characters
in the numerous versions of UCS. That is the point where utf-8 and iso8859-1/ksc_5601_1987 differ.
I.e., UTF-8 defines encoding schemes over UCS which has been an open set and will remains as an open set.
UTF-8 , from its definition, cannot have versioing suffices, like "utf-8-3.2" or "utf-8-3.1".
That's why "utf-8" should not be regardsed as a "genuine" charset, IMO.
Correct me if i am wrong at some points. Thanks.
Soobok Lee
----- Original Message -----
From: "Paul Hoffman / IMC" <phoffman@imc.org>
To: "Soobok Lee" <lsb@postel.co.kr>; <idn@ops.ietf.org>
Sent: Monday, June 03, 2002 10:36 AM
Subject: Re: [idn] utf8/legacy versioning
> At 10:11 AM +0900 6/3/02, Soobok Lee wrote:
> >Moreover, It does not have "utf8" charset entry, because "utf8" is
> >just one of the encodings of the Universal
> >Character Set, not an independent charset plus encoding like "ks_c_5601-1987".
>
> Both your statement and your reasoning are wrong. UTF-8 has been a
> registered charset since RFC 2279 was issued.
>
> --Paul Hoffman, Director
> --Internet Mail Consortium