[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [idn] utf8/legacy versioning



I'm not stepping into the issue of how to define the
charsets, but to answer Soobok Lee's specific inquiry:

> Everyone knows UCS(ISo10646) and Unicode (UTC) changes and expands over time.
> Does  UCS (ISO10646) versioning  strictly follow  Unicode Versioning (UTC) ?

The answer to that question is no. Unicode versioning is specified exactly
on the Unicode website, by references to specific sets of contributory
data files.

ISO/IEC 10646 expansion is by the publication of amendments, once they
have gone through international ballotting.

The two encoding committees, UTC and SC2/WG2, work hard to have significant
synchronization points, where the two standards can be matched up with
identical repertoires, but in between major versions it can get a little
tricky to match "versions" of the two standards.

The recent major synchronization point was Unicode 3.0. Here is the
information about that and subsequent synchronizations:

Unicode Version                   10646 Publication

Unicode 3.0                       ISO/IEC 10646-1:2000 (Second Edition)

=======================================================================

Unicode 3.1                       ISO/IEC 10646-1:2000 (Second Edition)
                                  + 2 characters from Amendment 1 to Part 1
                                  ISO/IEC 10646-2:2001

=======================================================================

Unicode 3.2                       ISO/IEC 10646-1:2000 (Second Edition)
                                  Amendment 1 to 10646-1:2000 (published 2002)
                                  ISO/IEC 10646-2:2001

=======================================================================

And prospectively, we anticipate the following synchronization in
repertoires sometime late next year:

Unicode 4.0                       ISO/IEC 10646-1:2000 (Second Edition)
                                  Amendment 1 to 10646-1:2000 (published 2002)
                                  Amendment 2 to 10646-1:2000 (to appear 2003?)
                                  ISO/IEC 10646-2:2001
                                  Amendment 1 to 10646-1:2001 (to appear 2003?)

But this situation is somewhat muddied by the fact that SC2/WG2 is also
starting on a project to merge Part 1 and Part 2 of 10646 into a single
standard, in which case it is conceivable that in 2003 or 2004 the coalesced
10646 will be reissued, presumably as ISO/IEC 10646:2003 (or 2004) (Third Edition),
rolling in the amendments. If so, we would return to a relatively simpler
publication situation, as for Unicode 3.0.

--Ken

> Then, Why can't we see "utf8-3.1" or "utf8-3.2" for Unicode 3.1 and 3.2 recpectively?
>