[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[idn] Test cases for AMC-Z
Greetings again. Some of you have asked offline for test cases for
AMC-Z. The following URL is an 8.5 MB file that expands to 34 MB. It
contains all 641883 unique IDN names from a recent snapshot of the
VGRS multilingual test bed. The contents of the file looks like:
. . .
U+D68C U+D654|dz--9x8b1d
U+D68C U+D654 U+0069|dz--i-9m3grf
U+D68C U+D68C|dz--vz8ba
U+D68C U+0031|dz--1-hq3g
. . .
All entries in the file were prepared first by converting the VGRS
zone files from RACE to U+ format, then by converting the U+ into
AMC-Z. The latter conversion was done twice, first with Adam's C code
from the draft, then using my Perl code which is based on his
pseudocode. The two outputs were identical, which is a good (but not
perfect) sign.
It would be great if other folks who have written their own AMC-Z
implementations would test against the file. (There is obviously no
need to do so if you are just using Adam's code.) Further, if other
folks have collections of example host names, please post them in a
similar format so we can all test against them.
The file is at <http://www.imc.org/nameprep/mltbd-amcz.gz>. Again,
you only want to download this if you can actually test against it
(bandwidth isn't free, y'know...). Please send any results to the
mailing list.
--Paul Hoffman, Director
--Internet Mail Consortium