[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[idn] Arabic and REORDERING
This posting is to explain how REORDERING can help countries using even non-CJK scripts.
This is an arabic ML.com of length 34 :
U+0627 U+0644 U+0625 U+0645 U+0627 U+0631 U+0627 U+062A
U+0627 U+0644 U+062F U+0648 U+0644 U+064A U+0629 U+0644
U+0644 U+0627 U+0633 U+062A U+0634 U+0627 U+0631 U+0627
U+062A U+0627 U+0644 U+0642 U+0627 U+0646 U+0648 U+0646
U+064A U+0629
AMC-Z : kgBDBAAAAAAARFFDC8E4AI9AV31A0AGBBAJH6CA8BR3CS ( 45 )
REORDERING+AMC-Z: mgBKMSFCEFDBXAFHEQHNKCECCAFHDBBIDBBC ( 36 )
This is another arabic label of length 55 which is constructed by
concatenating two arabic labels of length 34 and 21:
U+0627 U+0644 U+0625 U+0645 U+0627 U+0631 U+0627 U+062A
U+0627 U+0644 U+062F U+0648 U+0644 U+064A U+0629 U+0644
U+0644 U+0627 U+0633 U+062A U+0634 U+0627 U+0631 U+0627
U+062A U+0627 U+0644 U+0642 U+0627 U+0646 U+0648 U+0646
U+064A U+0629
U+0636 U+0645 U+064A U+0631 U+0627 U+0644 U+0645 U+062A
U+0643 U+0644 U+0645 U+0627 U+0644 U+0645 U+0636 U+0627
U+0641 U+0625 U+0644 U+064A U+0647
AMC-Z : kgBAEBAAAAAAAAAAWFJDCE8GuBIF5B1A5DE43AuA1BHGBBAJGCBEEZDDC7AA2EMR1GsADQ ( 69 > 59 = 63 - 4 : error!)
REORDERING+AMC-Z: lgBABCWPKSHQLCKLDBD7AAMHLOACAHHIQKDJCECCAFKDBEDDBBIDBBCJGD ( 58 )
[ 58 / 53 = 1.094, close to the ratio for latin labels in bare AMC-Z (1.09),
compensation succeeded! ]
Yet another arabic label of length : 47
U+0627 U+0644 U+0625 U+0645 U+0627 U+0631 U+0627 U+062A
U+0627 U+0644 U+062F U+0648 U+0644 U+064A U+0629 U+0644
U+0644 U+0627 U+0633 U+062A U+0634 U+0627 U+0631 U+0627
U+062A U+0627 U+0644 U+0642 U+0627 U+0646 U+0648 U+0646
U+064A U+0629
U+0636 U+0645 U+064A U+0631 U+0627 U+0644 U+0645 U+062A
U+0643 U+0644 U+0645 U+0627 U+0644
AMC-Z : kgBDBAAAAAAAAATFHDCE8FrBIF1BzA1DyT7ADGBBAJGCBEZDD4AA2ER7ESD ( 58 )
REORDERING+AMC-Z: lgBBPQQIHCGHDBDZAIHHOACDHILKDGECCAFKDBBDBBIDBBCJG ( 49 )
These experiments on two arabic samples shows that
the maximum length of arabic labels for bare AMC-Z
is about 47, while that for REORDERING+AMC-Z is 55 with some errors of 1~2 characters.
With REORDERING, arabic-using people will have less complaints about
their imposed disadvantage in arabic ACE labels.
Soobok Lee