[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[idn] 1st stringprep issue: not answered and ignored
Is the following issue addressed or mentioned in new stringprep/nameprep draft?
<I dot above> and <I><dot above> represent the same character. that is, they are NFC-equivalent.
But, UAX21 (casefolding in stringprep) maps the latter one errornously into <i><dot above>
(very different than the correct one <i>).
In lowercaseing, <I><dot above> should be converted as a whole, instead of by char-by-char basis.
Stringprep should have contained warnings about this or included new revised casefolding operatins/tables,
but it didn't . the authors and co-chairs have ignored this repeated warnings
without any responses in this list. This is not a theoretical issue,
but a practical issue because the sequences are used by modern east-europeans.
Therefore,stringprep(NFC(x)) == stringprep(x) is not guaranteed. This will cause unnoticeable failures.
It's ridiculous that repeated warnings are ignored/unanswered and the drafts went to IESG.
This will result in downplay on overall IETF processes and its reputations.
Soobok Lee
----- Original Message -----
From: "Soobok Lee" <lsb@postel.co.kr>
To: <idn@ops.ietf.org>
Sent: Tuesday, February 12, 2002 1:39 AM
Subject: [idn] IDNA comment 1 : applications' own normalization vs stringprep
>
> In the IDNA draft version6, section 6.1, paragraph 4 and 5,
>
> "In protocols and document formats that define how to handle
> specification or negotiation of charsets, labels can be encoded in any
> charset allowed by the protocol or document format. If a protocol or
> document format only allows one charset, the labels MUST be given in
> that charset."
>
> "In any place where a protocol or document format allows transmission of
> the characters in internationalized labels, internationalized labels
> SHOULD be transmitted using whatever character encoding and escape
> mechanism that the protocol or document format uses at that place."
>
> Here,we need more security warnings in this section regarding to applications'
> own normalizations that may collided with STRINGPREP's NFC/NFKC.
>
> Like some XML and HTML standards, there may be applications that performs
> NFC,NFKC or other normalizations on their data/text contents that
> may contain IDN labels that will be stringprepped and the ACEed later time.
>
> But, even in the case NFC, stringprep(NFC(x)) == stringprep(x) is not always
> guaranteed, especially in the case of <I dot above> and <I><dot above> ( + <acute>).
> That will cause silent failures in applications.
>
> Current premature UAX15 used in stringprep and other compensating or superior normalizations that may
> be from the same UTC or other organizations may collide to each other in
> a sequence of normalizatiom processes eventually ending in stringprep
> in many mission critical applications.
>
> We need more researches and inspections for the possiblities of normalization vs normalization
> conflicts and include the warning or recommendations in the IDNA specficiations.
>
> Soobok Lee
>
>
>
>
>