At 12:28 AM 9/1/2002 -0400, John C Klensin wrote:
I believe that IDNA (and the supporting documents) are... a reasonably well-understood solution to _some_ problem. I'm not sure I know what that problem is, who cares about it, and whether it is important enough to justify changes to the way the DNS works and is interpreted.We should all be *very* worried that this major IETF effort has gone on for nearly 3 years, yet such a question can be seriously raised. (Or rather, that such a question legitimately reflects a concern among serious participants, as it clearly does.)
I know that the IDNA approach, with adequate definition of the characters to be used, will permit internationalization of low-level identifiers.That is excellent, because that is exactly what it is supposed to do. After all that is what domain names are, albeit identifiers with some mnemonic qualities.
I really hate to ask this, but I am not even sure what you mean "character matching forIf that is all we care about, then there is a case to be made that it is actually more mechanism than is needed: if the same constrained processes are to be used to access a name that cause it to be created, than the subtle issues of character matching for different codepoints may not be relevant.
By contrast, it is clear that the WG has not solved (Dave would, I think, say thatYou are prescient. Cut and paste is a *user interface* issue that already exists, far beyond domain names. And the world already has mechanisms for dealing with it.
it has no scope or charter to even examine) the set of questions associated with accurate transcription of DNS names from other environments and media.
It is equally clear that many people are focused on that problem and won't consider any "DNS internationalization" problem to be solved unless it has some adequate resolution.This is a good example of the reason "DNS Internationalization" is not a useful term. It is also a good example of the reason the target usage scenarios need to be extremely explicit, as I have tried to make them, above.
One working definition of internationalization is that the encoding/expression is "understood" by speakers of all languages.This highlights why "localization" is probably a much more useful term.
Consequently, someone sending a letter from the US to a recipient in Vietnam can write the destination address in Vietnamese and the US postal service need only understand the characters "VIETNAM" at the bottom of the destination address.IDNA accomplishes this combination of global "interpretation routing". In fact, it is inherent in domain names, and IDNA strings are valid domain names within the current DNS.
multilingual domain names may not necessarily contribute to universal ability to use the resulting strings because it may be difficult to impossible to render or enter arbitrary character sets at the user interface to a local service.Has MIME's ability to support non-ASCII characters been helpful to the overall utility of the Internet? I claim it has, even though I cannot read any of those other characters. (I am, after all, a typical American....)
We have collectively probably created some confusion for ourselves by using the term "internationalized domain names" to cover both concepts.You are exactly correct. The problematic nature of the term "internationalization" has been discussed before. I would have wished for different terminology, too.
It strikes me that the IDNA documents are more aimed at localization/multilingualization than internationalization, using the "definition" in the first paragraph above.
Concerns about how cut/paste will work are germane to the discussion about the utility of IDNs because such actions may be the ONLY way in which someone may be able to enter special character strings into text intended to represent an IDN.The technical issues about multi-data-type cut-and-paste are beyond the competence of this community.
I usually end up cutting and pasting the characters. This works because the text of email is permitted to be pretty general in its encoding. I don't know how that would work out if I were dealing with non-Latin character sets.And you do not know that it will NOT work.
One of the important questions that I sense is being asked in the discussion of IDNA is just how applications that encounter these encoded objects/strings should handle them,Is there some reason that the IETF should pursue this matter any more deeply than it has done for MIME?