[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [idn] process

To: John C Klensin <klensin@jck.com>
Subject: Re: [idn] process
From: Erik van der Poel <erik@vanderpoel.org>
Date: Fri, 25 Feb 2005 17:07:35 -0800
Cc: idn@ops.ietf.org
In-reply-to: <421FA55B.9000308@vanderpoel.org>
References: <421B8484.3070802@vanderpoel.org> <20050223072837.GA21463~@nicemice.net> <D872CCF059514053ECF8A198@scan.jck.com> <421D8411.9030006@vanderpoel.org> <p06210208be4390618c81@[192.168.0.101]> <421E0D0C.2000309@vanderpoel.org> <p06210202be43c3888991@[192.168.0.101]> <E07CE813AD23B2D95DA0C740@scan.jck.com> <421E30F2.1040408@vanderpoel.org> <0E7F74C71945B923C52211F3@scan.jck.com> <421EA0C9.1010500@vanderpoel.org> <00a401c51af3$7863aae0$030aa8c0@DEWELL> <A574CA1BE87BFDA3C2A1AC0E@scan.jck.com> <421FA55B.9000308@vanderpoel.org>
User-agent: Mozilla Thunderbird 1.0 (X11/20041206)

If we do *not* allow these special local characters that function in the same way as the hyphen in the West, then people in other parts of the world would not only claim that our spec is unfair, they might even ignore it. If we *do* allow this Japanese example, then we have started sliding down a slippery slope that ends with a rather large extension of the LDH rule (for the rest of the world), and then the phishing problem would not be alleviated as much as we might have hoped when we started with just LDH. This would be a lot of work for little gain.

So it's a lose-lose situation.

Sorry, I said that wrong. What I meant was, "Damned if you do, damned if you don't."

However, one avenue that might be worth exploring some more is to check each registry's character table (for those that have one) and see what the Unicode category is for each character. The Japanese Katakana middle dot U+30FB has the category "Pc" which means "punctuation, connector" and LDH's hyphen U+002D has the category "Pd" which means "punctuation, dash".

http://www.unicode.org/Public/UNIDATA/UnicodeData.txt
http://www.unicode.org/Public/UNIDATA/UCD.html#General_Category_Values
http://vanderpoel.org/networking/i/idn.html (see bottom)

If it turns out that all or most of the registries that have tables are using characters with only a small number of Unicode categories, then we may wish to consider moving IDNA to that set of categories (disallowing all others). This would keep the registries happy while keeping *some* of the phishy characters out of DNS.

Erik

Follow-Ups:
- [idn] character tables
  - From: Erik van der Poel <erik@vanderpoel.org>

References:
- [idn] nameprep2 and the slash homograph issue
  - From: Erik van der Poel <erik@vanderpoel.org>
- Re: [idn] nameprep2 and the slash homograph issue
  - From: "Adam M. Costello" <idn.amc+0@nicemice.net.RemoveThisWord>
- Re: [idn] nameprep2 and the slash homograph issue
  - From: John C Klensin <klensin@jck.com>
- [idn] punctuation
  - From: Erik van der Poel <erik@vanderpoel.org>
- Re: [idn] punctuation
  - From: tedd <tedd@sperling.com>
- Re: [idn] punctuation
  - From: Erik van der Poel <erik@vanderpoel.org>
- Re: [idn] punctuation
  - From: tedd <tedd@sperling.com>
- Re: [idn] punctuation
  - From: John C Klensin <klensin@jck.com>
- Re: [idn] punctuation
  - From: Erik van der Poel <erik@vanderpoel.org>
- Re: [idn] punctuation
  - From: John C Klensin <klensin@jck.com>
- [idn] process
  - From: Erik van der Poel <erik@vanderpoel.org>
- Re: [idn] process
  - From: "Doug Ewell" <dewell@adelphia.net>
- Re: [idn] process
  - From: John C Klensin <klensin@jck.com>
- Re: [idn] process
  - From: Erik van der Poel <erik@vanderpoel.org>

Prev by Date: Re: [idn] process
Next by Date: Re: [idn] process
Previous by thread: Re: [idn] process
Next by thread: [idn] character tables
Index(es):
- Date
- Thread