[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [idn] Re: Just send UTF-8 with nameprep




> > Using these IDNs to answer legacy queries will absolutely
> > definitely cause some kind of unintended consequence, and we
> > will get a lot of heat for the failure.
> 
> The chance that when a legacy resolver passes through a local encoding
> and this matches with a UTF-8 entry is extremely low. The reason for
> this is that UTF-8 has a very peculiar byte pattern.

I agree that it is low probability. However, for charsets like MacRoman
and Windows-1252 that reuse the C1 codepoint space, there is a measurable
probability of it happening.

IMO, if we are to take a risk, we should simply say that any label which
contains eight-bit characters must be processed through the IDN path, and
that any eight-bit queries which arrive via the legacy path are to be
rejected (we may also want a new RCODE to signify the exact failure).
Otherwise we end up in a situation where freak labels have to be compared
to the UTF-8 profile and then processed or rejected based on match level.

-- 
Eric A. Hall                                        http://www.ehsco.com/
Internet Core Protocols          http://www.oreilly.com/catalog/coreprot/