[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [idn] case preservation



At 16:13 01/10/18 +0900, Soobok Lee wrote:

> > > There is indeed a non-zero (but very, very small) probability
> > > for such cases. But if domain names are written in lower case
> > > the way they mostly have been up to now, a word in a language
> > > written in Cyrillic looking the same as a word in a language
> > > written in Latin would be about as rare as a four-leaf clover.
> > >
>
>No. mcuh more  frequent than you guess.
>
>Cyrillic small 'a' 'e' 'o' 'c' 'p' 'x' 'y'  'i' 'j' 's' have the exactly 
>same look with latin small ones.

Yes. But 'i' 'j' 's' are not actually used in most languages
that are written in Cyrillic. And in all languages, most of
the possible letter combinations are not actually used. And
the longer a word is, the more quickly the probabilities
approach zero.


Regards,   Martin.