[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [idn] hangul question
I got the following:
UTF16:
\u0032\u0030\u0030\u0032\u0066\u0069\u0066\u0061\uC6D4\uB4DC\uCEF5\uCE74\uB4DC\u002E\u0063\u006F\u006D
RACE: bq--3aadeabqaayaamqamyagsadgabq4nvfu3thplttuwtoa.com
Source: RACE
CharacterEncoding: ISO-8859-1 >
native: 2002fifa월드컵카드.com>
Input: bq--3aadeabqaayaamqamyagsadgabq4nvfu3thplttuwtoa.com > 0xd000
check out
http://thor.ar.com/servlet/com.ar.idns.RACEServlet
>
> There's no valid Korean word or phrase represented by this string; in
> fact this string includes several syllables that are not normally used
> in contemporary Korean language.
>
> However, it seems to me that this is not even a valid UTF-8 encoding,
> because the string has a pattern:
>
> 1. All characters have the 0xd0 MSB.
> 2. The string of LSBs is (without leading 0x prefixes):
>
> 00 32 00 30 00 30 00 32 00 66 00 69 00 66 00 61 b6 d4 b4 d4 ce f5 ce 74
>
> Grouping each pair of adjacent bytes from the beginning, we get:
>
> 0032 0030 0030 0032 0066 0069 0066 0061 b6d4 b4d4 cef5 ce74
>
> I don't have any idea about the last four ones, but when interpreted as
> UCS-16 string, the first 8 characters are: "2002FIFA". : )
>
> Hope this helped,
> Eugene
>
> --
> Eugene M. Kim <ab@astralblue.com>
>
> "Is your music unpopular? Make it popular; make music which people
> like, or make people who like your music."
>
>