[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: draft-yergeau-rfc2279bis-02.txt for STANDARD



In message <202CFE2B-24B3-11D7-8818-0003934B2128@cisco.com>, =?ISO-8859-1?Q?Pat
rik_F=E4ltstr=F6m?= writes:
>
>On fredag, jan 10, 2003, at 13:03 Europe/Stockholm, Steven M. Bellovin 
>wrote:
>
>> From what others have said -- I know little of UTF
>
>Short description (just in case):
>
>When you encode a character from ISO-10646 or Unicode into UTF-8, the 
>length of the output depends on what value the codepoint has. It is a 
>variable length encoding.
>
>The wording I want should say the encoding is not limited to what is 
>described in the document, but can be longer, so people should not do 
>guesses on the size of the output (without inspecting the input, one 
>can calculate the size needed in one run, and then encode on the 
>second) just because the document has such a limit.
>

Right, that was what I assumed, and why I worded my suggestion the way 
I did -- that the encoding could produce longer output than 4 bytes, 
and that this is a danger.

		--Steve Bellovin, http://www.research.att.com/~smb (me)
		http://www.wilyhacker.com (2nd edition of "Firewalls" book)