[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

RE: FW: RFC2279 to Standard



Paf writes:

> The current suggestion is that we say we have only talked 
> about values up to 10ffff, but implementations should be
> prepared on higher values.
> 
> This in the security consideration section.
> 
But Ned wrote:

>   Actually, I think it is preferable to treat them as
>   protocol errors, due to the need for UTF-16 compatibility.
>

So, is that in conflict then?

Bert
> 
> On lördag, jan 11, 2003, at 00:25 Europe/Stockholm, Wijnen, 
> Bert (Bert) 
> wrote:
> 
> > Form Randy Presuhn (I think he is the best skilled from
> > the SNMP community w.r.t. UTF-8 and related topics)
> >
> > So the possible concern would/might be:
> >
> >   It DOES however mean that some older SNMP entities will
> >   accept (and play back) values that would legitimately
> >   be rejected by newer ones.
> >
> > So the statement by Ned that we should return an error:
> >   Actually, I think it is preferable to treat them as
> >   protocol errors, due to the need for UTF-16 compatibility.
> >
> > Does that mean that values above 10ffff should be rejected?
> > So if my older agents don't do that, is that a problem or
> > a non-compliance?
> >
> > Bert
> > -----Original Message-----
> > From: Presuhn, Randy [mailto:Randy_Presuhn@bmc.com]
> > Sent: vrijdag 10 januari 2003 19:42
> > To: 'Wijnen, Bert (Bert)'
> > Subject: RE: RFC2279 to Standard
> >
> >
> > Hi -
> >
> > I'm not sure why they're talking about UTF-16.  It's bogus,
> > and the surrogates it uses are not used in the UTF-8 transform.
> >
> > ISO 10646 and Unicode agreed to go from the full 31-bit range
> > down to the 0..10ffff range.  I think I raised the question
> > (along with selecting one of the normalization forms) on the
> > mailing list.  As I recall, there was only one response, and
> > the respondant didn't want to open that can of worms.
> >
> > RFC 3411 says:
> >        Since additional code points are added by
> >        amendments to the 10646 standard from time
> >        to time, implementations must be prepared to
> >        encounter any code point from 0x00000000 to
> >        0x7fffffff.  Byte sequences that do not
> >        correspond to the valid UTF-8 encoding of a
> >        code point or are outside this range are
> >        prohibited.
> >
> > The agreements on Unicode/ISO 10646 just say that those code
> > points in the 110000..7fffffff range will never be added.  If
> > the definition of "valid UTF-8 encoding" has been changed to
> > encompass a smaller range, I believe the the statement "Byte
> > sequences that do not correspond to the valid UTF-8 encoding
> > of a code point or are outside this range are prohibited"
> > remains correct, and gives an implementation grounds to reject
> > code points outside the 0..10ffff range.  It DOES however mean
> > that some older agents will accept (and play back) values that
> > would legitimately be rejected by newer ones.
> >
> >  ------------------------------------------------------
> >  Randy Presuhn          BMC Software, Inc.  1-3141
> >  randy_presuhn@bmc.com  2141 North First Street
> >  Tel: +1 408 546-1006   San José, California 95131  USA
> >  ------------------------------------------------------
> >  My opinions and BMC's are independent variables.
> >  ------------------------------------------------------
> >
>