[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Status of Operational issues with Tiny Fragments in IPv6

To: Fred Baker <fred@cisco.com>
Subject: Re: Status of Operational issues with Tiny Fragments in IPv6
From: Stig Venaas <stig.venaas@uninett.no>
Date: Mon, 29 May 2006 15:53:34 +0200
Cc: Elwyn Davies <elwynd@dial.pipex.com>, Vishwas Manral <vishwas.ietf@gmail.com>, v6ops@ops.ietf.org
In-reply-to: <F862DBB2-1923-46DF-B81E-22C5B5BF9293@cisco.com>
References: <77ead0ec0605252207v686b8355n493f47aa4d8de38c@mail.gmail.com> <4476BA9F.1040008@dial.pipex.com> <F862DBB2-1923-46DF-B81E-22C5B5BF9293@cisco.com>
User-agent: Mozilla Thunderbird 1.0.8 (X11/20060428)

Fred Baker wrote:

On May 26, 2006, at 1:21 AM, Elwyn Davies wrote:
I would suggest that rather than making this into a separate draft wecan look at improving the text in the security overview (ifnecessary) given that we are probably going to have to do *another*round on this one.
That seems rational to me.

Me too. As Pekka said any normative changes to the IPv6 protocol wouldprobably have to go elsewhere anyway.

A quick comment:

- no overlapping fragments and

Overlapping fragments is the main problem IMO. I would have liked thisto be banned so that implementations can just drop such packets. Ofcourse some hosts might still accept them, and e.g. a firewall wouldneed to keep some state to detect overlap.

- non-last fragments to be close to the guaranteed minimum MTU.
I think the latter point wants to read "non-last fragments close to thePATH MTU, and therefore at least as large as the largest fragment sizeless than or equal in size to the minimum MTU."

The draft doesn't talk about overlap, but talks about this latter point.Unless overlap is forbidden, a minimum size alone is not sufficient. Youmight have an initial fragment containing most of the datagram (ofsufficient size), and then have a second overlapping one that overwritesparts of the header.

Whatever minimum size we set, one may add so many extension headers thate.g. the transport header does not go into the first fragment anyway.

If a minimum fragment size is specified, then I think it should be abouthalf the minimum MTU. Rather than using the MTU for all fragments butthe last, it might make sense to distribute the data evenly so that allfragments are roughly the same size. See below.

There is a discussion of fragmentation and reassembly in RFC 1812section 4.2.2.7 that may be useful to reference, or at least learnlessons from. In part, this results from the behavior of NIC cards inthe late 1980's that couldn't reliably receive datagrams back to backfor very long due to chip or memory issues, and partly this is is dueto brain-dead behaviors in early end-station OS's. One issue was thatmany systems sent packets at the minimum MTU size (then 576 bytes,derived from the memory structure of the Fuzzball and BBN Routers)rather than at the path MTU, which meant that there were greateropportunities for per-datagram errors because there were more of them.There were also various strategies on how to fragment - should thefirst datagram be the smallest, the last, or should the fragmentingsystem try to make them all approximately the same size? It makes somespecific recommendations:

4.2.2.7 Fragmentation: RFC 791 Section 3.2

   Fragmentation, as described in [INTERNET:1], MUST be supported by a
   router.

   When a router fragments an IP datagram, it SHOULD minimize the  number
   of fragments.  When a router fragments an IP datagram, it SHOULD  send
   the fragments in order.  A fragmentation method that may  generate one
   IP fragment that is significantly smaller than the other MAY cause
   the first IP fragment to be the smaller one.

   DISCUSSION
      There are several fragmentation techniques in common use in the
      Internet.  One involves splitting the IP datagram into IP
      fragments with the first being MTU sized, and the others being
      approximately the same size, smaller than the MTU.  The  reason for
      this is twofold.  The first IP fragment in the sequence will be
      the effective MTU of the current path between the hosts, and the
      following IP fragments are sized to minimize the further
      fragmentation of the IP datagram.  Another technique is to split
      the IP datagram into MTU sized IP fragments, with the last
      fragment being the only one smaller, as described in  [INTERNET:1].

      A common trick used by some implementations of TCP/IP is to
      fragment an IP datagram into IP fragments that are no larger  than
      576 bytes when the IP datagram is to travel through a router.
      This is intended to allow the resulting IP fragments to pass the
      rest of the path without further fragmentation.  This would,
      though, create more of a load on the destination host, since it
      would have a larger number of IP fragments to reassemble into  one
      IP datagram.  It would also not be efficient on networks  where the
      MTU only changes once and stays much larger than 576 bytes.
      Examples include LAN networks such as an IEEE 802.5 network  with a
      MTU of 2048 or an Ethernet network with an MTU of 1500).

      One other fragmentation technique discussed was splitting the IP
      datagram into approximately equal sized IP fragments, with the
      size less than or equal to the next hop network's MTU.  This is
      intended to minimize the number of fragments that would result
      from additional fragmentation further down the path, and assure
      equal delay for each fragment.

This latter point is what I'm refering to above.

Also note that as mentioned above, some implementations send fragmentsout of order (e.g. Linux has been known to do this). The reason is thatyou don't know the total size of the datagram until you receive the lastfragment. Receiving the last fragment first means that you can allocatethe right amount of memory immediately.

Stig

      Routers SHOULD generate the least possible number of IP  fragments.

      Work with slow machines leads us to believe that if it is
      necessary to fragment messages, sending the small IP fragment
      first maximizes the chance of a host with a slow interface of
      receiving all the fragments.
I think the NIC card issue (where should the smallest fragment be?) ishistorical and not to be worried about. But the matter of generating(by whatever algorithm) the least number of fragments that canrepresent a transport-level message is, I think, important.

Follow-Ups:
- Re: Status of Operational issues with Tiny Fragments in IPv6
  - From: Fred Baker <fred@cisco.com>

References:
- Status of Operational issues with Tiny Fragments in IPv6
  - From: "Vishwas Manral" <vishwas.ietf@gmail.com>
- Re: Status of Operational issues with Tiny Fragments in IPv6
  - From: Elwyn Davies <elwynd@dial.pipex.com>
- Re: Status of Operational issues with Tiny Fragments in IPv6
  - From: Fred Baker <fred@cisco.com>

Prev by Date: RE: Status of Operational issues with Tiny Fragments in IPv6
Next by Date: Re: Status of Operational issues with Tiny Fragments in IPv6
Previous by thread: Re: Status of Operational issues with Tiny Fragments in IPv6
Next by thread: Re: Status of Operational issues with Tiny Fragments in IPv6
Index(es):
- Date
- Thread