[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[RRG] Comparing BGP with map-encap shemes

To: Routing Research Group <rrg@psg.com>
Subject: [RRG] Comparing BGP with map-encap shemes
From: Robin Whittle <rw@firstpr.com.au>
Date: Thu, 10 Jan 2008 18:00:47 +1100
Organization: First Principles
User-agent: Thunderbird 2.0.0.9 (Windows/20071031)

Short version: The functions and communication needs of the various
               map-encap schemes vary enormously from each other and
               from those of the BGP system.  So it is not very
               helpful to try to compare their efficiency directly
               with BGP's efficiency.

There has been discussion of whether the techniques used by amap-encap (ITR-ETR) scheme such as LISP-ALT are more efficient thanusing BGP to convey a similar amount of information across the Net.

BGP couldn't be used for LISP's purposes unless it was required thatall ITRs and ETRs be BGP routers, or be connected very closely tothem - which I think would be an unreasonable restriction.

Even if BGP could be used, I think the comparison is not veryhelpful, because the map-encap scheme is doing something verydifferent from the BGP system. So the information which needs to betransacted is of a very different nature and quantity.

The BGP system provides connectivity between ISPs and PI end-usernetworks. It does a good job of this, and the idea is that thiswill continue, while the map-encap scheme provides a new type ofaddress space (a subset of current addresses) which is suitable formulti-homed end-user networks - without directly involving the BGPsystem.

The BGP network needs to send a flurry of information some distance- perhaps close or perhaps a long way away - every time a router isconnected or disconnected, typically when one link goes up or down,and always when a prefix is advertised or withdrawn. Some BGPmessages are vital to connectivity. However, the effect of most ofthem is simply to enable better (or occasionally worse) paths forpackets than what would have occurred without the message.

Some features of the BGP system are:

1 - There are huge peaks in message volume whenever a link/router
    up/down event affects a large number of prefixes.

2 - The routers' CPUs have a very complex job handling incoming
    BGP messages, deciding how best to forward packets addressed
    to each of 250k+ prefixes and producing outgoing messages.  They
    also have to hold the outgoing queue of messages to each peer
    according to the rate the peer can accept them.  They apply
    complex local policy while doing all this.

3 - The convergence to the final optimal (or as optimal as BGP
    can achieve) state often involves a large number of messages as
    various routers adjust themselves to different forwarding
    arrangements based on the messages received from peers.  The
    whole BGP system works via the effect of information rippling
    from one router to the next, where each router makes decisions
    about the information and so potentially transforms it, or does
    not pass it on.

3 - The load on routers and the convergence time tends to get worse
    with the growth in advertised prefixes - which is why we are
    here.


There are 3 types of map-encap scheme:

  SP-C  Slow Push, Complex ITR with TE.

     APT
     LISP-NERD


  GQ-C  Global Query network, Complex ITR with TE.

     LISP-CONS
     LISP-ALT
     TRRP


  FP-S  Fast Push, Simple ITR with no specific TE functions.
        Also supports a powerful new approach to mobility.

     Ivip

The mapping data of SP-C and GQ-C schemes is basically the same.For each micronet (a prefix which has all the same mappinginformation, and serves one end-user network, though each suchnetwork might have multiple micronets), there is:

   Description of the micronet by prefix and prefix length -
   micronets in these schemes are on binary boundaries.

   The addresses of at least one but typically two or more (for
   multihoming) ETRs.

   Some kind of priority information, regarding TE and/or
   selecting one ETR ahead of the other when the ITR finds two or
   more are reachable.

This data changes only slowly. It may remain the same for months oryears. The rate of change is relatively low compared to that ofIvip's, but the mapping data is more complex and lengthy.

The SP-C and GQ-C schemes require the ITRs to decide which ETR touse, based on the likely arduous and error-prone task of determiningreachability to multiple ETRs. This involves a large body of workfor ITRs and ETRs, with consequent traffic.

CONS and ALT remove the need for pushing the global mapping databaseto ITRs and storing it in each one, but they involve the ITR beingunable to tunnel traffic packets until a query and a response havetraversed a global query network.

In LISP-ALT, this could easily involve a path longer than halfwayround the world, since the structure of routers is determined by theaim of aggregating IP addresses, resulting in one router being inlocation X and the next level router for the request to be sent tobeing in some other location Y, which is on Earth somewhere, but notnecessarily close to X. (Since IP addresses are scatteredgeographically all round the planet, aggregation by address meansinter-router links can't be optimised according to their physicallocation.)

Consequently, the ALT query system will be slow and traffic packetsaddressed to EID prefixes the ITR has no cached mapping informationfor will will need to be dropped or sent to the ETR via the ALTsystem itself - which would be slow, burdensome and not necessarilyreliable.


Ivip's mapping information consists of:

  Description of the micronet by starting address and length.

  A single ETR address.

The ITRs do not need to make any decisions or test reachability toETRs, so ETRs can be very simple. (For the purposes of thisdiscussion I am ignoring PMTUD and fragmentation problems. Ivipwill involve extra ITR and ETR complexity to deal with these. Theother schemes are currently not committed to solving these problems.)

Ivip mapping data changes more frequently than that of the otherschemes. This is because Ivip is not directly involved in sensingreachability or in restoring multi-homed connectivity. Iviprequires the end user to have their own multihoming monitoringsystem - which automatically (or perhaps manually) changes themapping to whatever the user desires.

While Ivip does not specifically provide TE, the end-user canachieve load spreading to a resolution of one IP address, by havingmicronets of any size, down to one IP address, and mapping them todifferent ETRs. (If one IP address carries too much traffic forload spreading to be achieved this way, it would be necessary forthe user to use multiple DNS names etc. to spread their traffic overa handful of IP addresses to achieve the granularity of trafficvolume they require.)

To achieve multihoming, Ivip's mapping data is changed only when anETR actually becomes unreachable. This is far less frequent thanthe average rate of messages in BGP concerning the advertised prefixwithin which this ETR is located.

So it is not as if Ivip involves a global distribution of a BGP-likelevel of messages for each micronet.

There could be lots of BGP messages concerning the ETR's prefix, butas long as the ETR was still reachable, there would be no need tochange the Ivip mapping in order to maintain the end-user'sconnectivity.

TE would require more frequent changes to the mapping information.

When Ivip is used to provide mobility (IPv4 and IPv6, with minimalnew functionality required for the mobile host and none for thecorrespondent hosts), there will be a mapping change ever time themobile host chooses to use a new ETR or TTR (Translating TunnelRouter - which performs its ETR functions and may be located outsidethe network it is currently connected to).

So both TE and mobility involve potentially high levels of mappingchange. I think there needs to be some kind of charging system formapping changes, because these changes will be sent to hundreds ofthousands of ITRDs (full database ITRs) and QSDs (full databaseQuery Servers) all over the world.

Depending on how Ivip is used for TE and mobility, the flow ofmapping data is likely to be much higher than that of the other schemes.

Even if Ivip was used solely for multihoming, the rate of mappingchanges would be higher than the other schemes.

However, Ivip involves a specially designed push system which willhandle these messages with *far* greater ease than would be the casefor them being passed peer-to-peer in BGP. (APT proposes a separateBGP system to push mapping data to each ISP.)

Ivip doesn't require that the full mapping data be pushed to everyITR. Caching ITRs and ITRFs (ITR function in sending host) can usenearby full database query servers. Ivip is more flexible regardinghow far the mapping data needs to be pushed than LISP-NERD (all ITRsneed the full feed) or APT (every ISP needs a full feed).

The nature of the mapping data and the way it flows in all theseschemes is very different from what happens in BGP.

Push systems are in principle much cleaner, faster and easier topredict than the only apparent alternative - the global query serversystems of LISP-CONS, LISP-ALT and TRRP. I am convinced thatLISP-ALT would be so slow, in many cases, that significant numbersof initial packets would be dropped, making the system unworkable.

  - Robin

  http://www.firstpr.com.au/ip/ivip/



--
to unsubscribe send a message to rrg-request@psg.com with the
word 'unsubscribe' in a single line as the message text body.
archive: <http://psg.com/lists/rrg/> & ftp://psg.com/pub/lists/rrg

Prev by Date: Re: [RRG] Tunnel fragmentation/reassembly for RRG map-and-encaps architectures
Next by Date: [RRG] Comparing BGP with map-encap schemes
Previous by thread: [RRG] updated RRG Vancouver meeting minutes
Next by thread: [RRG] Comparing BGP with map-encap schemes
Index(es):
- Date
- Thread