[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: TE & SHIM6 (was Re: comments on draft-ietf-shim6-proto-03

To: Erik Nordmark <erik.nordmark@sun.com>
Subject: Re: TE & SHIM6 (was Re: comments on draft-ietf-shim6-proto-03
From: Iljitsch van Beijnum <iljitsch@muada.com>
Date: Wed, 1 Mar 2006 16:12:42 +0100
Cc: shim6-wg <shim6@psg.com>
In-reply-to: <44035798.5020705@sun.com>
References: <20060202183423.9C6B.SHINTA@sfc.wide.ad.jp> <43F27772.4000304@sun.com> <20060215141706.9C61.SHINTA@sfc.wide.ad.jp> <dd569350f929cd26466dea1d8a1d19fa@it.uc3m.es> <20060216172251.GA15792@1-4-5.net> <43F4CF7F.90804@sun.com> <43F4EF36.8030907@info.ucl.ac.be> <43F50988.3030707@tony.li> <FA1B607C-7DB1-4F97-A688-797E34492A42@muada.com> <44035798.5020705@sun.com>

[This message will go into both source address rewriting and trafficengineering.]

On 27-feb-2006, at 20:48, Erik Nordmark wrote:

There are two problems with allowing routers to rewrite sourceaddresses:1. The routers must know which packets are "legacy" and can't havetheir source address changed vs which packets are controlled byshim6 or another mechanism that can handle rewritten sourceaddresses.2. In current shim6, only previously negotiated source addressesmay be used, which means the shim6-enabled hosts in a site and therewriting routers must coordinate their efforts so correspondenthosts don't see unexpected source addresses.

FWIW draft-nordmark-shim6-esd-00.txt is on the way to the I-Ddirectory, and it has some ideas for how to address this.

Right. Lots of good points in there, but unfortunately, I disagreewith the mechanisms proposed. I really hope I'll never have to runDHCPv6 to configure my hosts, it's a big, fat, unelegant protocol.

The first issue is readily solvable by simply having shim6 hostsput a magic value in the upper 64 bits of the source address thatindicates "rewriting permitted".

Or next hdr = IPPROTO_SHIM6.

I don't find this very suitable. If we're going to send many shimmedpackets, it's more important than ever that we omit the shim headerwhenever possible. Apart from that, using the source address tosignal that the source address may be changed is much cleaner. Italso has the advantage that we can now borrow some bits to make theprocess easier. What we can do is have shim6 capable hosts emit a"source rewriting information request". That would be a packetaddressed to a shim6 correspondent that has the magic prefix in thesource address that triggers source address rewriting, and anadditional bit combination that tells the router to send back a listof prefixes it will use to rewrite. The host can then make sure thatthe correspondent knows to expect packets with these source addresses.

If this is an ordered list, the host can then use bits in the datapackets with the rewrite prefix in the source address to tell therouter which addresses it may insert. (Not sure what would happenthough if the router wants to rewrite into Y but the host only allowsX and Z.)

I've been thinking about something similar for traffic engineeringever since my message yesterday where I mentioned A6 records. Theproblem is that it's far from inconceivable that at some point, adisconnect forms between the info in the DNS and the actual state ofthe network. The way I see it, we have four ways to convey TE relatedinfo:

1. out of band end-to-end: this would be stuff in the DNS
2. out of band hop-by-hop: BGP is like this
3. in-band end-to-end: measured timing and packet loss information
4. in-band hop-by-hop: feedback from routers

The problem is that 2. needs aggregation to scale. 3. and 4. need tohave contact with the correspondent already, so it's useless in somecases, like in the case where we want one or more backup addressesthat are only tried if the primary addresses don't work. The only wayto convey this is with 1. We can either reuse SRV records forindividual services for this, which has the advantage that it'salready available today, but the disadvantage that this mechanismisn't really used and it needs to be supported on an application-by-application basis. Alternatively, we can do some magic in theresolver library to make this happen.

But this doesn't really make it possible to react to trafficengineering events in anything close to real time, if at all (DNS maynot be accessible by people who need to do TE.) The thing is, BGPisn't all that great for this either: with current multihoming, youcan't engineer traffic such that link 1 gets the first 10 megabits,then everything between 10 and 15 goes to link 2 and if there's morethan 15 Mbit it's balanced over the two links in a 2:1 ratio.(Believe me, this doesn't stop people from asking.)

But an in-band hop-by-hop TE mechanism would allow exactly this. Theway it would work is that routers are configured to provide feedbackfor packets with a shim header, if necessary. This feedback would bein the form of entries that go into the address selection policytable. The site egress router would probably want to inform hostsabout which source addresses go well with certain destinationprefixes. All routers between the source and destination (includingthe site exit router) would signal back "this prefix (which would bethe prefix that the destination for the packets falls into)preference value XXX". To avoid trouble, the preference valueshouldn't be allowed to completely override locally configured info.

Ignoring non-shim6 traffic for a moment, this would allow any routerin the path to push back traffic when the conditions warrant it. Arouter could be configured to start lowering the preference valuewhen traffic hits a certain threshold and shim6 traffic wouldautomatically be rerouted if possible. Obviously there's still thepotential for conflicting preferences.

A less fortunate side effect could be that a lot of regular trafficwould be shimmed when the initially chosed destination address isn'toptimal, which is only discovered when shim state is created afterthe session reaches a certain number of packets exchanged. So thiswould work even better if shim packets are exchanged before thesession starts, like you describe in your draft. Interestingly,suppressing the shim header makes shimming less problematic but withthe shim header suppressed the traffic engineering doesn't workhalfway through a long-lived exchange. This can be fixed byperiodically sending a packet with a shim header, though. Shimmingfor TE reasons could also be problematic when one side garbagecollects shim state too aggressively.

If we go down this road it may be useful to have one or more bits inthe shim context tag to communicate with routers, so we'd probablywant to make the context tag a bit smaller than 47 bits.

Last but not least: it's probably useful to use SRV records (if we'regoing to use those anyway) to tell hosts that:

- they shouldn't initiate shim6 (because the other end wants tocontrol when this happens or shim6 isn't supported)

- they should defer shim6 negotiation as per local policy
- they should do shim6 negotiation before starting any sessions

The latter would probably be desireable for sites that want tooptimize for TE or want to balance incoming sessions over differenthosts.

Follow-Ups:
- Re: TE & SHIM6 (was Re: comments on draft-ietf-shim6-proto-03
  - From: Erik Nordmark <erik.nordmark@sun.com>
- Re: TE & SHIM6 (was Re: comments on draft-ietf-shim6-proto-03
  - From: marcelo bagnulo braun <marcelo@it.uc3m.es>
- Re: TE & SHIM6 (was Re: comments on draft-ietf-shim6-proto-03
  - From: Iljitsch van Beijnum <iljitsch@muada.com>

References:
- comments on draft-ietf-shim6-proto-03
  - From: Shinta Sugimoto <shinta@sfc.wide.ad.jp>
- Re: comments on draft-ietf-shim6-proto-03
  - From: Erik Nordmark <erik.nordmark@sun.com>
- Re: comments on draft-ietf-shim6-proto-03
  - From: Shinta Sugimoto <shinta@sfc.wide.ad.jp>
- TE & SHIM6 (was Re: comments on draft-ietf-shim6-proto-03
  - From: marcelo bagnulo braun <marcelo@it.uc3m.es>
- Re: TE & SHIM6 (was Re: comments on draft-ietf-shim6-proto-03
  - From: David Meyer <dmm@1-4-5.net>
- Re: TE & SHIM6 (was Re: comments on draft-ietf-shim6-proto-03
  - From: Erik Nordmark <erik.nordmark@sun.com>
- Re: TE & SHIM6 (was Re: comments on draft-ietf-shim6-proto-03
  - From: Olivier Bonaventure <Bonaventure@info.ucl.ac.be>
- Re: TE & SHIM6 (was Re: comments on draft-ietf-shim6-proto-03
  - From: Tony Li <tony.li@tony.li>
- Re: TE & SHIM6 (was Re: comments on draft-ietf-shim6-proto-03
  - From: Iljitsch van Beijnum <iljitsch@muada.com>
- Re: TE & SHIM6 (was Re: comments on draft-ietf-shim6-proto-03
  - From: Erik Nordmark <erik.nordmark@sun.com>

Prev by Date: Re: about R1bis
Next by Date: Re: shim6 @ NANOG (forwarded note from John Payne) (fwd)
Previous by thread: Re: TE & SHIM6 (was Re: comments on draft-ietf-shim6-proto-03
Next by thread: Re: TE & SHIM6 (was Re: comments on draft-ietf-shim6-proto-03
Index(es):
- Date
- Thread