[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [RRG] draft-farinacci-lisp-05

To: Iljitsch van Beijnum <iljitsch@muada.com>
Subject: Re: [RRG] draft-farinacci-lisp-05
From: Dino Farinacci <dino@cisco.com>
Date: Sun, 23 Dec 2007 23:17:52 -0800
Cc: Routing Research Group list <rrg@psg.com>
In-reply-to: <02EA00D8-AFCC-4380-9D4A-CD648C20F1FA@muada.com>
References: <5A0C2670-8696-41CC-8E72-2AE623BB8371@muada.com> <4AEEC2E6-7205-49B2-ADDA-0874E4E98A02@cisco.com> <02EA00D8-AFCC-4380-9D4A-CD648C20F1FA@muada.com>

On 19 dec 2007, at 6:20, Dino Farinacci wrote:
There is no version field in the LISP header. There should be. Atleast a few "set to zero on send, ignore on receive" bits that canbe used for extensions without bumping the version number is alsoa good idea.
We decided to use control-plane type codes.
Unless I missed something, this means you can never make backwards-incompatible changes to the LISP header without running the old andnew versions on different RLOCs.

We can for control packets. But no, we can't for data packets. Wedidn't see this as a problem for GRE, when we defined it in the early90s. I don't see it as a problem here right now.

"In order to eliminate the need for a mapping lookup in thereverse direction, the ETR gleans RLOC information from the LISPheader."
I find this undesirable because this way, the behavior of thesystem can be different depending on which end initiated thecommunication. Trusting information supplied directly by the otherend is also problematic security-wise. I would much prefer it ifboth ends did an independent mapping lookup.
We stated this because there was a requirement from big contentproviders. I will loosen the language and say "MAY glean".
This is one of the big problems with GSE: if someone contacts youwith EID=windowsupdate.com and RLOC=l33th4x0r, and you trust thisrelationship, an attacker gets to redirect traffic for that EID to arandom place. This is especially bad when the attacker can set upthis state just as you're about to set up an outgoing connection tothat EID, because then they get to intercept your outgoing traffic.

Understand.

"LISP Locator Reach Bits: in the LISP header are set by an ITR toindicate to an ETR the reachability of the Locators in the sourcesite."
I wonder how useful this is in practice. First of all, having 32RLOCs is way too many.
We have received feedback that it may not be enough.  ;-)
As the LISP related documents that I've read are light on failuredetection and repair, it's hard to say anything definitive, but with32 ITRs and 32 ETRs TCP has probably long since given up when youget around to learning that it's ITR 31 that can talk to ETR 31 butthe other 1023 combinations don't work.

I find it very unlikely that all xTRs would be down.

I'd be interested in learning the rationale behind that feedback,though.

Will add.

[xTRs in ISP networks or (also) in end-user sites]
As I said at the IETF, for a CE deployment of xTRs, the most commonfailure points in the network, which affects connectivity to thesite, is the CE router going down, the CE-to-PE link going down, orthe PE router going down. Other failures are rerouted in the corebased on richness of connectivity or are damped out due toaggregation.
That's what the big ISPs tell us. Myself, I haven't had too muchtrouble with my last kilometers, so my experiences may not berepresentative, but I can tell you that routing failurs andbrownouts DO happen and anyone who cares enough about theirconnectivity to be multihomed, wants to be protected against that,too. Especially having a single ETR go down or function incorrectly(claiming incorrect (un)reachability for other ETRs serving the sameEID) and then being unreachable or having severely degradedreachability would be highly unacceptable to any multihomer thatI've ever known.

Right and encapsulating a packet isn't going to change it. And youcan't depend on LISP to solve a core routing failure.

In these cases, the loc-reach-bits are extremely effective and getsthe new status information to the other sites at data-plane rates.
Only if the return traffic uses the ITR for the forward traffic asits ETR, which makes sense in your view. However, this is a goodexample of a more general tendency in LISP to commit to a narrowmode of operation rather than to make the whole thing more open soit's easy to change the protocol later for the IETF and to makedifferent deployment tradeoffs for operators.

All xTRs from source-site to destination-site (and vice versus) willmost likely use all xTRs so traffic will be flowing out and in allthem. That's the whole point in doing multi-homing. We want to do it*better* and better means active-active and not active-backup, whichis slow converging.

I'm currently not seeing this amount of extensiblity in LISP. Thefact that all of this is happening in an IRTF wg and that we've hadnumerous previous efforts before that either failed to address theissue completely (IPv6) or created something that only solves partof the problem (shim6, and some would argue that I'm being generous)shows that closing off too many paths at this stage is probably nota good idea.

How about thinking of designing a protocol that is simple to get thejob done efficiently and incrementally. Rather than bloating it withgratuitous features which most likely won't be used.

We have received a lot of operational feedback from network operatorsabout what they want and what they don't want. We respect theiropinions and don't want to give them more.

We did consider sending a "free cache" bit in the data plane sosites that have cached state could time out the state and re-request a new mapping if they needed it, but as you might guess itwould cause a request implosion to the site.
This is the same argument that you used against my idea of having acode point in the LISP header to signal back reachability and otherinformation on request.

Right, I wasn't for what I suggested above. I told you we consideredit and threw it out.

We do this by having the ITR ask the ETR to send updates of therelevant information (not important what exactly that information isright now) by setting a code point in the LISP header. To avoidhaving to send back information that's already known, the ETR keepsa "table version" like value. I think we only need two or three bitsfor this. When the ITR asks for an update, it supplies the "tableversion" of the latest information that it learned. If the tableversion at the ETR is the same as in the ITR's request, it doesn'tdo anything. If the table version is different, the ETR sends backan update. The ITR keeps an RTT estimate and makes sure there isonly one request in flight per RTT so the ETR won't be sendingunnecessary copies of the information packets.

Let's see what the other LISP coauthors say about this. I will discussit with them offline.

/| Priority | Weight |R| Loc-AFI |Loc +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+\|Locator |+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
But since we won't use all the AFIs that are encoded, a byte shouldbe sufficient. Please comment if you think we should stick with 16-bits or compress all occurences of AFI to 8 bits.
Aren't multiprotocol BGP AFIs 16 bits? In that case, it saves IANAfrom creating another registry for something that they alreadyregister so if 16 bits is no imposition then that would be thenatural choice. But if you need the bits for something else, thenlet IANA work a bit harder for their money... Obviously, they shouldall be the same size at least in the context of LISP.

I know, but it packs so nicely right now.

Well if an ITR didn't have any data to know if the destination sitewas LISP-capable and encapsulated the packet by copying the innerDA to the outer DA, and the site was using PA addressing, thepacket would enter the destination site and travel to the host. Thehost would not recognize a destination port of 4341, so it wouldrespond with a port unreachable.
But we would not do this anymore with the advent of the lisp-interworking draft.
That's good, because it doesn't make much sense to me to tunnelpackets to destinations for which you don't know if they support thetunneling, even if we ignore for a moment how you would discover thenon-existant RLOC in that case.

Right.

So basically the use of mapping requests/replies is the onlyreliable (implicit) reachability detection mechanism. However, itis largely unspecified how ITRs should use this mechanism todetermine reachability.
That and receiving data packets from the site.
Sometimes traffic only flows in one direction. Although this is rarefor long periods unless you count asymmetric traffic flow, it's muchmore common for short times when a session goes from active to idle.

Yes, for maybe one flow, but if you look at all flows between the twosites, there are safe chances it flows bidirectionally.

I guess we have had enough discussion on this.  ;-)
Well, let me put it this way: I'll gladly forego more discussion inlieu of more consensus. Unfortunately, most people haven't spokenout in favor of an approach towards the MTU thing.

Many have spoken out to do nothing because it is really not a problem.

In my opinion, it would be beneficial to remove pretty much all ofthe text that doesn't pertain to actual LISP operation from thisdraft, and move that which is still useful (some of it is a bitstale after five iterations) to a new "LISP architecture" document.
Can you list what is in this document doesn't pertain to LISPoperation?
I have a long list of other documents that I should review at somepoint, so I don't want to go over the LISP draft again at this time.If you agree with my suggestion, just keep your eye open for textthat qualifies on the next iteration of the document. If you don't,don't.

Will do. Thanks for you comments.

Dino

--
to unsubscribe send a message to rrg-request@psg.com with the
word 'unsubscribe' in a single line as the message text body.
archive: <http://psg.com/lists/rrg/> & ftp://psg.com/pub/lists/rrg

References:
- [RRG] draft-farinacci-lisp-05
  - From: Iljitsch van Beijnum <iljitsch@muada.com>
- Re: [RRG] draft-farinacci-lisp-05
  - From: Dino Farinacci <dino@cisco.com>
- Re: [RRG] draft-farinacci-lisp-05
  - From: Iljitsch van Beijnum <iljitsch@muada.com>

Prev by Date: Re: [RRG] Tunnel fragmentation/reassembly for RRG map-and-encaps architectures
Previous by thread: Re: [RRG] draft-farinacci-lisp-05
Next by thread: [RRG] LISP etc architecture
Index(es):
- Date
- Thread