[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: review of draft-ietf-shim6-failure-detection-03.txt

To: marcelo bagnulo braun <marcelo@it.uc3m.es>
Subject: Re: review of draft-ietf-shim6-failure-detection-03.txt
From: Iljitsch van Beijnum <iljitsch@muada.com>
Date: Sat, 24 Jun 2006 12:59:11 +0200
Cc: shim6-wg <shim6@psg.com>
In-reply-to: <bf58c11aa75972c847f3c9ba354a08b3@it.uc3m.es>
References: <615BD9B54CB01B41838C323DB9E91AAB4075EB@esebe100.NOE.Nokia.com> <446DC7E4.3090501@piuha.net> <F9F7CB0B-567C-4CB0-9775-63098CEBBD22@muada.com> <4497FBEA.7020908@piuha.net> <099F9743-9BD2-43B8-B53F-A1644D1D81C9@muada.com> <bf58c11aa75972c847f3c9ba354a08b3@it.uc3m.es>

On 22-jun-2006, at 16:59, marcelo bagnulo braun wrote:

This draft works per-context exclusively. So if there are 2, 5 or10 contexts between two hosts, this means 2, 5 or 10 times theamount of work is done.

i agree that this would be a nice feature. the problem with this ishow do you identify the peer in such a way that you can probe allthe existing contexts.


Have a look at the revision of my reachability detection draft:
http://www.muada.com/drafts/draft-van-beijnum-shim6-reach-detect-00.txt

Note that this is an update of draft-ietf-shim6-reach-detect-01.txtand it's not yet posted by the secretariat.

The other option would be to use a single probe/keepalive for allthe contexts between two peers. In order to do that we need a meanto identify the peer so that the receiver of the packet canidentify all the contexts corresponding to the same hosts and applythe received packet to all the contexts.


Indeed.

BAsically this would introduce the notion of endpoint in the shimcontext/protocol (which is not present today), since today thegranularity is ulid pairs (as oposed to endpoint pairs)


Not necessarily, read my draft.

this would be a considerable change in the protocol i guess, butmay be explored if people deem it relevant.

It makes the protocol a bit more complex, but it does allow it to beused by many different protocols at the same time.

As a general comment, i am kind of worried about the complexity ofthe resulting protocol, including shim protoc and the failuredetection protocol and i would really preffer to try to simplifythe protocol rather than making it more complex, even if this meansloosing some optimization for some cases.

I suppose the case where there are multiple contexts between two hostwon't be that common that it's worth too much effort to deal with it.But if other protocols also need this, then it would be MUCH betterto have a single code base that's shared by all of them rather thanhave essentially the same thing pop up in different places.

I am concerned about having a complex protocol that may becomeerror prone (we already have feedback expressing this concern BTW)

I hate complexity as much as the next IETFer, but leaving the last10% out just because it's simpler is generally not a good solution.

However, it's important that there is fate sharing between thereachability protocol and the user protocol (shim in our case). Ithink this can be solved by having the quick reachabilityverification stuff (= FBD) encapsulated in the user protocol, butlet the full path exploration be a protocol of its own or liveunder ICMPv6 or some such.

not sure why do you think this is needed. Defining the protocolmessages in a way that they can be included in the shim6 header aswell as in the mobility header or the hip header would be goodenough to allow using the failure detection protocol in otherprotocols.... what am i missing?

See the discussion above, and the need for fate sharing between thereachability protocol and the "user" protocol. If we want thereachability detection to be shared by different users, then it canhappen that one protocol is filtered and another isn't. So weprobably want the reachability detection to be independent of the"user" protocols and then when the reachability protocol says thatsomething is reachable, the user protocol does a quick check usingits own protocol number to be sure it actually works.

Another thing that's missing completely from this draft is adiscussion of how to use address pair preference information. Thismakes it impossible to address traffic engineering needs.

well, i have been working on this and i have submitted a draftabout how to perform locator pair selection, including reachabilityinformation and also preference information from the shim protocol

you can find it at:

http://www.ietf.org/internet-drafts/draft-ietf-shim6-locator-pair-selection-00.txt

of course your feedback would be very welcome


I'll have a look at it.

i think that the definition section is very useful, because theinsight it provides about the different states of an address andaddress pairs are very important.

I agree, but my problem with the definition section is that itcontains too much stuff that shouldn't be there. It's not unusual tohave to go back to the definition section several times duringreading, so a definition section needs to be as concise as possible.

I suggest tightening the use of words like "operational", "work","reachable". They're mostly used interchangably in the draft.

i don't think this is the case.
i find this differences relevant imho


I'm not sure there is a difference, and if there is, what it is...

This doesn't say what shim6 implementers should do. In my opinion:keep using deprecated addresses as the ULID/primary locator aslong as possible, but prefer non-deprecated addresses whenselecting alternative locators.

i think this should belong to the locator selection document...


Is that a separate document???

   2.  Whenever outgoing data packets are generated

Data packets as opposed to what other types of packets?

signalling packets, such as keeplives or probes (is my understanding)


Sure, but the draft doesn't say that.

   4.  The reception of a REAP keepalive packet leads to stopping the
       timer associated with the return traffic from the peer.

So when we receive a keepalive from the other side, _we_ stopsending keepalives

as i understand it, this means that we are not expecting anotherpacket (until we send a new packet, of course)

I guess. But shouldn't this follow from the general rules rather thanbe a specific one?

The keepalives are sent at an interval of 3 seconds (or shorter, Iimagine that an implementation isn't going to keep an exact timerfor each context, any rounding must obviously be in the downdirection) and the timeout is 10 seconds. In these 10 secondsyou'd normally receive 3 keepalives, while 1 is enough to indicatethat the other side is still alive. The other 2 are only there incase of packet loss. I think that's excessive.

would you suggest it to reduce it to 2 packets every 10 secs?

That's a bit better, but actually I think 1 in 10 seconds is enough,although that means you need to take a few extra seconds before youcan time out. If you want to time out after 10 seconds then sending akeepalive after 8 would probably be a good choice.

I mean, i think this protocol will require quite a lot of finetunning based on experience and simulations of the load... i guessthat what's in the current spec are resonable values for the timebeing (i have no problem with changing them a bit, but as i said iguess in depth fine tunning will be needed once we have moreexperience...)

How is experience going to tell us anything that we don't knowalready in this case? If we go for one missed keepalive before atimeout that would be a new approach that may not work out well andthen we can go back to 5 seconds or 3 seconds, but starting at 3means a lot of packets but as good as no unnecessary triggering ofpath exploration, there won't be any surprises there.

I believe that since the id of the last received probe isincluded, the iseeyou flag is unnecessary.

you mean that if the id field is empty, this means iseeyou=no?

No, what I mean is that the value of this bit doesn't convey anyinteresting information.

Or maybe it really is a "reply requested" bit in disguise, like wediscussed earlier.

Although copying back the last seen id seems to do the job, Ican't help but feel that it would be preferable to add timers toreach round trip times and copy back more received ids and alsosent ids. This allows the receiver of a probe to determine whichof the probes that made it to the other side did so faster, so itcan select the address pair with the shortest round trip time.

i would suggest to leave this for future work, since it is addedcomplexity and it is not obvious to me that selecting the fastestone is always the best choice.... (e.g. bandwidth is not considered)

I'd say: put in the fields, this is very little extra work, and thevalues can be ignored for simplicity when desired. Then, implementerscan experiement with how they use them if they like.

The keepalive is a fairly long packet. I think just a shim headeras would be used for data packets but with no ULP following theshim header would be sufficient.

not sure what would you omit from the current packet format... imean, we need the context tag and the identifier and we need it tomake it extensible in the header....

No we don't. Data packets don't have these fields either and alsoindicate that the current context is working. Moreover: data packetsthat haven't been rewritten don't even have a shim header!

Requiring random numbers in packets that are sent ratherfrequently is a bad idea, because it depletes the typicallylimited amount of entropy that's available for strong randomnumber generation rather quickly and semi-random number generationmay be somewhat expensive (and not that good). And I don't seewhat good an id does in a keepalive anyway... Also, there may bereasons to have non-random numbers, such as ease of lookup.

i guess this i neeeded to indeed verify that the reply wasgenerated as a response to the initial packet,

Keepalives are generated autonomously, not in response to other shimpackets, so this is not relevant in this case.

I don't have a good feeling about this... It's too hard todetermine what should be happening. Maybe it would be betterrather than go down the list of packets that are sent/received anddescribe the behavior in each state, to take one state at a timeand describe what happens with packets in that state.

that would be the state machine i guess, right?


I don't know.

Then I'm ignoring this too.

But I would be happier if they'd be removed, because eitherthey're superfluous as they're not normative, or they're actuallynecessary to understand the protocol, which is even worse becausethey're not part of the normative text.

i think state machines are very useful to understand how theprotocol works and to verify that it is working and i think theseshould be included in the docuemnts

Is it really not possible to express them in ASCII so they can bemade part of the normative text?

Follow-Ups:
- Re: review of draft-ietf-shim6-failure-detection-03.txt
  - From: marcelo bagnulo braun <marcelo@it.uc3m.es>

References:
- review of draft-ietf-shim6-failure-detection-03.txt
  - From: <john.loughney@nokia.com>
- Re: review of draft-ietf-shim6-failure-detection-03.txt
  - From: Jari Arkko <jari.arkko@piuha.net>
- Re: review of draft-ietf-shim6-failure-detection-03.txt
  - From: Iljitsch van Beijnum <iljitsch@muada.com>
- Re: review of draft-ietf-shim6-failure-detection-03.txt
  - From: Jari Arkko <jari.arkko@piuha.net>
- Re: review of draft-ietf-shim6-failure-detection-03.txt
  - From: Iljitsch van Beijnum <iljitsch@muada.com>
- Re: review of draft-ietf-shim6-failure-detection-03.txt
  - From: marcelo bagnulo braun <marcelo@it.uc3m.es>

Prev by Date: RE: review of draft-ietf-shim6-failure-detection-03.txt
Next by Date: Re: review of draft-ietf-shim6-failure-detection-03.txt
Previous by thread: Re: review of draft-ietf-shim6-failure-detection-03.txt
Next by thread: Re: review of draft-ietf-shim6-failure-detection-03.txt
Index(es):
- Date
- Thread