[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [RRG] thoughts on the design space 1: the space

To: Jari Arkko <jari.arkko@piuha.net>
Subject: Re: [RRG] thoughts on the design space 1: the space
From: Iljitsch van Beijnum <iljitsch@muada.com>
Date: Thu, 31 Jul 2008 15:58:00 +0100
Cc: rrg <rrg@psg.com>
In-reply-to: <4888730E.1090508@piuha.net>
References: <4888730E.1090508@piuha.net>

Be warned: long message.

Jari: I largely agree with your conclusion on the solution direction.

However, I think the NERD data format can significantly improved.

On 24 jul 2008, at 13:18, Jari Arkko wrote:

The next choice in the aggregatable branch is whether the identifier-locator separation goes all the way to the host or not. If it goesall the way to the host, we can see different solutions based onwhich layer the design is at. Shim6, Six/One, and HIP are IP layersolutions

One issue with these solutions, and presumably with any solutions thatexposes the locators to the hosts, is that it doesn't solve therenumbering issue.

We probably want to avoid exposing locators to the end-user site inany shape or form to make sure that the end-users won't be tempted toput locators in their configurations and thus make it hard to renumberlocators, requiring us to come up with an id-loc-loc split in thefuture.

whereas multipath TCP would be a transport layer solution.

Note that multipath TCP doesn't necessarily have to know the actuallocators: it could be good enough to just indicate "please use path 1for this packet" vs "please use path 2 for this packet".

On 24 jul 2008, at 13:21, Jari Arkko wrote:

The caching design has a number of issues:
- If packets are dropped while cache entries are being fetched,there may be deterministic loss- If packets are routed through a secondary path while cache entriesare being fetched, there may be deterministic delay

Unfortunately, the loss isn't deterministic in its timing so thesending host doesn't know when to resend its packets. There are alsonot any RTT measurements that could provide guidance with thatdecision. So the choice is between relatively long delays or sending alarge number of packets, creating a SYN flood of sorts on thedestination.

When routing initial packets through the mapping system, this wouldallow for denial of service against the mapping system. This can beremedied for the most part with rate limiting, but this increases theaverage delay before a mapping can be optained; possibly to the degreeof unusability.


On 24 jul 2008, at 13:23, Jari Arkko wrote:

First, I am not at all convinced that we actually HAVE to employ acache for the forwarding lookup.

I think we tend to operate under the unstated assumption that a singlerouter must be able to resolve mappings for the full set ofdestinations. I don't think that's necessary. If one big box isexpensive to build, it makes sense to use multiple smaller ones, thateach handle part of the destination name space. As long as we buildthe mapping system such that it's easy to direct different mappings tothe appropriate router or encapsulating device, we should be able toget parallelism benefits when scaling up.

On 24 jul 2008, at 14:20, William Herrin wrote:

But we already know negative acknowledgments can't be made to generate
and return reliably during an outage event. In any operational system,
we get unexpected routing loops or a firewall blocks the way or a
router malfunctions or the network is congested or something else
happens so that the packet is silently dropped without a NAK.

That's why we detect outages through the absence of positive
acknowledgment instead.

Right. So we apply a positive acknowledgment mechanism between thetunnel/translation endpoints. Perhaps you can ask Jari if he hasdesigned one in recent years. :-)

In other words: I think that the shim6 REAP protocol can be appliedhere. REAP is very light-weight when there is either no data orbidirectional data. In shim6 it runs between the hosts in question,but it could be made to run on the translators or encapsulator/decapsulator.

I.e., we do push for the static information and pull for the volatileinformation.

On 25 jul 2008, at 19:39, Olivier Bonaventure wrote:

The only case where you would detect problems with mapping are whena single host is opening a single TCP connection. In this case, theSYN packet will be delayed by the mapping. Note that using the DNSalso causes such a delay...

No, in the normal case you shoot off a DNS request and then youcontinue when you get the reply. So there is a delay, but you get toproceed immediately when the required information is available. With acache miss in the data plane, you lose a packet and then you have toretransmit at some point. There is no way to determine what the righttime is to retransmit, so either you'll be retransmitting tooaggressively, which means the destination gets multiple copies of thepacket, or you wait longer than necessary and the user experiencesuffers.

--
to unsubscribe send a message to rrg-request@psg.com with the
word 'unsubscribe' in a single line as the message text body.
archive: <http://psg.com/lists/rrg/> & ftp://psg.com/pub/lists/rrg

References:
- [RRG] thoughts on the design space 1: the space
  - From: Jari Arkko <jari.arkko@piuha.net>

Prev by Date: Re: [RRG] thoughts on the design space 4: encapsulate vs. translate
Next by Date: [RRG]
Previous by thread: Re: [RRG] thoughts on the design space 1: the space
Next by thread: Re: [RRG] thoughts on the design space 1: the space
Index(es):
- Date
- Thread