Re: [RAM] New LISP draft available ...

Dino Farinacci <dino@cisco.com> Wed, 25 July 2007 16:20 UTC

Return-path: <ram-bounces@iab.org>
Received: from [127.0.0.1] (helo=stiedprmman1.va.neustar.com) by megatron.ietf.org with esmtp (Exim 4.43) id 1IDjbD-0002Wf-Mo; Wed, 25 Jul 2007 12:20:31 -0400
Received: from [10.91.34.44] (helo=ietf-mx.ietf.org) by megatron.ietf.org with esmtp (Exim 4.43) id 1IDjbC-0002WF-Td for ram@iab.org; Wed, 25 Jul 2007 12:20:30 -0400
Received: from sj-iport-1-in.cisco.com ([171.71.176.70] helo=sj-iport-1.cisco.com) by ietf-mx.ietf.org with esmtp (Exim 4.43) id 1IDjbB-0003zO-Hi for ram@iab.org; Wed, 25 Jul 2007 12:20:30 -0400
Received: from sj-dkim-3.cisco.com ([171.71.179.195]) by sj-iport-1.cisco.com with ESMTP; 25 Jul 2007 09:20:28 -0700
X-IronPort-Anti-Spam-Filtered: true
X-IronPort-Anti-Spam-Result: Ao8CAA4Vp0arR7PD/2dsb2JhbAA
X-IronPort-AV: i="4.16,581,1175497200"; d="scan'208"; a="10442793:sNHT16495542"
Received: from sj-core-5.cisco.com (sj-core-5.cisco.com [171.71.177.238]) by sj-dkim-3.cisco.com (8.12.11/8.12.11) with ESMTP id l6PGKSlE000664; Wed, 25 Jul 2007 09:20:28 -0700
Received: from xbh-sjc-231.amer.cisco.com (xbh-sjc-231.cisco.com [128.107.191.100]) by sj-core-5.cisco.com (8.12.10/8.12.6) with ESMTP id l6PGKJT5005500; Wed, 25 Jul 2007 16:20:28 GMT
Received: from xfe-sjc-211.amer.cisco.com ([171.70.151.174]) by xbh-sjc-231.amer.cisco.com with Microsoft SMTPSVC(6.0.3790.1830); Wed, 25 Jul 2007 09:20:28 -0700
Received: from [130.129.84.246] ([10.21.90.69]) by xfe-sjc-211.amer.cisco.com with Microsoft SMTPSVC(6.0.3790.1830); Wed, 25 Jul 2007 09:20:28 -0700
In-Reply-To: <46A3CEE9.5080008@gmail.com>
References: <1DFBAD55-CA28-4D44-9F96-218F5F281047@cisco.com> <46A3CEE9.5080008@gmail.com>
Mime-Version: 1.0 (Apple Message framework v752.3)
Content-Type: text/plain; charset=US-ASCII; delsp=yes; format=flowed
Message-Id: <FD40F969-AACC-4182-87F0-109C5556ACD1@cisco.com>
Content-Transfer-Encoding: 7bit
From: Dino Farinacci <dino@cisco.com>
Subject: Re: [RAM] New LISP draft available ...
Date: Wed, 25 Jul 2007 09:20:21 -0700
To: Brian E Carpenter <brian.e.carpenter@gmail.com>
X-Mailer: Apple Mail (2.752.3)
X-OriginalArrivalTime: 25 Jul 2007 16:20:28.0340 (UTC) FILETIME=[B6CB0F40:01C7CED7]
DKIM-Signature: v=0.5; a=rsa-sha256; q=dns/txt; l=5490; t=1185380428; x=1186244428; c=relaxed/simple; s=sjdkim3002; h=Content-Type:From:Subject:Content-Transfer-Encoding:MIME-Version; d=cisco.com; i=dino@cisco.com; z=From:=20Dino=20Farinacci=20<dino@cisco.com> |Subject:=20Re=3A=20[RAM]=20New=20LISP=20draft=20available=20... |Sender:=20; bh=VMMHrx27lhcVrt2fRi4Vtiou3gLyQ5JgFkalfeIzUew=; b=eS2KOQqQAPC7ABGFTwgAQkl2apCFMrS6bIhP0ZSvik22J+Fh6IgbjbrFl0WuP+0Gwdl3NU3g pQYzZchxQPoQYnEhkjzn9BXJ1NwN58VpaBOpy8lpr34KBXJIcYKWaoaF;
Authentication-Results: sj-dkim-3; header.From=dino@cisco.com; dkim=pass (si g from cisco.com/sjdkim3002 verified; );
X-Spam-Score: -4.0 (----)
X-Scan-Signature: c54bc2f42d02429833c0ca4b8725abd7
Cc: RAM Mailing List <ram@iab.org>
X-BeenThere: ram@iab.org
X-Mailman-Version: 2.1.5
Precedence: list
List-Id: Routing and Addressing Mailing List <ram.iab.org>
List-Unsubscribe: <https://www1.ietf.org/mailman/listinfo/ram>, <mailto:ram-request@iab.org?subject=unsubscribe>
List-Archive: <http://www1.ietf.org/pipermail/ram>
List-Post: <mailto:ram@iab.org>
List-Help: <mailto:ram-request@iab.org?subject=help>
List-Subscribe: <https://www1.ietf.org/mailman/listinfo/ram>, <mailto:ram-request@iab.org?subject=subscribe>
Errors-To: ram-bounces@iab.org

> On 2007-07-17 19:56, Dino Farinacci wrote:
>> We have a new LISP-02 draft available. The packet formats are  
>> consistent with CONS-01 as well as the implementation we are testing.
>> The draft has been submitted to the ID directory, but you can find  
>> a draft at:
>>     http://www.dinof.net/~dino/ietf/draft-farinacci-lisp-02.txt
>
> A few comments on this version...

Thanks for the comments Brian. Sorry for the delay, but I want to add  
to what Scott said. I didn't disagree with any of his comments.

> General points that aren't mentioned and perhaps should be:
>
> MTU size reduction
> Fragmentation (in IPv4)

I will add text to the -03 version. Did you have any suggestions?

> Referrals (host A passes an address for host B to host C)

As Scott said, everything is EID based. Will make that more clear in  
the document.

> > 4.1.  Packet Flow Sequence
> ...
> >    1.  host1.abc.com wants to open a TCP connection to  
> host2.xyz.com.
> >        It does a DNS lookup on host2.xyz.com.  An A record is  
> returned.
> >        This address is used as the destination EID and the locally-
> >        assigned address of host1.abc.com is used as the source  
> EID.  An
> >        IP packet is built using the EIDs in the IP header and  
> sent to
> >        the default router.
>
> 'default router' seems wrong (it would typically be understood to
> mean the default router on the host's LAN). Doesn't this mean
> 'a site egress router' (which is ingress to the LISP domain)?

The default router is the first-hop router directly connected to the  
source. In this example, the ITR is placed in the first-hop router  
but can be placed anywhere, and maybe more ideally in the CE router  
(the one connected to the CE-PE link).

> > 5.3.  Tunnel Header Field Descriptions
> ...
> >    o  The OH header Time to Live field MUST be copied from the IH  
> header
> >       Time to Live field.
> >
> >    o  The OH header Type of Service field SHOULD be copied from  
> the IH
> >       header Type of Service field.
>
> Does this apply when encapsulating across versions (4 in 6 or 6 in 4)?

Absolutely.

> In any case the terminology is slightly different in v6 headers.

I will fix that.

> In the v6 case, does it apply to the flow label too?

I am not sure, depending on the semantic of the flow label which has  
been rather unclear. What do you think?

> In 6.1.1 and elswhere you give RFC2434 as a reference for
> AFIs. There's nothing about AFIs in that RFC; in fact AFI  
> documentation
> is a bit of a mystery.

I will include a reference to http://www.iana.org/numbers.html. We  
use the term "AFI" in PIM as well.

> In 6.1.3:
>
> >    Priority:  each RLOC is assigned a priority.  Lower values are  
> more
> >       preferable.  When multiple RLOCs have the same priority,  
> they are
> >       used in a load-split fashion.  A value of 255 means the  
> RLOC MUST
> >       NOT be used.
>
> Then why waste bits sending that RLOC?

Because the cost of these bits are minimal and we don't want to  
change the record often. Want to keep the change frequency of mapping  
records low. Plus the loc-reach-bits in the encapsulation header  
refer to the position here and *could* override the 255 value.

> >    Weight:  when priorities are the same for multiple RLOCs, the  
> weight
> >       indicates how to balance traffic between them.  Weight is  
> encoded
> >       as a percentage of total packets that match the mapping  
> entry.  If
> >       a non-zero weight value is used for any RLOC, then all  
> RLOCs must
> >       use a non-zero weight value and then the sum of all weight  
> values
> >       MUST equal 100.  What did the 3rd grader say after Steve  
> Jobs gave
> >       an iPhone demo to the class?
>
> It doesn't add up?

You have to read the CONS draft.  ;-)

> >       ...  If a zero value is used for any RLOC
> >       weight, then all weights MUST be zero and the receiver of  
> the Map-
> >       Reply will decide how to load-split traffic.
>
> That MUST condition seems pointless. If there is any zero value, the
> other values don't matter.

Not true, you could have 3 locators all with non-255 priority, each  
with 0, 50, and 50 as weights. What is conveyed in this encoding is  
that the first locator is not used. When all weights are 0, the ITR  
can load-split to it's liking.

> > 6.2.  Routing Locator Selection
> ...
> >    o  Server-side returns a list of RLOC where a subset of the  
> list has
> >       the same best priority.  Client can only use the subset list
> >       according to the weighting assigned by the server-side.  In  
> this
> >       case, the server-side controls both the subset list and load-
> >       splitting across its members.
>
> That rule doesn't seem viable or enforcable. The client side may
> simply not desire to take orders on how to load share, and that
> could be a business issue (if the client has ISP preferences
> for example). I think it needs to be "Client SHOULD only use the  
> subset list
>       according to the weighting assigned by the server-side."

Well, we do want client spec-compliant implications. I realize there  
are malicious systems out there but you have to document what the  
intended rules are.

> Nit: the citations of RFC3056 and RFC2784 are inverted in the text.

Will fix. Thanks again.

Dino

_______________________________________________
RAM mailing list
RAM@iab.org
https://www1.ietf.org/mailman/listinfo/ram