Re: [Lsr] IS-IS over TCP

"Les Ginsberg (ginsberg)" <ginsberg@cisco.com> Wed, 07 November 2018 08:54 UTC

Return-Path: <ginsberg@cisco.com>
X-Original-To: lsr@ietfa.amsl.com
Delivered-To: lsr@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id E240E1286D9 for <lsr@ietfa.amsl.com>; Wed, 7 Nov 2018 00:54:54 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -14.97
X-Spam-Level:
X-Spam-Status: No, score=-14.97 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIMWL_WL_HIGH=-0.47, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_HI=-5, SPF_PASS=-0.001, USER_IN_DEF_DKIM_WL=-7.5] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=cisco.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id NwAkMUoyEaS3 for <lsr@ietfa.amsl.com>; Wed, 7 Nov 2018 00:54:52 -0800 (PST)
Received: from rcdn-iport-1.cisco.com (rcdn-iport-1.cisco.com [173.37.86.72]) (using TLSv1.2 with cipher DHE-RSA-SEED-SHA (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id E026B123FFD for <lsr@ietf.org>; Wed, 7 Nov 2018 00:54:51 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=cisco.com; i=@cisco.com; l=26880; q=dns/txt; s=iport; t=1541580891; x=1542790491; h=from:to:cc:subject:date:message-id:references: in-reply-to:mime-version; bh=D8veLG6Rj79QPwncZimNuLIbaFRsLyHuuVN4jEPLAOY=; b=lmGiAp/SGpyDPSMKKCwDTRYwfVqtmwFvBy60m7nj4PM71mkCmfG6FRU2 n2OzCoHUnBkc1OMJHSIAYupCj13BuoWc0R8O8pCWMOrUV53fKwjNeEjA4 ft+iFSlG7Qx/SLDj5/PcfzHmC2skR1vUCtRTA/sXSt1+pCWnRXe30XWgC I=;
X-IronPort-Anti-Spam-Filtered: true
X-IronPort-Anti-Spam-Result: A0AHAAD4p+Jb/5ldJa1kGQEBAQEBAQEBAQEBAQcBAQEBAQGBUwIBAQEBAQsBgQ12ZoECJwqDbpQvgg2XMIF6CwEBGAEMhEcCF4NJIjYLDQEDAQECAQECbRwMhToBAQEBAgEBASEKQQsFBwQCAQgRBAEBKAMCAgIlCxQJCAIEDgUIgxqBHVwID6hHgS6EP0CFKAWLeBeBQT+BEAGCFH6DGwEBAwGBdA8Qgk6CVwKIa1CFLIYsiisJAoZtihsggVaFAYoPjRWKJAIRFIEmJA0kgVVwFTuCbIIkGocQgU6FPkExjB6BHwEB
X-IronPort-AV: E=Sophos;i="5.54,475,1534809600"; d="scan'208,217";a="477912460"
Received: from rcdn-core-2.cisco.com ([173.37.93.153]) by rcdn-iport-1.cisco.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 07 Nov 2018 08:54:44 +0000
Received: from XCH-ALN-002.cisco.com (xch-aln-002.cisco.com [173.36.7.12]) by rcdn-core-2.cisco.com (8.15.2/8.15.2) with ESMTPS id wA78sirr016627 (version=TLSv1.2 cipher=AES256-SHA bits=256 verify=FAIL); Wed, 7 Nov 2018 08:54:45 GMT
Received: from xch-aln-001.cisco.com (173.36.7.11) by XCH-ALN-002.cisco.com (173.36.7.12) with Microsoft SMTP Server (TLS) id 15.0.1395.4; Wed, 7 Nov 2018 02:54:44 -0600
Received: from xch-aln-001.cisco.com ([173.36.7.11]) by XCH-ALN-001.cisco.com ([173.36.7.11]) with mapi id 15.00.1395.000; Wed, 7 Nov 2018 02:54:44 -0600
From: "Les Ginsberg (ginsberg)" <ginsberg@cisco.com>
To: Robert Raszuk <robert@raszuk.net>
CC: "hhwsmit@xs4all.nl" <hhwsmit@xs4all.nl>, Tony Li <tony.li@tony.li>, "lsr@ietf.org" <lsr@ietf.org>
Thread-Topic: [Lsr] IS-IS over TCP
Thread-Index: AQHUdYQQhdtsLtzGL0aulBBpslE59KVCif6AgAEjL2CAALcWgP//nYkg
Date: Wed, 07 Nov 2018 08:54:44 +0000
Message-ID: <202838e9138148cf95965531edfa97af@XCH-ALN-001.cisco.com>
References: <8C41CC22-2350-446F-9755-0AB1A6CE55EB@tony.li> <42003bf67cf6a6f5f2735bf867f8f1ef@xs4all.nl> <6255e2641d37460da22755c236d0f7bc@XCH-ALN-001.cisco.com> <CAOj+MMEybMra=Kgk2BrgPBGz+-rjCc6kHZfJW7yedRj1iLe=_A@mail.gmail.com>
In-Reply-To: <CAOj+MMEybMra=Kgk2BrgPBGz+-rjCc6kHZfJW7yedRj1iLe=_A@mail.gmail.com>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
x-ms-exchange-transport-fromentityheader: Hosted
x-originating-ip: [10.24.42.236]
Content-Type: multipart/alternative; boundary="_000_202838e9138148cf95965531edfa97afXCHALN001ciscocom_"
MIME-Version: 1.0
X-Outbound-SMTP-Client: 173.36.7.12, xch-aln-002.cisco.com
X-Outbound-Node: rcdn-core-2.cisco.com
Archived-At: <https://mailarchive.ietf.org/arch/msg/lsr/_E90pHFm2sMt3rF1yWsHveFn0Ok>
Subject: Re: [Lsr] IS-IS over TCP
X-BeenThere: lsr@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Link State Routing Working Group <lsr.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/lsr>, <mailto:lsr-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/lsr/>
List-Post: <mailto:lsr@ietf.org>
List-Help: <mailto:lsr-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/lsr>, <mailto:lsr-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 07 Nov 2018 08:54:55 -0000

Robert –

The comment being discussed here is in regards to the potential benefits of altering IS-IS to use a different way of flooding over an interface.
Whether we agree/disagree on this does not have any bearing on the high amount of redundant flooding which occurs in a highly meshed network – which I think all of us agree is a problem worth addressing.

The latter is addressed in a number of drafts – including Henk/Gunter’s https://datatracker.ietf.org/doc/draft-hsmit-lsr-isis-dnfm/ .

But this thread is about https://datatracker.ietf.org/doc/draft-hsmit-lsr-isis-flooding-over-tcp/ .

Please do not confuse the two.

   Les


From: Robert Raszuk <robert@raszuk.net>
Sent: Wednesday, November 07, 2018 12:39 AM
To: Les Ginsberg (ginsberg) <ginsberg@cisco.com>
Cc: hhwsmit@xs4all.nl; Tony Li <tony.li@tony.li>; lsr@ietf.org
Subject: Re: [Lsr] IS-IS over TCP

Hi Les,

> There are existing and successful deployments of an instance with thousands of
> neighbors and thousands of nodes in a network and sub-second convergence is supported.

While I completely agree with your above observation (and as a matter of fact first person to tell me that was actually Henk around year 2000) may I ask what problem are we solving with vast majority of the proposals in space of "IGP scaling" for data center deployments currently on the table ? Are we perfecting the perfect ?

It sounds that what is really needed is a deployment guide (perhaps in the form of informational RFC) documenting that use of ISIS in 1000(s) node DC cluster is fine provided features X, Y, Z are enabled.

Otherwise folks out there in the field no matter what is done here will keep using BGP for huge 50 node DC clusters because RFC7938 says so.

Cheers,
Robert.


On Wed, Nov 7, 2018 at 5:09 AM Les Ginsberg (ginsberg) <ginsberg@cisco.com<mailto:ginsberg@cisco.com>> wrote:
Henk/Gunter -

Couple of points.

1)IS-IS PDUs are sent directly over layer 2 - whereas TCP (or other transport) are sent over Layer3.
This means we have a potential fate sharing issue where IIHs (which continue to be sent over Layer 2 in your proposal (as I understand it) may be successfully exchanged but the transport session set up to send LSPs/SNPs may fail. This is the exact issue RFC 6213 has addressed.

What should happen if IIHs are being successfully exchanged, the neighbors both indicate support for the extension, but the transport session fails to come up or goes down?
Would you follow similar behavior to RFC 6213 i.e., bring the adjacency down? If so, how would you determine when you can bring the adjacency up?

I appreciate you have indicated this as TBD in Section 7.6 of the draft, but you also say in Section 7.5 that adjacency should NOT go down when transport session fails - which implies we can have an ongoing condition where an adjacency is up but
flooding cannot be done-  which is a pathological condition.

2)Your statements regarding existing flooding limitations of IS-IS are rather dated. Many years ago implementations varied from the base specification by allowing much faster flooding and contiguous flooding bursts on an interface in support of fast convergence. There are existing and successful deployments of an instance with thousands of neighbors and thousands of nodes in a network and sub-second convergence is supported. So the statement that the existing protocol per interface flooding is a blocking factor in highly meshed topologies is not (IMO) accurate. The more accurate characterization of the flooding problem in dense topologies is the amount of redundant flooding (i.e., a node may receive many copies of a new LSP) - which your proposal does nothing to address (I understand it was not intended to address this problem which you discuss in a different draft).

My point here is that there are existing implementations which would get no benefit from your proposal. It might be argued that someone writing a new implementation may find it simpler to make use of a transport mechanism like TCP - but I do not think there is compelling data that demonstrates that the scalability of an implementation using your proposal is better than that of many existing implementations.

This then suggests that for existing implementations the main motivation to support your proposal is to help other implementations which have not optimized their existing implementations. :-)
Comments?

   Les


> -----Original Message-----
> From: Lsr <lsr-bounces@ietf.org<mailto:lsr-bounces@ietf.org>> On Behalf Of Henk Smit
> Sent: Monday, November 05, 2018 8:22 PM
> To: tony.li@tony.li<mailto:tony.li@tony.li>
> Cc: lsr@ietf.org<mailto:lsr@ietf.org>
> Subject: Re: [Lsr] IS-IS over TCP
>
>
> Thanks, Tony.
>
> We picked TCP because every router on the planet already has a TCP stack
> in it.
> That made it the obvious choice.
>
> Our draft described a TVL in the IIHs to indicate a router's
> ability to use TCP for flooding.
> That TLV has several sub-TVLs.
> 1) the TCP port-number
> 2) an IPv4 address
> 3) and/or an IPv6 address
>
> We can change the first sub-TVL so that it indicates:
> 1) 1 or 2 bytes indicating what protocol to use
> 2) the remainder of the sub-TLV is an indicator what port-number
>     or other identifier to use to connect over that protocol.
>
> This way we can start improving IS-IS with TCP today.
> And add/replace it with other protocols in the future.
>
> henk.
>
>
>
> tony.li@tony.li<mailto:tony.li@tony.li> schreef op 2018-11-06 04:51:
> > Per the WG meeting, discussing on the list:
> >
> > This is good work and I support it.
> >
> > I would remind folks that TCP is NOT the only transport protocol
> > available and that perhaps we should be considering QUIC while we’re
> > at it.  In particular, flooding is a (relatively) low bandwidth
> > operation in the modern network and we could avoid slow-start issues
> > by using QUIC.
> >
> > Tony
> >
> > _______________________________________________
> > Lsr mailing list
> > Lsr@ietf.org<mailto:Lsr@ietf.org>
> > https://www.ietf.org/mailman/listinfo/lsr
>
> _______________________________________________
> Lsr mailing list
> Lsr@ietf.org<mailto:Lsr@ietf.org>
> https://www.ietf.org/mailman/listinfo/lsr
_______________________________________________
Lsr mailing list
Lsr@ietf.org<mailto:Lsr@ietf.org>
https://www.ietf.org/mailman/listinfo/lsr