RE: [Bier] [pim] Q on the congestion awareness of routing protocols

"Jeffrey (Zhaohui) Zhang" <zzhang@juniper.net> Fri, 03 March 2023 19:02 UTC

Return-Path: <zzhang@juniper.net>
X-Original-To: tsv-area@ietfa.amsl.com
Delivered-To: tsv-area@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id CCDC6C14CEE4; Fri, 3 Mar 2023 11:02:35 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.798
X-Spam-Level:
X-Spam-Status: No, score=-2.798 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIMWL_WL_HIGH=-0.001, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=unavailable autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=juniper.net header.b="Wa5J1r+Z"; dkim=pass (1024-bit key) header.d=juniper.net header.b="RLedoDTj"
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 3wR_fkWKZMz2; Fri, 3 Mar 2023 11:02:31 -0800 (PST)
Received: from mx0b-00273201.pphosted.com (mx0a-00273201.pphosted.com [208.84.65.16]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 21BBAC1516F2; Fri, 3 Mar 2023 11:02:17 -0800 (PST)
Received: from pps.filterd (m0108157.ppops.net [127.0.0.1]) by mx0a-00273201.pphosted.com (8.17.1.19/8.17.1.19) with ESMTP id 323G6KVR015690; Fri, 3 Mar 2023 11:01:28 -0800
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=juniper.net; h=from : to : cc : subject : date : message-id : references : in-reply-to : content-type : content-transfer-encoding : mime-version; s=PPS1017; bh=vAbD8fl2/ZRhKfywbEA/9OJdUk5lB5yoDJ3I2PWntOI=; b=Wa5J1r+ZHmaQQzZwU1xiN+iEFbImJ36vKrJnJxvh+7s2OtBQInXaG7h+VyvkYQu7c+XD f74cyaTMqWbwD3JaI6LBqEML3BmKr7w0Wvc8fJ7hAdMS+lNlnHUmR7j+/sMHHk/AS5PL vTC2KK59t4HfAWBwZCa0NTQtiNVssWsmoVepNwm67whSbjuunMSN8wFuriM/n15WVne3 AX2cXjbmAwi9BNQsK24+ASDskaWVa8KkosQcLd+M7VNrJ7TfGWcNHVdCLAt9BVD6W5oo tX7+/1AZgHC0tOxRaq1vgw7zZKeukjiwNuay+Xm7DbyTrs2/lk0JBGkd9Hm4TKdvCDC4 Lg==
Received: from dm4pr02cu001-vft-obe.outbound.protection.outlook.com (mail-centralusazlp17012026.outbound.protection.outlook.com [40.93.13.26]) by mx0a-00273201.pphosted.com (PPS) with ESMTPS id 3p2bdem7jd-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); Fri, 03 Mar 2023 11:01:28 -0800
ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=AJiSZy/JJWVEo+8e/S4Ea9LL8O5fxtWZkobgCwZ33+DOJzdUMlR7LQ0UYRm/7Z8nTcTWIEg8JFwRg6XBkAC9S8Ot54yJLA1JjinAO31v1jyNZm2nkJBJzzV3eR8qzFmPkxp/7V2cM9f5xsDZXWMNZrwlW864bzJsHZs5lcVidO1cKl99i0H3OHTYFglWM60c70Fhg6yfdnj4gdT/9p8O1pEc16uYbqO7wipCv4/PSLYAyK/fbv3S/Xg7IkFYOZjJDRwLKSIWt498jUL2oJXHO7/5IDDc18fxOb8K+EGcbTezX7iyJMbyHjQrhy41T7JXhaMo5dYMMbSqVdPknRLSWQ==
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=vAbD8fl2/ZRhKfywbEA/9OJdUk5lB5yoDJ3I2PWntOI=; b=FM4UwG+ZDyYnrKZJG7pIT8AgKHDGhmo6leRepfwqcIG6XGy5pR2gyozE/ZooOeVMWfyHTYP+RbJkcOsGHGJ2QO04g9NTr56DdHN73xppqVaJe1XLCtfLaDNkk6z97Z0FZMT/iBcSu9gWXzkaRd6nQqSOpw6DQljoNoFGFK+J12Fa+yJ8th6rECHHW6asK28Mb3R3TtTl81ehO377pga1YehXrsHPDQq8S0cQOgxklhpm0Y90OonoFaXdFUJ2gYzYpGY+ErPORmMVjUiNkXvAr2/p0eoM4E/sJFC7qBITt1St10OpBrjyi24+1IR4Fjq2oklOjSYFc6Tx1z7Rvsb0VA==
ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=juniper.net; dmarc=pass action=none header.from=juniper.net; dkim=pass header.d=juniper.net; arc=none
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=juniper.net; s=selector1; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=vAbD8fl2/ZRhKfywbEA/9OJdUk5lB5yoDJ3I2PWntOI=; b=RLedoDTjXW5Vnwm0/n0dOslFPoLxvevScyBG08dZM1lARqSeOnWTI4/VNYYVS7KkdLKiztu5cQ4mKpBO1+Fb61XFEwQ1Cc2V5RAFhg2UtN5ZYZPY4QaxtWd17i2K+chNkoM4KOnlEcAN55qcr3nHnTHQph3uEForP7ZEO+3jEZs=
Received: from BL0PR05MB5652.namprd05.prod.outlook.com (2603:10b6:208:6a::19) by SJ0PR05MB8597.namprd05.prod.outlook.com (2603:10b6:a03:398::21) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.6156.19; Fri, 3 Mar 2023 19:01:22 +0000
Received: from BL0PR05MB5652.namprd05.prod.outlook.com ([fe80::b380:a5af:145d:89c5]) by BL0PR05MB5652.namprd05.prod.outlook.com ([fe80::b380:a5af:145d:89c5%4]) with mapi id 15.20.6156.019; Fri, 3 Mar 2023 19:01:22 +0000
From: "Jeffrey (Zhaohui) Zhang" <zzhang@juniper.net>
To: Toerless Eckert <tte@cs.fau.de>
CC: Jon Crowcroft <Jon.Crowcroft@cl.cam.ac.uk>, BIER WG <bier@ietf.org>, "routing-discussion@ietf.org" <routing-discussion@ietf.org>, Matt Mathis <mattmathis=40google.com@dmarc.ietf.org>, "tsv-area@ietf.org" <tsv-area@ietf.org>, Stewart Bryant <stewart.bryant@gmail.com>, pim <pim@ietf.org>
Subject: RE: [Bier] [pim] Q on the congestion awareness of routing protocols
Thread-Topic: [Bier] [pim] Q on the congestion awareness of routing protocols
Thread-Index: AQHZCUKW2xa4j8sy606ZdmKT1G0T/a5lltWAgAuPmlCAd7iMAIABDOKQ
Date: Fri, 03 Mar 2023 19:01:22 +0000
Message-ID: <BL0PR05MB5652DF389C96CECB05B37329D4B39@BL0PR05MB5652.namprd05.prod.outlook.com>
References: <CAH56bmBnqi4peTWUXOVy0KRRXRc1L7TP+atFfVF6qb_OKBMBwg@mail.gmail.com> <C303F9BF-F96A-4710-A4B5-4228807C07F7@gmail.com> <52907137-CA5A-4042-AB2C-23FD9B032210@gmail.com> <E1p2SAw-006HQa-3s@mta0.cl.cam.ac.uk> <Y5M8RSjDuTLqJ/+v@faui48e.informatik.uni-erlangen.de> <BL0PR05MB56529DBC5D9299428B0D2A84D4E79@BL0PR05MB5652.namprd05.prod.outlook.com> <ZAFc1A5XQuKYbCsT@faui48e.informatik.uni-erlangen.de>
In-Reply-To: <ZAFc1A5XQuKYbCsT@faui48e.informatik.uni-erlangen.de>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
msip_labels: MSIP_Label_0633b888-ae0d-4341-a75f-06e04137d755_Enabled=true; MSIP_Label_0633b888-ae0d-4341-a75f-06e04137d755_SetDate=2023-03-03T19:01:21Z; MSIP_Label_0633b888-ae0d-4341-a75f-06e04137d755_Method=Standard; MSIP_Label_0633b888-ae0d-4341-a75f-06e04137d755_Name=0633b888-ae0d-4341-a75f-06e04137d755; MSIP_Label_0633b888-ae0d-4341-a75f-06e04137d755_SiteId=bea78b3c-4cdb-4130-854a-1d193232e5f4; MSIP_Label_0633b888-ae0d-4341-a75f-06e04137d755_ActionId=590c8982-4f3e-4627-859e-5a95bb1d56ec; MSIP_Label_0633b888-ae0d-4341-a75f-06e04137d755_ContentBits=2
x-ms-publictraffictype: Email
x-ms-traffictypediagnostic: BL0PR05MB5652:EE_|SJ0PR05MB8597:EE_
x-ms-office365-filtering-correlation-id: b4ae4376-e9b4-450f-9aa4-08db1c19ae0a
x-ms-exchange-senderadcheck: 1
x-ms-exchange-antispam-relay: 0
x-microsoft-antispam: BCL:0;
x-microsoft-antispam-message-info: AYbdqLrL9tjYGiyjqleS+YuaKdKNf/M5b3snRR/00ICJRlT1kodTxkZbUvpPhj2vRxaNHFQAvHkZcL2J7fPqmw4JC1RjHaNmGHsDa/gtkf8pxBebKyXsl8D9tAO39X/1UgFTzIILG5zb83sN8wDOnHGO+H4NwpfNxCcfOisyVgAcX7OzKSfc885v0hnZcUcAJZYZgsJqM+N6Olqj67lltQiFz7EF7U3D9kVz6eAIEpiBS6GJOtGPlGiz0R2KB4ku8lXne/uMiCjSZdn1GYuZcK+x8VSrAwZUG/jUpHGmnvzsPY4lDQl0BsWdlpwMH+DpVV9Ff8b5EoP2Zqrv57r53QvbvxP/ktTG0vYwJBYTHw0KtnrJN8pN3dFEUK+bt5RRFlg3Mpr45pWE8SGp2TsoTe51X/AEgGALMOiM51R/83L5mmKMnwsLCpK97hdgg+kux+7oUUIzLxMJupzbM7SZGkxuhth+69d/UVl6i254VbyfFsfKg75du4Myy0gVkMHdTra30wfEbLWfKFefzB0iDsElLfht5uxDL7+jf9yMca1rQhUQUl7XX3qHNrNapJp0BjOEYQLgtkbSUJopnjJfpc//XOLga6N06WCvBtewLW00Nsy6hWJCSg64DtsArHafG9UlJHSXShTMrMdu3CMALlvnlK+mBOrggAzlxKE04UWqJxz1p7jFV9rDNFKDgywKHS9beICNddNOS9Ij8lFY6/aky6cKoieIFS5Bxh4CD6w=
x-forefront-antispam-report: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:BL0PR05MB5652.namprd05.prod.outlook.com; PTR:; CAT:NONE; SFS:(13230025)(4636009)(346002)(136003)(376002)(396003)(39860400002)(366004)(451199018)(38100700002)(33656002)(66574015)(71200400001)(6506007)(53546011)(966005)(66446008)(9686003)(26005)(186003)(316002)(4326008)(41300700001)(54906003)(64756008)(2906002)(66476007)(66946007)(76116006)(6916009)(8676002)(7696005)(478600001)(52536014)(5660300002)(122000001)(86362001)(38070700005)(55016003)(8936002)(66556008)(83380400001)(66899018); DIR:OUT; SFP:1102;
x-ms-exchange-antispam-messagedata-chunkcount: 1
x-ms-exchange-antispam-messagedata-0: NcF5WznUwni9O9qo7G7rtvO0CRDFCnPDnMM8hSVrlCBVWo4NCWaiXwj/dgHz7wI7zNwsAbA96Re/4c2JHUReVCsHT1TlXYD426/YiWYD1NwLOl4YTxrDe30P2UwMmHSoK7GiRLRYHVn2SYBxXLkfFxz4DHgI/NXdBsV3R1KiL1flzpy+FQ7hxSd9lmyiLrzywpbSgmgsK8+fp8PPiICGB/xmNkWbuPIGjAJMhLpt58NvzYL1d3EwJG6eCV2N0lXbFlXI4f9/snFa7CKst7A0RF2Gr+geVsgJiK/C+/Nrc97UN2zqaINvGDum/o/fCxtpxLNhUqqG7Zo33rXwEJDkgdmeL7lE1tu55OJMBSfq+nk0lLLk4O6Q91nrFDLj+nVPh87sCBoi+54O/5YISwAH+JiMvbXAamn+VkzG/oIPzvVMUKd7UPXj6PI1RIcB7ACwTumiLyL4uZpcMoA9SSctljHvM0J/yReggH99+hH9zMk4FnpC1XsqjXvOdXz2Lrb2jkPcEHTiIZr5y7w2RXDRaJoNnhXJ1jled/bhjfSXh8T9XlJ6fDKk+vNseIktR6Wrcw5gDZRsFKP2aTBr5E7yamqNUWXYWPpGVqVhPglSQYOQnYbhQWIApYfB4182za6U1zdFVZYgHFTEHqFtoaZhCV7UwTSZ1uRKKIMIeopM7hNO+llwjVWBLqITXY0H/PXCUUPLiJcbhJUDKYp3FxqvobxPIfmDdUnjULRz7Fa5PaigHKJ7pjdrGM+m69kKHKtgE4xw/p9j5VXw1DfOoXNubka+vvENHwI6tZFQ5g7EElhJg12xKw5r6QRZ7N91197+hHfLiS6+nZ2LBYiXNHcRR78PTn8y9np3+xFPlBRJWEwtJSW4gPabhdDuIVnYQZu1kuSPz1VtdaJMMpJU6Vn4YNcnFh3bud9dsdYOvG9rZQNaJh5K1iZ6e/HrVAY71VKgfaTMBBlpNSkcuHoYIsDrAwk+cNRU/At/fKez1Qw7lbF/mkjq0kWxZdazJaq7Wr8A+tAGTvUNYyykEjpUn3DMMaDy75yd9Xff9/oIMxrUzVks3FjjXHZ91QAuzuy6eB0MVfh3oOORhv+R5Gb7must9oX0H0ft//NPuAV+73ILHGFaehDfkuVR03JV5l/BEEy42EdQJdlVD4IZyp8oSmf/E8RFJu2xOpm+Z+80egaF6aTyRwNwopmlQh11nUxSCMZhXB9XZRdvkmYLZPFVW91PPg9UiLtXdtsKP+njc3rEOD739CheK3/Q2kLLOQM2bCxV4HePkzO5M5cPnQyKsVb6JrASuJ4zpK8xiYXQKRE9rGv3lRTtMQ2efBhFokYXPP+E4a56YCQc9sh2o2+Tspz/wRB2hAVOMCDqc3S2nCBfko6uOmjosucPnpjIZw5H+A7XgZeeH0FAaRRNNmedLsAAvcLsRrifGBkn21B5MKWRFsLjsV4PCweGWCwnsrs69Zk3R1WkxSiAxG7AVFYGCvocO1IDP872eIKyXwti8PicRgIUrUvjkwgrYPy59sLnDtj05Qy/LqYxAyCi2j/CDCZ1J/I9XyjXVu/SWLvH8bYluhDy+Y0nIubmPrS5Gqwp628u
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: quoted-printable
MIME-Version: 1.0
X-OriginatorOrg: juniper.net
X-MS-Exchange-CrossTenant-AuthAs: Internal
X-MS-Exchange-CrossTenant-AuthSource: BL0PR05MB5652.namprd05.prod.outlook.com
X-MS-Exchange-CrossTenant-Network-Message-Id: b4ae4376-e9b4-450f-9aa4-08db1c19ae0a
X-MS-Exchange-CrossTenant-originalarrivaltime: 03 Mar 2023 19:01:22.2591 (UTC)
X-MS-Exchange-CrossTenant-fromentityheader: Hosted
X-MS-Exchange-CrossTenant-id: bea78b3c-4cdb-4130-854a-1d193232e5f4
X-MS-Exchange-CrossTenant-mailboxtype: HOSTED
X-MS-Exchange-CrossTenant-userprincipalname: OiaX5um0pFpu7NFNBtjHmc6f1++2MTpuNR6tnYNAiVPUPlwhpCMKTbY9kmoCVDiFM8/gqddFuwL5unM6UuME8w==
X-MS-Exchange-Transport-CrossTenantHeadersStamped: SJ0PR05MB8597
X-Proofpoint-ORIG-GUID: ypgOCOrGOqnxz15tl5SPfcc0_vyXnTrO
X-Proofpoint-GUID: ypgOCOrGOqnxz15tl5SPfcc0_vyXnTrO
X-Proofpoint-Virus-Version: vendor=baseguard engine=ICAP:2.0.219,Aquarius:18.0.942,Hydra:6.0.573,FMLib:17.11.170.22 definitions=2023-03-03_04,2023-03-03_01,2023-02-09_01
X-Proofpoint-Spam-Details: rule=outbound_spam_notspam policy=outbound_spam score=0 phishscore=0 spamscore=0 priorityscore=1501 lowpriorityscore=0 suspectscore=0 malwarescore=0 adultscore=0 bulkscore=0 impostorscore=0 mlxlogscore=999 clxscore=1011 mlxscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.12.0-2212070000 definitions=main-2303030161
Archived-At: <https://mailarchive.ietf.org/arch/msg/tsv-area/u0I_oluoKoNPvo2kTL7M2uLNZGw>
X-BeenThere: tsv-area@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: IETF Transport and Services Area Mailing List <tsv-area.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tsv-area>, <mailto:tsv-area-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tsv-area/>
List-Post: <mailto:tsv-area@ietf.org>
List-Help: <mailto:tsv-area-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tsv-area>, <mailto:tsv-area-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 03 Mar 2023 19:02:35 -0000

Hi Toerless,


Juniper Business Use Only

-----Original Message-----
From: Toerless Eckert <tte@cs.fau.de> 
Sent: Thursday, March 2, 2023 9:35 PM
To: Jeffrey (Zhaohui) Zhang <zzhang@juniper.net>
Cc: Jon Crowcroft <Jon.Crowcroft@cl.cam.ac.uk>; BIER WG <bier@ietf.org>; routing-discussion@ietf.org; Matt Mathis <mattmathis=40google.com@dmarc.ietf.org>; tsv-area@ietf.org; Stewart Bryant <stewart.bryant@gmail.com>; pim <pim@ietf.org>
Subject: Re: [Bier] [pim] Q on the congestion awareness of routing protocols

[External Email. Be cautious of content]


Jeff, *

Sorry, i forgot about this thread because of christmas, just remembering it now ;-)

I think your observations are all spot on, but the causality that is being implied is not correct.

I will claim that we will have a very hard time to make PIM be any faster without TCP than with TCP. Even getting close to the performance of TCP will be a lot of work, replicating work that was done so many times in before in TCP, or worse yet coming up with PIM specific optimizations (not enough of a market to invest a lot in that).

Zzh> If this is related to the "there are situations where a non-TCP based solution is needed even when a parallel TCP-based option is also present, so we can not simply disallow the former. We can discuss examples separately (one example is actually PIM as BIER overlay vs mLDP/BGP as BIER overlay)" point, the specific example (though it does not necessarily need BIER in the picture) is IPTV fast-channel-switching situation.
Zzh> Consider that not all channels are pre-delivered to BNGs. When a subscriber switches to a new channel, to reduce the delay you may want the line card to send out the join (at least the initial ones) to upstream (ingress PE if you're using MVPN) to quickly pull the traffic instead of going through the routing engine. If it's BGP based, it's hard to do it from the line card.
Zzh> I agree that this is special case, and it is not because non-TCP is better than TCP.
Zzh> Jeffrey

Sure we can and should also spend some some cycles to ask TCP experts which TCP feature/profile/CC we would recommend for use with PORT, but thats but a fraction of the work we would need otherwise if we wanted to re-invent the wheel or figure out which non-TCP alternative reliability sublayer we wanted to recommend/use for PIM.

I remember soemthing like 10 or more than 10 years ago to have had the discussion about TCP in routers, and indeed, earlier implementations where very bad and often sucked up CPU, but even back then, the current TCP was a minor consumer of CPU compared to the actual BGP work of best path calculation and dealing with state.

What can be a typical issue of non-ideal TCP implementations is the incast issue, when you have a single node (BGP/PIM whatever) that simulataneously receives traffic from multiple senders. And i am sure that not all TCP CC options will perform equally well. But i am quite sure that all of them perform better than datatram PIM where we simply get packet loss because of queue overrun on this receiving PIM router - and not forwarding traffic for 60 seconds or continuing to send unwanted traffic.

The other part of course is that it's easy to misimplement how the application
(PIM/BGP) ineracts with TCP: If the application is not written so that it will accept traffic from the TCP socket arbitrarily fast, then you easily run into non-idel flow-control at TCP level because you exhaust the TCP socket buffer.
So at least you need to make the TCP socket buffer sufficiently larger than the maximum amount of data you may get from your neighbors so that tcp flow control can operate at its fastest. Of course, when you have a big LAN with
20 downstream routers sending you after a reconvergence PIM joins for the same
100,000 groups, that can be a good amount of buffer memory, but in todays big routers IMHO not an issue anymore. And that will also ensure that even if the TCP implementations used are not idal for incast, that at least they will do retransmissions as faest as possible, because they never need to wait for the app (PIM). Good incast-friendly TCP implementations would of course share buffer across sockets and not require memory based on how many TCP connections you have, but only based on what your aggregate incast bandwidth is.

There are of course optimizations that can be done within PIM itself to save even more memory, but i really don't want to start explaining those details unless i am really persuaded its necesssary. Which i am totally not right now.

So, in summary: I'd rather go for PORT which gives me reliability instead of random burst join loss caused issues, even if vendors then may take one or two releases to optimize convergence speed under high load and/or lots of downstream routers! Its just going to be soo much work reduction for years to come than tinkering on a PIM-datagram specific solution.

Cheers
    Toerless

P.S.: The one past datapoint of interest you did not mention:

When we designed mLDP as part of MPLS for multicast in the 200x, we initially looked at Dino's 1998 implementation of MPLS for PIM, which actually was released to customers but only in software routers of course, so very few people had actually looked at it, and decided against it and for LDP also because to a large extend because of TCP.

[ Of course now the use of LDP in mLDP hurts given how customers want to get rid
  of LDP and think that mLDP must also go because it's LDP. If you jump on
  a protocol as a buzzword train you trive and perish with it i guess, but
  back then if it would have been just PIM over TCP with MPLS labels, it would
  have been less well accepted *sigh* ;-)]


On Sat, Dec 17, 2022 at 02:32:36PM +0000, Jeffrey (Zhaohui) Zhang wrote:
> Hi Toerless,
>
> Some late comments - first specifically on the PIM topic and then extend to the general point of congestion aware routing protocols.
>
> The TCP-based PIM protocol RFC6559 was designed to handle the congestion-on-scale problem. However, most PIM deployments have not come to the point where scaling become a acute problem where RFC6559 solution must be used, so its deployment has been limited.
>
> The congestion-on-scale point was also taken when BGP-MVPN (RFC 6514) was developed. The Rosen/PIM-MVPN was very popular and there was a big debate when BGP-MVPN was proposed. Good that it eventually got standardized and became mainstream (at least for new deployments).
>
> Someone already brought up a point of BGP updates being potentially slow. I've also heard about that many times (sometimes from known BGP experts), including when I work on BGP based multicast (beyond RFC 6559).
>
> However, there are also protocols that rely on fast convergence even though they use BGP. EVPN is one example.
>
> Then mobile network's control plane relies on UDP-based GTP-C. I wonder why they're not concerned with congestion in scaled situations.
>
> For some 5G use cases I was proposing to use BGP to propagate routing information in place of some mobile user session information, and I often get asked "can you do that very fast"?
>
> So, I am struggling with these two things:
>
> - TCP-based solutions reduce protocol messages, but BGP may be deemed slow (or should I say with uncontrolled delay), though BGP-based EVPN actually relies on fast exchange of (at least some) BGP routes (e.g., for DF election).
> - Other solutions may lead to lots of protocol messages including refreshes, but the mobile operators seem to have been fine with UDP-based control plane.
>
> As for the "a totally non-congestion aware sending of protocol packets should not be permitted anymore for new RFC IMHO and i am just baffled how this is permitted anymore by the IETF. Where is adult supervision by TSV when we need it" comment below, I have the following view:
>
> - I am not sure if this involves TSV. A protocol sending lots of protocol packets is no different from an application sending lots of application traffic as far as transport is concerned. It is ultimately an issue with protocol design itself.
> - There are situations where a non-TCP based solution is needed even when a parallel TCP-based option is also present, so we can not simply disallow the former. We can discuss examples separately (one example is actually PIM as BIER overlay vs mLDP/BGP as BIER overlay).
>
> Thanks.
>
> Jeffrey
>
>
> Juniper Business Use Only
>
> -----Original Message-----
> From: pim <pim-bounces@ietf.org> On Behalf Of Toerless Eckert
> Sent: Friday, December 9, 2022 8:47 AM
> To: Jon Crowcroft <Jon.Crowcroft@cl.cam.ac.uk>
> Cc: BIER WG <bier@ietf.org>; routing-discussion@ietf.org; Matt Mathis 
> <mattmathis=40google.com@dmarc.ietf.org>; tsv-area@ietf.org; Stewart 
> Bryant <stewart.bryant@gmail.com>; pim <pim@ietf.org>
> Subject: Re: [pim] Q on the congestion awareness of routing protocols
>
> [External Email. Be cautious of content]
>
>
> On Tue, Dec 06, 2022 at 07:15:31AM +0000, Jon Crowcroft wrote:
> > path exploration? but consider the shadow pricing...
> >
> > the tradeoff between convergence rate and congestion control seems 
> > to be something that ought to be put on a more systematic grounding
>
> You folks are all thinking way beyond the point i was making and looking for support:
>
> In PIM, we have potentially gigantic burst of datagrams without any 
> specification of pacing sent to routers across a network core (with 
> easily likelyhood of path congestion). Such a totally non-congestion 
> aware sending of protocol packets should not be permitted anymore for 
> new RFC IMHO and i am just baffled how this is permitted anymore by 
> the IETF. Where is adult supervision by TSV when we need it ;-)
>
> Yes, the incast issue is an interesting aspect, but i have not seen good simulations whether / to-what-extend it would happen in the PIM/BGP cases, but i would bet any sum, that a TCP solution, as bad as it may be will outperform the no-congestion-control periodic burst solution of (datagram) PIM.
>
> Cheers
>     Toerless
>
> _______________________________________________
> pim mailing list
> pim@ietf.org
> https://urldefense.com/v3/__https://www.ietf.org/mailman/listinfo/pim_
> _;!!NEt6yMaO-gk!ASlubqGLmV8O43aB2Lcffy5JQ7FN49DnrotemtmPtVIat4Zubv-4Dn
> JEjmh7o_4QoUn9BRIsoiEJuQ$
>

--
---
tte@cs.fau.de