[Dime] Question regarding RFC 6733 on transport end disconnections

Miguel Rodriguez Caudevilla <miguel.rodriguez.caudevilla@ericsson.com> Thu, 18 October 2018 15:57 UTC

Return-Path: <miguel.rodriguez.caudevilla@ericsson.com>
X-Original-To: dime@ietfa.amsl.com
Delivered-To: dime@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id A39521277C8 for <dime@ietfa.amsl.com>; Thu, 18 Oct 2018 08:57:06 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -4.364
X-Spam-Level:
X-Spam-Status: No, score=-4.364 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIMWL_WL_HIGH=-0.064, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_MED=-2.3, SPF_PASS=-0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=ericsson.com header.b=CB0uhSRS; dkim=pass (1024-bit key) header.d=ericsson.com header.b=apgRylc+
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id QT21Od-FfBRB for <dime@ietfa.amsl.com>; Thu, 18 Oct 2018 08:57:04 -0700 (PDT)
Received: from sesbmg23.ericsson.net (sesbmg23.ericsson.net [193.180.251.37]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 77ACF12426A for <dime@ietf.org>; Thu, 18 Oct 2018 08:57:03 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; d=ericsson.com; s=mailgw201801; c=relaxed/simple; q=dns/txt; i=@ericsson.com; t=1539878221; x=1542470221; h=From:Sender:Reply-To:Subject:Date:Message-ID:To:Cc:MIME-Version:Content-Type: Content-Transfer-Encoding:Content-ID:Content-Description:Resent-Date:Resent-From: Resent-Sender:Resent-To:Resent-Cc:Resent-Message-ID:In-Reply-To:References:List-Id: List-Help:List-Unsubscribe:List-Subscribe:List-Post:List-Owner:List-Archive; bh=SaSjjxAHs2vm2oj2cbeqkFri0F8IJKckVoeQDPNzVrY=; b=CB0uhSRSBproMG3RJNMN6DOnAy3RQnxhc/K8jli+JOxmy6x0D82bRB5BfhgYABMv zNjliPcN1qlnfpSS2x8Qt/z2aHzknHsFFhkLa8iix5jOYb49VDQWCwql9sBxGZ9V n46JW4o5H/K9s2OUM6iDttPnyXjIR60ZB/K/m1QEb4k=;
X-AuditID: c1b4fb25-a0b8c9e0000018b4-42-5bc8ad4de80c
Received: from ESESSMB501.ericsson.se (Unknown_Domain [153.88.183.119]) by sesbmg23.ericsson.net (Symantec Mail Security) with SMTP id FF.A8.06324.D4DA8CB5; Thu, 18 Oct 2018 17:57:01 +0200 (CEST)
Received: from ESESSMB505.ericsson.se (153.88.183.166) by ESESSMB501.ericsson.se (153.88.183.162) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_CBC_SHA256_P256) id 15.1.1466.3; Thu, 18 Oct 2018 17:56:53 +0200
Received: from EUR03-AM5-obe.outbound.protection.outlook.com (153.88.183.157) by ESESSMB505.ericsson.se (153.88.183.166) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_CBC_SHA256_P256) id 15.1.1466.3 via Frontend Transport; Thu, 18 Oct 2018 17:56:53 +0200
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ericsson.com; s=selector1; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=SaSjjxAHs2vm2oj2cbeqkFri0F8IJKckVoeQDPNzVrY=; b=apgRylc+x05pYRKpUR6rieN4o9V430bRbYdBW028Yx402q4Uqst0gXuiRF5XQ+gLOK01uOzxmIiD1Q035DRFX0sKVixZqHtU7RaO+LnsObO7Xnl6gZkKs86kR1LwkGaOAB/4hKEeB5NO+pHj9jse7NE0e4MYV911IC2ceQk3qUg=
Received: from AM4PR07MB3313.eurprd07.prod.outlook.com (10.171.189.30) by AM4PR07MB3332.eurprd07.prod.outlook.com (10.171.189.141) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.1250.18; Thu, 18 Oct 2018 15:56:53 +0000
Received: from AM4PR07MB3313.eurprd07.prod.outlook.com ([fe80::1889:9cfd:f542:efbd]) by AM4PR07MB3313.eurprd07.prod.outlook.com ([fe80::1889:9cfd:f542:efbd%2]) with mapi id 15.20.1250.019; Thu, 18 Oct 2018 15:56:52 +0000
From: Miguel Rodriguez Caudevilla <miguel.rodriguez.caudevilla@ericsson.com>
To: "dime@ietf.org" <dime@ietf.org>
Thread-Topic: Question regarding RFC 6733 on transport end disconnections
Thread-Index: AdRm+y6jOuN2MdVPSWqcGfypQAFPEw==
Date: Thu, 18 Oct 2018 15:56:52 +0000
Message-ID: <AM4PR07MB33135B773985A79741E8D2CCD1F80@AM4PR07MB3313.eurprd07.prod.outlook.com>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
x-originating-ip: [195.235.15.200]
x-ms-publictraffictype: Email
x-microsoft-exchange-diagnostics: 1; AM4PR07MB3332; 6:TtjHJVDiU4tiLf/9obuWY9Q5XFeOlug4Duetu8sL/fRrW9F7JgxtVK35A7/PY6kbMHmYmhc+bE8wRUV6XMbszOzwzKDN32bQdAUVhwdbTYTw3ZKuragJNariR2U1Lt/5UphGE43PoI0yVGISqu55iP8NW8NQE7bzKjpOQa/8leHMJTHzq7sibNUT5RGtlRaNxaQBhLUS9eV+Of+BubX+qld5uifDGOzRnDxwcJL0SuDkqG1beKURkd0YKgC+MFJ9I4byOAAQn7y3LtXFNiij3Uau+5HPZt4fwxm4rzgYkKmVgm/HwxEjKroywzLqJsKN7LeXX7JE2CzyxodIQIsdle7ztcivVFHDjA9v8TXLCGnEv0H06TbG6UblP/TuIu+LF1DIKr1MA0QB4e/8c5ppCC7lSXY6UX6EIUlDZ+DOZRuAgQGUem6wMpgmV2e2PM8LvUOdamJHw1ZcKH3ZipXyNg==; 5:eAIDejIRKw94hxC6PMdj6I7hPi+Afs2vagpzP4m+GXclPlwV7ctpf16Vx7E1p43ODR0Lq5g+IYG2ahR9NzqWpvFU6gLZW9+8iObH8R68s0bqGUCsd4OW7Nrjs0p/q+ZB1blobv1BTAIFKU7QTPILrTn4hOnEBG8tkgmqSrJh7dg=; 7:0pjsT5Rth75CGvuaqpa5i67/tAoKyNRpZdN7gTd0P9zRabuK62o02s+O+ZQp+ca39t9uTd4CIgq7RLh3V+D6UORiUdyjdwrcaJ6a/6nOiAh+TXutvUO/Md/3eAHy+SugXRfsGWv+8RLd58sWl5VnE12HR3Tnol5HgScCcDEp1eXZzJDbJ9PUbxw824JtaA21tCkrEp9cBBl55pcd4DpKPIyQz1GT3IZ8NiAjtaK+o2su/KnnoAzq7J38SANpnuzo
x-ms-exchange-antispam-srfa-diagnostics: SOS;
x-ms-office365-filtering-correlation-id: 3f74f2a8-0d64-4cf6-63f5-08d635125286
x-microsoft-antispam: BCL:0; PCL:0; RULEID:(7020095)(4652040)(8989299)(5600074)(711020)(4534185)(4627221)(201703031133081)(201702281549075)(8990200)(2017052603328)(7153060)(7193020); SRVR:AM4PR07MB3332;
x-ms-traffictypediagnostic: AM4PR07MB3332:
authentication-results: spf=none (sender IP is ) smtp.mailfrom=miguel.rodriguez.caudevilla@ericsson.com;
x-microsoft-antispam-prvs: <AM4PR07MB3332686FAEDF02A19A4FAD52D1F80@AM4PR07MB3332.eurprd07.prod.outlook.com>
x-exchange-antispam-report-test: UriScan:(21532816269658)(21748063052155)(28532068793085)(190501279198761)(227612066756510);
x-ms-exchange-senderadcheck: 1
x-exchange-antispam-report-cfa-test: BCL:0; PCL:0; RULEID:(8211001083)(6040522)(2401047)(5005006)(8121501046)(93006095)(93001095)(10201501046)(3231355)(944501410)(52105095)(3002001)(149066)(150057)(6041310)(20161123562045)(20161123564045)(201703131423095)(201702281528075)(20161123555045)(201703061421075)(201703061406153)(20161123558120)(20161123560045)(201708071742011)(7699051)(76991095); SRVR:AM4PR07MB3332; BCL:0; PCL:0; RULEID:; SRVR:AM4PR07MB3332;
x-forefront-prvs: 08296C9B35
x-forefront-antispam-report: SFV:NSPM; SFS:(10009020)(136003)(396003)(366004)(39860400002)(346002)(376002)(51874003)(189003)(199004)(71190400001)(8936002)(2351001)(66066001)(86362001)(25786009)(2501003)(5630700001)(97736004)(6436002)(55016002)(6306002)(6916009)(5250100002)(54896002)(2900100001)(5640700003)(53936002)(5660300001)(236005)(105586002)(9686003)(106356001)(6506007)(26005)(7696005)(478600001)(68736007)(33656002)(2906002)(102836004)(74316002)(256004)(99286004)(476003)(790700001)(8676002)(81166006)(316002)(81156014)(1730700003)(14454004)(6116002)(3846002)(966005)(486006)(71200400001)(186003)(606006)(14444005)(7736002); DIR:OUT; SFP:1101; SCL:1; SRVR:AM4PR07MB3332; H:AM4PR07MB3313.eurprd07.prod.outlook.com; FPR:; SPF:None; LANG:en; PTR:InfoNoRecords; MX:1; A:1;
received-spf: None (protection.outlook.com: ericsson.com does not designate permitted sender hosts)
x-microsoft-antispam-message-info: lZ/NaKAghkvQhnWcNqHEayny2Grck+D6SJDf0mJo8oa0HntZX3CRupglujn1W/PiMrtRvDef/NyoTkfX1sfGAu33F4SxcaqbjAFQGos5YD/C0RlffUFrTSNQ1X2+2/VtJUNG0uKEy5BR/zLn08GL+OEx1Vh3pwbyz7uQ1BDlrN5oT/fTZLHouz3yMiatPvUvgNlGmnJ4rSSofTUAzoXW3t38wEMRISYUT+HJBG7Yq+hn9zv3eAfzxDGr87imxNhNjgInr0uLy8c9aFSjbTAqSts3vPs7LFvcGKXkyczkg21LqIx34AzRA2Z3gLihNbQB11yT3bJ5cKcrTomAiODY1CeXVwd/BzdJRoRCpbCKrqY=
spamdiagnosticoutput: 1:99
spamdiagnosticmetadata: NSPM
Content-Type: multipart/alternative; boundary="_000_AM4PR07MB33135B773985A79741E8D2CCD1F80AM4PR07MB3313eurp_"
MIME-Version: 1.0
X-MS-Exchange-CrossTenant-Network-Message-Id: 3f74f2a8-0d64-4cf6-63f5-08d635125286
X-MS-Exchange-CrossTenant-originalarrivaltime: 18 Oct 2018 15:56:52.8476 (UTC)
X-MS-Exchange-CrossTenant-fromentityheader: Hosted
X-MS-Exchange-CrossTenant-id: 92e84ceb-fbfd-47ab-be52-080c6b87953f
X-MS-Exchange-Transport-CrossTenantHeadersStamped: AM4PR07MB3332
X-OriginatorOrg: ericsson.com
X-Brightmail-Tracker: H4sIAAAAAAAAA02Sa0hTYRjHeXfOzo7TxdvSfDBntDTKmnYRjDAxSihJsC8RXsiVB13OKTtm WhkqTLynNEHHbEb7UCtT1DAlzTR1KWboB6VCnJqXvGAKiprZjmeB337P///nuby8NCF9KvSg VZpURqtRquWUmKy80XRPEV5jjTr5st/zbFXxCyoEXTab1wURKFIcFMeoVWmM1j84Vpxg0Vmp FGNm+sYIk4XmMwqQEw04AMqeW0UcS3EXghlDWAES23kVwY/eMsQXZgF0fzFQXEHiUgKGdaMU 7+gFULioc8TGEdhqsimuGYWvwqB+iyhANO2KD0PfRggn78OX4M1Y3k7EFV+BtxXTQp79YLTd gDgmsQ+sbi8LOJbgaGhtGSc5Rng/rPW+3tEJ7A7fJk0C/gYM5vcDBM9uMDvxV8iNBSyHmdoY XpbBoKlwZ03AbSLIzn/lyCtgqbzcweHQN5HrCPUgWGlfdwzwheac74hfIhEWZh6TvJ4I+sUK xPNDsA3mUDx7gaXYRvKNPhAwW9flaOQJX+unHcaWEGYrBlApUhh2XcRzMhTlGUjDzgvshc+V k3am7foxqG3x5yOHQF9oE/F8FHTGKtFuvRqJLMiNZdhbSfGnz/gxWtVtlk3W+GmY1Hpk/zUf Gzd93qGh+QsdCNNI7iKZqrRGSYXKNDYjqQMBTchdJb+MdkkSp8y4z2iTb2rvqhm2Ax2gSbm7 xBbYECnF8cpUJpFhUhjtf1dAO3lkIW9n2VSjrA5yLThTnnyu5EhrU3DYg2jCvby7LLowoD+m tLMk4tNFyU/ViYa5ZzXnTb8n51L1x9fCtumKPZhRWNpcZaELjw76UdWWns7myPprI/PD6S0r Zq876kD1n7xagXdfUXioKXPJOZZNdwm83v7EOJEftDw2tzk0O+qtk5NsgvKUL6Fllf8AXx2s SDEDAAA=
Archived-At: <https://mailarchive.ietf.org/arch/msg/dime/ptA9UL1Dok6e5ysBLvRg4-0C3Pc>
X-Mailman-Approved-At: Mon, 22 Oct 2018 01:47:53 -0700
Subject: [Dime] Question regarding RFC 6733 on transport end disconnections
X-BeenThere: dime@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Diameter Maintanence and Extentions Working Group <dime.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/dime>, <mailto:dime-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/dime/>
List-Post: <mailto:dime@ietf.org>
List-Help: <mailto:dime-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/dime>, <mailto:dime-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 19 Oct 2018 06:50:07 -0000

Hello,

We are working in a product which uses an implementation of a diameter stack based on RFC 6733.
We recently had a discussions with the developers of the stack and after some thoughts and post analysis of the discussion, we personally think this could be due to an open interpretation of the RFC regarding DPR messages and how a stack implementation should deal with transport end disconnections. Let me try to explain the reason of the discussion.

We have a cluster wide distributed stack. The diameter instances in the cluster can be administratively be locked, which in consequence ends a transport end connection with a peer.

From https://tools.ietf.org/html/rfc6733#section-5.4 we interpret that the DPR/DPA flow before transport end disconnection is as follows (Simplified flow):

When a diameter instance disconnects one of its transport connections it must be notified to the peer with a DPR.
A DPA will be sent by the peer and the diameter instance initiates the transport disconnect.
The peer will receive the transport closure request and close its transport end.

What we argue is that what the peer does after this flow depends completely in the Disconnection-Reason AVP received in the DPR.

In our case, when the lock is performed we expected that the diameter stack would send a DPR with Disconnect-Cause REBOOTING, to inform the peer that the transport end connection is going to be closed and that the peer can immediately retry to connect as there are other diameter instances available to provide service in the same cluster.

The stack implementers have a different argument. The administrative lock of a single diameter instance, they consider that it is performed in a way that all the services on cluster level are available. So, it would be quite strange to indicate the peer disconnect-case REBOOTING for the loss of a single node considering we can still provide diameter service through all the other nodes in the cluster. So, the loss of a node (and connections handled by it) is a transient fault that is easily handled if the peer reconnects. They consider that this way the peer reconnects as quickly as possible in such cases. That the stack can achieve this according to the standard as well if it does not send any DPR with connection loss reasons.

They point to https://tools.ietf.org/html/rfc6733#section-2.1

A given Diameter instance of the peer state machine MUST NOT use more than one transport connection to communicate with a given peer, unless multiple instances exist on the peer, in which, case a separate connection per process is allowed.

And consider that a diameter disconnect cause (eg: REBOOTING or whatever else) sent by a diameter node to a peer actually reflects the overall state of the sending node. Since we have a cluster wide distributed implementation it would be not correct to send a disconnect case REBOOTING as that indicates the whole node is going to be rebooted. We just perform some node level operations while still providing full service to diameter peers.

We would like to have your point of view regarding this discussion.
Is this intentionally left open for the stack implementation?
Did we interpret something wrong?

Thanks in advance.

Best Regards,
Miguel Rodriguez Caudevilla