Re: [tsvwg] Switch testing at 25G with ECN --SCE Draft

Gorry Fairhurst <gorry@erg.abdn.ac.uk> Tue, 13 August 2019 07:38 UTC

Return-Path: <gorry@erg.abdn.ac.uk>
X-Original-To: tsvwg@ietfa.amsl.com
Delivered-To: tsvwg@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 052BA1200B1 for <tsvwg@ietfa.amsl.com>; Tue, 13 Aug 2019 00:38:05 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.898
X-Spam-Level:
X-Spam-Status: No, score=-1.898 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, SPF_HELO_NONE=0.001, SPF_NONE=0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id SVf7Xfnv-jOY for <tsvwg@ietfa.amsl.com>; Tue, 13 Aug 2019 00:38:02 -0700 (PDT)
Received: from pegasus.erg.abdn.ac.uk (pegasus.erg.abdn.ac.uk [IPv6:2001:630:42:150::2]) by ietfa.amsl.com (Postfix) with ESMTP id 0468A120091 for <tsvwg@ietf.org>; Tue, 13 Aug 2019 00:38:01 -0700 (PDT)
Received: from MacBook-Pro-5.local (fgrpf.plus.com [212.159.18.54]) by pegasus.erg.abdn.ac.uk (Postfix) with ESMTPSA id DAE031B0015C; Tue, 13 Aug 2019 08:37:53 +0100 (BST)
Message-ID: <5D5268D1.6020003@erg.abdn.ac.uk>
Date: Tue, 13 Aug 2019 08:37:53 +0100
From: Gorry Fairhurst <gorry@erg.abdn.ac.uk>
Reply-To: gorry@erg.abdn.ac.uk
Organization: University of Aberdeen
User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.13; rv:12.0) Gecko/20120428 Thunderbird/12.0.1
MIME-Version: 1.0
To: "Scaglione, Giuseppe" <giuseppe.scaglione@hpe.com>
CC: Greg White <g.white@CableLabs.com>, Jonathan Morton <chromatix99@gmail.com>, "tsvwg@ietf.org" <tsvwg@ietf.org>
References: <A8E3F5E9-443D-4F5A-9336-9A0E2E72C278@cablelabs.com> <201908082333.x78NXS0T094756@gndrsh.dnsmgr.net> <AT5PR8401MB07070C672C9F519C05D2D3F599D60@AT5PR8401MB0707.NAMPRD84.PROD.OUTLOOK.COM> <81354E22-777C-4BDB-96E0-0B1F6C1DCCD2@cablelabs.com> <AT5PR8401MB070700703FBF2318808238CF99D60@AT5PR8401MB0707.NAMPRD84.PROD.OUTLOOK.COM> <C9E9A1C7-59CF-4414-82CE-14ABFE74ADB2@gmail.com> <ADCEE0A5-6A32-4106-8557-029C65B2D4C5@cablelabs.com> <AT5PR8401MB07074B665CF7612FAD66E13899D60@AT5PR8401MB0707.NAMPRD84.PROD.OUTLOOK.COM> <6D40C855-EE12-4148-9EE6-E67DE8ADC715@cablelabs.com> <AT5PR8401MB0707095986FAEB5F212F7B5199D30@AT5PR8401MB0707.NAMPRD84.PROD.OUTLOOK.COM>
In-Reply-To: <AT5PR8401MB0707095986FAEB5F212F7B5199D30@AT5PR8401MB0707.NAMPRD84.PROD.OUTLOOK.COM>
Content-Type: text/plain; charset="UTF-8"; format="flowed"
Content-Transfer-Encoding: 8bit
Archived-At: <https://mailarchive.ietf.org/arch/msg/tsvwg/IEfCY4oVI4LiuKFjuo1bsud5bzU>
Subject: Re: [tsvwg] Switch testing at 25G with ECN --SCE Draft
X-BeenThere: tsvwg@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Transport Area Working Group <tsvwg.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tsvwg/>
List-Post: <mailto:tsvwg@ietf.org>
List-Help: <mailto:tsvwg-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 13 Aug 2019 07:38:05 -0000

It seems you are comparing with RFC3168, but have you looked also at the 
updated response in RFC8511. If you are using shallow marking, this 
would seem to be giving you the differentiation between buffer over flow 
(loss) and queue building (CE) that you mention?

Gorry

On 12/08/2019, 16:28, Scaglione, Giuseppe wrote:
> Hi Greg,
>
> My humble opinion based on the testing presented is that with SCE remarking purely function of the queue depth, the TCP stack has a notion of congestion present in the TCP connection that is independent of the buffering capability of the switch queue. The TCP stack is seeing a "5%" of packet remarked for example, which means somewhere in the network some queue is seeing "some" oversubscription period.
>
> While with CE RFC3168 you know that remarking typically only starts after a X% of queue depth (user configured). And that X is function of the queue depth and memory available, and if the X is not 'tuned' you either get not optimal link utilization or packet drops.
>
> Basically, it seems like having an 'early warning' marking makes the SW implementation of the TCP algo easier and more predictable, instead of relying on a switch configuration of X.
>
> Yes -- more testing is required.
>
> Best Regards,
> Giuseppe
>
> -----Original Message-----
> From: Greg White [mailto:g.white@CableLabs.com]
> Sent: Friday, August 9, 2019 2:54 PM
> To: Scaglione, Giuseppe<giuseppe.scaglione@hpe.com>; Jonathan Morton<chromatix99@gmail.com>
> Cc: rgrimes@freebsd.org; tsvwg@ietf.org
> Subject: Re: [tsvwg] Switch testing at 25G with ECN --SCE Draft
>
> Agreed.  I would not expect any different result either, which begs the question why does SCE need two different signals (ECT(1) and CE) in a datacenter environment?
>
> -Greg
>
>
>
> On 8/9/19, 3:43 PM, "Scaglione, Giuseppe"<giuseppe.scaglione@hpe.com>  wrote:
>
>      Greg,
>
>      We are working -- and in beta -- with having the switch natively set SCE bits instead of CE and removing the iptable configuration on the target server.  Yet, I do not expect any different result since the TCP-SCE stack would react the same.
>
>      Regards,
>      Giuseppe Scaglione
>
>
>      -----Original Message-----
>      From: Greg White [mailto:g.white@CableLabs.com]
>      Sent: Friday, August 9, 2019 2:36 PM
>      To: Jonathan Morton<chromatix99@gmail.com>; Scaglione, Giuseppe<giuseppe.scaglione@hpe.com>
>      Cc: rgrimes@freebsd.org; tsvwg@ietf.org
>      Subject: Re: [tsvwg] Switch testing at 25G with ECN --SCE Draft
>
>      Right.  Per the SCE method, the switch would mark ECT(1) using a ramp, and then when the ramp would exceed 100% marking, it would change to using CE.
>
>      The implementation discussed here marks CE using a ramp, and when the ramp would exceed 100% marking, it changes to packet drop.  This, as you said, is simply RFC3168 ECN marking, not SCE ECN marking.
>
>      I was just trying to set the record straight.  There was a claim made that a switch vendor had implemented SCE-style packet marking in hardware at 25Gbps, which wasn't accurate.
>
>      -Greg
>
>
>
>      On 8/9/19, 2:58 PM, "Jonathan Morton"<chromatix99@gmail.com>  wrote:
>
>          >  On 9 Aug, 2019, at 11:44 pm, Scaglione, Giuseppe<giuseppe.scaglione@hpe.com>  wrote:
>          >
>          >>>  Just to be super clear, this isn't a hardware implementation of SCE running at 25Gbps.
>          >
>          >  I am not sure I follow. The Test setup section of the paper clearly describes the hardware used -- severs, switch, cables.
>          >  What I cannot disclose at this point is the exact model and characteristics of the HPE Aruba Switch used. Yet, it is a "real" Ethernet Switch, providing 25Gbps connectivity, configured to do bridging across the four ports and implementing RFC3168 with the ECN remarking configuration described in my previous email and on the paper.
>
>          I think the issue is with the way the switch itself marks with CE, not ECT(1).  It's a limitation I think is worth acknowledging and, hopefully, finding a way to remove.
>
>           - Jonathan Morton
>
>
>