Re: [v6ops] draft-cc-v6ops-wlcg-flow-label-marking-00.txt

Brian E Carpenter <brian.e.carpenter@gmail.com> Fri, 30 June 2023 23:22 UTC

Return-Path: <brian.e.carpenter@gmail.com>
X-Original-To: v6ops@ietfa.amsl.com
Delivered-To: v6ops@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 177EEC14CE2B for <v6ops@ietfa.amsl.com>; Fri, 30 Jun 2023 16:22:46 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.097
X-Spam-Level:
X-Spam-Status: No, score=-2.097 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, NICE_REPLY_A=-0.001, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 2-XHndz4e3ce for <v6ops@ietfa.amsl.com>; Fri, 30 Jun 2023 16:22:42 -0700 (PDT)
Received: from mail-oa1-x2c.google.com (mail-oa1-x2c.google.com [IPv6:2001:4860:4864:20::2c]) (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 50CECC15256B for <v6ops@ietf.org>; Fri, 30 Jun 2023 16:22:42 -0700 (PDT)
Received: by mail-oa1-x2c.google.com with SMTP id 586e51a60fabf-1b01b43577fso2146273fac.0 for <v6ops@ietf.org>; Fri, 30 Jun 2023 16:22:42 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20221208; t=1688167361; x=1690759361; h=content-transfer-encoding:in-reply-to:content-language:references :cc:to:subject:from:user-agent:mime-version:date:message-id:sender :from:to:cc:subject:date:message-id:reply-to; bh=3su1r0IsOcTmIISeYU0QG6tdjPPBHaYg6EMVtph81Tk=; b=NQrxUL2AzFrWhndbQPb+U3gJTEMLgGvEnuXVFAkw2qqYHLH3OvLAniQMxpg3eHM8Pl KrZAvwuXrf6fBCIox/cYKvB9SB5xHGkQVZliLNyAy0gUAZvzUOxbdX8yd78rFhJh0324 eaJWulVn2R5l4u/X2L8cHH7z8JpQUgN+s5Lz9XwASBISJ9DcTVeznib0nXt5b8ghaqPx SqdixZyPHptPR8T7kPqGjAD9pB7ggyPfVz/pThKHdtFNNTIH65tpX1vTSOJ/jypTm7SK MIO//Wz3lqBf99dxMBjZyf+thUWCMfFJ2NOayiDU7gHsX/uarKIy8opad8Hy3gjiBUrM NJoQ==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20221208; t=1688167361; x=1690759361; h=content-transfer-encoding:in-reply-to:content-language:references :cc:to:subject:from:user-agent:mime-version:date:message-id:sender :x-gm-message-state:from:to:cc:subject:date:message-id:reply-to; bh=3su1r0IsOcTmIISeYU0QG6tdjPPBHaYg6EMVtph81Tk=; b=RrmVOWZ7iYpGWWgBkv7jTq7E1Vi2exgrSytAAnOBUFMlZBvkp1BrnDt8Iqn8xDh9dB 9FPWkOEjmdUP2FCHbZyuNCZX+VjtbKp8CVaAc4ufxesQAj3q+/ubllebScZilTJRr/yC FXYYF8XPTb+3BNMIaJJJo1VaXWK14dKe3+dsuFteUflP2SOg/ducukCthSxDiMFPq+sp Y7NZ25o3tZs2C2lMYgH+lNszdOVMHGITBMDHWCMKHzTalIWtM6FHGI4Dm2nygcatvzOs dYNyALGSzNZyOb601LYeiZD7mnv5ugbYom2xJ/xLa0xB7w5lEyqNOqRtn3RKaF02MEfi i1TA==
X-Gm-Message-State: AC+VfDzJHT0iCaqy7wHnQVCGNmvSt/KnH3zZLMvzZojn2Va1f3EOTTXu uYL66PhqE2OM94PrP4J7gc0NfjflO7E=
X-Google-Smtp-Source: APBJJlGY54VQdgcJ923d6oOVeO8kpAmqv6QVpyXssPCJ7GZc/EVCTjDZHQhiGefx4oLi5F8QynRdYQ==
X-Received: by 2002:a05:6870:b14c:b0:1b0:1c08:857f with SMTP id a12-20020a056870b14c00b001b01c08857fmr5225679oal.49.1688167361424; Fri, 30 Jun 2023 16:22:41 -0700 (PDT)
Received: from ?IPV6:2406:e003:1184:f001:9991:d1ad:8c20:42bd? ([2406:e003:1184:f001:9991:d1ad:8c20:42bd]) by smtp.gmail.com with ESMTPSA id co21-20020a17090afe9500b00262dbf8648esm10006716pjb.34.2023.06.30.16.22.39 (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Fri, 30 Jun 2023 16:22:40 -0700 (PDT)
Sender: Brian Carpenter <becarpenter46@gmail.com>
Message-ID: <a18182e1-dcf5-e943-128c-df2a7f44668d@gmail.com>
Date: Sat, 01 Jul 2023 11:22:35 +1200
MIME-Version: 1.0
User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:91.0) Gecko/20100101 Thunderbird/91.10.0
From: Brian E Carpenter <brian.e.carpenter@gmail.com>
To: Tim Chown <tjc.ietf@gmail.com>
Cc: "Dale W. Carder" <dwcarder@es.net>, v6ops <v6ops@ietf.org>, Shawn McKee <smckee@umich.edu>
References: <ZJ3g8L16euGgDLuI@dwc-desktop.local> <9e2081c4-896f-9a7b-71e3-5a88457d4659@gmail.com> <1237C505-BE2F-4025-A29E-25B1E4B7C8E5@gmail.com>
Content-Language: en-US
In-Reply-To: <1237C505-BE2F-4025-A29E-25B1E4B7C8E5@gmail.com>
Content-Type: text/plain; charset="UTF-8"; format="flowed"
Content-Transfer-Encoding: base64
Archived-At: <https://mailarchive.ietf.org/arch/msg/v6ops/Axagh1PB5lXuHDp9L0pFqXvigBE>
Subject: Re: [v6ops] draft-cc-v6ops-wlcg-flow-label-marking-00.txt
X-BeenThere: v6ops@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: v6ops discussion list <v6ops.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/v6ops>, <mailto:v6ops-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/v6ops/>
List-Post: <mailto:v6ops@ietf.org>
List-Help: <mailto:v6ops-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/v6ops>, <mailto:v6ops-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 30 Jun 2023 23:22:46 -0000

On 30-Jun-23 21:02, Tim Chown wrote:
> Hi Brian,
> 
>> On 29 Jun 2023, at 23:50, Brian E Carpenter <brian.e.carpenter@gmail.com> wrote:
>>
>> Hi Dale,
>>
>> Thanks for documenting this.
>>
>> As well as the impact on ECMP/LAG (where you might add a reference to
>> RFC 6438), could you also mention the impact on server load balancing
>> (RFC 7098)? The real issue here is that with only 5 random bits, the
>> probability of two very large flows hashing to the same path or server
>> is higher than we'd like.
> 
> We can add that pointer.
> 
>> But since the WLCG flows are *all* very large,
>> this presumably doesn't matter so much in your use case?
> 
> Well, actually they’re not. Many transfers are aggregates of a high number of lower throughout flows, moreso perhaps that there’s been a shift away from GridFTP to XRootD.

OK. But in that case I think you do need to discuss the impact on ECMP/LAG a bit more deeply. Is ECMP/LAG a factor in WLCG, and if so, is your traffic mixed in with other general purpose traffic that has standard flow labels? Does an effective 1/32 risk of flow label collision significantly affect the traffic distribution between ECMP/LAG paths?

As far as server load balancing goes, I assume that is either absent in WLCG or is explicitly under your control, which means you can decide what matters and what doesn't.

Perhaps there's a bit of background needed here, to explain whether WLCG traffic is on dedicated paths and what the traffic profiles look like. (How many people here know anything about GridFTP or XRootD?) This determines whether non-standard flow label usage is harmful, and whether this technique can be used safely in other use cases.

     Brian

> 
>> I'd never thought about using a Hamming code to generate flow labels.
>> But since a given flow must use the same flow label throughout, you can
>> only use the regular 5-tuple as input to the Hamming code. I don't
>> believe that a Hamming code will give you a discrete uniform distribution,
>> but I'm not an expert in that area.
> 
> This would be good to get some expert feedback on.
> 
> Note that traffic for a single excitement (community) and application will share 15 common bits, so it’s the other 5 bits we would like to see as effectively randomised and distributed as possible.
> 
>> An expert known as ChatGPT says:
>> "No, a Hamming code does not produce a discrete uniform distribution...
>> The non-uniformity of the distribution arises from the fact that Hamming
>> codes prioritize error detection and correction capabilities over
>> uniformity of codeword distribution."
> 
> :)
> 
>> So I think that simply using a stateless hash is the best way. These
>> days I like FNV (draft-eastlake-fnv) which is easy and cheap to implement.
> 
> This is all good discussion, more feedback very welcome.
> 
> We’d like to produce a -01 before the cutoff.
> 
> Thanks,
> Tim
> 
>> Regards
>>    Brian Carpenter
>>
>> On 30-Jun-23 07:52, Dale W. Carder wrote:
>>> Hi folks,
>>> We have submitted this draft documenting our experimental use of the
>>> IPv6 flow label for packet marking and the considerations we had made
>>> along the way that sort of got us to this point.
>>> I'm hoping we could get some initial discussion underway here before
>>> IETF 117.  Thanks!
>>> Dale
>>> ----- Forwarded message from internet-drafts@ietf.org -----
>>>> Date: Thu, 29 Jun 2023 12:18:53 -0700
>>>> From: internet-drafts@ietf.org
>>>> Subject: New Version Notification for draft-cc-v6ops-wlcg-flow-label-marking-00.txt
>>>>
>>>>
>>>> A new version of I-D, draft-cc-v6ops-wlcg-flow-label-marking-00.txt
>>>> has been successfully submitted by Dale W. Carder and posted to the
>>>> IETF repository.
>>>>
>>>> Name:		draft-cc-v6ops-wlcg-flow-label-marking
>>>> Revision:	00
>>>> Title:		Use of the IPv6 Flow Label for WLCG Packet Marking
>>>> Document date:	2023-06-29
>>>> Group:		Individual Submission
>>>> Pages:		15
>>>> URL:            https://www.ietf.org/archive/id/draft-cc-v6ops-wlcg-flow-label-marking-00.txt
>>>> Status:         https://datatracker.ietf.org/doc/draft-cc-v6ops-wlcg-flow-label-marking/
>>>> Html:           https://www.ietf.org/archive/id/draft-cc-v6ops-wlcg-flow-label-marking-00.html
>>>> Htmlized:       https://datatracker.ietf.org/doc/html/draft-cc-v6ops-wlcg-flow-label-marking
>>>>
>>>>
>>>> Abstract:
>>>>     This document describes an experimentally deployed approach currently
>>>>     used within the Worldwide Large Hadron Collider Computing Grid (WLCG)
>>>>     to mark packets with their project (experiment) and application.  The
>>>>     marking uses the 20-bit IPv6 Flow Label in each packet, with 15 bits
>>>>     used for semantics and 5 bits for entropy.  Alternatives, in
>>>>     particular use of IPv6 Extension Headers (EH), were considered but
>>>>     found to not be practical.  The WLCG is one of the largest worldwide
>>>>     research communities and has adopted IPv6 heavily for movement of
>>>>     many tens of PB of data annually, with the ultimate goal of running
>>>>     IPv6 only.
>>>>
>>>>                                                                                    
>>>>
>>>> The IETF Secretariat
>>>>
>>>>
>>> ----- End forwarded message -----
>>> _______________________________________________
>>> v6ops mailing list
>>> v6ops@ietf.org
>>> https://www.ietf.org/mailman/listinfo/v6ops
>>
>> _______________________________________________
>> v6ops mailing list
>> v6ops@ietf.org
>> https://www.ietf.org/mailman/listinfo/v6ops
>