Re: [iccrg] Congestion Control in Data Centers and HPC/RDMA environments

Paul Congdon <paul.congdon@tallac.com> Mon, 21 October 2019 15:11 UTC

Return-Path: <paul.congdon@tallac.com>
X-Original-To: iccrg@ietfa.amsl.com
Delivered-To: iccrg@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 4FA351200E7 for <iccrg@ietfa.amsl.com>; Mon, 21 Oct 2019 08:11:42 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.119
X-Spam-Level:
X-Spam-Status: No, score=-1.119 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_NEUTRAL=0.779] autolearn=no autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=tallac-com.20150623.gappssmtp.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id ueV_3mZ74JsK for <iccrg@ietfa.amsl.com>; Mon, 21 Oct 2019 08:11:40 -0700 (PDT)
Received: from mail-oi1-x236.google.com (mail-oi1-x236.google.com [IPv6:2607:f8b0:4864:20::236]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id CB134120123 for <iccrg@irtf.org>; Mon, 21 Oct 2019 08:11:39 -0700 (PDT)
Received: by mail-oi1-x236.google.com with SMTP id g81so11324199oib.8 for <iccrg@irtf.org>; Mon, 21 Oct 2019 08:11:39 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=tallac-com.20150623.gappssmtp.com; s=20150623; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=o+1/IVQHw/zsfg/ubc7jzj3cAhAasqejzy/QpLUpN8g=; b=Lq6kMgfMLwsMBdOrrmOLmE6vKk7QE1NaEFFSvgN8DKOOY7gwJ8LL7BkfGNtQyjM7W/ O6voeDaPMACI1/WrH1vD3gABXYLHh9TDv6jdQDbE+ViQVC55hPYaWglkZN1IPfi5aVjH g2txcoXEsf1ZjTg91Y8/fQ9JVvko/UYUGn8D+7ORNCoWsB7xlfZ4XmI85NT/GsDrrr/z y6L6b+dI0MaMP2JV8Wv1R+4wftNEw7K4BjonNqj0/XAfE0t99QKS1wtEUAXpQkUdz88U j+k3EUxF5kyOvTBDurVssi/ye7NnBLg8JUHqWm8czcVNaR3RLCpWVLSLtNN7Sa+u/Fxx IEwQ==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=o+1/IVQHw/zsfg/ubc7jzj3cAhAasqejzy/QpLUpN8g=; b=ZCtKTaztAEKAAPbOLXuU4HK3JRzgP8phNH56mcWlv/PqAVvIm4MwpDF/LgJVzgpxZt xxG5jIHZARtzPsBg0s+R48XiUNOavareY+Lcut57JR8cZHTfLpRvrdfVDf63tNgFGV92 6tp4e0tqyYWuoiS71wkuWIqqMnib4+9N8qzn7U3y2jNw7zV1RTMGJtoNhhRkEvcwLvOt Q4PkuGyiHLN6Ty3AOvdH498NFZHeYFRwhFDC0LzYwWJOEWnLUASpY9arUMdPxFeveBvg 3WFAj6JhakloGUBrSQ8/pHVQxlHORrMAjNvASmW0Vcqfa9wrKOw082vDwfeWn5oiE/jg TS+Q==
X-Gm-Message-State: APjAAAWsALPs5vu1lzKwy6rLXQXV31WrRwVyX1ZSaFgOS1VS0Veet8EQ hKU7Ov8zIPtjZ0suBLixvtZM9PWLX4JCKVgA2dq1Rwyf37ObVA==
X-Google-Smtp-Source: APXvYqyPmebGCF4spHTDk2zafUUGyZOm9lhoxJeI2sYkUeJYj6mhRqANBWrQJpPYGqUP6QUrM0P6YeRmWDKOW4YBfGk=
X-Received: by 2002:aca:318c:: with SMTP id x134mr18722670oix.41.1571670698885; Mon, 21 Oct 2019 08:11:38 -0700 (PDT)
MIME-Version: 1.0
References: <CAAMqZPuL-GdKnoEaM7A4yNKb5uxgFvr583byT2HdAxsfPDcu4g@mail.gmail.com> <7D0ABA58-F56C-4FFE-BCCF-D7CC28CD6DCF@ifi.uio.no>
In-Reply-To: <7D0ABA58-F56C-4FFE-BCCF-D7CC28CD6DCF@ifi.uio.no>
From: Paul Congdon <paul.congdon@tallac.com>
Date: Mon, 21 Oct 2019 08:11:28 -0700
Message-ID: <CAAMqZPs8P3wjFfbyomCOC=mU7o0PfcLOO1zDXr4E01HPtmCZcg@mail.gmail.com>
To: Michael Welzl <michawe@ifi.uio.no>
Cc: iccrg IRTF list <iccrg@irtf.org>
Content-Type: multipart/alternative; boundary="000000000000b62d0805956d1962"
Archived-At: <https://mailarchive.ietf.org/arch/msg/iccrg/UEF2c3M6lO_i-zopBydUD3MaBlA>
Subject: Re: [iccrg] Congestion Control in Data Centers and HPC/RDMA environments
X-BeenThere: iccrg@irtf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: "Discussions of Internet Congestion Control Research Group \(ICCRG\)" <iccrg.irtf.org>
List-Unsubscribe: <https://www.irtf.org/mailman/options/iccrg>, <mailto:iccrg-request@irtf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/iccrg/>
List-Post: <mailto:iccrg@irtf.org>
List-Help: <mailto:iccrg-request@irtf.org?subject=help>
List-Subscribe: <https://www.irtf.org/mailman/listinfo/iccrg>, <mailto:iccrg-request@irtf.org?subject=subscribe>
X-List-Received-Date: Mon, 21 Oct 2019 15:11:43 -0000

Yes, you are correct.

"Because congestion control is typically a key function of transport
protocols, it is hard to separate other transport layer issues from
congestion control. Considerations for deployment of transport protocols
are therefore also considered within scope. The ICCRG may also consider
congestion and protocol performance problems in general IP networks, i.e.,
not only on the global Internet. One example of such IP networks are
multi-tenant, heterogeneous datacenters, which in some sense represent a
microcosm of the larger Internet; changes to the transport protocols being
considered in that space are fundamental and can have broader implications."

One question that came up in discussions with others is can the solution be
restricted to particular topologies and domains, such as datacenters, if
the solution is not deemed 'safe' to run across the Internet at large?

Sounds like such research is in scope from the above paragraph.


On Thu, Oct 17, 2019 at 10:31 PM Michael Welzl <michawe@ifi.uio.no> wrote:

> Hi,
>
> What about adding a statement like the following to the charter?
>
> ***
> The ICCRG may also consider congestion and protocol performance problems
> in general IP networks, i.e., not only on the global Internet. One example
> of such IP networks are multi-tenant, heterogeneous datacenters, which in
> some sense represent a microcosm of the larger Internet; changes to the
> transport protocols being considered in that space are fundamental and can
> have broader implications.
> ***
>
> (turns out that it’s already there:
> https://datatracker.ietf.org/rg/iccrg/about/ )
>
> Cheers,
> Michael
>
>
> On Oct 17, 2019, at 7:54 PM, Paul Congdon <paul.congdon@tallac.com> wrote:
>
> Hello ICCRG,
>
> Jana suggested that I email the group with this call for interest on
> Congestion Control for Data Centers.  At the last several IETF meetings
> I've hosted side meetings and presented to various groups about work going
> on in Data Center congestion control and scaling HPC/RDMA networks.  We
> have been searching for a permanent venue within the IETF for this work.
>  ICCRG is primarily focused on end-to-end congestion control across the
> vast and expansive Internet.   The data center, and in particular, the
> HPC/RDMA data center network has much different characteristics.   So, the
> question to ICCRG is this; is this topic of interest to ICCRG and should
> the charter be expanded to include DC congestion control, or is this a good
> topic for a new research item in IRTF?
>
> Looking forward to hearing from you,
> Thank you,
> Paul Congdon
>
> _______________________________________________
> iccrg mailing list
> iccrg@irtf.org
> https://www.irtf.org/mailman/listinfo/iccrg
>
>
> _______________________________________________
> iccrg mailing list
> iccrg@irtf.org
> https://www.irtf.org/mailman/listinfo/iccrg
>