[Anima] ANIMA when there is a system-wide issue

Brian E Carpenter <brian.e.carpenter@gmail.com> Mon, 30 November 2020 22:02 UTC

Return-Path: <brian.e.carpenter@gmail.com>
X-Original-To: anima@ietfa.amsl.com
Delivered-To: anima@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 43A833A0B12 for <anima@ietfa.amsl.com>; Mon, 30 Nov 2020 14:02:36 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.098
X-Spam-Level:
X-Spam-Status: No, score=-2.098 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id d1AlZOAIEPEX for <anima@ietfa.amsl.com>; Mon, 30 Nov 2020 14:02:33 -0800 (PST)
Received: from mail-pl1-x62f.google.com (mail-pl1-x62f.google.com [IPv6:2607:f8b0:4864:20::62f]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id ACBA73A0B8A for <anima@ietf.org>; Mon, 30 Nov 2020 14:02:33 -0800 (PST)
Received: by mail-pl1-x62f.google.com with SMTP id x4so5540171pln.8 for <anima@ietf.org>; Mon, 30 Nov 2020 14:02:33 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=subject:from:organization:to:message-id:date:user-agent :mime-version:content-language:content-transfer-encoding; bh=0iFAE4ei8QB1ORT37M194Q4O5e35VYMtDXfsvgRzNwA=; b=CClidWzR7fPU1nG48q62mi6HZ2xRgaaOt0g1mieQQiTwDiyF+zOdop5L03B3tZgach mXJasuzxorgH7ZQN8J7FzFpgr7HepJ102SzB900c9qeVy//Y3iZ4Ab6INOJQHRF9SLQ5 e8G8dRQd2Jrg9ipKfbio2w41CCu1+JpQb9bmvnUpDjwKLCXxwaIpjwXiy5DZAQIHOPfx XoAhfDXvTHTsCKcfRf+0dmHRd4FCpxZyQeVqnZRZPsMZNh2q4WdLqcUttTAsaXELUsnF +wJwlj+fNHBjkaXi8iThlHF9cPJK1k2X4nxrEfQD0IFVVg4m9atRJBSfrtIQZPlcDOMf 6WyA==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:subject:from:organization:to:message-id:date :user-agent:mime-version:content-language:content-transfer-encoding; bh=0iFAE4ei8QB1ORT37M194Q4O5e35VYMtDXfsvgRzNwA=; b=rX0ePa62HluEnKW3wXXQN56Nyxqrng27ewNtm2eDdKyh/zW3snzSdH3tOthIyn8/Cz tzqfh5gQ3ndXmRrVR6pf0dUb9I1G4NlYkWDFuLqgSmhZjFiLLq0J4SQMiqMppBdlslbU yEYETL5wC9i72XDIgCqqLUHjI6aWUWsze5t10SwiPg8yd5ncmLPVrZ4W2QtdRkwx+Lw4 j+T1BoMF8n4Muenj5qnVaTnguq2CokdkbUlEYtd7fDMKVBMFWZBqGw/DJp2Rt+vWpy0o 4GuQDQu+B42A5GPr8cfb7YyXn3V3Bh05KO3E5JbORL2xibNw87cPCfwr1MPgKWgQ+49k kRsA==
X-Gm-Message-State: AOAM5333eTkRuGw4614PkmMevYJqji9+hXdaa//Dx3JnQDolDf01fwsp ZfIJ2RFuxMYKlOGNIpd6lqB+c43sOGI6ng==
X-Google-Smtp-Source: ABdhPJyWXOxx1h3lf+uuSFBDUplh7lKxydOH9bRxQaj/OFIZG6VNv+CViPmNoxCW0QMC1oJHwu3Maw==
X-Received: by 2002:a17:902:b941:b029:da:8134:486a with SMTP id h1-20020a170902b941b02900da8134486amr5604371pls.37.1606773752137; Mon, 30 Nov 2020 14:02:32 -0800 (PST)
Received: from [192.168.178.20] ([151.210.131.28]) by smtp.gmail.com with ESMTPSA id v63sm17724633pfb.217.2020.11.30.14.02.30 for <anima@ietf.org> (version=TLS1_2 cipher=ECDHE-ECDSA-AES128-GCM-SHA256 bits=128/128); Mon, 30 Nov 2020 14:02:31 -0800 (PST)
From: Brian E Carpenter <brian.e.carpenter@gmail.com>
Organization: University of Auckland
To: Anima WG <anima@ietf.org>
Message-ID: <136aa329-41a5-8b65-ef9e-fadf089696eb@gmail.com>
Date: Tue, 01 Dec 2020 11:02:28 +1300
User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:60.0) Gecko/20100101 Thunderbird/60.9.1
MIME-Version: 1.0
Content-Type: text/plain; charset="utf-8"
Content-Language: en-US
Content-Transfer-Encoding: quoted-printable
Archived-At: <https://mailarchive.ietf.org/arch/msg/anima/dHx2v58Y16uCDZ-UqFy0UXT-o7M>
Subject: [Anima] ANIMA when there is a system-wide issue
X-BeenThere: anima@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Autonomic Networking Integrated Model and Approach <anima.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/anima>, <mailto:anima-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/anima/>
List-Post: <mailto:anima@ietf.org>
List-Help: <mailto:anima-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/anima>, <mailto:anima-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 30 Nov 2020 22:02:36 -0000

"AWS reveals it broke itself by exceeding OS thread limits"

https://www.theregister.com/2020/11/30/aws_outage_explanation/

Especially:
"The TIFU-like post also outlines why Amazon's dashboards offered only scanty info about the incident – because they, too, depend on a service that depends on Kinesis."

Perhaps there is something we should specify in ANIMA to prevent the ANIMA infrastructure falling into this sort of trap: when there is a system-wide issue (such as hitting an O/S resource limit everywhere at the same time) it also prevents the autonomic mechanisms from working.
 
Regards
   Brian Carpenter