Re: [DNSOP] New draft on delegation revalidation

Paul Vixie <paul@redbarn.org> Thu, 28 May 2020 15:55 UTC

Return-Path: <paul@redbarn.org>
X-Original-To: dnsop@ietfa.amsl.com
Delivered-To: dnsop@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 52AFE3A0E85 for <dnsop@ietfa.amsl.com>; Thu, 28 May 2020 08:55:23 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.9
X-Spam-Level:
X-Spam-Status: No, score=-1.9 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, SPF_HELO_NONE=0.001, SPF_PASS=-0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 4vqSUmPaXavv for <dnsop@ietfa.amsl.com>; Thu, 28 May 2020 08:55:21 -0700 (PDT)
Received: from family.redbarn.org (family.redbarn.org [IPv6:2001:559:8000:cd::5]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id B599F3A0E32 for <dnsop@ietf.org>; Thu, 28 May 2020 08:55:21 -0700 (PDT)
Received: from linux-9daj.localnet (vixp1.redbarn.org [24.104.150.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client did not present a certificate) by family.redbarn.org (Postfix) with ESMTPSA id 25D90B074A; Thu, 28 May 2020 15:55:16 +0000 (UTC)
From: Paul Vixie <paul@redbarn.org>
To: Shumon Huque <shuque@gmail.com>, dnsop@ietf.org
Cc: "dnsop@ietf.org WG" <dnsop@ietf.org>, Petr Špaček <petr.spacek@nic.cz>
Date: Thu, 28 May 2020 15:55:15 +0000
Message-ID: <33141878.RIuage5OGR@linux-9daj>
Organization: none
In-Reply-To: <0f92991e-991a-f497-9b75-cf924512b0ba@nic.cz>
References: <CAHPuVdV9eSCLQOqMF0cq8fHcuSZs7nCgjhHMfMoaV5H=ekbtSA@mail.gmail.com> <CAHPuVdX29yLG4VFMLq8ad7wgq2N-L=FfG=f-eBd2F6aP_q6CCg@mail.gmail.com> <0f92991e-991a-f497-9b75-cf924512b0ba@nic.cz>
MIME-Version: 1.0
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain; charset="UTF-8"
Archived-At: <https://mailarchive.ietf.org/arch/msg/dnsop/Lj8TLGkKIuSULkVD42vsTttBCzg>
Subject: Re: [DNSOP] New draft on delegation revalidation
X-BeenThere: dnsop@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: IETF DNSOP WG mailing list <dnsop.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/dnsop>, <mailto:dnsop-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/dnsop/>
List-Post: <mailto:dnsop@ietf.org>
List-Help: <mailto:dnsop-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/dnsop>, <mailto:dnsop-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 28 May 2020 15:55:23 -0000

On Thursday, 28 May 2020 14:38:11 UTC Petr Špaček wrote:
> On 25. 05. 20 5:23, Shumon Huque wrote:
> > ...
> >     Most importantly:
> >     - Does the NS affect maximum TTL of _other_ data in the zone?
> > 
> > I think there are probably different views on what should happen here.
> > Folks who want very prompt takedown of "bad" domains, will probably
> > prefer a complete pruning of the cache at the delegation point at the
> > revalidation interval, if the NS set has changed or disappeared. ...

yes, that was the original motive for revalidation itself, noting that it also 
facilitates emergency redelegation, for example, after a registrant/registrar 
account compromise. so the domain might not be bad, but the cached content 
might be poisonous in other ways.

> > When
> > this topic has come up in the past, there has been pushback from some
> > implementers that it's difficult to do this because they use a non-tree
> > data structure for the cache (a hash table most commonly). ...

i don't think we should argue the computer science behind implementing this. 
if the cache doesn't facilitate "rm -r" behaviour directly, then it might 
choose to bookend the retrievals, such that retrieving data having a bailiwick 
whose timestamp is older than the revalidation event would cause deletion and 
refetch at the time of the encounter. these details should not enter into a 
discussion of whether the system should have a capability or not.

> >     - If it does, doesn't it increase risk of thundering herd behavior?
> > 
> > Possibly, depending on how popular the zone is, and what we decide is the
> > answer to the previous question. At any rate, implementers should always
> > employ strategies that bound how much work resolvers can be caused to do.
> I'm not concerned about any single resolver instance, I'm more concerned
> about large number of resolver instances doing the same thing at the same
> time.
> 
> E.g. if NS TTL was short (say 30 s) and it was used as cap on TTL of all
> other records in the zone, then each resolver instance would clear zone
> from its cache each 30 seconds. That might cause interesting behavior when
> NS TTL is shortened e.g. before NS set change etc.
> 
> I do not know if there really is a problem, I'm just trying to explain why
> potential for thundering herd needs to be be seriously analyzed.

if we can think of a way that the intervals can become synchronized, then we 
would treat this with random subtractive revalidation. to get a thundering 
herd every member of the herd would have to start their interval at the same 
time. such synchronicity is hard to trigger.

-- 
Paul