Re: [DNSOP] Priming query transport selection

Nicholas Weaver <nweaver@ICSI.Berkeley.EDU> Thu, 14 January 2010 16:40 UTC

Return-Path: <nweaver@ICSI.Berkeley.EDU>
X-Original-To: dnsop@core3.amsl.com
Delivered-To: dnsop@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id 087153A6819 for <dnsop@core3.amsl.com>; Thu, 14 Jan 2010 08:40:22 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -6.449
X-Spam-Level:
X-Spam-Status: No, score=-6.449 tagged_above=-999 required=5 tests=[AWL=-0.150, BAYES_00=-2.599, MIME_8BIT_HEADER=0.3, RCVD_IN_DNSWL_MED=-4]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 2sRSaZ62Ieok for <dnsop@core3.amsl.com>; Thu, 14 Jan 2010 08:40:21 -0800 (PST)
Received: from fruitcake.ICSI.Berkeley.EDU (fruitcake.ICSI.Berkeley.EDU [192.150.186.11]) by core3.amsl.com (Postfix) with ESMTP id 3EBD73A67A3 for <dnsop@ietf.org>; Thu, 14 Jan 2010 08:40:21 -0800 (PST)
Received: from [IPv6:::1] (jack.ICSI.Berkeley.EDU [192.150.186.73]) by fruitcake.ICSI.Berkeley.EDU (8.12.11.20060614/8.12.11) with ESMTP id o0EGeAGs004086; Thu, 14 Jan 2010 08:40:10 -0800 (PST)
References: <201001131823.o0DINxYv068180@stora.ogud.com> <555CFB98-BB21-4AD4-9D4A-3AF3BD98E4B2@rfc1035.com> <D9CCEA0D18D9D5B457A90853@Ximines.local> <631E7931-47D4-4AAF-B2C6-62DA6DA5A4CA@rfc1035.com> <CDE7E0414BC50C42E4FCC54F@Ximines.local> <E87EE584-97B5-4FE8-B47D-21048A702B51@rfc1035.com> <201001132241.o0DMfOO3070819@stora.ogud.com> <OFC2033777.3267FDFE-ON802576AB.002F2429-802576AB.002F8339@nominet.org.uk> <DE9B343C-EA58-4BA0-93C3-5E68A12E57F8@frobbit.se>
In-Reply-To: <DE9B343C-EA58-4BA0-93C3-5E68A12E57F8@frobbit.se>
Mime-Version: 1.0 (Apple Message framework v1077)
Content-Type: text/plain; charset="iso-8859-1"
Message-Id: <74CD1A36-E78E-4C29-A8C8-62B8A071C8C5@icsi.berkeley.edu>
Content-Transfer-Encoding: quoted-printable
From: Nicholas Weaver <nweaver@ICSI.Berkeley.EDU>
Date: Thu, 14 Jan 2010 08:40:10 -0800
To: Patrik Fältström <patrik@frobbit.se>
X-Mailer: Apple Mail (2.1077)
Cc: Ray.Bellis@nominet.org.uk, dnsop@ietf.org, Nicholas Weaver <nweaver@ICSI.Berkeley.EDU>
Subject: Re: [DNSOP] Priming query transport selection
X-BeenThere: dnsop@ietf.org
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: IETF DNSOP WG mailing list <dnsop.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/listinfo/dnsop>, <mailto:dnsop-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/dnsop>
List-Post: <mailto:dnsop@ietf.org>
List-Help: <mailto:dnsop-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/dnsop>, <mailto:dnsop-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 14 Jan 2010 16:40:22 -0000

On Jan 14, 2010, at 7:58 AM, Patrik Fältström wrote:
> 
> Please do not start talking about enforcing some fixed limit that we will laugh about 10 years from now... And if you talk about a limit, pick something very large (like 65535 that seems to be already chosen).
> 
> It is enough problems with the 512 limit of today. I do not want to have the same problems when we pass 4096.
> 
> Implementations should be free to choose an implementation limit smaller if they want to (and signal that in the EDNS0 size), but please do not say that "max value on EDNS0 size will forever be 4096" or something similar.
> 
> Be careful with the wording...

Except that EDNS0 MTU is closely coupled with the UDP protocol and its unreliable nature: this message MTU is irrelevant for TCP or another reliable protocol.

It is highly unlikely that the network's MTU will expand beyond 1500B:  There is too much Ethernet, and >1500B MTUs don't really benefit things anyway, because the overhead reductions of going to a higher MTU are near zero (Amdahl's law).  

Which means the number of fragments which ALL need to be received correctly goes up linearly with the size of the message.

Even WITH a larger MTU, bit-errors become more common.  So, even at a minimum, you'd expect many more failures, dropped packets, etc, with a 40,000B datagram than a 4000B datagram.  And DNS over UDP is already unreliable enough, at least when you consider it all the way to the end host with a reasonable timeout on lookups.

Thus given the nature of the UDP protocol, it is highly unlikely that you'd ever want to do ~10K+ byte UDP datagrams.