Re: [DNSOP] draft-liman-tld-names-04

Andrew Sullivan <ajs@shinkuro.com> Tue, 16 November 2010 16:47 UTC

Return-Path: <ajs@shinkuro.com>
X-Original-To: dnsop@core3.amsl.com
Delivered-To: dnsop@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id 350D73A6DB7 for <dnsop@core3.amsl.com>; Tue, 16 Nov 2010 08:47:38 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -103.522
X-Spam-Level:
X-Spam-Status: No, score=-103.522 tagged_above=-999 required=5 tests=[AWL=1.077, BAYES_00=-2.599, GB_I_LETTER=-2, USER_IN_WHITELIST=-100]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id Gm9QZ7k+kmJy for <dnsop@core3.amsl.com>; Tue, 16 Nov 2010 08:47:37 -0800 (PST)
Received: from mail.yitter.info (mail.yitter.info [208.86.224.201]) by core3.amsl.com (Postfix) with ESMTP id 4CFCB3A6DB2 for <dnsop@ietf.org>; Tue, 16 Nov 2010 08:47:37 -0800 (PST)
Received: from crankycanuck.ca (69-196-144-230.dsl.teksavvy.com [69.196.144.230]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by mail.yitter.info (Postfix) with ESMTPSA id 3416B1ECB442 for <dnsop@ietf.org>; Tue, 16 Nov 2010 16:48:20 +0000 (UTC)
Date: Tue, 16 Nov 2010 11:48:18 -0500
From: Andrew Sullivan <ajs@shinkuro.com>
To: dnsop@ietf.org
Message-ID: <20101116164818.GN1389@shinkuro.com>
References: <4CDE6D2D.40805@necom830.hpcl.titech.ac.jp> <4819246C-FF1F-456D-9732-51510F0537A1@frobbit.se> <4CE0F829.1010605@necom830.hpcl.titech.ac.jp> <7F03C666-16F8-49E4-BC56-F5DD441DD970@frobbit.se> <4CE1A110.1060403@necom830.hpcl.titech.ac.jp> <20101115213532.GD322@shinkuro.com> <04856F66-598D-43CC-8164-90178A6F2952@virtualized.org> <4CE283DA.5080606@abenaki.wabanaki.net> <20101116145308.GG1389@shinkuro.com> <alpine.LSU.2.00.1011161601450.14239@hermes-2.csi.cam.ac.uk>
MIME-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
Content-Disposition: inline
In-Reply-To: <alpine.LSU.2.00.1011161601450.14239@hermes-2.csi.cam.ac.uk>
User-Agent: Mutt/1.5.18 (2008-05-17)
Subject: Re: [DNSOP] draft-liman-tld-names-04
X-BeenThere: dnsop@ietf.org
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: IETF DNSOP WG mailing list <dnsop.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/listinfo/dnsop>, <mailto:dnsop-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/dnsop>
List-Post: <mailto:dnsop@ietf.org>
List-Help: <mailto:dnsop-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/dnsop>, <mailto:dnsop-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 16 Nov 2010 16:47:38 -0000

On Tue, Nov 16, 2010 at 04:06:37PM +0000, Tony Finch wrote:

> According to my reading of RFC 1123 it is allowed, and IDNA depends on it
> being allowed. The normative text says that anything that isn't a dotted
> quad IP address should be treated as a domain name.

Define what you mean by "normative text".  You're quite right that the
"will be alphabetic" restriction is in the Discussion, and therefore
can be construed as not actually part of the protocol (that's actually
how I read it).  But the argument has been that the Discussion segment
amounts to a constraint on the protocol, or anyway may well have been
interpreted that way in running code.  We don't know.  (And of course,
as usual in the DNS, we don't have any way of checking.)

You're quite right that the "will be alphabetic" stricture has to be
released for TLDs to be able to do IDNA at all: every one of them
contains "--" and that's not alphabetic.  So I read the document as
trying to relax that rule just exactly enough to allow A-labels in the
TLDs, without relaxing everything.  The "don't relax everything" goal
is to make the smallest change needed so that if there are any bad
effects, they're not magnified more than they ought to be.

Given what I have seen in other TLD-checking type code, I'd bet a
pretty good lunch that anything _starting_ with a digit in the top
level will run into all sorts of troubles.  There's probably some
heuristic software out there that checks the top-most label, looks for
a digit in the first character, and treats the set of labels as an IP
address if there is one.

To me, the justification for the U-label having to be a letter, then,
is that such a label could get passed in as a U-label form, and if the
carelessly-written code is running in a Unicode environment it will do
exactly the same thing as I just described.  So the draft seems to me
to make the most conservative change it can.  Even if I don't like it,
the top level is different from other levels in the DNS, and we need
to be careful there.

A

-- 
Andrew Sullivan
ajs@shinkuro.com
Shinkuro, Inc.