Re: [DNSOP] Terminology: IDN

Paul Hoffman <paul.hoffman@vpnc.org> Mon, 04 May 2015 15:24 UTC

Return-Path: <paul.hoffman@vpnc.org>
X-Original-To: dnsop@ietfa.amsl.com
Delivered-To: dnsop@ietfa.amsl.com
Received: from localhost (ietfa.amsl.com [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id DE5141A7113 for <dnsop@ietfa.amsl.com>; Mon, 4 May 2015 08:24:32 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.347
X-Spam-Level:
X-Spam-Status: No, score=-1.347 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, HELO_MISMATCH_COM=0.553] autolearn=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id afQMWfzyLsZT for <dnsop@ietfa.amsl.com>; Mon, 4 May 2015 08:24:32 -0700 (PDT)
Received: from proper.com (Opus1.Proper.COM [207.182.41.91]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 1A5F91A702E for <dnsop@ietf.org>; Mon, 4 May 2015 08:24:32 -0700 (PDT)
Received: from [10.20.30.101] (50-1-98-218.dsl.dynamic.fusionbroadband.com [50.1.98.218]) (authenticated bits=0) by proper.com (8.15.1/8.14.9) with ESMTPSA id t44FO9qD087818 (version=TLSv1 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Mon, 4 May 2015 08:24:11 -0700 (MST) (envelope-from paul.hoffman@vpnc.org)
X-Authentication-Warning: proper.com: Host 50-1-98-218.dsl.dynamic.fusionbroadband.com [50.1.98.218] claimed to be [10.20.30.101]
Content-Type: text/plain; charset="us-ascii"
Mime-Version: 1.0 (Mac OS X Mail 8.2 \(2098\))
From: Paul Hoffman <paul.hoffman@vpnc.org>
In-Reply-To: <D16D027D.B573%edward.lewis@icann.org>
Date: Mon, 04 May 2015 08:24:10 -0700
Content-Transfer-Encoding: quoted-printable
Message-Id: <A4A329B9-6E42-43A4-B7D4-F02788EFAD62@vpnc.org>
References: <3FB18AE0-8769-4B73-9964-8F0D1AB1FA54@vpnc.org> <D16D027D.B573%edward.lewis@icann.org>
To: Edward Lewis <edward.lewis@icann.org>
X-Mailer: Apple Mail (2.2098)
Archived-At: <http://mailarchive.ietf.org/arch/msg/dnsop/ZTqBRXV6ApJtwuNpjPOesxhAFtk>
Cc: dnsop WG <dnsop@ietf.org>
Subject: Re: [DNSOP] Terminology: IDN
X-BeenThere: dnsop@ietf.org
X-Mailman-Version: 2.1.15
Precedence: list
List-Id: IETF DNSOP WG mailing list <dnsop.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/dnsop>, <mailto:dnsop-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/dnsop/>
List-Post: <mailto:dnsop@ietf.org>
List-Help: <mailto:dnsop-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/dnsop>, <mailto:dnsop-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 04 May 2015 15:24:33 -0000

On May 4, 2015, at 8:13 AM, Edward Lewis <edward.lewis@icann.org> wrote:
> I'm going to ask this as a question, not a comment, directed to those who
> know something about IDN.
> 
> Isn't IDNA2008 a "convention" on top of the representation?  

IDNA2008 is a set of standards-track documents. It is the only standardized way to convert between DNS names and representation of non-ASCII characters in the DNS.

> And, this
> isn't about display, it's about, well, encapsulation, maybe?

It is about those, and about on-the-wire representation as well.

>  And isn't
> IDN tied to Unicode?

For someone being pedantic, your use of "tied to" seems awfully vague. :-) IDNA2008 refers normatively to the Unicode Standard, yes.

> I ask for a few reasons - "non-ASCII" might include raw binary or bit
> encodings other than Unicode.

It does. And there is no standardized way of representing those in domain names in applications yet. If someone creates one and gets it standardized, we can update the definition. I'm willing to bet this will never happen.

> The punycode algorithm maps Unicode (code points) to DNS-label compatible
> strings.  (No-case, etc.)

Yes.

> IDNA 2008 is a treatment of labels generated by the punycode algorithm to
> achieve a number of other goals.

It is also a standardized definition of how to generate those labels.

> (I guarantee you I am missing something in the above, like Normalization
> Forms and such.)

Yes. But note that I'm not suggesting that you read the RFCs, just answering your questions.

> When I use "IDN" I think of a representation of a Unicode string in a DNS
> label according to a bunch of conventions.  

Good! That's a reasonable guess, although a subset of what it really is.

> Perhaps IDN is taking on a
> wider meaning.  I don't know.

The purpose of the terminology draft is to point people towards the general definition of a term, and give pointers to the exact definitions where possible. For "IDN", I believe this is the best we can do. Other suggestions are welcome.

--Paul Hoffman