Re: [EAI] I-D Action: draft-ietf-eai-mailinglistbis-00.txt

John C Klensin <klensin@jck.com> Sun, 27 November 2011 16:04 UTC

Return-Path: <klensin@jck.com>
X-Original-To: ima@ietfa.amsl.com
Delivered-To: ima@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 3BC1D21F8B89 for <ima@ietfa.amsl.com>; Sun, 27 Nov 2011 08:04:50 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.025
X-Spam-Level:
X-Spam-Status: No, score=-2.025 tagged_above=-999 required=5 tests=[AWL=-0.326, BAYES_00=-2.599, J_CHICKENPOX_34=0.6, MIME_8BIT_HEADER=0.3]
Received: from mail.ietf.org ([12.22.58.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id l5Ese66bpv9Q for <ima@ietfa.amsl.com>; Sun, 27 Nov 2011 08:04:49 -0800 (PST)
Received: from bs.jck.com (ns.jck.com [209.187.148.211]) by ietfa.amsl.com (Postfix) with ESMTP id 2CCFC21F8B9A for <ima@ietf.org>; Sun, 27 Nov 2011 08:04:44 -0800 (PST)
Received: from [127.0.0.1] (helo=localhost) by bs.jck.com with esmtp (Exim 4.34) id 1RUhDW-0009kU-6N; Sun, 27 Nov 2011 11:04:34 -0500
Date: Sun, 27 Nov 2011 11:04:33 -0500
From: John C Klensin <klensin@jck.com>
To: Frank Ellermann <hmdmhdfmhdjmzdtjmzdtzktdkztdjz@gmail.com>, "Martin J. Dürst" <duerst@it.aoyama.ac.jp>
Message-ID: <9FA39809C7B0482D83BD9A46@PST.JCK.COM>
In-Reply-To: <CAHhFybqf-JdrxuCj0MuRiShUksysHLNMt6=fyXAyUcYuLf71Sg@mail.gmail.com>
References: <20111126051407.96953.qmail@joyce.lan> <4ED182D1.2020901@it.aoyama.ac.jp> <CAHhFybqf-JdrxuCj0MuRiShUksysHLNMt6=fyXAyUcYuLf71Sg@mail.gmail.com>
X-Mailer: Mulberry/4.0.8 (Win32)
MIME-Version: 1.0
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: quoted-printable
Content-Disposition: inline
Cc: ima@ietf.org
Subject: Re: [EAI] I-D Action: draft-ietf-eai-mailinglistbis-00.txt
X-BeenThere: ima@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: "EAI \(Email Address Internationalization\)" <ima.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/ima>, <mailto:ima-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/ima>
List-Post: <mailto:ima@ietf.org>
List-Help: <mailto:ima-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ima>, <mailto:ima-request@ietf.org?subject=subscribe>
X-List-Received-Date: Sun, 27 Nov 2011 16:04:50 -0000

--On Sunday, November 27, 2011 02:11 +0100 Frank Ellermann
<hmdmhdfmhdjmzdtjmzdtzktdkztdjz@gmail.com> wrote:

> On 27 November 2011 01:22, "Martin J. Dürst" wrote:
>...
>>>   Note that discussion on whether internationalized domain
>>> names   should be percent-encoded or puny-coded, is
>>> ongoing; see<xref   target="I-D.duerst-iri-bis" />.
> 
>> Why not just point to RFC 3986, last two paragraphs of
>> section 3.2.2
>> (http://tools.ietf.org/html/rfc3986#section-3.2.2)?
> 
>> That's an Internet Standard rather than an "ongoing
>> discussion".
> 
> All those "<reg-name> outside of DNS" considerations in STD 66
> are IMO irrelevant for UTF8SMTPBIS addresses.

Largely concur, especially if the IRI efforts to retroactively
revise the URI spec are relevant.   From my personal point of
view, there are a relatively small number of possibilities here:

(1) RFC3986/STD66 is the controlling spec.  If so, it is
irrelevant to this WG: email addresses are not inherently URIs
and the WG's decision that encoded forms in other than UTF-8
--especially encoded forms that interact with the %-routing
convention-- makes RFC 3986's notion of
internationalization-by-%-encoding inappropriate.

(2) The IRI WG has determined that RFC 3986 has outlived its
usefulness (STD or not) and that IRIs are the new URIs -- a
direct-use (protocol element) form, not user-friendly
presentation of URIs.  Email addresses may _still_ be irrelevant
to IRIs and vice versa, but the other question then becomes
whether the right answer for URIng involves further tuning (or,
depending on one's point of view, piling onto) the IRI spec or
justifies some rethinking.  

If an email address is transmitted via an IRI form (e.g., in an
extended  MAILTO URL), the question of when the string stops
being an IRI (or IRI encoded in a URI) and starts being an email
address becomes pretty important.

>...
  If that's
> correct we end up with the last two sentences in RFC 3986
> 3.2.2:
> 
> | When a non-ASCII registered name represents an
> internationalized | domain name intended for resolution via
> the DNS, the name must be | transformed to the IDNA encoding
> [RFC3490] prior to name lookup. | URI producers should provide
> these registered names in the IDNA | encoding, rather than a
> percent-encoding, if they wish to maximize | interoperability
> with legacy URI resolvers.

> That is an ordinary MUST, and obviously the tool or party
> knowing how it works cannot delegate "punycoding" to another
> tool or party which might not know how it works.

Right.  However, the discussion in the recent IRI WG f2f about
the RFC 6055 implications of that statement take us back to the
question of whether 3986 is still relevant and, more important,
into a possible dilemma: Suppose one "tool or party" knows how
the IDNA encoding works but doesn't know whether the string will
be interpreted in a public DNS environment or so other
environment (one that does not use IDNA encoding).  So it can't
apply the IDNA encoding (especially the IDNA2003 one, which may
lose information) without risking considerable damage if the
string is interpreted and looked up in an non-public-DNS/IDNA
context.  Conversely, if the Unicode string is (as nearly as
possible) preserved until lookup time, the "tool or party" doing
the lookup might, given your implied hypothesis above, not know
how to apply the IDNA encoding.  RFC 6055 effectively argues for
preserving the Unicode form as long as possible, but that may
not always be the right answer (or "as long as possible" may
need careful interpretation) and, more important, %-encoded
UTF-8 is not "the Unicode form", especially since the email
sender or intermediary cannot know what the delivery server does
about normalization (once again, the "no one interprets an email
address (especially a local part) other than the delivery
server" rule comes into effect... and somewhat contradicts all
of the permitted and required transformations in the current and
proposed IRI specs).

For the purposes of this WG, I think we need to concentrate on
email addresses, not IRIs or URIs.  If mailto doesn't work in
the headers, let's figure out what, from an email point of view,
works and is needed and then work backwards to mailto (or
MAILTobis) and whatever _we_ need from URIbis or IRIbis, rather
than taking those specs as given or restrictive.

Just my personal opinion, of course.

    john