Re: [Idna-update] IDNA and combining sequences (was: Re: Expiration impending: <draft-klensin-idna-rfc5891bis-01.txt>)

John C Klensin <john-ietf@jck.com> Sat, 10 March 2018 13:51 UTC

Return-Path: <john-ietf@jck.com>
X-Original-To: idna-update@ietfa.amsl.com
Delivered-To: idna-update@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id B3B5C124D37 for <idna-update@ietfa.amsl.com>; Sat, 10 Mar 2018 05:51:45 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.91
X-Spam-Level:
X-Spam-Status: No, score=-1.91 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, T_RP_MATCHES_RCVD=-0.01] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id UO8SGwjAWtYM for <idna-update@ietfa.amsl.com>; Sat, 10 Mar 2018 05:51:44 -0800 (PST)
Received: from bsa2.jck.com (ns.jck.com [70.88.254.51]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 530561205F0 for <idna-update@ietf.org>; Sat, 10 Mar 2018 05:51:44 -0800 (PST)
Received: from [198.252.137.10] (helo=PSB) by bsa2.jck.com with esmtp (Exim 4.82 (FreeBSD)) (envelope-from <john-ietf@jck.com>) id 1eueuR-0006XF-Vh; Sat, 10 Mar 2018 08:51:39 -0500
Date: Sat, 10 Mar 2018 08:51:34 -0500
From: John C Klensin <john-ietf@jck.com>
To: =?UTF-8?Q?Patrik_F=C3=A4ltstr=C3=B6m?= <paf@frobbit.se>
cc: Asmus Freytag <asmusf@ix.netcom.com>, idna-update@ietf.org
Message-ID: <D26CE952D968BBEC0AB96A76@PSB>
In-Reply-To: <516E58F3-015D-4AD7-A3FD-0749A6890245@frobbit.se>
References: <C4FBCF12821031786F472AA2@PSB> <02c29140-29f1-cc81-8c4f-8249d0f23b2c@ix.netcom.com> <1E562CDE39B4224F227E765D@PSB> <516E58F3-015D-4AD7-A3FD-0749A6890245@frobbit.se>
X-Mailer: Mulberry/4.0.8 (Win32)
MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: quoted-printable
Content-Disposition: inline
X-SA-Exim-Connect-IP: 198.252.137.10
X-SA-Exim-Mail-From: john-ietf@jck.com
X-SA-Exim-Scanned: No (on bsa2.jck.com); SAEximRunCond expanded to false
Archived-At: <https://mailarchive.ietf.org/arch/msg/idna-update/N2f8m4Nmv9UNWsP36ytvMJO_Z_4>
Subject: Re: [Idna-update] IDNA and combining sequences (was: Re: Expiration impending: <draft-klensin-idna-rfc5891bis-01.txt>)
X-BeenThere: idna-update@ietf.org
X-Mailman-Version: 2.1.22
Precedence: list
List-Id: "Internationalized Domain Names in Applications \(IDNA\) implementation and update discussions" <idna-update.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/idna-update>, <mailto:idna-update-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/idna-update/>
List-Post: <mailto:idna-update@ietf.org>
List-Help: <mailto:idna-update-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/idna-update>, <mailto:idna-update-request@ietf.org?subject=subscribe>
X-List-Received-Date: Sat, 10 Mar 2018 13:51:46 -0000


--On Saturday, March 10, 2018 09:06 -0400 Patrik Fältström
<paf@frobbit.se> wrote:

> I think we should do some scenario planning here. Remember
> that we do not have a world where IDNA2008 based on Unicode
> 6.x is what people use. People use all different kind of mix
> between IDNA2008, IDNA2003 and Unicode versions. I have myself
> worked with the curl library (that uses libidn) and to be
> honest, I do not think people KNOW what they use. Or they
> know, and they know they violate the rules. For the contracted
> parties, they have the LGR coming down the road anyways, so...
> 
> And *if* we go down this path, is it "enough" to do in the LGR
> (i.e. ICANN) or should IETF do some adoption (and W3C?), or
> should IETF say "we can move forward in a more safe way AS WE
> KNOW ICANN DO LGR"...

As I said at far more length in another note, the LGR is (at
least by its charter/ Procedure) designed for, and applicable
only to, the root.  While those code points would probably be
safe to use in any zone, trying to apply them globally would be
unduly restrictive, would violate the "administratively
distributed hierarchy" principle, and, IMO, just wouldn't fly..
The latter would probably quickly reach the point that attempts
to apply the LGR to labels at the second level and beyond would
encourage non-compliance and daring ICANN to do anything about
it.     

As an example of the overly restrictive part, I assume that the
LGR rules, like the ccTLD Fast Track Procedures, would prohibit
the use of "ур" (U+0443 U+0440) or other permutations of those
characters) in to root on the grounds of either intrinsic
confusability or the potential conflict with the 3166-1 alpha-2
list.  However, if we accept the logic that justifies IDN TLDs,
that sequuence would be acceptable in a subtree of a domain
whose TLD label was in Cyrillic and that published and applied a
policy that its subdomains were entirely Cyrillic.

> Or...

I think the above puts us well into the "or..." range.

best,
   john