Re: [I18ndir] Guidance on Return of A-Labels in a URL?

"Hollenbeck, Scott" <shollenbeck@verisign.com> Fri, 11 September 2020 11:38 UTC

Return-Path: <shollenbeck@verisign.com>
X-Original-To: i18ndir@ietfa.amsl.com
Delivered-To: i18ndir@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id DF56C3A0F22 for <i18ndir@ietfa.amsl.com>; Fri, 11 Sep 2020 04:38:26 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.099
X-Spam-Level:
X-Spam-Status: No, score=-2.099 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=unavailable autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=verisign.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 1z-2TTnP0Ry2 for <i18ndir@ietfa.amsl.com>; Fri, 11 Sep 2020 04:38:25 -0700 (PDT)
Received: from mail3.verisign.com (mail3.verisign.com [72.13.63.32]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 25F503A0F20 for <i18ndir@ietf.org>; Fri, 11 Sep 2020 04:38:25 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=simple/simple; d=verisign.com; l=10560; q=dns/txt; s=VRSN; t=1599824307; h=from:to:cc:date:message-id:references:in-reply-to: content-transfer-encoding:mime-version:subject; bh=9f/LeOcEhHwrq17BuvyizmJ3RmLUF42tdbjLdC52k5w=; b=nwKpD1syfDsInudXdTGYmTUbOAPy4v4yjJh3/Pu0bXu0miDaoWncYn7D 5Yhjls4LQ6ST/b0hd8QuW91d8dmyvuWUbATck0o43ujJFbqCc3dvi+UMW dBQ3zqpEc2IwQYrOgs20oZVS/XjDiTUZq4FdTjCJ71Tug84RjlxUWQ9AF 6wQ4fFc2GX/glSmiM16Yn1Tqxb8fdgYMxlfg22XSmRi2rbpvBDEvrrHfy FSNXP0DRZUhtD+UXh6YbIhbqAE7JfBfWHgmEKFYzLK2OFoUBQHcWHxiZA 4mXUgtRcnDnVrqhznmQEWzdIp6RqMna2NXM8zgwB1+xyP7UbhW4Ae+6/g Q==;
IronPort-SDR: 03eE0SHSX3By1arj2nqGsYaTrJ/cpmYi7BibnueTPZ2n+/VpXc+BQC9fdCDlOogVTe8s5+lCjj 9vGRinI2ogNGhX57dq4A3nX3ELzCXDg9nJNDpPhbcQhxB1Rtzi39UjbC1qz7Vhy9q6LNchie3V ZQ7T/8vznmrEvi4VP/AcWEj0H8v6ML3q8eSVgrrEim/xqii4GDLrbzHjZJ7x9/WGQVfQ9QKCEz uO7qaS8zoyiZHLvjGdVesqFcC+rKQt5vRB6/Khl3or/adj1xMqscuBvQtF3ioKz0ODLZ9R/Qq6 0hs=
X-IronPort-AV: E=Sophos;i="5.76,415,1592870400"; d="scan'208";a="2963350"
IronPort-PHdr: 9a23:R4nK6hMfvh3rZtoKJDgl6mtUPXoX/o7sNwtQ0KIMzox0K/38osbcNUDSrc9gkEXOFd2Cra4d1ayP7vGrATNIyK3CmUhKSIZLWR4BhJdetC0bK+nBN3fGKuX3ZTcxBsVIWQwt1Xi6NU9IBJS2PAWK8TW94jEIBxrwKxd+KPjrFY7OlcS30P2594HObwlSizexfLJ/IA+roQjRssQajpduJ6QswRbVv3VEfPhby3l1LlyJhRb84cmw/J9n8ytOvv8q6tBNX6bncakmVLJUFDspPXw7683trhnDUBCA5mAAXWUMkxpHGBbK4RfnVZrsqCT6t+592C6HPc3qSL0/RDqv47t3RBLulSwKMSMy/mPKhcxqlK9VoAyvqQFjw4DaY4+VOvhxfqLBct0VSmVMRdpRWDdDAo+gc4cDE+gMMOBFpIf9vVsOqh6+CBGiCO3tzT9Ignv20rM80+s6Dw7JwA8gE8oTu3rJsNr1M7sSUfy7wKLVyjjDdPNW2TD56IjMbB8hp+qDUqxsfsrS0kQvCR3Kjk+RqYz+PjOV2eINv3KH4OpnUOKikmgqoBx+rTaz3MkjkJXJhp4LxVDe8yV02Ig4KN2mREB7YdOpDZlduiCVOoZoTM0uX25mtTgkx7AFuZO2fSoHxZAoyRPfd/CKd5WF7gz+WeuNPzt2inNodry5ih2v/0agzej8WdO10FZMtidKjtjMtmsR2BzU8MSIVvx9/kK51TaOyQ/T9u9ELVozlarBJJ4sxKM7mJkLsUnbAyP6hF/6gLKUe0gq4OSk9uTqb7v8qpKTM4J4khzyPrg0lsCiA+k0LhICU3WU9Om/zrHv4E70TbNXhfMsiKbZqorVJcEDq668BA9az5gs5g6kDzejzNQYhX4HLE9ZeB6fj4jmJVXOIPfgAPqin1qiiCpnyPDeMLPuA5rBMmbPnK3/fblh7E5c0hIzwcpF6J1JEL0BPujzWlT3tNDCEhA5NAm0z/7mCNV7yIweRXqCDrKFPK/IrFOF5OwiL/ORaIIVtjvxMfco6vr2gX88g1AdfK2p3ZUNaHC/G/RrO16ZYXrrgtcFFWcFoAw+Q/f0h1KeUj5efGiyULwm5jE6E4KmDIjDRoa3jLOd2ye7G4VaZnpaBVCUDXfoa4KEVu8UaC2IPs9hiD0EVbu5RoIuyR2uuhX6y718IeXP+y0YqYjv1MNr6ODIiB4y8CZ7D96B3G2XTmF7gH8IRzEs0KB4u0x9xU+J0bJkjPxACdxT+/RJXx84NZ7bz+x6Dcn+WgPfcdeVRlarWdemDi0yTt4r39AOZEN9Ec24jh/fxyqqH6MVl7uTCZw1767R3H7xKNt8y3bcz6QhjkcpQtdBNWG8ga5/7QfTVMb1lBDTj6+2b6UfiXKV+n2Ex2CPoAdDXSZ8VKzfVjYea1fY69Pj6QmKG6enFa4mMV4dkcCYLaFAa8GvllhuSPLqIt+YYm+tlSG3HxnegvvGbo3qfSAc0TnTIEoPkEYY+nPMfVw8ASen5mPTEDJGF1nsJUjr9L8thmm8Sxp+7waOa0Bn3be++VpdvvebV+9ZluYfuCAlrzhyFlu23PrIBsCBvAtueuNXZtZrswQP7n7QqwEoZs/oFKtlnFNLKwk=
X-IPAS-Result: A2HxAACtYFtf/zGZrQpWCRsBAQEBAQEBAQUBAQESAQEBAwMBAQFAgU+DGoE0CoQukTGbXz0LAQEBAQEBAQEBCAEjDAQBAQKESQIXgg4lOBMCAwEBCwEBAQUBAQEBAQYDAQEBAoZFDII3Ins9CT0BAQEBAQEBAQEBAQEBAQEBAQEBAQEBAQEBAQEBAQEBAQEBAQEBBAINVEkBAR0BAQEBAyMROgsMBAIBCBEEAQEBAgImAgICMBUICAEBBAENBQiDH4MLsnh2gTKEOwGGI4EOKoZahm2BQj6BEYJiLj6CIzkCAoExFC+DAIJgBI98gyWHFZtAgQgDB4JliGuRQyqDCY8njjCSWYpSlQwCBAIEBQIVgWtogRNwUIJpCUcXAg2OKxeDTopWdAILKgIGAQkBAQMJjRMtgQaBEQEB
Received: from BRN1WNEX02.vcorp.ad.vrsn.com (10.173.153.49) by BRN1WNEX02.vcorp.ad.vrsn.com (10.173.153.49) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.1979.3; Fri, 11 Sep 2020 07:38:22 -0400
Received: from BRN1WNEX02.vcorp.ad.vrsn.com ([fe80::7c0a:1cc:5def:9dde]) by BRN1WNEX02.vcorp.ad.vrsn.com ([fe80::7c0a:1cc:5def:9dde%4]) with mapi id 15.01.1979.003; Fri, 11 Sep 2020 07:38:22 -0400
From: "Hollenbeck, Scott" <shollenbeck@verisign.com>
To: "patrik=40frobbit.se@dmarc.ietf.org" <patrik=40frobbit.se@dmarc.ietf.org>, "john-ietf@jck.com" <john-ietf@jck.com>
CC: "i18ndir@ietf.org" <i18ndir@ietf.org>
Thread-Topic: [EXTERNAL] Re: [I18ndir] Guidance on Return of A-Labels in a URL?
Thread-Index: AdaHbyMe55MY1ZKMSpW68jWImAZiOQAY6vkAAAXkcoAACtWNAAAGjbuw
Date: Fri, 11 Sep 2020 11:38:22 +0000
Message-ID: <1c85b426a7274fb3a7b22a158a20d556@verisign.com>
References: <326c954da33646f79a4e3bc4f27b7cb7@verisign.com> <0864F4FF-A615-451E-8828-433F3098A599@frobbit.se> <4D25DED743AC92D995786A92@PSB> <B6ED8D5D-39D3-4D12-AF2F-BD2C18B70393@frobbit.se>
In-Reply-To: <B6ED8D5D-39D3-4D12-AF2F-BD2C18B70393@frobbit.se>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
x-originating-ip: [10.170.148.18]
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: base64
MIME-Version: 1.0
Archived-At: <https://mailarchive.ietf.org/arch/msg/i18ndir/u7-29secymHZIn0aK6pSE24JdAE>
Subject: Re: [I18ndir] Guidance on Return of A-Labels in a URL?
X-BeenThere: i18ndir@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Internationalization Directorate <i18ndir.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/i18ndir>, <mailto:i18ndir-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/i18ndir/>
List-Post: <mailto:i18ndir@ietf.org>
List-Help: <mailto:i18ndir-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/i18ndir>, <mailto:i18ndir-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 11 Sep 2020 11:38:27 -0000

Thanks, folks, this was all very helpful. I think I have what I need.

Scott

> -----Original Message-----
> From: I18ndir <i18ndir-bounces@ietf.org> On Behalf Of Patrik Fältström
> Sent: Friday, September 11, 2020 12:30 AM
> To: John C Klensin <john-ietf@jck.com>
> Cc: Hollenbeck, Scott <shollenbeck@verisign.com>; i18ndir@ietf.org
> Subject: [EXTERNAL] Re: [I18ndir] Guidance on Return of A-Labels in a URL?
>
> Scott: TL;DR of all of our messages are: A-Label and U-Labels are 1:1
> mappings to each other. The question is what you want to happen (and who
> is doing validation) of:
>
> - Strings starting with xn-- that are not A-Labels
> - Unicode strings that are not U-Labels
> - URLs that do not follow RFC 3986 or 3987
>
> In this mix you absolutely must include (for natural reasons) people that do
> believe IETF, W3C are nut heads, including for example the ccTLDs registries
> that do provide domain names not IDNA-2008 conformant. Those following
> TR#46 and not IETF etc.
>
> Basically, as with all protocol design, the question is not what you do with the
> ones that do follow the specification, but the ones that do not.
>
>    Patrik
>
> On 11 Sep 2020, at 1:19, John C Klensin wrote:
>
> > Scott,
> >
> > Let me add a slightly different perspective (and a warning) to what
> > Patrik and John Levine have said without in any way disagreeing with them.
> It will also give you a somewhat deeper analysis and set of references to
> refer to if needed.
> >
> > As long as IDNA2008 is strictly followed, then it really makes no
> > difference whether U-labels and A-labels are returned and the choice
> > should be made according to what works best for the application
> > protocol.  If the URLs you are referring to are strictly conformant to
> > RFC 3986, there there is a strong case for using A-labels because the
> > alternative is %-encoding, presumably of UTF-8 [1].  That form is hard to
> read, less optimized for DNS labels, much less compact, etc., than the A-label
> form.
> >
> > _However_ note "strictly" above.  There are communities out there that
> > consider 3986 (and 3987) obsolete and are developing their own URL specs.
> There are overlapping ones that use or assume profiles of Unicode UTR#46
> which involve some specific mappings and, for all intents and purposes, rely
> on IDNA2003.
> > For IDNA2003 and those profiles of UTR#46, one cannot make an identity
> comparison between native character Unicode strings (UTF-8 or otherwise)
> without IDNA-specific processing.
> > Whatever the arguments for and against reliance on UTR#46 rather than
> > strict conformance to IDNA2008, that is a very strong argument for RDAP
> (and other protocols involving registration data) work in terms of  A-labels
> rather than anything else.
> >
> > FWIW and AFAIK, while, as Pstrik indicates, IETF has not taken a
> > position on this issue for RDAP and similar protocols, ICANN has.
> > Long ago, when the Board created two committees chaired by Katoh-san,
> > there were conclusions that, regardless of what interfaces registrars
> > chose to present to users and what they accepted as input, all registration
> databases and access protocols [2] should work strictly in terms of already
> mapped and processed Punycode-encoded form and that, if native
> characters were presented to users from those systems, they should be the
> result of
> >    ToUnicode(ToASCII(user-supplied-string)
> > and not the user supplied string.   Those conclusions and the reasoning that
> went into them strongly influenced RFC 4690 and hence IDNA2008 but, more
> important from your standpoint, I don't believe ICANN has ever deprecated
> or formally abandoned them.
> >
> >      john
> >
> >
> >
> > [1] See the rather flexible text in Section 2.5 of 3986.
> >
> > [2] While WHOIS was the only game in town then, the Whois++ specs had
> > been published a half-dozen years earlier and the debates about
> > databases and database access permissions and tools were well
> > underway.
> >
> >
> >
> > --On Thursday, September 10, 2020 22:30 +0200 Patrik Fältström
> > <patrik=40frobbit.se@dmarc.ietf.org> wrote:
> >
> >> Now some time...
> >>
> >> Regarding the use of U-Label and A-Label nothing specific is said
> >> about RDAP.
> >>
> >> What you have is RFC 5890 2.3.2.6. Domain Name Slot and some previous
> >> sections that talks about the equivalence between U-Label and A-Label
> >> which is a 1:1 mapping (compared to earlier versions of IDNA
> >> standard).
> >>
> >> Note section 2.3.2.1.  IDNA-valid strings, A-label, and U-label in
> >> the same RFC which says:
> >>
> >>> A "U-label" is an IDNA-valid string of Unicode characters, in
> >>> Normalization Form C (NFC)...
> >>
> >> My view is that it is up to the protocol that pass around domain
> >> names in a Domain Name Slot how to handle the situation, and because
> >> of that ensure that if U-Label is in use that the requirements on
> >> that protocol element matches the definition of a U-Label, and not
> >> just "random Unicode code points in some random encoding and
> unknown
> >> normalisation".
> >>
> >> I can see situations where you in RDAP do want to be able to send
> >> random Unicode Code Points just with the intention to do some search
> >> on the server, but that is then NOT a U-Label. The same way I do see
> >> interest in sending A-Labels to be sure the string is stable and as
> >> it was supposed to be during the whole transaction.
> >>
> >> Does this help?
> >>
> >>    Patrik
> >>
> >> On 10 Sep 2020, at 14:41, Hollenbeck, Scott wrote:
> >>
> >>> Does anyone know of any text in any of the IDN RFCs that would
> >>> support a recommendation to return A-Labels in RDAP response URLs?
> >>>
> >>> Marc Blanchet wrote an I-D
> >>> (https://secure-web.cisco.com/1PK_rOPig5YMI6-
> Qrt1gNz_5CyTTbwNMDx9crt
> >>>
> zA3SB6VurxRB5Ef60dnxyG8iYiput8zffNsdFACm4iiFSgWTF1hOv9kwCSBO84Ts
> yDjS
> >>> dbIhE9w6ncWvlZvicjEkT29zcPEe-
> klPnwK0GqxnZjmVwH1qxl5PQafux34UVebh6qmB
> >>> 0M-
> zrTNAdTcdZ05_OcpO_G2jVq52uQcyZtgBLxPXo1he2ZbuhfZ7gsPG9tX0awAWe
> ZVM
> >>>
> S8eZ48me1v2jB3e3Lzlzic3ba832W4eUDXOmHW3uW3oydCDARwe6D1pF14/h
> ttps%3A%
> >>> 2F%2Fwww.ietf.org%2Farchive%2Fid%2Fdraft-blanchet-regext-rdap-d
> >>> eployfindings-05.txt) a while ago in which he described RDAP
> >>> deployment findings. In Section 3.6, he suggests that "All links of
> >>> any "rel" types should always be returned in the A-Label form for
> >>> IDNs in the href or value members, independent of if the query was a
> >>> U-Label or A-Label or a mix". That seems like a good idea since a
> >>> server doesn't know what a client is capable of consuming, but I was
> >>> hoping to support this recommendation with a reference to something
> >>> in IDNA. I didn't see anything obvious.
> >>>
> >>> I sent this same basic question to both Marc and Patrik
> >>> individually. I haven't heard back from either of them, so I
> >>> apologize if I'm "jumping the gun" by asking he directorate without
> >>> waiting to see if they respond.
> >>>
> >>> Scott
> >>>
> >>> --
> >>> I18ndir mailing list
> >>> I18ndir@ietf.org
> >>> https://secure-
> web.cisco.com/10_phtaKsvsQuH07jrEHgj7uHPlUIi0R4tmzOwX
> >>>
> BgYAvKcIGjSJ1xKu03S87xFRePOw3QY8p4N6qNHeOiVqE1dVDf2BzJShA4JjDcn
> Mz2LN
> >>> BQ_IGd4g_ih0MWa7jdGcS_IUeRqR_2JzIIzgf1-
> oTu0IEyMbKZLOBYoCtc2r83kkio3P
> >>> qZd2s6GfeJfV75tpIUWWXduizU-iXfhAjY-
> G55LK27dEWEQEgCMf2WlhM1IhdTKnXmS3
> >>>
> ebpRpvAwFm6escwmupFsH8DFWGkKsFU2Wh4fVpNbtvCuI51ukotJo7AJE/ht
> tps%3A%2
> >>> F%2Fwww.ietf.org%2Fmailman%2Flistinfo%2Fi18ndir