Re: [urn] Benjamin Kaduk's Discuss on draft-hakala-urn-nbn-rfc3188bis-01: (with DISCUSS and COMMENT)

Peter Saint-Andre <stpeter@mozilla.com> Thu, 07 June 2018 22:29 UTC

Return-Path: <stpeter@mozilla.com>
X-Original-To: urn@ietfa.amsl.com
Delivered-To: urn@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 9DCEA13117D for <urn@ietfa.amsl.com>; Thu, 7 Jun 2018 15:29:41 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.701
X-Spam-Level:
X-Spam-Status: No, score=-2.701 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, RCVD_IN_DNSWL_LOW=-0.7, SPF_PASS=-0.001] autolearn=unavailable autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=mozilla.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id Wx0r_lJr9y5n for <urn@ietfa.amsl.com>; Thu, 7 Jun 2018 15:29:40 -0700 (PDT)
Received: from mail-io0-x236.google.com (mail-io0-x236.google.com [IPv6:2607:f8b0:4001:c06::236]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 6D3D3131170 for <urn@ietf.org>; Thu, 7 Jun 2018 15:29:36 -0700 (PDT)
Received: by mail-io0-x236.google.com with SMTP id l25-v6so13669151ioh.12 for <urn@ietf.org>; Thu, 07 Jun 2018 15:29:36 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=mozilla.com; s=google; h=subject:to:cc:references:from:openpgp:autocrypt:message-id:date :user-agent:mime-version:in-reply-to; bh=eIvAiCBXxFTqvQtDziATRg6mJzVZ5arRVe1cNu3lbps=; b=ZpohBZ5WR2SXkaDpS+RE0kwCIMJk/4gqUbtOTKmw67Q6P71A/pVz6/uv5pJKAv7ERO xrkRm3vnq2LCYVtCsMNUXO6H2aN9/5IAwmtEjbmcKe98N1v5DPI81fyI+cBGma/WnLnY YH631f87q1BcsauBSrW/I8LlWcTk62/Ai64vg=
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:subject:to:cc:references:from:openpgp:autocrypt :message-id:date:user-agent:mime-version:in-reply-to; bh=eIvAiCBXxFTqvQtDziATRg6mJzVZ5arRVe1cNu3lbps=; b=WkjhstIBR2kf2G74xfneLbp2bXCpVaAeGHHCg/lkn0E5Q1q4iv+hTf3+wCgTiDk0zl ncKw5QvhVdu/bxqaWecRzpQlM9IFTkzingIa0AMXQCJvKFx5anRsLCu8n47LkpYZ1WZ6 PnKz6VlnutIqtOxqhn90cHs2Y0yqZcMhN94L+cTDPPctlTUVtDcE2njpYWFZg0Y96ujt 4Ace9k/Sc+Sc/nf/Qd51dGFxE9RL19pX8yeHVg1psLbtOe+6Dx7eej9SX1DbCSjSQXMY r9YD9IYrwXsVxAxrOJRStEFCAMh95F0Wk1+Px+d2EbpG1GfIxAno0ZcAPE8KQCJeIeh3 K+Ig==
X-Gm-Message-State: APt69E3QjQXcUjoP94POOGvVKf/fgxppLZHgNFQy9hBgcvqoDi9BVwTz 9njWOdTY1uVbla3XE9xJ+lJuR+Z33cI=
X-Google-Smtp-Source: ADUXVKI7PzYRh4A823VcEfJNp9ZTJsWp7lsbJwocX9KJ4ktSjCN3upd48yIdHK69F9Squ5decxpTfQ==
X-Received: by 2002:a6b:9e88:: with SMTP id h130-v6mr3358404ioe.283.1528410575610; Thu, 07 Jun 2018 15:29:35 -0700 (PDT)
Received: from dragon.local ([76.25.3.152]) by smtp.gmail.com with ESMTPSA id z85-v6sm1483604ita.1.2018.06.07.15.29.34 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Thu, 07 Jun 2018 15:29:34 -0700 (PDT)
To: Adam Roach <adam@nostrum.com>, Benjamin Kaduk <kaduk@mit.edu>, The IESG <iesg@ietf.org>
Cc: "urn@ietf.org" <urn@ietf.org>, draft-hakala-urn-nbn-rfc3188bis@ietf.org
References: <152837409539.30768.4568779645299135020.idtracker@ietfa.amsl.com> <6a1a100c-3bc0-76d3-3ae4-047d37906bfc@mozilla.com> <7161340e-014b-3740-83ed-39f4db3a30c0@nostrum.com>
From: Peter Saint-Andre <stpeter@mozilla.com>
Openpgp: preference=signencrypt
Autocrypt: addr=stpeter@mozilla.com; prefer-encrypt=mutual; keydata= xsFNBFonEf4BEADvZ+RGsJoOyZaw2rKedB9pBb2nNXVGgymNS9+FAL/9SsfcrKaGYSiWEz7P Lvc97hWH3LACFAHvnzoktv+4IWHjItvhdi9kUQ3Gcbahe55OcdZuSXXH3w5cHF0rKz9aYRpN jENqXM5dA8x4zIymJraqYvHlFsuuPB8rcRIV9SKsvcy14w9iRqu770NjXfE/aIsyRwwmTPiU FQ0fOSDPA/x2DLjed/GYHem90C5vF4Er9InMqH5KAMLnjIYZ9DbPx5c5EME4zW/d648HOvPB bm+roZs4JTHBhjlrTtzDDpMcxHq1e8YPvSdDLPvgFXDcTD4+ztkdO5rvDkbc61QFcLlidU8H 3KBiOVMA/5Rgl4lcWZzGfJBnwvSrKVPsxzpuCYDg01Y/7TH4AuVkv5Na6jKymJegjxEuJUNw CBzAhxOb0H9dXROkvxnRdYS9f0slcNDBrq/9h9dIBOqLhoIvhu+Bhz6L/NP5VunQWsEleGaO 3gxGh9PP/LMyjweDjPz74+7pbyOW0b5VnIDFcvCTJKP0sBJjRU/uqmQ25ckozuYrml0kqVGp EfxhSKVqCFoAS4Q7ux99yT4re2X1kmlHh3xntzmOaRpcZsS8mJEnVyhJZBMOhqE280m80ZbS CYghd2K0EIuRbexd+lfdjZ+t8ROMMdW5L51CJVigF0anyYTcAwARAQABzSdQZXRlciBTYWlu dC1BbmRyZSA8c3RwZXRlckBtb3ppbGxhLmNvbT7CwZQEEwEIAD4WIQQ1VSPTuPTvyWCdvvRl YYwYf2gUqQUCWicR/gIbIwUJCWYBgAULCQgHAgYVCAkKCwIEFgIDAQIeAQIXgAAKCRBlYYwY f2gUqdaREAChG8qU1853mP0sv2Mersns8TLG1ztgoKHvMXFlMUpNz6Oi6CjjaMNFhP7eUY4T D43+yQs7f4qCkOAPWuuqO8FbNWQ+yUoVkqF8NUrrVkZUlZ1VZBMQHNlaEwwu1CGoHsLoRohP SiZ0hpmGTWB3V6cDDK4KN6nl610WJbzE9LeKY1AxtePdJi2KM281U0Fz8ntij1jWu0gF2xU4 Sez46JDogHLWKgd0srauhcCVzZjAhiWrXp1+ryzSWYaZO8Kh8SnF1f4o6jtYikMqkxUaI5nX wvD3kNX4AMSkCAZfG7Jcfj/SLDojTcREgO87g7B9bcOOsHN4lj3lHoFV0aXpgPmjfIvAjJHu fHkXZAQAH8w0u9bgJqRn703+A4NPfLopnjegyhlNi7fQ3cMQV1H7Oj7WrB/pCcprx+1u/6Uq oTtDwWh1U5uVthVAI0QojpNWR08zABDX19TlGtVoeygaQV3CAEolxTiYQtCfVavUzUplCZ/t 3v4YiRov+NylflJd+1akyOs1IAgARf444BnoH1fotkpfXNOpp9wUXXwsQcFRdP7vpMkSCkc0 sxPNTVX3ei0QImp4NsrFdaep7LV3zEb3wkAp6KE5Qno4hVVEypULbvB0G6twNZbeRfcs2Rjp jnPb2fofvg2WhAKB20dnRfIfK8OKTD/P+JDcauJANjmekM7BTQRaJxH+ARAApPwkbOTChAQu jMvteb/xcwuL5JZElmLxIqvJhqybV7JknM+3ATyN0CTYQFvPTgIrhpk4zSn0A6pEePdK8mKK 5/aHyd7pr7rLEi1sI/X3UE8ld/E83MExksKrYbs0UX1wSQwYXU6g64KicnuP2Abqg+8wrQ18 1nPcZci9jJI75XVPnTdUpZD5aaQWGp7IJ06NTbiOk30I50ORfulgKoe4m3UfsMALFxIx3pJk oy76xC2tjxYGf+4Uq1M0iK3Wy655GrcwXq/5ieODNUcAZzvK5hsUVRodBq0Lq3g1ivQF4ba7 RQayDzlW6XgoeU49xnCr9XdZYnTnj4iaPmr2NtY6AacBwRz+bJsyugeSyGgHsnVGyUSMk8YN wZHvUykMjH21LLzIUX5NFlcumLUXDOECELCJwewui4W81sI5Sq/WDJet+iJwwylUX22TSulG VwDS+j66TLZpk1hEwPanGLwFBSosafqSNBMDVWegKWvZZVyoNHIaaQbrTIoAwuAGvdVncSQz ttC6KkaFlAtlZt3+eUFWlMUOQ9jxQKTWymyliWKrx+S6O1cr4hwVRbg7RQkpfA8E2Loa13oO vRSQy/M2YBRZzRecTKY6nslJo6FWTftpGO7cNcvbmQ6I++5cBG1B1eNy2RFGJUzGh1vlYo51 pdfSg0U1oPHBPCHNvPYCJ7UAEQEAAcLBfAQYAQgAJhYhBDVVI9O49O/JYJ2+9GVhjBh/aBSp BQJaJxH+AhsMBQkJZgGAAAoJEGVhjBh/aBSpAw0P/1tEcEaZUO1uLenNtqysi3mQ6qAHYALR Df3p2z/RBKRVx0DJlzDfDvJ2R/GRwoo+vyCviecuG2RNKmJbf1vSm/QTtbQMUjwut9mx6KCY CyKwniqdhaMBmjCfV2DB2MxxZLYMtDfx/2mY7vzAci7AkjC+RkSUByMEOkyscUydKC/ETdf9 tvI8GhTY/8Q7JSylS3lQA5pMUHiIf+KpSmqKZeBPkGc7nSKM1w1UKUvFAsyyVsiG6A/hWrTr 7tTQAl7YfjtOGE8n4IKGktvrT99bbh9wdWKZ5FdHUN9hx2Q8VP8+0lR1CH2laVFbEwCOv1vM W4cgQDLxwwpo1iOTdHBVtQDxlQ9hPMKVlB1KP9KjchxuiLc24wLmCjP3pDMml4LQxOYB34Eq cgPZ3uHvJZG309sb2wTMTWaXobWNI++ZrsRD5GTmuzF3kkx3krtrq6HI5NSaemxK6MTDTjDN Rj/OwTl0yU35eJXuuryB20GFOSUsxiw00I2hMGQ1Cy9L/+IW6Dvotd8O3LmKh2tFArzXaKLx /rZyGNurS/Go5YjHp8wdJOs7Ka2p1U31js24PMWO6hf6hIiY2WRUsnE6xZNhvBTgKOY6u0KT V6hTevFqEw7OAZDCWUoE2Ob2/oHGZCCMW5SLAMgp7eihF0kGf2S2CmpIFYXGb61hAD8SqSY7 Fn7V
Message-ID: <7aad7f1f-ca4c-51d3-a1f4-aefb919dae34@mozilla.com>
Date: Thu, 07 Jun 2018 16:29:33 -0600
User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.13; rv:52.0) Gecko/20100101 Thunderbird/52.8.0
MIME-Version: 1.0
In-Reply-To: <7161340e-014b-3740-83ed-39f4db3a30c0@nostrum.com>
Content-Type: multipart/signed; micalg="pgp-sha256"; protocol="application/pgp-signature"; boundary="N5fcrHKPcBkZwfFmcmIToXXy5gkpyJ6KA"
Archived-At: <https://mailarchive.ietf.org/arch/msg/urn/ppg-KapLnNauSrpGche_RtqOlBk>
Subject: Re: [urn] Benjamin Kaduk's Discuss on draft-hakala-urn-nbn-rfc3188bis-01: (with DISCUSS and COMMENT)
X-BeenThere: urn@ietf.org
X-Mailman-Version: 2.1.26
Precedence: list
List-Id: Revisions to URN RFCs <urn.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/urn>, <mailto:urn-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/urn/>
List-Post: <mailto:urn@ietf.org>
List-Help: <mailto:urn-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/urn>, <mailto:urn-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 07 Jun 2018 22:29:42 -0000

On 6/7/18 2:40 PM, Adam Roach wrote:
> On 6/7/18 3:02 PM, Peter Saint-Andre wrote:
>> By "normalize" you mean perform equivalence matching of percent-decoded
>> strings (of which Unicode normalization might be one step), right? Here
>> again I think the answer is "don't do that" because it's equivalence
>> matching is done on the percent-encoded strings.
> 
> 
> I think the concern here is the translation between percent-encoded URNs
> on the wire and the display form that users enter into lookup forms. For
> example, imagine some alt-history version of a country that decided that
> its NBNs would take the form <author-last-name>-<serial>. Encoded as a
> URN, this might look like urn:nbn:dd:roach-157.
> 
> The issue, of course, is that "urn:nbn:dd:m%c3%bcller-127" doesn't match
> "urn:nbn:dd:mu%cc%88ller-127".
> 
> So the problem doesn't occur at the level you mention; it happens
> somewhere between the keyboard and the network card of a querying user.
> That's not really this document's problem per se, but it is definitely a
> dragon that needs flagging. I'm not sure whether "don't do that then" is
> the correct advice, unless we have reason to believe that national
> libraries are hyper-aware of URN considerations when designing their NBN
> schemes. What we probably need to say is "if your national library
> defines an NBN that can contain percent-encoded characters higher than
> U+007F, then that same body needs to carefully define the canonical
> transformation from NBNs into URNs, including normalization forms."

Text along those lines seems appropriate. We might also discourage
people from defining their own canonical transformation, but rather
re-use one that's already defined (say, a PRECIS profile such as
UsernameCaseMapped).

Peter