Re: [I18nrp] Last Call: <draft-faltstrom-unicode11-05.txt> (IDNA2008 and Unicode 11.0.0) to Informational RFC

"Asmus Freytag (c)" <asmusf@ix.netcom.com> Sun, 09 December 2018 23:38 UTC

Return-Path: <asmusf@ix.netcom.com>
X-Original-To: i18nrp@ietfa.amsl.com
Delivered-To: i18nrp@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 6B48A128D68 for <i18nrp@ietfa.amsl.com>; Sun, 9 Dec 2018 15:38:58 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.69
X-Spam-Level:
X-Spam-Status: No, score=-2.69 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_LOW=-0.7, SPF_PASS=-0.001, T_KAM_HTML_FONT_INVALID=0.01] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=ix.netcom.com; domainkeys=pass (2048-bit key) header.from=asmusf@ix.netcom.com header.d=ix.netcom.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id ToFC0R-hvP7S for <i18nrp@ietfa.amsl.com>; Sun, 9 Dec 2018 15:38:55 -0800 (PST)
Received: from elasmtp-dupuy.atl.sa.earthlink.net (elasmtp-dupuy.atl.sa.earthlink.net [209.86.89.62]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 5CEF512D4EB for <i18nrp@ietf.org>; Sun, 9 Dec 2018 15:38:54 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ix.netcom.com; s=dk12062016; t=1544398735; bh=Dv/GSXIKv+CqnlrbHs+wGo9opwvM4brKcjPI 1gEvhPA=; h=Received:Subject:To:Cc:References:From:Message-ID:Date: User-Agent:MIME-Version:In-Reply-To:Content-Type:Content-Language: X-ELNK-Trace:X-Originating-IP; b=Gmih62YOjOQnb8n4DkonhUnk8WYaIUorP ulNPka9/tQ97N1WNDiklVrCdU8zViWRe4CtYgnKx72pnlbJyA+1/OqDKhzCbCZ8PA7u 4Oaaw5dcA5GkB7oenapEuUN7qDDVsXVWcq7WEXdmocKTaL7Jw5GemVybrMT9p0B32Z5 O1fkXP+YDsZxWBhf3/BlPzS3a1a7+1YoKxDm6hSIRR7bXMF+VfgbPJucKQxoGCtIjYj EV5cPNoA8tz9qf2KcHm5mlRLqA8jhYOXHx4mWcuKNUuBFwBx91fmi3aaSSlvFX12gma /IX4zalSvDzGndxa53Fuipn+q6pMiJA8jnYuIAcFw==
DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=dk12062016; d=ix.netcom.com; b=Z2FeZSi+1Sn8UxFJyv15Jv7/hLr4YTPVHO5mxFGjb41+TggDDd9i7AAtCaDdAiAbs9sAN8IOMXNixysB4iYm2UEJs94YlLqYQEjJZ531j0/7799ZULudHj0UqrZ4zNVh5/N1se3QV/bCbaYQTNEu7GcfTfc70e0N0gcS6j1D0ID2JBI7NJIal3V5bRMncmI93JpVOD1e39lxEppy+pp5RopEpE4n6GVfunW4lnyoKwUV+F6U1wbmmyG2Rgq745pSOCU1OC58GlBDyFl/HLe7k8NYkoo2LNE8dUWeoYvKNGpp9vEarGixkErn1blfSSVx5vyI/ma+KXZcVqmFt/j6Bw==; h=Received:Subject:To:Cc:References:From:Message-ID:Date:User-Agent:MIME-Version:In-Reply-To:Content-Type:Content-Language:X-ELNK-Trace:X-Originating-IP;
Received: from [174.21.171.131] (helo=[192.168.1.111]) by elasmtp-dupuy.atl.sa.earthlink.net with esmtpa (Exim 4) (envelope-from <asmusf@ix.netcom.com>) id 1gW8ey-000E0k-4j; Sun, 09 Dec 2018 18:38:52 -0500
To: Larry Masinter <LMM@acm.org>, 'John C Klensin' <john-ietf@jck.com>, 'Patrik Fältström' <paf=40frobbit.se@dmarc.ietf.org>
Cc: i18nrp@ietf.org, 'Paul Hoffman' <paul.hoffman@vpnc.org>
References: <154385119878.18333.5085298134102919486.idtracker@ietfa.amsl.com> <FF6F9EB9-C73B-4EC0-AC4F-3E3BFBABA0AB@vpnc.org> <8E20D432-01B0-4B52-80BB-3348C5FE73AF@vpnc.org> <CC73FC25-92FC-4822-B267-15C41CE450F2@frobbit.se> <D81CDFF3-8CDF-4168-9CEA-E8DC3A133B73@vpnc.org> <217ede0e-ea1f-bb31-a276-f8c618c71278@ix.netcom.com> <8885EE4C-412E-4337-A099-66354A36CEA1@vpnc.org> <EC12FDAE-4ABD-4AD3-A35A-B39D2C8A0AE0@frobbit.se> <f4417f80-fa86-11e6-baf7-2365981e18b1@ix.netcom.com> <48A2A546-4FEA-4060-8706-34D210B2ABAF@frobbit.se> <055301d48dc8$0ea95120$2bfbf360$@acm.org> <07CB0B3B-E48A-40CD-BBC9-E6CAA2FB29F0@frobbit.se> <001d01d48dee$82b415c0$881c4140$@acm.org> <1f879380-f586-cddf-ae4b-62cfc106308a@ix.netcom.com> <00f301d48e63$071e9be0$155bd3a0$@acm.org> <0D2335F6D932D325C3FBA91E@PSB> <6a8c84c4-a7af-9398-e706-199a6ec61d81@ix.netcom.com> <009f01d48ecc$18d853d0$4a88fb70$@acm.org> <932251fc-d620-8d77-aaa5-684d917c9ed6@ix.netcom.com> <01ab01d49009$e69b4d70$b3d1e850$@acm.org>
From: "Asmus Freytag (c)" <asmusf@ix.netcom.com>
Message-ID: <5ccda97f-7375-afb5-6048-2cebc2cdf6bc@ix.netcom.com>
Date: Sun, 09 Dec 2018 15:38:56 -0800
User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:60.0) Gecko/20100101 Thunderbird/60.3.3
MIME-Version: 1.0
In-Reply-To: <01ab01d49009$e69b4d70$b3d1e850$@acm.org>
Content-Type: multipart/alternative; boundary="------------F0E759325053970640195A72"
Content-Language: en-US
X-ELNK-Trace: 464f085de979d7246f36dc87813833b28d93432b0f0788b99d2e780c1e858ac235981e7521cbac1e350badd9bab72f9c350badd9bab72f9c350badd9bab72f9c
X-Originating-IP: 174.21.171.131
Archived-At: <https://mailarchive.ietf.org/arch/msg/i18nrp/XsoVYIbxpdZ6Iy3_KSL_BD8woq0>
Subject: Re: [I18nrp] Last Call: <draft-faltstrom-unicode11-05.txt> (IDNA2008 and Unicode 11.0.0) to Informational RFC
X-BeenThere: i18nrp@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Internationalization Review Procedures <i18nrp.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/i18nrp>, <mailto:i18nrp-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/i18nrp/>
List-Post: <mailto:i18nrp@ietf.org>
List-Help: <mailto:i18nrp-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/i18nrp>, <mailto:i18nrp-request@ietf.org?subject=subscribe>
X-List-Received-Date: Sun, 09 Dec 2018 23:38:58 -0000

On 12/9/2018 1:55 PM, Larry Masinter wrote:
>
> I think we’re looking at the problem from different ends of the telescope.
>
Not necessarily.

> I’m concerned with giving guidance to the use of Unicode in other 
> naming contexts, including URLs.
>
I'm interested in that aspect as well, even though my current projects 
is focused on IDN labels for domains.

Many of the considerations I mentioned apply - they are based on the 
nature of the writing systems more than on particularities specific to 
domain names.


> I think we should extend the advice to anyone assigning a “Unicode 
> name” e.g., email addresses social media handles.
>

Agreed.

Here are some potential distinctions between different types of systems 
that handle Unicode names.

1) public vs. single entity

That is, who can create names. If any user can create names, then you'll 
want some protection against malicious registrations (or some other 
means of handling duplicate handles)

For singly entity controlled naming systems, malicious use and 
inadvertent duplication would not be a concern (as long as names are 
truly assigned by that entity and not via pass-through from other data - 
like document titles).

2) registry-based vs. open naming

If there's a list of names maintained in a registry (or database of some 
sort), it's possible to query it and prevent some duplicate names from 
being added -- that allows the addition of script or writing-system 
dependent rules to manage collisions.

Without such a database, inadvertent or malicious duplicates are hard to 
police.

3) exact or permissive lookup

Some name systems perform folding (like punctuation removal for e-mail 
names). Such lookup systems could directly support linguistic and visual 
variants by mapping all of them to a "preferred" form of the label.

Exact lookup systems must mimic that with a support for variant labels. 
Unlike a label folding scheme, the variant mechanism allows more 
fine-grained control over which form(s) of the label will succeed in 
lookup (and registration).

> In particular, the  IRI document RFC3987 needs update to align its 
> recommendations with IDNA and the WHATWG URL “living standard” (which 
> mainly neglects giving any advice at all the those choosing URLs.)
>

OK; looks like there's work to do.

A./


> Larry
>
> --
>
> https://LarryMasinter.net
>
> *From:*i18nRP <i18nrp-bounces@ietf.org> *On Behalf Of *Asmus Freytag (c)
> *Sent:* Saturday, December 8, 2018 1:13 PM
> *To:* Larry Masinter <LMM@acm.org>; 'John C Klensin' 
> <john-ietf@jck.com>; 'Patrik Fältström' <paf=40frobbit.se@dmarc.ietf.org>
> *Cc:* i18nrp@ietf.org; 'Paul Hoffman' <paul.hoffman@vpnc.org>
> *Subject:* Re: [I18nrp] Last Call: <draft-faltstrom-unicode11-05.txt> 
> (IDNA2008 and Unicode 11.0.0) to Informational RFC
>
> On 12/8/2018 12:00 AM, Larry Masinter wrote:
>
>     We were discussing Patrik’s document and what advice or rules to
>     give to Registrars about being conservative,
>
>     And some questions about the motivation or intelligence of
>     Registrars and their Clients.
>
>     It would help if the Registrars could pass off responsibility to
>     their Clients; the only reason not to do so would be that the
>     clients shouldn’t be expected to know the complex rules.
>
>     The transcription problem is relatively easy to explain and
>     understand, and covers most other ways in which a name could be
>     “bad”.
>
>     These names don’t fall from the sky. Someone chooses them. Give
>     them a clear motivation.
>
> The rules are complex even if the basic requirements can be stated simply.
>
> The process of teasing them out for the 28 scripts that make up all 
> but a miniscule fraction of modern use is progressing nicely, but 
> including preparatory phases will probably have taken most of a decade 
> before completion. (I'm referring to the Root Zone LGR program).
>
> The good news is that the result of this effort will be well 
> documented (linking the final design to the underlying details of the 
> scripts and language use) and that it will also be available in a 
> machine-readable format.
>
> That means that for the first time, there would be a well-researched 
> (and well-documented) starting point for anyone trying to create and 
> enforce registry policies for pretty much any writing system used in 
> daily transactions.
>
> Policies for the second level would have to extend this template to 
> allow digits and hyphen - and perhaps relax some other restrictions 
> that are seen as relevant only to the root. This is a much smaller 
> task than starting from scratch, but for some scripts, even that task 
> is not entirely trivial.
>
> Common to the rules for most of the scripts is that they define 
> variant labels and the restrictions applicable to them (never 
> allocatable to unrelated entities and most commonly the first 
> allocated blocks all subsequent registrations of variants). Carefully 
> designed to make indistinguishable or semantically equivalent labels 
> mutually exclusive or reserved for a single applicant, these policies 
> require access to a registry to implement. Only when you know what is 
> allocated already can you evaluate whether a new label is a blocked 
> variant.
>
> Being machine readable should make it easy to integrate these rules 
> into registry operations. Therefore, I'm a bit confused as to what 
> would be gained by somehow distributing this responsibility down the 
> chain - other than surfacing restrictions built into the registry to 
> applicants.
>
> A./
>