Re: [I18ndir] Hooks - and other non-decomposing diacritics

John C Klensin <john-ietf@jck.com> Mon, 18 February 2019 23:56 UTC

Return-Path: <john-ietf@jck.com>
X-Original-To: i18ndir@ietfa.amsl.com
Delivered-To: i18ndir@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 1630A1310B3 for <i18ndir@ietfa.amsl.com>; Mon, 18 Feb 2019 15:56:21 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.899
X-Spam-Level:
X-Spam-Status: No, score=-1.899 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, RCVD_IN_DNSWL_NONE=-0.0001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id Tw98w1fRbW5A for <i18ndir@ietfa.amsl.com>; Mon, 18 Feb 2019 15:56:19 -0800 (PST)
Received: from bsa2.jck.com (bsa2.jck.com [70.88.254.51]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id BCB831310B0 for <i18ndir@ietf.org>; Mon, 18 Feb 2019 15:56:18 -0800 (PST)
Received: from [198.252.137.10] (helo=PSB) by bsa2.jck.com with esmtp (Exim 4.82 (FreeBSD)) (envelope-from <john-ietf@jck.com>) id 1gvsli-000F5Z-30; Mon, 18 Feb 2019 18:56:14 -0500
Date: Mon, 18 Feb 2019 18:56:08 -0500
From: John C Klensin <john-ietf@jck.com>
To: "Asmus Freytag (c)" <asmusf@ix.netcom.com>
cc: Patrik Fältström <paf@frobbit.se>, i18ndir@ietf.org
Message-ID: <8F19F97ACA9EA6BE79477B98@PSB>
In-Reply-To: <0687bcf8-21f6-af75-9c81-3a2596f637fc@ix.netcom.com>
References: <37939676-2D8A-4329-B6A0-A854F9530016@episteme.net> <f514be1d-ef7e-b32e-a304-dbb6e50ac8ae@ix.netcom.com> <0D42E9A5-B134-467A-A26B-3443ECAE1F83@frobbit.se> <b6f27274-e18f-2360-dede-67343351fbca@ix.netcom.com> <CF86A56BB400F11ACE033AF0@PSB> <96a1f45a-6052-0086-9a8b-4b58cb65c092@ix.netcom.com> <BFA865AF-6BF8-41FF-B0E4-4470F25D97BB@frobbit.se> <93ae785a-a4d4-5dbb-e8a2-ce5a1e27d843@ix.netcom.com> <B9241BCB03F01EA4F33B76CD@PSB> <ad2b6d3c-82e8-5285-c13d-746e05b1d908@ix.netcom.com> <6693BFC3-C48C-47A3-BED2-533C24C18DB5@frobbit.se> <0687bcf8-21f6-af75-9c81-3a2596f637fc@ix.netcom.com>
X-Mailer: Mulberry/4.0.8 (Win32)
MIME-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: 7bit
Content-Disposition: inline
X-SA-Exim-Connect-IP: 198.252.137.10
X-SA-Exim-Mail-From: john-ietf@jck.com
X-SA-Exim-Scanned: No (on bsa2.jck.com); SAEximRunCond expanded to false
Archived-At: <https://mailarchive.ietf.org/arch/msg/i18ndir/BwtcDcX05rpj_wXq5F6Yf789gSE>
Subject: Re: [I18ndir] Hooks - and other non-decomposing diacritics
X-BeenThere: i18ndir@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Internationalization Directorate <i18ndir.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/i18ndir>, <mailto:i18ndir-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/i18ndir/>
List-Post: <mailto:i18ndir@ietf.org>
List-Help: <mailto:i18ndir-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/i18ndir>, <mailto:i18ndir-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 18 Feb 2019 23:56:21 -0000


--On Monday, February 18, 2019 01:08 -0800 "Asmus Freytag (c)"
<asmusf@ix.netcom.com> wrote:

>...
> However, category Virama, and category Vowel_Dependent do need
> context rules (otherwise you get structurally unsound labels).
> 
> So, you could give them a a category:
> RESTRICTED_CONTEXT_RECOMMENDED
> 
> and likewise for all the other ISC values.
> 
> This works as long as your categories are reasonably generic
> and do not try to delve into "which context rules" would be
> recommended, because that's highly script dependent. We just
> know that you shouldn't allow them unrestricted by default.

But there are no code points, at least no code points outside
the ASCII repertoire, that are "unrestricted by default".
Every code point classified as PVALID and every label containing
code points that are valid by application of appropriate
contextual rules, is _still_ restricted by the rule that
registries (at all levels of the DNS) are required to understand
what they are doing and to be responsible about it.  If we want
to give (or point to) additional advice and can figure out how
to do that, fine, but where the specification is today isn't
"unrestricted by default" and setting that up as a strawman does
not, IMO, help us make progress.

If this were the only issue and we concluded IDNA2008 should not
go further in the direction of advice than it does today, then
the best way to get draft-faltstrom-unicode11 published as
quickly as possible (see below), might include:

* Get draft-klensin-idna-rfc5891bis into shape and ready for
publication.  I think it would require a bit of work to be used
this way.  In particular, make sure it says what we want it to
say about "troublesome".

* Persuade ICANN to release "troublesome" as part of general
advice to registries, especially at the second level but for the
use of anyone who felt like using it.   ICANN's doing that would
avoid IETF constraints about informed community consensus, code
point by code point analysis and listings, and long-term
concerns about maintenance... and would almost certainly be a
lot faster than trying to develop IETF consensus about those
issues.

* Include text in draft-faltstrom-unicode11 that stresses the
upper limit nature of IDNA2008, mentions the requirement that
registries impose their own restrictions and be careful about
it, and point to draft-klensin-idna-rfc5891bis for further
discussion.   If there is really urgency about
draft-faltstrom-unicode11 (see below) I can see ways to make
that reference non-normative so it would not hold up publication
_if_ we were quite confident we were actually going to finish
and publish draft-klensin-idna-rfc5891bis.

  --or-- 

we could try to figure out if the IETF wants to go into the
recommendations business, make a recommendation about a WG or
other mechanism to accomplish that, etc.  However a good idea
that might be, nothing that has occurred in either the last
several months (before and during IETF 102) or the last four and
a half years (since the first posted version of
draft-klensin-idna-5892upd-unicode70) has given me much hope
that it would be quick.

_However_

* I still haven't seen an explanation of why publication of
draft-faltstrom-unicode11 is urgent or who is pushing for it
other than the equivalent of "we've screwed around long enough".
That isn't much of an explanation because we have screwed around
long enough with a fairly long list of i18n documents, none of
which have been singled out in this way.  Some of them haven't
even been mentioned in this discussion including the early 2014
attempt to revise the IETF character set and languages policy
document (draft-sullivan-rfc2277-bis-00).

* The above does not address issues raised and unresolved from
that list of other documents, just the relationship between the
present I-D, "troublesome", and registry responsibility.

best,
   john