Re: [I18ndir] Hooks - and other non-decomposing diacritics

"Asmus Freytag (c)" <asmusf@ix.netcom.com> Tue, 19 February 2019 00:25 UTC

Return-Path: <asmusf@ix.netcom.com>
X-Original-To: i18ndir@ietfa.amsl.com
Delivered-To: i18ndir@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 36E401310C3 for <i18ndir@ietfa.amsl.com>; Mon, 18 Feb 2019 16:25:29 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.997
X-Spam-Level:
X-Spam-Status: No, score=-1.997 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=0.001, RCVD_IN_MSPIKE_WL=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=ix.netcom.com; domainkeys=pass (2048-bit key) header.from=asmusf@ix.netcom.com header.d=ix.netcom.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id UYpPLE3JWZ6L for <i18ndir@ietfa.amsl.com>; Mon, 18 Feb 2019 16:25:26 -0800 (PST)
Received: from elasmtp-masked.atl.sa.earthlink.net (elasmtp-masked.atl.sa.earthlink.net [209.86.89.68]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 5832E1200ED for <i18ndir@ietf.org>; Mon, 18 Feb 2019 16:25:25 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ix.netcom.com; s=dk12062016; t=1550535926; bh=MPRo8SW55QEzsEoIebDdOp1Dzm8+Qiu+vWc2 km/05LE=; h=Received:Subject:To:Cc:References:From:Message-ID:Date: User-Agent:MIME-Version:In-Reply-To:Content-Type:Content-Language: X-ELNK-Trace:X-Originating-IP; b=pwkUSYdqde1XM22sLed8OWUkQot4xEBYZ KG+PXDtlUfApW1gAl+PQF+iOR9YKimf9qAZtJV3cvZ3GEwr1wqopsPCihFF/VenAxJC zIFtnFkPxJA3ljtN6q9UB8l04XtS5Ghu7k7STqQ+r3asoX9F+r6Q7Hr1yAGWqf/AwIh DJpdLVjTwtDKpb2imjIv2dmBK0YU+bjH2dEC4De9iVXur0lK6O8Y+PHuoX4yeOo3oC9 fmeTo7x7q5Bz7Iwt4uku4qr4Ivqhwv0dXWyZCLkr7RNo/EVmRrLOIJoI9QNGHGrRYop CZ46bsMmSUaP7gB//8eivukEBmVZT7ZHG1Uvz6R3A==
DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=dk12062016; d=ix.netcom.com; b=NB93XH8iL6Qzg4OXT1Uw5Fo8gRVWcqw1u6rxZjgnsj9odA2JxBwQPj2EVkq7z2L83HvW+G4p9qL5YwgI7/hzSQ5qUtnTWEN7nXaCo3g4nxQdFEJUlyCROECHlbQYrhkE7nWkphW87QUhESzNcR0IASfoFRk1iX2Ze8M4Tvk7/RSpGydpXdr7tEkRJXpLA/kE23WNSlHo6Yrn9Cv8slG11j4aLNA9LDqt0Zjm5ogfmqpE4e+B58hcB60cJuoWzTTsmmc6NaDXDqkNNtlUqEx2M6V4xbu4lbdFvh5p/8ct3R8+NrUnjENpCw3ScCw5pFkHTHxJcSmG8j2LUA3i5LNmUQ==; h=Received:Subject:To:Cc:References:From:Message-ID:Date:User-Agent:MIME-Version:In-Reply-To:Content-Type:Content-Language:X-ELNK-Trace:X-Originating-IP;
Received: from [71.212.3.155] (helo=[192.168.0.5]) by elasmtp-masked.atl.sa.earthlink.net with esmtpa (Exim 4) (envelope-from <asmusf@ix.netcom.com>) id 1gvtDu-000DS0-SN; Mon, 18 Feb 2019 19:25:23 -0500
To: John C Klensin <john-ietf@jck.com>
Cc: Patrik Fältström <paf@frobbit.se>, i18ndir@ietf.org
References: <37939676-2D8A-4329-B6A0-A854F9530016@episteme.net> <f514be1d-ef7e-b32e-a304-dbb6e50ac8ae@ix.netcom.com> <0D42E9A5-B134-467A-A26B-3443ECAE1F83@frobbit.se> <b6f27274-e18f-2360-dede-67343351fbca@ix.netcom.com> <CF86A56BB400F11ACE033AF0@PSB> <96a1f45a-6052-0086-9a8b-4b58cb65c092@ix.netcom.com> <BFA865AF-6BF8-41FF-B0E4-4470F25D97BB@frobbit.se> <93ae785a-a4d4-5dbb-e8a2-ce5a1e27d843@ix.netcom.com> <B9241BCB03F01EA4F33B76CD@PSB> <ad2b6d3c-82e8-5285-c13d-746e05b1d908@ix.netcom.com> <6693BFC3-C48C-47A3-BED2-533C24C18DB5@frobbit.se> <0687bcf8-21f6-af75-9c81-3a2596f637fc@ix.netcom.com> <8F19F97ACA9EA6BE79477B98@PSB>
From: "Asmus Freytag (c)" <asmusf@ix.netcom.com>
Message-ID: <f537a3eb-5ed9-86b2-4f29-ddf53a199561@ix.netcom.com>
Date: Mon, 18 Feb 2019 16:25:22 -0800
User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:60.0) Gecko/20100101 Thunderbird/60.5.0
MIME-Version: 1.0
In-Reply-To: <8F19F97ACA9EA6BE79477B98@PSB>
Content-Type: multipart/alternative; boundary="------------A2F6735F6CE7C8303E88A04F"
Content-Language: en-US
X-ELNK-Trace: 464f085de979d7246f36dc87813833b2817f643b8a0bc7e049e0da0f2c2ebae30075e20bea7f8155350badd9bab72f9c350badd9bab72f9c350badd9bab72f9c
X-Originating-IP: 71.212.3.155
Archived-At: <https://mailarchive.ietf.org/arch/msg/i18ndir/2x5_teC3FU07WfwjSy2DGSCl-Ko>
Subject: Re: [I18ndir] Hooks - and other non-decomposing diacritics
X-BeenThere: i18ndir@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Internationalization Directorate <i18ndir.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/i18ndir>, <mailto:i18ndir-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/i18ndir/>
List-Post: <mailto:i18ndir@ietf.org>
List-Help: <mailto:i18ndir-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/i18ndir>, <mailto:i18ndir-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 19 Feb 2019 00:25:29 -0000

On 2/18/2019 3:56 PM, John C Klensin wrote:
>
> --On Monday, February 18, 2019 01:08 -0800 "Asmus Freytag (c)"
> <asmusf@ix.netcom.com> wrote:
>
>> ...
>> However, category Virama, and category Vowel_Dependent do need
>> context rules (otherwise you get structurally unsound labels).
>>
>> So, you could give them a a category:
>> RESTRICTED_CONTEXT_RECOMMENDED
>>
>> and likewise for all the other ISC values.
>>
>> This works as long as your categories are reasonably generic
>> and do not try to delve into "which context rules" would be
>> recommended, because that's highly script dependent. We just
>> know that you shouldn't allow them unrestricted by default.
> But there are no code points, at least no code points outside
> the ASCII repertoire, that are "unrestricted by default".

By "unrestricted by default" I mean an IDN table where this code point 
simply
appears in the list of allowed code points for the registry without any
LGR WLE or context rules or without being limited to a sequence.

I'm sorry if you misconstrued this.

> Every code point classified as PVALID and every label containing
> code points that are valid by application of appropriate
> contextual rules, is _still_ restricted by the rule that
> registries (at all levels of the DNS) are required to understand
> what they are doing and to be responsible about it.

Yes, no argument.

I was addressing the case where registries simply do not add any
restrictions of their own, even though they most probably should have.


>    If we want
> to give (or point to) additional advice and can figure out how
> to do that, fine,

good (both alternatives)


>
> If this were the only issue and we concluded IDNA2008 should not
> go further in the direction of advice than it does today, then
> the best way to get draft-faltstrom-unicode11 published as
> quickly as possible (see below), might include:
>
> * Get draft-klensin-idna-rfc5891bis into shape and ready for
> publication.  I think it would require a bit of work to be used
> this way.  In particular, make sure it says what we want it to
> say about "troublesome".

Or about "a framework for advice to registries on label generation rules 
". (*)

That's how I am thinking about this.

(*) it's not just character selection.

>
> * Persuade ICANN to release "troublesome" as part of general
> advice to registries, especially at the second level but for the
> use of anyone who felt like using it.   ICANN's doing that would
> avoid IETF constraints about informed community consensus, code
> point by code point analysis and listings, and long-term
> concerns about maintenance... and would almost certainly be a
> lot faster than trying to develop IETF consensus about those
> issues.

I think we should still have a bit more of a discussion of what I call the
"framework" -- this has more to do with the types of actions that would
be appropriate and the types of character categories.

Also, about the way all of this can be related to other Unicode properties.

Even if we end up pushing this as a request to ICANN, it would help if we
got a better idea of what we would like to see. It can't just be to ask 
ICANN
to "publish advice to registries".

>
> * Include text in draft-faltstrom-unicode11 that stresses the
> upper limit nature of IDNA2008, mentions the requirement that
> registries impose their own restrictions and be careful about
> it, and point to draft-klensin-idna-rfc5891bis for further
> discussion.   If there is really urgency about
> draft-faltstrom-unicode11 (see below) I can see ways to make
> that reference non-normative so it would not hold up publication
> _if_ we were quite confident we were actually going to finish
> and publish draft-klensin-idna-rfc5891bis.

That would work for me.


>
>    --or--
>
> we could try to figure out if the IETF wants to go into the
> recommendations business, make a recommendation about a WG or
> other mechanism to accomplish that, etc.  However a good idea
> that might be, nothing that has occurred in either the last
> several months (before and during IETF 102) or the last four and
> a half years (since the first posted version of
> draft-klensin-idna-5892upd-unicode70) has given me much hope
> that it would be quick.


Understand your reasoning for being cautious - but I thought with the
i18ndir we have at least the start of an attempt for IETF to become serious
about issues like this.

A./

PS: about "urgency" - as it stands 5892 applies to all of Unicode. With each
version, the "automatic" propagation of properties will become more of
the standard way to handle Unicode and the IANA parameters will become
less relevant.

You could say, as long as Patrik's draft doesn't contain any new exceptions
we don't need it. Not sure I buy that.



>
> _However_
>
> * I still haven't seen an explanation of why publication of
> draft-faltstrom-unicode11 is urgent or who is pushing for it
> other than the equivalent of "we've screwed around long enough".
> That isn't much of an explanation because we have screwed around
> long enough with a fairly long list of i18n documents, none of
> which have been singled out in this way.  Some of them haven't
> even been mentioned in this discussion including the early 2014
> attempt to revise the IETF character set and languages policy
> document (draft-sullivan-rfc2277-bis-00).
>
> * The above does not address issues raised and unresolved from
> that list of other documents, just the relationship between the
> present I-D, "troublesome", and registry responsibility.
>
> best,
>     john
>
>
>