Re: [I18ndir] I18ndir last call review of draft-ietf-regext-dnrd-objects-mapping-06

Asmus Freytag <asmusf@ix.netcom.com> Fri, 06 March 2020 00:14 UTC

Return-Path: <asmusf@ix.netcom.com>
X-Original-To: i18ndir@ietfa.amsl.com
Delivered-To: i18ndir@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 896D63A0F1D for <i18ndir@ietfa.amsl.com>; Thu, 5 Mar 2020 16:14:47 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.996
X-Spam-Level:
X-Spam-Status: No, score=-1.996 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=0.001, RCVD_IN_MSPIKE_H3=0.001, RCVD_IN_MSPIKE_WL=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=ix.netcom.com; domainkeys=pass (2048-bit key) header.from=asmusf@ix.netcom.com header.d=ix.netcom.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id Gy-0e0xMT4gg for <i18ndir@ietfa.amsl.com>; Thu, 5 Mar 2020 16:14:45 -0800 (PST)
Received: from elasmtp-masked.atl.sa.earthlink.net (elasmtp-masked.atl.sa.earthlink.net [209.86.89.68]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 837C83A0F1C for <i18ndir@ietf.org>; Thu, 5 Mar 2020 16:14:45 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ix.netcom.com; s=dk12062016; t=1583453685; bh=jHREvFTITaPEMDcdVbenzzNiv0p7LjKrBO87 Ji/LWis=; h=Received:Subject:To:References:From:Message-ID:Date: User-Agent:MIME-Version:In-Reply-To:Content-Type:Content-Language: X-ELNK-Trace:X-Originating-IP; b=b/IcLwy9ZlCjH3yZADhgT3MC01PBPxn5X oTKyeaHYXKpbu+5GWGyOaavjnOUPbDBkgS6hOz5KSuX0CHhrh2OelHGTB6roBw4pO6c vruDNjhqv0oOKhNSp1HoHLli6rHW0PuloRkSUkLN0qzkDXcP9T9uk1/IMB3i1sw341c PIC2i0fJ9DBdnV4tec3gkb6PBD0v+2EfRqdelsUKJINT19h4uCgIMNvMSB/i1vBOPH5 Q6I8tc5AYUsh5wdmkWK6tr141aNySA2Fz3IH7Tu0NgEgQLcaO0opTmGZktybAMbxJ+2 k0BCH2+ZeoCYUUQEdikhMMRWjZPxx5Vsr5QBnrE0g==
DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=dk12062016; d=ix.netcom.com; b=ZOQSe4AuH5PakwWUg3zvwGXtU9LVZdAUYwQhl4TneGMGg2dVpoLtkCMWrf23UwH+cJjPYcfMaaLW4HlxoI/G9YScQDiFAZsT2naiKlcJNzPBt/cn7WmyTlEoNLZvF3JwiVp76un6NDz5U7rpe1XxkZmmp2xhBqgLVhm62q2wFW8p0aeJXNSnmg5qel6r8QnmiVDP8xE+jDi8L030iqN1I1J65UIxle03o0KJblk90eb2ZjNHSSskW/zX8zmdX36EzdAGnQMve/3nKhqq6+TlR8xYkZ8feLVQRerartWW7E0dkKhYU7jWtQ0zAAolkynM+F+A6bFjpDQfgGR5lMqnvQ==; h=Received:Subject:To:References:From:Message-ID:Date:User-Agent:MIME-Version:In-Reply-To:Content-Type:Content-Language:X-ELNK-Trace:X-Originating-IP;
Received: from [75.172.97.195] (helo=[192.168.1.106]) by elasmtp-masked.atl.sa.earthlink.net with esmtpa (Exim 4) (envelope-from <asmusf@ix.netcom.com>) id 1jA0dW-000EZa-Uc for i18ndir@ietf.org; Thu, 05 Mar 2020 19:14:43 -0500
To: i18ndir@ietf.org
References: <158343520135.15044.10991712449156105132@ietfa.amsl.com> <9CD56DEFBC9108D9620ED61E@PSB>
From: Asmus Freytag <asmusf@ix.netcom.com>
Message-ID: <2cb9e78f-32dc-3e2f-ba1a-6ae0218f3ef9@ix.netcom.com>
Date: Thu, 05 Mar 2020 16:14:43 -0800
User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:68.0) Gecko/20100101 Thunderbird/68.5.0
MIME-Version: 1.0
In-Reply-To: <9CD56DEFBC9108D9620ED61E@PSB>
Content-Type: multipart/alternative; boundary="------------BAAC8C17688EC7632E5744ED"
Content-Language: en-US
X-ELNK-Trace: 464f085de979d7246f36dc87813833b26976a2cdabd2db7a79b06a382a360a26c8f1c1a68e9af841350badd9bab72f9c350badd9bab72f9c350badd9bab72f9c
X-Originating-IP: 75.172.97.195
Archived-At: <https://mailarchive.ietf.org/arch/msg/i18ndir/vHjk5oJR9n4sU7wkK7IhDDxJt8U>
Subject: Re: [I18ndir] I18ndir last call review of draft-ietf-regext-dnrd-objects-mapping-06
X-BeenThere: i18ndir@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Internationalization Directorate <i18ndir.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/i18ndir>, <mailto:i18ndir-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/i18ndir/>
List-Post: <mailto:i18ndir@ietf.org>
List-Help: <mailto:i18ndir-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/i18ndir>, <mailto:i18ndir-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 06 Mar 2020 00:14:48 -0000

On 3/5/2020 12:47 PM, John C Klensin wrote:
> (All list-type other than i18ndir removed)
>
> General observations about substance:
>
> (1) I completely agree with Marc about the UTF-8 issue.  Unless
> there is a really good reason for allowing either other
> normalization forms of Unicode or any other CCS, new protocols
> and IETF-specified formats should be restricted to UTF-8.  I
> would go a half-step further and suggest that any deviations
> from that principle should be explicitly discussed and justified
> in an Internationalization Considerations section, i.e., that
> section should not be just a description of what XML allows.

+1

(In fact, the i18n considerations in the draft were defective in not
discussing the CSV mode at all.)

About the your suggested "half-step further": do we need to write
a short RFC to explicitly put that on the record? Or how would you
think such a mandate would be expressed?

>
> (2) I have not studied this spec (or RFC 5732 carefully enough
> to be sure, but it appears to me that, with four normalization
> forms specified in The Unicode Standard and periodic disputes
> about preferences between NF[K]C and NF[K]D, "normalizedString"
> may be under-specified.  Clearly actual IDN labels should
> (SHOULD?) be in NFC form, but most other fields are less
> obvious.

I would say: IDN labels MUST be in NFC.

There's no reason for formal records of IDN U-labels not to be
in the form specified by IDNA.

> In addition, given increasing trend on the web 9at
> least) to do exactly what TUS says to do, which is to normalize
> only at comparison time rather than trying to carry strings
> around in normalized form, the application of that attribute all
> almost all text-type values may be inappropriate. I can find no
> evidence in the I-D that those issues were considered; the
> document should not progress until they are.

Right, there are many other contexts where it makes sense to
keep the data as submitted and not to force normalization.

However, this does not appear to be one of them.

We may need a statement specific to IETF that captures the
suggested policy.


>
> Directorate procedural issue:
>
> I want to thank Marc for copying the i18ndir list on this review.
>
> Given our low, and perhaps waning, levels of activity and the
> "no one is expert on all i18n issues" principle described in
> draft-klensin-unicode-review, I think it would be helpful if all
> requests for reviews of i18n-related documents and review
> assignments were posted to the i18ndir list.

Seems fine. Appointing a lead reviewer shouldn't make the
that information private.

> That would reduce
> the chances of duplicated effort and increase the odds of enough
> cross-checking (however slightly) to be sure that directorate
> opinions were at least consistent with each other.    Without
> that sort of communication, we are not a directorate; we would
> be, at best, a loosely-structured review team.

We would simply be a review pool (like a typing pool of old).

> With the understanding that it might not accomplish much of
> anything if we continue to be as inactive as we have been in
> recent months, is there any reason to not expose review requests
> and assignments to this list?
>
> thanks,
>     john
>
>
>
> --On Thursday, March 5, 2020 11:06 -0800 Marc Blanchet via
> Datatracker <noreply@ietf.org> wrote:
>
>> Reviewer: Marc Blanchet
>> Review result: Ready with Issues
>>
>> I was assigned by the Internationalization Directorate to do a
>> review of this document with a specific eye on
>> internationalization and also a specific request from AD to
>> look at section 10.
>>
>> I would like to point out that in some cases, the spec seem to
>> provide a choice for the implementor/deposit provider to use
>> something else than UTF-8 for the non-ascii encoding. For
>> example, section 4.6.2.1. provides a choice of encoding for
>> csv files: "encoding  Defines the encoding of the CSV file
>> with the default encoding of "UTF-8". Moreover, section 10
>> talks about UTF-8 and UTF-16 and recommends UTF-8 instead of
>> making it mandatory. At the same time, there are multiple
>> fields in this spec that are defined as UTF-8. Therefore, it
>> would be appropriate and much less prone to interoperability
>> problems to make UTF-8 the only encoding possible, specially
>> given that most protocols, data payloads and software
>> librairies are using UTF-8 encoding. If the authors agree, then
>> section 10 and 4.6.2.1 could be revised, and probably adding a
>> paragraph in section 1 or 4 that states the only possible
>> encoding is UTF-8 for both CSV and XML files.
>>
>> Section 9.14 schema has a comment on ACE name field. Wonder if
>> A-label would be more appropriate.
>>
>> Section 5.6.2.1.1. While in other parts of the spec, the
>> encoding was clearly identified as UTF-8, the definition of
>> "<rdeCsv:fUName>  Name of the NNDN in Unicode character set
>> for the <csvNNDN:fAName> field element." does not state any.
>> Might want to say it clearly as UTF-8 like others.
>