Re: [I18ndir] [art] Modern Network Unicode

John C Klensin <john-ietf@jck.com> Wed, 10 July 2019 23:34 UTC

Return-Path: <john-ietf@jck.com>
X-Original-To: i18ndir@ietfa.amsl.com
Delivered-To: i18ndir@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 239E4120397 for <i18ndir@ietfa.amsl.com>; Wed, 10 Jul 2019 16:34:02 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.897
X-Spam-Level:
X-Spam-Status: No, score=-1.897 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, SPF_HELO_NONE=0.001, SPF_NONE=0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id HiYvW3FvL92d for <i18ndir@ietfa.amsl.com>; Wed, 10 Jul 2019 16:33:59 -0700 (PDT)
Received: from bsa2.jck.com (ns.jck.com [70.88.254.51]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id C349012037B for <i18ndir@ietf.org>; Wed, 10 Jul 2019 16:33:59 -0700 (PDT)
Received: from [198.252.137.10] (helo=PSB) by bsa2.jck.com with esmtp (Exim 4.82 (FreeBSD)) (envelope-from <john-ietf@jck.com>) id 1hlM61-000HAa-EF; Wed, 10 Jul 2019 19:33:57 -0400
Date: Wed, 10 Jul 2019 19:33:52 -0400
From: John C Klensin <john-ietf@jck.com>
To: Asmus Freytag <asmusf@ix.netcom.com>, i18ndir@ietf.org, Carsten Bormann <cabo@tzi.org>
Message-ID: <EEF2384430D3C2B158C5C601@PSB>
In-Reply-To: <213ae024-b819-4f56-6e37-0cd53eb566c9@ix.netcom.com>
References: <0A5251342D480BA6437F7549@PSB> <B243365E-F7C5-4C53-A64F-2E3E87C4CD66@tzi.org> <248A8DD5DA0D3D34D6B6EFC9@PSB> <213ae024-b819-4f56-6e37-0cd53eb566c9@ix.netcom.com>
X-Mailer: Mulberry/4.0.8 (Win32)
MIME-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: 7bit
Content-Disposition: inline
X-SA-Exim-Connect-IP: 198.252.137.10
X-SA-Exim-Mail-From: john-ietf@jck.com
X-SA-Exim-Scanned: No (on bsa2.jck.com); SAEximRunCond expanded to false
Archived-At: <https://mailarchive.ietf.org/arch/msg/i18ndir/F-YArhT0a3ElRO8MH1QPs_sQ2Q4>
Subject: Re: [I18ndir] [art] Modern Network Unicode
X-BeenThere: i18ndir@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Internationalization Directorate <i18ndir.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/i18ndir>, <mailto:i18ndir-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/i18ndir/>
List-Post: <mailto:i18ndir@ietf.org>
List-Help: <mailto:i18ndir-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/i18ndir>, <mailto:i18ndir-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 10 Jul 2019 23:34:09 -0000


--On Wednesday, July 10, 2019 16:00 -0700 Asmus Freytag
<asmusf@ix.netcom.com> wrote:

> On 7/10/2019 12:09 AM, John C Klensin wrote:
>> For the record, I do have one other concern.  The examples
>> above use extended Latin script.  Because of its NVT origins,
>> much of 5198 makes assumptions about that script or scripts
>> closely related to it.  If you are doing something for this
>> century and beyond, you should really think carefully about
>> the implications of scripts that are very different.
> 
> There are a few scripts where un-normalized text is
> "preferred" by the user community over NFC. In some cases, the
> most natural ordering of combining marks does not match NFC's
> canonical ordering. I other cases, NFC does not compose some
> sequences while local user communities strongly prefer the
> precomposed code points (e.g. Bengali).
> 
> Those scripts would be an exception to John's statement: " NFC
> is also a close approximation to what any sensible terminal
> driver or IME is going to produce natively from a plausible
> keyboard layout for the relevant script", a statement that
> otherwise holds well.

Indeed.   The explanation is also more detailed that I had time
for, and better than I would have written if I had tried, as to
why encouraging everything to be forced into NFC before
transmission is probably unwise.

    john