Re: [I18ndir] [art] Modern Network Unicode

Asmus Freytag <> Wed, 10 July 2019 22:59 UTC

Return-Path: <>
Received: from localhost (localhost []) by (Postfix) with ESMTP id 7C1A71200C7 for <>; Wed, 10 Jul 2019 15:59:54 -0700 (PDT)
X-Virus-Scanned: amavisd-new at
X-Spam-Flag: NO
X-Spam-Score: -2
X-Spam-Status: No, score=-2 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H2=-0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001] autolearn=ham autolearn_force=no
Authentication-Results: (amavisd-new); dkim=pass (2048-bit key); domainkeys=pass (2048-bit key)
Received: from ([]) by localhost ( []) (amavisd-new, port 10024) with ESMTP id jwAq9dsuGfLV for <>; Wed, 10 Jul 2019 15:59:52 -0700 (PDT)
Received: from ( []) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by (Postfix) with ESMTPS id 9A8281200B9 for <>; Wed, 10 Jul 2019 15:59:52 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;; s=dk12062016; t=1562799592; bh=rtkq7NtkajI/0mO8dVK/J62m16plkJxOy7kO qge1+vQ=; h=Received:Subject:To:References:From:Message-ID:Date: User-Agent:MIME-Version:In-Reply-To:Content-Type:Content-Language: X-ELNK-Trace:X-Originating-IP; b=GvyCzMkeFZyJ9K4Hxy8hUUukjjviRqu7+ hPLVl0kCMsGbM1KfghEAovoqOh/0psgpOZWmAi4urIUGflkutS+FdrfcdAKFvgsWstW FATSIT8kzY9zgI0w1tH3dlNd+yuemnmFlSBz4hr1YAcAJBS7zxy1tOXbqOMKGq/myZu E/Yd//3tWh+hS75uwBTOPEh58VtE+9GKJxincbhaWkNuP01of3NAVQ85PE7QBOTd9GD viNBIj3I8FJs83VKbCFrNZkofkftfXMcUMbskjKgMxSdszwbHHSUAGIqx1xsgm50E7i xGC91rYMNrpD50+LDl1UUPDDVtMzFxGI78Z/5r4YQ==
DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=dk12062016;; b=sEXi2oSjKX82+1lTyN8bp9S0ourX+Az/4cNUtBWen8ZqMQBFfZm5mu/2IZF/0stnSnmc5WRWlaJiklKOQvGzCLsSbcrdOy8wDqeBNg+cS6iqtXUR0wWRuXtd5lTfsE4N1JBIsC5eR9b+6VOjenmY6K4jlrf5mJ+TScuEWMSEe4KJvpsY7hW9i2t/SkVQFGZKv9MTNCVl5ah9oLaUKH+9DOza82WsC+dLvKIiyAySGcHOBrpRdBKo3PrVuTjTo8Fy4lcRX781NCb6LkyzpLcalMld0cs+PWzsa1R+u0Rj1FuMuNtxbRuxUY9kSVqwNBRNDie4fUf+pxX/QU1QuBY92Q==; h=Received:Subject:To:References:From:Message-ID:Date:User-Agent:MIME-Version:In-Reply-To:Content-Type:Content-Language:X-ELNK-Trace:X-Originating-IP;
Received: from [] (helo=[]) by with esmtpa (Exim 4) (envelope-from <>) id 1hlLYz-00043w-EF; Wed, 10 Jul 2019 18:59:49 -0400
To:, Carsten Bormann <>
References: <0A5251342D480BA6437F7549@PSB> <> <248A8DD5DA0D3D34D6B6EFC9@PSB>
From: Asmus Freytag <>
Message-ID: <>
Date: Wed, 10 Jul 2019 16:00:03 -0700
User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:60.0) Gecko/20100101 Thunderbird/60.7.2
MIME-Version: 1.0
In-Reply-To: <248A8DD5DA0D3D34D6B6EFC9@PSB>
Content-Type: multipart/alternative; boundary="------------53D0813EBC3F17FFAA588EC8"
Content-Language: en-US
X-ELNK-Trace: 464f085de979d7246f36dc87813833b27dfed51d2184666888858e8cee231409427377c2581fad0c350badd9bab72f9c350badd9bab72f9c350badd9bab72f9c
Archived-At: <>
Subject: Re: [I18ndir] [art] Modern Network Unicode
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Internationalization Directorate <>
List-Unsubscribe: <>, <>
List-Archive: <>
List-Post: <>
List-Help: <>
List-Subscribe: <>, <>
X-List-Received-Date: Wed, 10 Jul 2019 22:59:54 -0000

On 7/10/2019 12:09 AM, John C Klensin wrote:
> For the record, I do have one other concern.  The examples above
> use extended Latin script.  Because of its NVT origins, much of
> 5198 makes assumptions about that script or scripts closely
> related to it.  If you are doing something for this century and
> beyond, you should really think carefully about the implications
> of scripts that are very different.

There are a few scripts where un-normalized text is "preferred" by the 
user community over NFC. In some cases, the most natural ordering of 
combining marks does not match NFC's canonical ordering. I other cases, 
NFC does not compose some sequences while local user communities 
strongly prefer the precomposed code points (e.g. Bengali).

Those scripts would be an exception to John's statement: " NFC is also a 
close approximation to what any sensible terminal driver or IME is going 
to produce natively from a plausible keyboard layout for the relevant 
script", a statement that otherwise holds well.