Re: [I18ndir] I-D on filesystem I18N
Nico Williams <nico@cryptonector.com> Wed, 08 July 2020 16:51 UTC
Return-Path: <nico@cryptonector.com>
X-Original-To: i18ndir@ietfa.amsl.com
Delivered-To: i18ndir@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1])
by ietfa.amsl.com (Postfix) with ESMTP id 580BB3A0F39
for <i18ndir@ietfa.amsl.com>; Wed, 8 Jul 2020 09:51:02 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.097
X-Spam-Level:
X-Spam-Status: No, score=-2.097 tagged_above=-999 required=5
tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1,
DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, RCVD_IN_MSPIKE_H3=0.001,
RCVD_IN_MSPIKE_WL=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001,
URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key)
header.d=cryptonector.com
Received: from mail.ietf.org ([4.31.198.44])
by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024)
with ESMTP id z_GTzYjfETtL for <i18ndir@ietfa.amsl.com>;
Wed, 8 Jul 2020 09:51:01 -0700 (PDT)
Received: from camel.elm.relay.mailchannels.net
(camel.elm.relay.mailchannels.net [23.83.212.29])
(using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits))
(No client certificate requested)
by ietfa.amsl.com (Postfix) with ESMTPS id E87C13A0EB3
for <i18ndir@ietf.org>; Wed, 8 Jul 2020 09:51:00 -0700 (PDT)
X-Sender-Id: dreamhost|x-authsender|nico@cryptonector.com
Received: from relay.mailchannels.net (localhost [127.0.0.1])
by relay.mailchannels.net (Postfix) with ESMTP id A15C1361593;
Wed, 8 Jul 2020 16:50:58 +0000 (UTC)
Received: from pdx1-sub0-mail-a38.g.dreamhost.com
(100-96-8-49.trex.outbound.svc.cluster.local [100.96.8.49])
(Authenticated sender: dreamhost)
by relay.mailchannels.net (Postfix) with ESMTPA id DBA2D36161F;
Wed, 8 Jul 2020 16:50:57 +0000 (UTC)
X-Sender-Id: dreamhost|x-authsender|nico@cryptonector.com
Received: from pdx1-sub0-mail-a38.g.dreamhost.com (pop.dreamhost.com
[64.90.62.162]) (using TLSv1.2 with cipher DHE-RSA-AES256-GCM-SHA384)
by 0.0.0.0:2500 (trex/5.18.8); Wed, 08 Jul 2020 16:50:58 +0000
X-MC-Relay: Good
X-MailChannels-SenderId: dreamhost|x-authsender|nico@cryptonector.com
X-MailChannels-Auth-Id: dreamhost
X-Illustrious-Industry: 6eb87e0539ad38e9_1594227058320_1988094108
X-MC-Loop-Signature: 1594227058320:4211443226
X-MC-Ingress-Time: 1594227058320
Received: from pdx1-sub0-mail-a38.g.dreamhost.com (localhost [127.0.0.1])
by pdx1-sub0-mail-a38.g.dreamhost.com (Postfix) with ESMTP id 4150FB4169;
Wed, 8 Jul 2020 09:50:57 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=cryptonector.com; h=date
:from:to:cc:subject:message-id:references:mime-version
:content-type:in-reply-to; s=cryptonector.com; bh=a8EQ+kirPUNwrO
g7vbZvOFF/EVc=; b=HHSd5rFpEaYJgA7eVxRrOa0SybOgwElElkk+fvdRHd1B2z
5D+zVlguZybeTdxPkUlxi3Yl6KIEEyfgiRn4KjsUAfVutwbcOEcBBT7N8vin596e
e6E1r9ZApI+fGEFdH9nMvtJ6e8i/XWncjC+1AhiiI6p0CbX+VCR04iPwW10Ec=
Received: from localhost (unknown [24.28.108.183])
(using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits))
(No client certificate requested)
(Authenticated sender: nico@cryptonector.com)
by pdx1-sub0-mail-a38.g.dreamhost.com (Postfix) with ESMTPSA id 2A7A3B4164;
Wed, 8 Jul 2020 09:50:54 -0700 (PDT)
Date: Wed, 8 Jul 2020 11:50:49 -0500
X-DH-BACKEND: pdx1-sub0-mail-a38
From: Nico Williams <nico@cryptonector.com>
To: John R Levine <johnl@taugh.com>
Cc: i18ndir@ietf.org
Message-ID: <20200708165048.GP3100@localhost>
References: <20200706225139.GJ3100@localhost>
<20200708004250.01B111C67670@ary.qy>
<20200708013311.GZ3100@localhost>
<alpine.OSX.2.22.407.2007072143001.22521@ary.qy>
<20200708032341.GF3100@localhost>
<alpine.OSX.2.22.407.2007081147001.24605@ary.qy>
<20200708161644.GO3100@localhost>
<alpine.OSX.2.22.407.2007081222580.24789@ary.qy>
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
In-Reply-To: <alpine.OSX.2.22.407.2007081222580.24789@ary.qy>
User-Agent: Mutt/1.9.4 (2018-02-28)
X-VR-OUT-STATUS: OK
X-VR-OUT-SCORE: -100
X-VR-OUT-SPAMCAUSE: gggruggvucftvghtrhhoucdtuddrgeduiedrudejgddutdejucetufdoteggodetrfdotffvucfrrhhofhhilhgvmecuggftfghnshhusghstghrihgsvgdpffftgfetoffjqffuvfenuceurghilhhouhhtmecufedttdenucesvcftvggtihhpihgvnhhtshculddquddttddmnecujfgurhepfffhvffukfhfgggtuggjfgesthdtredttdervdenucfhrhhomheppfhitghoucghihhllhhirghmshcuoehnihgtohestghrhihpthhonhgvtghtohhrrdgtohhmqeenucggtffrrghtthgvrhhnpefftdektefhueetveeigfefgeejteejvdfhhefgvddtfeeujeehleeguefhgffhgfenucfkphepvdegrddvkedruddtkedrudekfeenucevlhhushhtvghrufhiiigvpedtnecurfgrrhgrmhepmhhouggvpehsmhhtphdphhgvlhhopehlohgtrghlhhhoshhtpdhinhgvthepvdegrddvkedruddtkedrudekfedprhgvthhurhhnqdhprghthheppfhitghoucghihhllhhirghmshcuoehnihgtohestghrhihpthhonhgvtghtohhrrdgtohhmqedpmhgrihhlfhhrohhmpehnihgtohestghrhihpthhonhgvtghtohhrrdgtohhmpdhnrhgtphhtthhopehnihgtohestghrhihpthhonhgvtghtohhrrdgtohhm
Archived-At: <https://mailarchive.ietf.org/arch/msg/i18ndir/pDKyo2MBVTCh3IzBrT1ouCvl7Ow>
Subject: Re: [I18ndir] I-D on filesystem I18N
X-BeenThere: i18ndir@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Internationalization Directorate <i18ndir.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/i18ndir>,
<mailto:i18ndir-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/i18ndir/>
List-Post: <mailto:i18ndir@ietf.org>
List-Help: <mailto:i18ndir-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/i18ndir>,
<mailto:i18ndir-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 08 Jul 2020 16:51:02 -0000
On Wed, Jul 08, 2020 at 12:23:53PM -0400, John R Levine wrote: > So long as we make it clear what kind of advice it is, and we acknowledge > the issues in migrating from what they do now to UTF-8 nirvana, that could > be somewhat useful. Right! Thank you for seeing that. This work sure can't hurt. Migration is tough of course, but for _new_ filesystems it is a non-issue (unless the intent is to restore existing contents that needs migration). Stopping the rot is often the first step towards repairing it. Two schemes are described in my I-D: the HFS+ scheme (normalize on create and lookup) and the ZFS scheme (form-insensitive/preserving). (As you can see, there is nothing terribly original in this I-D...) The former is by far the simplest to retrofit onto existing filesystem code that doesn't index directory contents. Conversely, the latter is equally easy to retrofit into filesystems that do index directory contents. The latter yields superior user experience and interoperability, but the former would be good enough if it normalized to NFC (sadly, Apple went with NFD, but others could choose to use NFC). Those two schemes were arrived at by Apple and Sun engineers between 2 and 1.5 decades ago, respectively. To my knowledge, in all that time no better approach has been found. We should document this and take the opportunity to make small improvements, such as mapping U+2011 to U+002D and such. In the worst case, it will do as much good as the NFSv4 spec's I18N provisions (none), though also no harm. In the best case it will help make all sorts of filesystems better. Lastly, I believe this approach has wider applicability, thus documenting it may help in areas other than filesystems. Nico --
- [I18ndir] I-D on filesystem I18N Nico Williams
- Re: [I18ndir] I-D on filesystem I18N Patrik Fältström
- Re: [I18ndir] I-D on filesystem I18N Nico Williams
- Re: [I18ndir] I-D on filesystem I18N Nico Williams
- Re: [I18ndir] I-D on filesystem I18N John C Klensin
- Re: [I18ndir] I-D on filesystem I18N Patrik Fältström
- Re: [I18ndir] I-D on filesystem I18N Nico Williams
- Re: [I18ndir] I-D on filesystem I18N Nico Williams
- Re: [I18ndir] I-D on filesystem I18N John C Klensin
- Re: [I18ndir] I-D on filesystem I18N Nico Williams
- Re: [I18ndir] I-D on filesystem I18N John C Klensin
- [I18ndir] Do we need an I18N WG? (Re: I-D on file… Nico Williams
- Re: [I18ndir] I-D on filesystem I18N Nico Williams
- Re: [I18ndir] I-D on filesystem I18N John C Klensin
- Re: [I18ndir] I-D on filesystem I18N Nico Williams
- Re: [I18ndir] I-D on filesystem I18N Asmus Freytag
- Re: [I18ndir] I-D on filesystem I18N John Levine
- Re: [I18ndir] I-D on filesystem I18N Nico Williams
- Re: [I18ndir] I-D on filesystem I18N John C Klensin
- Re: [I18ndir] I-D on filesystem I18N Nico Williams
- Re: [I18ndir] I-D on filesystem I18N Nico Williams
- Re: [I18ndir] I-D on filesystem I18N Asmus Freytag
- Re: [I18ndir] I-D on filesystem I18N Nico Williams
- Re: [I18ndir] I-D on filesystem I18N Asmus Freytag (c)
- Re: [I18ndir] I-D on filesystem I18N John R Levine
- Re: [I18ndir] I-D on filesystem I18N Nico Williams
- Re: [I18ndir] I-D on filesystem I18N Nico Williams
- Re: [I18ndir] I-D on filesystem I18N Nico Williams
- Re: [I18ndir] I-D on filesystem I18N Asmus Freytag (c)
- Re: [I18ndir] I-D on filesystem I18N Asmus Freytag (c)
- Re: [I18ndir] I-D on filesystem I18N Patrik Fältström
- Re: [I18ndir] I-D on filesystem I18N Patrik Fältström
- Re: [I18ndir] I-D on filesystem I18N John C Klensin
- Re: [I18ndir] I-D on filesystem I18N Nico Williams
- Re: [I18ndir] I-D on filesystem I18N Nico Williams
- Re: [I18ndir] I-D on filesystem I18N Patrik Fältström
- Re: [I18ndir] I-D on filesystem I18N Nico Williams
- Re: [I18ndir] I-D on filesystem I18N Nico Williams
- Re: [I18ndir] I-D on filesystem I18N Patrik Fältström
- Re: [I18ndir] I-D on filesystem I18N Asmus Freytag
- Re: [I18ndir] I-D on filesystem I18N Nico Williams
- Re: [I18ndir] I-D on filesystem I18N John R Levine
- Re: [I18ndir] I-D on filesystem I18N Nico Williams
- Re: [I18ndir] I-D on filesystem I18N John R Levine
- Re: [I18ndir] I-D on filesystem I18N Nico Williams
- Re: [I18ndir] I-D on filesystem I18N John Levine
- Re: [I18ndir] I-D on filesystem I18N Nico Williams
- Re: [I18ndir] I-D on filesystem I18N Nico Williams
- Re: [I18ndir] I-D on filesystem I18N Nico Williams
- Re: [I18ndir] I-D on filesystem I18N Nico Williams