Re: [I18ndir] I-D on filesystem I18N

"Asmus Freytag (c)" <asmusf@ix.netcom.com> Wed, 08 July 2020 02:27 UTC

Return-Path: <asmusf@ix.netcom.com>
X-Original-To: i18ndir@ietfa.amsl.com
Delivered-To: i18ndir@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id DCE383A0FD5 for <i18ndir@ietfa.amsl.com>; Tue, 7 Jul 2020 19:27:20 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.996
X-Spam-Level:
X-Spam-Status: No, score=-1.996 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=0.001, RCVD_IN_MSPIKE_H3=0.001, RCVD_IN_MSPIKE_WL=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=ix.netcom.com; domainkeys=pass (2048-bit key) header.from=asmusf@ix.netcom.com header.d=ix.netcom.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id Uc1wrKJtuW_V for <i18ndir@ietfa.amsl.com>; Tue, 7 Jul 2020 19:27:19 -0700 (PDT)
Received: from elasmtp-dupuy.atl.sa.earthlink.net (elasmtp-dupuy.atl.sa.earthlink.net [209.86.89.62]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 132AD3A0FD2 for <i18ndir@ietf.org>; Tue, 7 Jul 2020 19:27:18 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ix.netcom.com; s=dk12062016; t=1594175239; bh=ToX/uaoCeM8I7AuWpuxbyqorLPsa186SJhLs +r+EKLE=; h=Received:Subject:To:Cc:References:From:Message-ID:Date: User-Agent:MIME-Version:In-Reply-To:Content-Type:Content-Language: X-ELNK-Trace:X-Originating-IP; b=Iq1gyaII9rsWJ7nLAf3ZMSwwB+HV943N7 E6QSt+laASmzpPlAigzVq64dIfhDWABttEfGzpB4wLc4/pvieVL5oQ7ce6ZUCxh2O8H tTVLU5W3qAOUzrp+m6r7mGSHwvh8hIX2QeQ2ECv+2uTjY5dSNXZTsKbngIeO9Y3sBfV fdjEWOhyYhK08GggrcuG8h2kxZ9M/Qyo0hzEFddny9Did1lB4SzFfyScl67W1b3F1te 0HH2cp4vl6Cv5S+jjGl+BQ8E+GieNOqJldlHOIRfuGDz/qL/x9FThiXOuNx+d1LGX7+ r3e0zS3UGUfZq6qfc5a0p+7lRIqvs+WZBke7V2shA==
DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=dk12062016; d=ix.netcom.com; b=a/lTmBfltYozgk4keflE2PAKjGeg9mh4CanHzhJ9Q6lL5EbPiGDnaBK1Uidgg7J7KTXM5J0Da+byBpWHnswMoXx9nKlYwDL67sj1QdLJVx/Tp13FeNoFkiEuE7UpXOkwm+VUn4l2QyauQuGZkZRjIp85C8KBuzb/PXi3Sb0tbYZIu21Fv/aIvpl3YJ+mvj7VfNyy8sEk6Tdy5rrc9iIXZb/BBW33jQR7vdytjDN4NefeIxB/H5ADiC1ENTZBhzV+OfiFcc5pCmMmsYByAGIqtFQnaiQuY11lv5029DAIz4UY0QD71EbSCNdnonvMSvTZNThktb4cb5vR8tvQqQwD4g==; h=Received:Subject:To:Cc:References:From:Message-ID:Date:User-Agent:MIME-Version:In-Reply-To:Content-Type:Content-Language:X-ELNK-Trace:X-Originating-IP;
Received: from [71.212.59.17] (helo=[192.168.1.106]) by elasmtp-dupuy.atl.sa.earthlink.net with esmtpa (Exim 4) (envelope-from <asmusf@ix.netcom.com>) id 1jsznp-000AjR-Ft; Tue, 07 Jul 2020 22:27:17 -0400
To: Nico Williams <nico@cryptonector.com>, John C Klensin <john-ietf@jck.com>
Cc: i18ndir@ietf.org
References: <20200706225139.GJ3100@localhost> <90740541-ab72-ffaf-ff3e-5a27b5805eae@ix.netcom.com> <9044C737C36C0787B9EAE190@PSB> <20200708014054.GB3100@localhost>
From: "Asmus Freytag (c)" <asmusf@ix.netcom.com>
Message-ID: <8679878e-5264-6647-464b-e4ce71181243@ix.netcom.com>
Date: Tue, 07 Jul 2020 19:27:15 -0700
User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:68.0) Gecko/20100101 Thunderbird/68.10.0
MIME-Version: 1.0
In-Reply-To: <20200708014054.GB3100@localhost>
Content-Type: multipart/alternative; boundary="------------BD5E2C2C517EED87A776A4AC"
Content-Language: en-US
X-ELNK-Trace: 464f085de979d7246f36dc87813833b22356fd30c7fd936e03746623c0a5c3df1a09b9575864ea2f350badd9bab72f9c350badd9bab72f9c350badd9bab72f9c
X-Originating-IP: 71.212.59.17
Archived-At: <https://mailarchive.ietf.org/arch/msg/i18ndir/afgY1sUQC6ZGFgSrIj6-1t2drB8>
Subject: Re: [I18ndir] I-D on filesystem I18N
X-BeenThere: i18ndir@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Internationalization Directorate <i18ndir.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/i18ndir>, <mailto:i18ndir-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/i18ndir/>
List-Post: <mailto:i18ndir@ietf.org>
List-Help: <mailto:i18ndir-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/i18ndir>, <mailto:i18ndir-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 08 Jul 2020 02:27:21 -0000

On 7/7/2020 6:40 PM, Nico Williams wrote:
> On Tue, Jul 07, 2020 at 09:31:55PM -0400, John C Klensin wrote:
>> --On Tuesday, July 7, 2020 16:43 -0700 Asmus Freytag
>> <asmusf@ix.netcom.com> wrote:
>>> However, early case-insensitive file systems did not preserve
>>> case. Not sure how rare this has become.
>> Well, Unix, and every Unix-derived system I know of (definitely
>> including Linux, FreeBSD, and NetBSD) are case-sensitive and
>> getting anywhere near their file names with Case Folding or even
>> lower casing will cause rather interesting problems.  One the
>> case-insensitive side, I've heard of a system called "Windows"
>> that usually preserves case but does not guarantee to do so and
>> many operations, in practice, don't.
> At least ZFS supports case-insensitivity as an option.  ZFS is available
> on just about every Unix and Unix-like OS today, and is quite popular.
>
> That said, I doubt anyone enables case-insisitivity in ZFS datasets on
> Unix unless they intend to share it with Windows clients and users
> expect case-insensitivity.  Indeed, that's the reason that ZFS has this
> option at all: Windows interoperability.
>
>> In addition, Asmus caught at least one case where your
>> terminology is inconsistent with that of Unicode.  Such
> Yes yes, I beg your forgiveness.  I wrote "normal form" instead of
> "normalization form".  A silly editing error.
Not silly, it may well be that "Normal Form" would have been the better 
term. Water under the bridge.

NFC/D (no-K) use what is called "canonical de/composition" in Unicode. 
Therefore, NFKx cannot be considered "canonical" or you will confuse 
anyone who knows Unicode usage.
>
>> inconsistencies (at least if not specifically identified and
>> justified) are an invitation to reader confusion or worse.  As
>> one more example, you should not be talking about case folding
>> and then lower casing.  They are different and have different
>> implications.  And, fwiw, most of my Turkish-speaking and
> Did you not read that text?  I only mentioned lower case in the context
> of explaining in introductory text what case-folding is.  And I did NOT
> say that they are the same thing.
Nico, please understand that we find your text otherwise quite readable 
and lucid - however, John's point may be that these concepts are subject 
to confusion (whether or not you share that) and that it would be useful 
to make extra-double sure that all readers get that these are different.

There was somewhere in the earlier e-mail discussion also the mention 
that is-lowercase and to-lowercase (however you name these two concepts) 
are crucially different in that there's an expectation that is-uppercase 
= ! is-lowercase (ignoring is-titlecase for the argument), but that 
to-lowercase and to-uppercase are operations that may be inverses of 
each other. German to-lowercase(to-uppercase(ß)) != ß (but "ss").

It would be good to lay the foundation by reminding everyone of these 
subtleties.
>
>> writing colleagues would disagree that U+0131 "could" or should
>> be considered equivalent to U+0069.  In my experience, that
> The idea was to describe how an admin might square that circle in a
> case-insensitive filesystem, and then I did not endorse it.  The idea
> was to describe how custom case-folding tailorings (FYI, that's the term
> Unicode uses, 'tailoring') might come about.
>
> There will be time to address all the nits.  The big idea is more
> interesting.

The danger of reviewing the test while reading along from the start. 
However, the only way I know how to review anything that's not in a 
change-tracked editable format.

A./

>
> Nico