Re: [Ietf-languages] Forms for subtag kmpre20c

Michael Everson <everson@evertype.com> Sun, 01 December 2019 14:55 UTC

Return-Path: <everson@evertype.com>
X-Original-To: ietf-languages@ietfa.amsl.com
Delivered-To: ietf-languages@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id F3F6E120077 for <ietf-languages@ietfa.amsl.com>; Sun, 1 Dec 2019 06:55:03 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -3.42
X-Spam-Level:
X-Spam-Status: No, score=-3.42 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, RCVD_IN_DNSWL_MED=-2.3, SPF_HELO_NONE=0.001, SPF_NEUTRAL=0.779] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (768-bit key) header.d=evertype.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id MUdkl44D04gy for <ietf-languages@ietfa.amsl.com>; Sun, 1 Dec 2019 06:55:01 -0800 (PST)
Received: from mork.alvestrand.no (mork.alvestrand.no [158.38.152.117]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 511EF120046 for <ietf-languages@ietf.org>; Sun, 1 Dec 2019 06:55:01 -0800 (PST)
Received: by mork.alvestrand.no (Postfix) id 4402C7C4B2C; Sun, 1 Dec 2019 15:54:59 +0100 (CET)
Delivered-To: ietf-languages@alvestrand.no
Received: from localhost (localhost [127.0.0.1]) by mork.alvestrand.no (Postfix) with ESMTP id 2CD177C4B2A for <ietf-languages@alvestrand.no>; Sun, 1 Dec 2019 15:54:59 +0100 (CET)
X-Virus-Scanned: Debian amavisd-new at alvestrand.no
Authentication-Results: mork.alvestrand.no (amavisd-new); dkim=pass (768-bit key) header.d=evertype.com
Received: from mork.alvestrand.no ([127.0.0.1]) by localhost (mork.alvestrand.no [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id sugi4QU3VTqS for <ietf-languages@alvestrand.no>; Sun, 1 Dec 2019 15:54:57 +0100 (CET)
X-Greylist: delayed 00:40:31 by SQLgrey-1.8.0
X-Greylist: from auto-whitelisted by SQLgrey-1.8.0
X-Comment: SPF skipped for whitelisted relay - client-ip=192.0.46.72; helo=pechora6.dc.icann.org; envelope-from=everson@evertype.com; receiver=ietf-languages@alvestrand.no
Received: from pechora6.dc.icann.org (pechora6.icann.org [192.0.46.72]) by mork.alvestrand.no (Postfix) with ESMTPS id ED5F37C4B24 for <ietf-languages@alvestrand.no>; Sun, 1 Dec 2019 15:54:56 +0100 (CET)
Received: from scandium.cloudhosting.co.uk (scandium.cloudhosting.co.uk [77.72.0.158]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by pechora6.dc.icann.org (Postfix) with ESMTPS id 85EFE1E027E for <ietf-languages@iana.org>; Sun, 1 Dec 2019 14:54:55 +0000 (UTC)
DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=evertype.com; s=default; h=To:References:Message-Id: Content-Transfer-Encoding:Cc:Date:In-Reply-To:From:Subject:Mime-Version: Content-Type:Sender:Reply-To:Content-ID:Content-Description:Resent-Date: Resent-From:Resent-Sender:Resent-To:Resent-Cc:Resent-Message-ID:List-Id: List-Help:List-Unsubscribe:List-Subscribe:List-Post:List-Owner:List-Archive; bh=/taOYdR7LWHmcNhz3c6TEeQ82YsBBYD1ZyedG9ysaHY=; b=mlSWZB//upHztPBcsSLJ3EED2 FsHl/JzAgwELRF4kpNuK88EOvrS9tcLfI6nBKmf1mC78tulVmDuJl5+G/ypvTgj73tTlqE16O3DZk bXFWIhuXRMEh2BG7A+D8BV4Hna;
Received: from cpc139632-dund15-2-0-cust239.16-4.cable.virginm.net ([92.237.223.240]:54313 helo=[192.168.0.18]) by scandium.cloudhosting.co.uk with esmtpsa (TLSv1.2:ECDHE-RSA-AES256-GCM-SHA384:256) (Exim 4.92) (envelope-from <everson@evertype.com>) id 1ibQcg-0003dP-MX; Sun, 01 Dec 2019 14:54:54 +0000
Content-Type: text/plain; charset="utf-8"
Mime-Version: 1.0 (Mac OS X Mail 12.4 \(3445.104.11\))
From: Michael Everson <everson@evertype.com>
In-Reply-To: <CANfi1Ji+v9BqLwUzOvLnOZ05SDeF189mecg8vG+dp1jyK_W=yg@mail.gmail.com>
Date: Sun, 01 Dec 2019 14:54:53 +0000
Cc: ietflang IETF Languages Discussion <ietf-languages@iana.org>
Content-Transfer-Encoding: quoted-printable
Message-Id: <A2340367-D478-4E5C-8042-71EA58AD1775@evertype.com>
References: <20191121141336.665a7a7059d7ee80bb4d670165c8327d.9a3859061b.wbe@email03.godaddy.com> <CANfi1JjyouJV-CLXdKOwvRxcFPM0csTe8=+44hszSBhVTxd-qA@mail.gmail.com> <CANfi1JjeSo2-Ez52Nu3Lcb3jC9skPp2_YWza8Xnusu0Xi8vHuA@mail.gmail.com> <CANfi1JgVZ=rc1s=ELHoS=tv9HkwuzNCP0PUAZbjXWfWX0UtEXQ@mail.gmail.com> <000501d5a31c$cb6f52e0$624df8a0$@ewellic.org> <7AAF56F5-A51D-45B2-9400-86FB94625A06@gmail.com> <CANfi1JiLLtURco13w_VMNdj9fLbOHaWG+tim5e53A=_Ohp5ZXg@mail.gmail.com> <B0BA5736-38BC-4589-8F24-F5EE59B5E280@gmail.com> <CANfi1Ji+v9BqLwUzOvLnOZ05SDeF189mecg8vG+dp1jyK_W=yg@mail.gmail.com>
To: Élie Roux <elie.roux@telecom-bretagne.eu>
X-Mailer: Apple Mail (2.3445.104.11)
X-AntiAbuse: This header was added to track abuse, please include it with any abuse report
X-AntiAbuse: Primary Hostname - scandium.cloudhosting.co.uk
X-AntiAbuse: Original Domain - iana.org
X-AntiAbuse: Originator/Caller UID/GID - [47 12] / [47 12]
X-AntiAbuse: Sender Address Domain - evertype.com
X-Get-Message-Sender-Via: scandium.cloudhosting.co.uk: authenticated_id: everson@evertype.com
X-Authenticated-Sender: scandium.cloudhosting.co.uk: everson@evertype.com
Archived-At: <https://mailarchive.ietf.org/arch/msg/ietf-languages/mqqnwMpBthuLFenqwpXG1r8KgyY>
Subject: Re: [Ietf-languages] Forms for subtag kmpre20c
X-BeenThere: ietf-languages@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: <ietf-languages.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/ietf-languages>, <mailto:ietf-languages-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/ietf-languages/>
List-Post: <mailto:ietf-languages@ietf.org>
List-Help: <mailto:ietf-languages-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ietf-languages>, <mailto:ietf-languages-request@ietf.org?subject=subscribe>
X-List-Received-Date: Sun, 01 Dec 2019 14:55:04 -0000

Doug, I’ve rejected the current version of the proposal. 

On 1 Dec 2019, at 14:26, Élie Roux <elie.roux@telecom-bretagne.eu> wrote:
> 
>> We have en (English, unspecified (apart from being modern English, not Old or Middle English), and we have en_US and en_GB for main variants, typically involving and we have en_GB-oxendict for a specific spelling variant.
>> 
>> It seems to me that you need a subtag for modern spelling; assuming that Chuon Nath is the name of the prevalent reform, for instance, you could have km-chuonnat or km-chn1938 or the like.
>> 
>> In fact if there were a need you could have subtags referrng to various editions of the reformed orthographies.
>> 
>> But really, we use subtags for what things ARE, not for what the are not.
> 
> Ok yes, makes sense. I will do that once I manage to gather more documents and info, which should happen in Spring.

Whenever you can. It doesn’t have to be a thesis.

> So, to continue the analogy with English: the -oxendict subtype refers to the general style right? What I mean is: in the same way that the Oxford Dictionary
> - has different editions with (I suppose marginally) different spelling
> - does not encompass all possible words of all possible strings tagged with -oxendict

In this case, en_GB has two main traditions of spelling. Oxford uses one which spells centre, colour, civilize, and the other tradition spells centre, colour, civilise. The use of this tag is chiefly to select a spell-check dictionary, I think. The one orthography closest to en_GB-oxendict is en_CA.

> I could propose a -dictcamb (for Dictionnaire Cambodgien), which could refer to:
> - the general work, not a specific edition

If the orthography differs from edition to edition and you need to make that distinction, you may need more than one subtag. 

> - a general style of spelling, defined by a dictionary that doesn’t encompass all possible words (especially loan ones, even though it refers to Pali a lot)
> 
> right?

I tend to think that subtags can be used to mark a variety of different contexts but selection of a spell-check dictionary is a very practical one.

> I think the main difference is that there will likely not be any new
> edition of the Dictionnaire Cambodgien (we could assume that if
> someone was to take over the project of a new edition, they would not
> keep the French name), and that people usually refer to the 1967
> (last) edition. In that case, what would be the customary type of
> langtag? "-dica1967" or "-dictcamb”?

dc1967 would be adequate I should think. It depends on the number of things you think you need to differentiate.

Michael