Re: [Ietf-languages] Fwd: Proposal for variant language subtags for German dialects

Doug Ewell <doug@ewellic.org> Tue, 12 September 2023 23:16 UTC

Return-Path: <doug@ewellic.org>
X-Original-To: ietf-languages@ietfa.amsl.com
Delivered-To: ietf-languages@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 92AE6C15C523 for <ietf-languages@ietfa.amsl.com>; Tue, 12 Sep 2023 16:16:49 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -6.904
X-Spam-Level:
X-Spam-Status: No, score=-6.904 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, RCVD_IN_DNSWL_HI=-5, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_FAIL=0.001, SPF_HELO_NONE=0.001, T_SCC_BODY_TEXT_LINE=-0.01, URIBL_BLOCKED=0.001, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=unavailable autolearn_force=no
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id Lk1abACfnFaY for <ietf-languages@ietfa.amsl.com>; Tue, 12 Sep 2023 16:16:45 -0700 (PDT)
Received: from out.mail.icann.org (out.mail.icann.org [64.78.33.5]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 2028EC14F73F for <ietf-languages@ietf.org>; Tue, 12 Sep 2023 16:16:45 -0700 (PDT)
Received: from MBX112-W2-CO-1.pexch112.icann.org (10.226.41.128) by MBX112-W2-CO-1.pexch112.icann.org (10.226.41.128) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.2.1118.37; Tue, 12 Sep 2023 16:16:44 -0700
Received: from aesmt112-co-1-1.serverpod.net (10.224.74.75) by MBX112-W2-CO-1.pexch112.icann.org (10.226.41.129) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.2.1118.37 via Frontend Transport; Tue, 12 Sep 2023 16:16:44 -0700
Received: from aesc112-co-1-2.serverpod.net (aesc112-co-1-2.serverpod.net [10.224.76.91]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange ECDHE (P-256) server-signature RSA-PSS (4096 bits) server-digest SHA256) (No client certificate requested) by aesmt112-co-1.serverpod.net (Postfix) with ESMTPS id 09ECF40002 for <ietf-languages@ex.icann.org>; Tue, 12 Sep 2023 16:16:44 -0700 (PDT)
Received: from exmx112-co-1-1.serverpod.net (exmx112-co-1-1.serverpod.net [10.224.72.73]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange ECDHE (P-256) server-signature RSA-PSS (4096 bits) server-digest SHA256) (No client certificate requested) by aesmt112-co-1.serverpod.net (Postfix) with ESMTPS id D89E4120002 for <ietf-languages@ex.icann.org>; Tue, 12 Sep 2023 16:16:43 -0700 (PDT)
Received: from pechora3.dc.icann.org (pechora3.icann.org [192.0.46.73]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange ECDHE (P-256) server-signature RSA-PSS (4096 bits) server-digest SHA256) (No client certificate requested) by west.smtp.mx.icann.org (Postfix) with ESMTPS id 85628180002 for <ietf-languages@ex.icann.org>; Tue, 12 Sep 2023 16:16:43 -0700 (PDT)
Received: from NAM11-DM6-obe.outbound.protection.outlook.com (mail-dm6nam11on2060d.outbound.protection.outlook.com [IPv6:2a01:111:f400:7eaa::60d]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by pechora3.dc.icann.org (Postfix) with ESMTPS id 61E9E70000D3 for <ietf-languages@iana.org>; Tue, 12 Sep 2023 23:16:42 +0000 (UTC)
ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=ebkS4JrUXcgEE1gHe//wylMrTGli4AO4zo4XLXlJ6sdGIGnQ7FP6n/soahc572Uz510htRlhbgJ/2o+6xwk1yhfeWKlLT34tT1q3hqOwwID6CstK8HG/OFSQcntjlYmTHjG8cf1yRYPYzoaDPHP8BmYYAj2f+feZ/FRp9sQ0ENrNFdUG8nAaEAEwIsNmyWSDELBMuA8ccn68XnDkhZSAF7zfNHEqrBI2DQN6jNL7213W+C4m3VCh0laXDr1aqysKZibvlkjrKTU7Uo/Yk5uZOKtQG+AdB0mqb6fT+6+G4cZEukJhvDI9wVDHQBRuCbKbD4m7/VT8W736Mj7gUN7TVg==
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=6K/fZ/SEuM/0m6PenQB5quAIypY0t4KtHodQ7Tpli7k=; b=afsioAnvkeAhclTBuRmGbzmk8pgn0BAyip9u1SGDbp4H+xafVW1euWCVwiaCgH2QUPQyBasDW5yLfTSUhXIw7D6thheja4qJhl+sKbmUTpwUS4A7VXmxYeSNLJTgZawuc7FYnJW4gmLYeuf/o94jFgg0eUaDAVaVHbnJElZUVUH3Evw2dvr7w+w1wkOeYmhWQZk1nnugfFg2ozPyIqpHmTbvDl0OJog455K+FQG5ILYpksBK7wsczsnnpNN/KNn9BXVlEwUm6ulH8CXcf4Xb2QeXH9HzL0WEE1m1cRBrupyBtj3CH6zYRw4ZsOtT08fD44/V3JJ5jJkZc+eMb/o4+A==
ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=ewellic.org; dmarc=pass action=none header.from=ewellic.org; dkim=pass header.d=ewellic.org; arc=none
Received: from SJ0PR03MB6598.namprd03.prod.outlook.com (2603:10b6:a03:38a::21) by LV3PR03MB7501.namprd03.prod.outlook.com (2603:10b6:408:19f::11) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.6768.35; Tue, 12 Sep 2023 23:16:17 +0000
Received: from SJ0PR03MB6598.namprd03.prod.outlook.com ([fe80::cdba:50cc:8625:30fe]) by SJ0PR03MB6598.namprd03.prod.outlook.com ([fe80::cdba:50cc:8625:30fe%6]) with mapi id 15.20.6768.036; Tue, 12 Sep 2023 23:16:16 +0000
From: Doug Ewell <doug@ewellic.org>
To: Hugh Paterson III <sil.linguist@gmail.com>, John Cowan <cowan@ccil.org>
CC: Lisa Dücker <lisa.duecker=40uni-marburg.de@dmarc.ietf.org>, "ietf-languages@iana.org" <ietf-languages@iana.org>
Thread-Topic: [Ietf-languages] Fwd: Proposal for variant language subtags for German dialects
Thread-Index: AQHZ5Kwl9cpx6x3jk0a+QkrzKHo4yrAVlUaAgAGzcYCAAA//AIAAEkKAgABb1ICAAAW5cA==
Date: Tue, 12 Sep 2023 23:16:16 +0000
Message-ID: <SJ0PR03MB6598211FDF90569B00022B7CCAF1A@SJ0PR03MB6598.namprd03.prod.outlook.com>
References: <20230911135450.Horde.C3GF4Fl3isb4n4eJMJT5bTp@home.staff.uni-marburg.de> <ZP8OX/qk2NWz1tku@sources.org> <CAD2gp_T7+MXd2qr84z+39w5zgt38TM8L3gKA6FUcTe=C2j=VqA@mail.gmail.com> <CAE=3Ky8mtG7Q9SkOCGSpVXAojTJkJCwJWoDS56dEHqW9igHukw@mail.gmail.com> <CAD2gp_R1qoRST0LS=EV9rtvyW2Td7kLcx3GobFXea6kufG1jjw@mail.gmail.com> <CAE=3Ky_71yeh+-2rR6Qp+eBoLW7SHATQ8zY8tJtjy=_T+F=JVw@mail.gmail.com>
In-Reply-To: <CAE=3Ky_71yeh+-2rR6Qp+eBoLW7SHATQ8zY8tJtjy=_T+F=JVw@mail.gmail.com>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
authentication-results: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=ewellic.org;
x-ms-publictraffictype: Email
x-ms-traffictypediagnostic: SJ0PR03MB6598:EE_|LV3PR03MB7501:EE_
x-ms-office365-filtering-correlation-id: 69076bb8-c7ff-4cf8-93e8-08dbb3e64402
x-ms-exchange-senderadcheck: 1
x-ms-exchange-antispam-relay: 0
x-microsoft-antispam: BCL:0;
x-microsoft-antispam-message-info: jnEs4bdIQy+HjX767hMXwxkXrg5ZkUQ2BefEiCPW1BLa6tDyY9PX7mPjTMmIaC8oF/RorVoZ8Wlfti/U/LcQ7xhxm0BVtxQSk7lf2yTEEnbj0P6OoRzzF8mhHvsTV/RX1g41Ex+mcPFAqXtbwe/p3h02tEhA6jDmySkmwmBCkRhZrPWjjYRsyUeEiZ51CvQMZI2KUDQ0MIegNzHqxrIwWwo3ckqpA0C5U99sEWBs32aQAOKzCdQlaqTDBumJWzRPX4CEv3UsjSPCNCMAXBvdD3sMbtxRGmSS3Bp5wer2bYK2ctz5NM5MRglp62f1aIaMFAsFPDygDq+JOQff+4WZYMGqFv5fUT3oEfDp3Y+ypqEZQcBcT0L9vHMp/GhV6bxovfVCZqK6ePkUCiA50bk7MfXCebcqybidQ96QzdGhn5EQ/gpMgYWD4V4ZT1YEapW2VvoVTcN34w7YQvJ8VD1h6ESldzakE4r0lFse5h5lC9e7jPIXcFyZw2k1sIvZKx6Rq7GGd4XvSt6UcYMZGcTWJ1C+XGbQXqJ/OzzTQVdG73Ckkx2HSQOqy7+PNfDbbyzVAmBLvIofEPFvV2K4vPGHvhxSFkB6zUiWc3kZuJtvlvj05xWte5EbD2oc+gWJYTkC
x-forefront-antispam-report: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:SJ0PR03MB6598.namprd03.prod.outlook.com; PTR:; CAT:NONE; SFS:(13230031)(366004)(39830400003)(396003)(136003)(346002)(376002)(84040400005)(451199024)(1800799009)(186009)(110136005)(53546011)(6506007)(71200400001)(122000001)(52536014)(41300700001)(7696005)(55016003)(38100700002)(86362001)(38070700005)(33656002)(9686003)(2906002)(478600001)(83380400001)(4326008)(5660300002)(316002)(8676002)(8936002)(66946007)(66476007)(76116006)(66446008)(66556008)(64756008)(54906003); DIR:OUT; SFP:1101;
x-ms-exchange-antispam-messagedata-chunkcount: 1
x-ms-exchange-antispam-messagedata-0: Qsu2QSWaGFtxVqcXiUgLqp8zPLuJdqvSIYW7tHP0+r2344Zuzc263+pfuZZISzHIW/UHWWWYxz+ksFEnVWi8zMLGQz+AZtjLZTCH8wuVeeqluLLb8QUqMDoKrV8+Vvs7tlIFRTbrdR7FUnVMZosX2KkzQ018H36AQf1own5kLZSZTl213qWFnLvrUdBZzX2FtuJi3tdICmhWRNqK6UsqEwAUG9lDmVOPiQ+vJvcJ9COniFhIrBc20U07HZZIEUaZGAFw9nxVFhvRI+uaMf74R6G1eMsncv6taDyD59o4oB1qv6i5AlsaKLJmjgtgd2uaZDB6wc4OrKAMFeRa5hyTOskIy8rWsKt7cet4+kQ8zSSqdTWt1yXkMU7aERlpumv/fFCDgthuekefLYe+It2JjUbmNLz/Ia3AncVXznnddZPrhxJQnz/CWWQpWsGhhiPtd2rlKM1PZO/Qnog799uRCa4JgIZVCGIYeRfK/n2MZgYXcJUoOs+AfG5ALLHDrR5xsBPM/u2QbvBXPnCtCNqBJqBz+HebeRgRWJ1M7WueDL8HfuWDnxHbGaQf9gfM47S5rHffgLeNnRxcYIR5wm/1F/Hub4qsyyjORkpQdhencEOEwrMDS7ZPqRWg1wQTelHy2/BWen6aGwo1vLkZRqWfO8DyimgeRGPULyhKvCWNHdkClYkv756UkstT2oBlsfNop3iKKxBT86MrtwpNYJlOThwcCUrxft9uDkHw2SNRWZbgL/WaRLmbW7cPH6KZaQpQX5iBMTjYRxRPokC45TS/hKcbJCYSwZoEK69NLoDkf+ouZI3BdnZ5Q7/4ZT47ueYyc2saEQnfGKX/1y1nW88ntf+8o9t4qFN+fR1AsM1hdH6O2YTD1ZOt8O8qcI6EISG7+oS7Uh9JWBbkNyhFD281Qj5zoYqB+4C5iQpB4NCH/CQk7FFLd+ZuyFn4XOSaUwgT/lcVeHH5BgstCRQ1r4zyrv+jjJmCvPZeXK8Nqr6j97Dj/Vtx5yrmnHtfjD4Jp+QTIsacSPdMQ28KLkPK8lMWUySMD2fT08L3TYV/0zQFfQ99QSROY2BiNlNni2hk+dBw4ebbanBkeNIw4DmRhEoRnREHecNXpYkMx1nwXtojhmPY1q2+7Sb+ngvbTVicl8+0c2yhTwy/GoPT+GnfYIcyOcDs8E6vRaUU8fwGB2JNTOtOOHs4bthnnRQ78/VzpVJczwJnsJ0bc/AurTQ0vxw1kdjiAh+SfXYuKnunb9iNKylFeyTOqwMO6LvrwugsFpCsKzuHgbqltvsdwVE2PseH39HAI6QeaSr/GqHHN71kAAhyKIr8JEmZy/QX4ibdpkz61ISEM82mZrdD+jfYEHiMphZPK/LeSLFH9b8lm3Z7rXNNT0CgFeva6RCtH8UERtLFZLf9UkWcLjEFQM6pXLPpw6YVE/QODnUTXmexQTsaoYuYGLHBO2aLv7jIS1L/einSDIwFYnbIfsf3GkQVQ4ARahMqVW13gVXfRSImayOLKuQ5cjF4pEVZmpulbIVWH1qLib+ThyerRDf5msr0Sl3xxVLIJUH9kWXdfPDtEaowPSkQ+lKG3jGT6qj3vnBDKx7od6SZSqezjglcF9bVM09kDBjlWP6gvv219Cl5yTB0WrM=
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: base64
MIME-Version: 1.0
X-MS-Exchange-CrossTenant-AuthAs: Internal
X-MS-Exchange-CrossTenant-AuthSource: SJ0PR03MB6598.namprd03.prod.outlook.com
X-MS-Exchange-CrossTenant-Network-Message-Id: 69076bb8-c7ff-4cf8-93e8-08dbb3e64402
X-MS-Exchange-CrossTenant-originalarrivaltime: 12 Sep 2023 23:16:16.7794 (UTC)
X-MS-Exchange-CrossTenant-fromentityheader: Hosted
X-MS-Exchange-CrossTenant-id: af914547-9fbe-40e1-a852-1a58e1f247dc
X-MS-Exchange-CrossTenant-mailboxtype: HOSTED
X-MS-Exchange-CrossTenant-userprincipalname: 9QjhTIfIAaZZv0vasMAYzP1wsmzihdFoYq3J+TDiE/zLakLqRXOkYNb4VTeO1kR+3gl2eCxYRGhQANakMrJNEg==
X-MS-Exchange-Transport-CrossTenantHeadersStamped: LV3PR03MB7501
X-CMAE-Score: 0
X-CMAE-Analysis: v=2.4 cv=UvJwis8B c=1 sm=1 tr=0 ts=6500f15b a=Z2iVbzAMQWfC12katpY7Eg==:117 a=Z2iVbzAMQWfC12katpY7Eg==:17 a=xqWC_Br6kY4A:10 a=IkcTkHD0fZMA:10 a=zNV7Rl7Rt7sA:10 a=NdtbqBCgZM0A:10 a=nORFd0-XAAAA:8 a=48vgC7mUAAAA:8 a=2tsvuTQuAAAA:8 a=I0CVDw5ZAAAA:8 a=OII1XEMAQU6zeGsdX-EA:9 a=QEXdDO2ut3YA:10 a=AYkXoqVYie-NGRFAsbO8:22 a=w1C3t2QeGrPiZgrLijVG:22 a=w1QI8THEI4iyJQ0oNEIE:22 a=YdXdGVBxRxTCRzIkH2Jn:22
X-SOURCE-IP: 192.0.46.73
X-SPF-STATUS: hard_fail
X-SPF-FROM-STATUS: not_checked
X-RDNS-STATUS: pass
X-HELO-STRING: pechora3.dc.icann.org
Spam-Stopper-Id: 3b0c0451-2a73-4ff2-b058-7ece8ef66e93
Spam-Stopper-v2: Yes
X-Envelope-Mail-From: doug@ewellic.org
X-Spam-Reasons: None
X-Spam-Category: None
X-AES-Category: LEGIT
X-Auto-Response-Suppress: DR, OOF, AutoReply
Archived-At: <https://mailarchive.ietf.org/arch/msg/ietf-languages/RrWGi7AFt7qU0YcAmYfhmH8v-aY>
Subject: Re: [Ietf-languages] Fwd: Proposal for variant language subtags for German dialects
X-BeenThere: ietf-languages@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: "Review of requests for language tag registration according to BCP 47 \(RFC 4646\)" <ietf-languages.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/ietf-languages>, <mailto:ietf-languages-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/ietf-languages/>
List-Post: <mailto:ietf-languages@ietf.org>
List-Help: <mailto:ietf-languages-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ietf-languages>, <mailto:ietf-languages-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 12 Sep 2023 23:16:49 -0000

There is work being done at present on a new standard (ISO 21636, “A Framework for Language Varieties”) which might cover a lot of what Hugh is talking about, whenever it is finalized and published.

Beyond that, it is always possible in BCP 47 to have both variant and private-use subtags at multiple levels, so that one could have some combination of:

en-US-engne
en-US-enboston *or* en-US-engne-enboston
en-US-ennwyork *or* en-US-engne-ennwyork
(although I doubt New Yorkers would identify themselves as part of New England)
en-US-ennwyork-enmanhtn

and the beat goes on, with any of the lower-level variants in these examples being replaced by private-use subtags:

en-US-x-engne
en-US-engne-x-enboston
etc.

As always, there is a two-edged sword when it comes to private-use coding elements. If a private code achieves widespread use, that is a good indication that the thing being privately coded needs to be assigned a formal code; but then it becomes increasingly difficult for users to migrate away from the private code, which then might have to be supported indefinitely.

If it does not achieve widespread use, despite its existence being known, that probably means it was a wise decision not to assign a formal code.

Language can be micro-analyzed down to a very fine level of detail, but the question must always be asked whether there is a broad need to interchange identifiers for different sub-sub-varieties. Many New Yorkers can distinguish English as spoken in Manhattan vs. Brooklyn vs. Queens, but would content in these varieties (not only on computers, and not only spoken) actually be tagged separately, by more than one individual or small research body, if the possibility to do so without private-use were available?

I wish it were possible to know how people use BCP 47, and how people wish they could use it but either can’t, or don’t know that they can, similar to the way Unicode is able to use Google statistics to estimate which emoji are most popular.

--
Doug Ewell, CC, ALB | Lakewood, CO, US | ewellic.org



From: Ietf-languages <ietf-languages-bounces@ietf.org> On Behalf Of Hugh Paterson III
Sent: Tuesday, September 12, 2023 16:26
To: John Cowan <cowan@ccil.org>
Cc: Lisa Dücker <lisa.duecker=40uni-marburg.de@dmarc.ietf.org>; ietf-languages@iana.org
Subject: Re: [Ietf-languages] Fwd: Proposal for variant language subtags for German dialects

John,

Can you help clarify how many sub levels are presumed in these sub-tags.  For example, if we say that we assign a tag to ‘New England English’ does that preclude creating a tag for ‘New York’ and/or ‘Boston’ Englishes? If New York Englanish has a tag does that preclude a Manhattan English? If English is the language name and there is hierarchy in the geographical units does that create an assumption that the speech varieties are also hierarchical in their designation too? Or is it all flat and exclusive within the context of the sub-language level? 

Written varieties which have been debated in this forum (such as the German orthography) generally have dates associated giving them a time depth with dates usually based on implementation dates.  In contrast oral records and oral realities “on the ground” do not change with discrete dates. For example, New York English “on the ground” doesn’t generally sound like Bernie Sanders or Christopher Walken.  That is they represent Brooklyn/queens New York English they represent a certain time depth and social class which may have been dominant at one point for the speech they represent. However, Even the same social class and racial backgrounds don’t sound the same today with younger generations. How do we model time depth for archival oral materials via sub tags?