Re: [Ietf-languages] [EXTERNAL] Re: language identifiers for sign languages (incl. sgn) vs. attribute for indicating the representation of an individual language in "sign language modality"

Peter Constable <petercon@microsoft.com> Sat, 23 November 2019 00:24 UTC

Return-Path: <petercon@microsoft.com>
X-Original-To: ietf-languages@ietfa.amsl.com
Delivered-To: ietf-languages@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id BC8B5120812 for <ietf-languages@ietfa.amsl.com>; Fri, 22 Nov 2019 16:24:26 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.788
X-Spam-Level:
X-Spam-Status: No, score=-1.788 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIMWL_WL_HIGH=-0.001, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=0.001, HTTPS_HTTP_MISMATCH=0.1, SPF_FAIL=0.001, SPF_HELO_NONE=0.001, T_KAM_HTML_FONT_INVALID=0.01, URI_HEX=0.1] autolearn=no autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=microsoft.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 12WPz3IJ7zFF for <ietf-languages@ietfa.amsl.com>; Fri, 22 Nov 2019 16:24:24 -0800 (PST)
Received: from mork.alvestrand.no (mork.alvestrand.no [IPv6:2001:700:1:2::117]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 5F4F01200CE for <ietf-languages@ietf.org>; Fri, 22 Nov 2019 16:24:23 -0800 (PST)
Received: by mork.alvestrand.no (Postfix) id 727867C4BF4; Sat, 23 Nov 2019 01:24:21 +0100 (CET)
Delivered-To: ietf-languages@alvestrand.no
Received: from localhost (localhost [127.0.0.1]) by mork.alvestrand.no (Postfix) with ESMTP id 52FF97C4B99 for <ietf-languages@alvestrand.no>; Sat, 23 Nov 2019 01:24:21 +0100 (CET)
X-Virus-Scanned: Debian amavisd-new at alvestrand.no
Authentication-Results: mork.alvestrand.no (amavisd-new); dkim=pass (1024-bit key) header.d=microsoft.com
Received: from mork.alvestrand.no ([127.0.0.1]) by localhost (mork.alvestrand.no [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id D4hl55i1FDVu for <ietf-languages@alvestrand.no>; Sat, 23 Nov 2019 01:24:11 +0100 (CET)
X-Greylist: from auto-whitelisted by SQLgrey-1.8.0
X-Greylist: from auto-whitelisted by SQLgrey-1.8.0
X-Comment: SPF skipped for whitelisted relay - client-ip=192.0.46.72; helo=pechora6.dc.icann.org; envelope-from=petercon@microsoft.com; receiver=ietf-languages@alvestrand.no
Received: from pechora6.dc.icann.org (pechora6.icann.org [192.0.46.72]) by mork.alvestrand.no (Postfix) with ESMTPS id 95A657C0C3A for <ietf-languages@alvestrand.no>; Sat, 23 Nov 2019 01:24:10 +0100 (CET)
Received: from NAM03-DM3-obe.outbound.protection.outlook.com (mail-eopbgr800093.outbound.protection.outlook.com [40.107.80.93]) (using TLSv1.2 with cipher AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by pechora6.dc.icann.org (Postfix) with ESMTPS id 3AAF31E01F2 for <ietf-languages@iana.org>; Sat, 23 Nov 2019 00:24:09 +0000 (UTC)
ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=EoU58g7LlOC9eiRIiZAohWtCfVgkqs2nqRdUbFVXW90dR5BSAxNW8iHkkfyslhPS9JG1fhkDaHkxZ8AdHvHQemK/otqynKrAHElhxsYAyM6wPBnBsmxGQKZ7xTO1if6LxTPcDx4riXKEch2qOXzaiarVSAWuayzMO99wyB0Q4bSMAeKIcsygQP2dZpFwWp1ZkXKkPT9o9Ek+d/6veF0wxodJVEvyCq+d7jy2MXO14Zy9WLpfRoNeWIZF7tSRFy1b3W4BG4xvoao4FvMel5i6U24gf1rPlVgOgQjZ6zLUtal8AfrBOT/J1GxbAq/ctYu5FA7PPeD4dtLTTJEZTl2gVg==
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=RPAqhbJLEpB2odvdQMaAO68Egg0W+uMcjp8bCCyqZf4=; b=Wf1goCgXlddlOoPxbKQK8NkOSqOKYjFYKDpNpCH1DwQh+WLWMfjYLkVUjYA7NeAYRGLnO2fsyHeABBkM6FEa2B3tNV08N9Re+kt3Ij5ENWdvtdpLAc08HLkfa3p+ru4HqUUqK8QGGUXFC6LDF4HP6XGjd9lzuPPik97CZXLozcIsY+86iMohbDp4Iticz4i6jATvTdCFcz8BiIq+79Wcwq02J1gjj5BAzqXwhR+he1wt+WoC15IR80uDswg0dbue4j5aQSUyGL1VFGnZpZRSA+5E6q7xvokG1gga9S0oeMF5tWbakv3RnctAZ3NB4sN4+RZKMLqiht47nFBq2/07LQ==
ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=microsoft.com; dmarc=pass action=none header.from=microsoft.com; dkim=pass header.d=microsoft.com; arc=none
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=selector2; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=RPAqhbJLEpB2odvdQMaAO68Egg0W+uMcjp8bCCyqZf4=; b=hJuywHZYIsmAJ2C00hvBFzt7fAc0+nqW1Ran4qBMHfv73DPIUyELA7yqklkPC2voBPWA8w5G+OK1JWrGCgZdWLB+uo7yGza768GsR7cpjHLogy0Htpjo1Ucf0fWQMRjQ2SoaiGVlBlS/lPR+g7k0ZXMGKiw/0OmLVECfMZ7LAu0=
Received: from MW2PR2101MB1065.namprd21.prod.outlook.com (52.132.149.18) by MW2PR2101MB0987.namprd21.prod.outlook.com (52.132.146.24) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.2495.7; Sat, 23 Nov 2019 00:23:47 +0000
Received: from MW2PR2101MB1065.namprd21.prod.outlook.com ([fe80::7d:28fc:dfa1:b19e]) by MW2PR2101MB1065.namprd21.prod.outlook.com ([fe80::7d:28fc:dfa1:b19e%4]) with mapi id 15.20.2495.011; Sat, 23 Nov 2019 00:23:47 +0000
From: Peter Constable <petercon@microsoft.com>
To: Doug Ewell <doug@ewellic.org>, Christian Galinski <christian.galinski@chello.at>, "'Fourney, David'" <david.fourney@usask.ca>
CC: ietf-languages <ietf-languages@iana.org>, 'Sebastian Drude' <Sebastian.Drude@outlook.com>, "Melinda_Lyons@sil.org" <Melinda_Lyons@sil.org>
Thread-Topic: [EXTERNAL] Re: [Ietf-languages] language identifiers for sign languages (incl. sgn) vs. attribute for indicating the representation of an individual language in "sign language modality"
Thread-Index: AQHVoXihwQkTkYnhb0icGqtrhOPuHaeX2oPQ
Date: Sat, 23 Nov 2019 00:23:47 +0000
Message-ID: <MW2PR2101MB10651AA60FBA508E53BE023BD5480@MW2PR2101MB1065.namprd21.prod.outlook.com>
References: <20191122140445.665a7a7059d7ee80bb4d670165c8327d.e5d7554235.wbe@email03.godaddy.com>
In-Reply-To: <20191122140445.665a7a7059d7ee80bb4d670165c8327d.e5d7554235.wbe@email03.godaddy.com>
Accept-Language: en-US, en-CA
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
msip_labels: MSIP_Label_f42aa342-8706-4288-bd11-ebb85995028c_ActionId=de9b1a33-8ff7-4cdf-b6a9-0000fcbfaa8d; MSIP_Label_f42aa342-8706-4288-bd11-ebb85995028c_ContentBits=0; MSIP_Label_f42aa342-8706-4288-bd11-ebb85995028c_Enabled=true; MSIP_Label_f42aa342-8706-4288-bd11-ebb85995028c_Method=Standard; MSIP_Label_f42aa342-8706-4288-bd11-ebb85995028c_Name=Internal; MSIP_Label_f42aa342-8706-4288-bd11-ebb85995028c_SetDate=2019-11-22T23:44:07Z; MSIP_Label_f42aa342-8706-4288-bd11-ebb85995028c_SiteId=72f988bf-86f1-41af-91ab-2d7cd011db47;
authentication-results: spf=none (sender IP is ) smtp.mailfrom=petercon@microsoft.com;
x-originating-ip: [2001:4898:80e8:1:3593:42c3:d18c:f8f0]
x-ms-publictraffictype: Email
x-ms-office365-filtering-ht: Tenant
x-ms-office365-filtering-correlation-id: 452d1cda-b9f8-4ede-746b-08d76fab681e
x-ms-traffictypediagnostic: MW2PR2101MB0987:
x-microsoft-antispam-prvs: <MW2PR2101MB098753B8936935EE4F6E2035D5480@MW2PR2101MB0987.namprd21.prod.outlook.com>
x-ms-oob-tlc-oobclassifiers: OLM:10000;
x-forefront-prvs: 0230B09AC4
x-forefront-antispam-report: SFV:NSPM; SFS:(10019020)(396003)(346002)(376002)(366004)(39860400002)(136003)(189003)(199004)(13464003)(4326008)(7736002)(86362001)(25786009)(606006)(66556008)(66446008)(11346002)(52536014)(33656002)(10290500003)(6436002)(64756008)(55016002)(10090500001)(236005)(76116006)(74316002)(9686003)(446003)(54896002)(6306002)(66476007)(5660300002)(66946007)(22452003)(8676002)(478600001)(14454004)(99286004)(102836004)(7696005)(790700001)(8990500004)(6246003)(76176011)(966005)(6506007)(316002)(110136005)(53546011)(54906003)(6116002)(256004)(81166006)(186003)(8936002)(46003)(81156014)(71190400001)(71200400001)(2906002)(14444005)(229853002)(21314003); DIR:OUT; SFP:1102; SCL:1; SRVR:MW2PR2101MB0987; H:MW2PR2101MB1065.namprd21.prod.outlook.com; FPR:; SPF:None; LANG:en; PTR:InfoNoRecords; MX:1; A:1;
received-spf: None (protection.outlook.com: microsoft.com does not designate permitted sender hosts)
x-ms-exchange-senderadcheck: 1
x-microsoft-antispam: BCL:0;
x-microsoft-antispam-message-info: j//1i2BLJnCchMUQKyJT3KUfjWdVy7QwA7PdcPqUcaKSoOyl06PR6KUnX/7ml5QawNYOIZiL5IgceEd0FCClYrsYisPGOHkqBBFvpGrT455Y2BeAD+cUkYACjudP/oL70pEcdIOnecbHlzo26jP0K7RXwz8vz9snXZ2Hf6Q9nlBCDf9TERNlZXKr9hO6ngY3I1BJUaAUR+3JKt5/c5vT7gLj9s3K5uYiLrkw5xA5UfXZPUxFdcHER5gVrRThHRyQ4g3bNQiyMa+85tI/J4eyutag0271EMqc/jUpvgHZVnkrwwVTefyTTNk1UoSvD8Le78BwIZnHZsK3qvBZF8dncnOo/Y88hq5j1yxH5V/NYH/EJUb8TAM+tyK9zdBc5R/03kdqE2302/THwfjZqxTVPPpqpnIjKg0mNPHU4oOCmQAZQf2xsR9K2ONanw6HWzehEgBL9ElR1S5aYV1jiFCmXANRRGoScX4vype+e56rYT8=
x-ms-exchange-transport-forked: True
Content-Type: multipart/alternative; boundary="_000_MW2PR2101MB10651AA60FBA508E53BE023BD5480MW2PR2101MB1065_"
MIME-Version: 1.0
X-OriginatorOrg: microsoft.com
X-MS-Exchange-CrossTenant-Network-Message-Id: 452d1cda-b9f8-4ede-746b-08d76fab681e
X-MS-Exchange-CrossTenant-originalarrivaltime: 23 Nov 2019 00:23:47.1972 (UTC)
X-MS-Exchange-CrossTenant-fromentityheader: Hosted
X-MS-Exchange-CrossTenant-id: 72f988bf-86f1-41af-91ab-2d7cd011db47
X-MS-Exchange-CrossTenant-mailboxtype: HOSTED
X-MS-Exchange-CrossTenant-userprincipalname: znlTT7S2F6Ei2ZUy+d4POXGU9rsugG/l+SrqWQKiw7+qPUOL5uyYKGw2cAp+UkaHSZxq7Tf3G71SEHBG7bBUWg==
X-Greylist: Sender passed SPF test, not delayed by milter-greylist-4.6.2 (pechora6.dc.icann.org [192.0.46.72]); Sat, 23 Nov 2019 00:24:09 +0000 (UTC)
X-MS-Exchange-Transport-CrossTenantHeadersStamped: MW2PR2101MB0987
Archived-At: <https://mailarchive.ietf.org/arch/msg/ietf-languages/e1BHBt-yVsVRsAELQXzxqhH2Xn0>
Subject: Re: [Ietf-languages] [EXTERNAL] Re: language identifiers for sign languages (incl. sgn) vs. attribute for indicating the representation of an individual language in "sign language modality"
X-BeenThere: ietf-languages@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: <ietf-languages.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/ietf-languages>, <mailto:ietf-languages-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/ietf-languages/>
List-Post: <mailto:ietf-languages@ietf.org>
List-Help: <mailto:ietf-languages-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ietf-languages>, <mailto:ietf-languages-request@ietf.org?subject=subscribe>
X-List-Received-Date: Sat, 23 Nov 2019 00:24:27 -0000

The scope of the ‘t’ extension is linguistic content that has undergone some type of transform in its expression, and signed modality for a spoken language could be considered a transform. But the ‘t’ extension as currently defined doesn’t support this. What is supported is primarily dealing with text transformations. Also, the way the ‘t’ extension works is that the additional information declares what content was transformed _from_, not what it is transformed _into_. For signed modality of spoken languages, what’s needed is a way to indicate signed modality as the final expression, not the source.

So, I don’t think the ‘t’ extension is appropriate.

I think a variant subtag “signed” or “signmod” would be better. The main problem that would arise is that this is very generic (it could be usefully applied to any oral language), which there has been resistance to in the past. A smaller issue is that, while variant tags for specific signed-modality variants could be registered, it might make sense to use a subtag sequence along the lines -signed-modvarnt, but it’s currently not possible to specify a prefix as anything other than a valid language tag. (E.g., *-signed can’t be a prefix specification.) That wouldn’t be a problem as long as the signed-modality variant is specific to a particular language, as would be the case for (e.g.) Signed Exact English.



Peter

From: Ietf-languages <ietf-languages-bounces@ietf.org> On Behalf Of Doug Ewell
Sent: Friday, November 22, 2019 1:05 PM
To: Christian Galinski <christian.galinski@chello.at>at>; 'Fourney, David' <david.fourney@usask.ca>
Cc: ietf-languages <ietf-languages@iana.org>rg>; 'Sebastian Drude' <Sebastian.Drude@outlook.com>om>; Melinda_Lyons@sil.org
Subject: [EXTERNAL] Re: [Ietf-languages] language identifiers for sign languages (incl. sgn) vs. attribute for indicating the representation of an individual language in "sign language modality"

Hi Christian,

> Many true sign languages (se definitions below), such as “ase”
> (American Sign Language [ASL], which /fictively/ might even have a
> Newfoundland and Labrador variety – to be coded ase-CA-NL in line
> with BCP47 rules) have already a language identifier.

This example is actually not valid BCP 47 syntax. The use of ISO 3166-1 country codes as region subtags doesn't extend to appending ISO 3166-2 subdivision codes directly. You would need to use "ase-u-sd-canl" or "ase-CA-u-sd-canl". See UTS #35, Section 3.6.5.

> The question to Doug is, how the BCP and Unicode rules deal with the
> above-mentioned difference between (true) “individual sign languages”
> and the “signed language modality” (as a sort of “transform” of any
> individual language)?

I don't believe there are or should be any "Unicode rules" (which I assume refers to CLDR and the 't' or 'u' extension) that deal with this.

One approach would be to request a variant subtag, such as 'signed', to represent the signed modality of a spoken language, such as (but not limited to) Signing Exact English. See RFC 5646, Section 2.2.5 for details on variant subtags and Section 3.6 for details on requesting a registration.

However, some may argue that modality is beyond the scope of BCP 47 variants and would suggest a CLDR extension to deal with this within the 't' extension framework. In that case, your best bet would be to contact cldr-contact@unicode.org<mailto:cldr-contact@unicode.org> .

--
Doug Ewell | Thornton, CO, US | ewellic.org<https://nam06.safelinks.protection.outlook.com/?url=http%3A%2F%2Fewellic.org&data=02%7C01%7Cpetercon%40microsoft.com%7C19558599ac7d421157fc08d76f8fc274%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C637100535555481188&sdata=5sWZ089qfuVRPWbmetKrFDHskz%2BETA2vY0ioACdSzos%3D&reserved=0>


-------- Original Message --------
Subject: language identifiers for sign languages (incl. sgn) vs.
attribute for indicating the representation of an individual language in
"sign language modality"
From: "Christian Galinski" <christian.galinski@chello.at<mailto:christian.galinski@chello.at>>
Date: Fri, November 22, 2019 11:48 am
To: "'Fourney, David'" <david.fourney@usask.ca<mailto:david.fourney@usask.ca>>
Cc: <Melinda_Lyons@sil.org<mailto:Melinda_Lyons@sil.org>>, "'Sebastian Drude'"
<Sebastian.Drude@outlook.com<mailto:Sebastian.Drude@outlook.com>>, <doug@ewellic.org<mailto:doug@ewellic.org>>


Dear David,

First I have to apologize for my long silence – I was absorbed with work on several standards.

We are now at a crucial moment where things need to be clarified in ISO 639-4 “language coding” (and ISO/TR 21636 “Language varieties”) – including your issue of how to identify “individual sign languages” (i.e. true individual sign languages, which are not just a modality of spoken language) and the “signed language modality” which is a signed representation of a spoken language).


  1.  concerning the difference between “individual sign languages” and “signed language modality”, the use of the language identifier “sgn” (in library use) is confined to an unidentifiable individual sign language – it is NOT referring to a “signed language modality”. According to the fundamental rules of language coding, we cannot change the scope of “sgn”, nor ignore the difference between sign language and the signed language modality.
Therefore, for the sign language modality we need an “attribute” to be added to the language identifier of an individual language, e.g. if the sign language modality of the type of “Signing Exact English” is used.
  2.  However, I could not find an identifier for signed language modality, nor a mechanism for inserting an identifier for this in:
https://tools.ietf.org/html/bcp47<https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Ftools.ietf.org%2Fhtml%2Fbcp47&data=02%7C01%7Cpetercon%40microsoft.com%7C19558599ac7d421157fc08d76f8fc274%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C637100535555486190&sdata=PWlDE0pdRCgLBG4wsnprwit5%2B6EeB%2Fux%2FiApkJkmweg%3D&reserved=0>
https://tools.ietf.org/html/rfc6497#ref-UTS35<https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Ftools.ietf.org%2Fhtml%2Frfc6497%23ref-UTS35&data=02%7C01%7Cpetercon%40microsoft.com%7C19558599ac7d421157fc08d76f8fc274%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C637100535555491183&sdata=Y3Zi1erRIWT%2F8K%2F5ZhtfSPCofmTkczyny89RagNWmhA%3D&reserved=0>
http://unicode.org/reports/tr35/<https://nam06.safelinks.protection.outlook.com/?url=http%3A%2F%2Funicode.org%2Freports%2Ftr35%2F&data=02%7C01%7Cpetercon%40microsoft.com%7C19558599ac7d421157fc08d76f8fc274%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C637100535555496180&sdata=fezBI46al7DmxtciBBIwI7Fj%2Fuuyor7d8uB7xdyvzM4%3D&reserved=0>
The regular order of attributes to a language tag (language identifier) is “lang-geogr” (dialect), or “lang-script” (language written in a certain script) or “lang-script-geogr” (language in a script in a certain region).. In between, a “t” (for “transform” in the meaning of transcription, transliteration, translation or other) may be inserted.

From your experience/problems with video technology (and HTML), the questions to you would be:

  1.  Many true sign languages (se definitions below), such as “ase” (American Sign Language [ASL], which /fictively/ might even have a Newfoundland and Labrador variety – to be coded ase-CA-NL in line with BCP47 rules) have already a language identifier.
Does it need another attribute to further specify them as a sign language? In that case, an attribute must be found which is different from “sgn”. How could it look like?
  2.  In the case of a signed language modality, such as “Signing Exact English” the core language identifier for English would be “eng”. It would need an attribute to identify it as the signed language modality (which could be followed by a country code, if there are “dialects” of /fictive/ eng-xxx-AUS meaning “Signing Exact English as used in Australia”. What could “xxx” indicating “signed language modality look like?
  3.  It probably would not help to use an attribute identifier “Xxxx” in the slot of “script code”, as a signed language modality might slightly differ depending on the script used, even if it is the same spoken language (represented in different scripts in different areas/communities).
  4.  Could the “t” (transform) symbol be of help – as a given signed language modality somehow is a “transformation” of a spoken language?


  1.  The above questions (resp. the answer to them) could have an impact on ISO 639 and ISO/TR 21636 insofar as we should not formulate provisions in these documents which conflict with other standards. We should rather try to find generic solutions.

The question to Doug is, how the BCP and Unicode rules deal with the above-mentioned difference between (true) “individual sign languages” and the “signed language modality” (as a sort of “transform” of any individual language)? see the respective terminology entries below

Best regards
Christian


p.s.
In the most recent revised version of ISO 639-4 we came up with the following terminology entries:
individual sign language
NOT: signed language
individual language (3.1.3) having the visual-spatial language modality (3.5.1) as basic modality
Note 1 to entry: Usually “sign language” appears as part of the name of the respective individual language.
EXAMPLE: ASL (American Sign Language); )

signed language modality
NOT: sign language
visual-spatial language modality (3.5.1) that uses a combination of hand shapes, palm orientation and movement of the hand, arm or body, and facial expression