Re: [Ietf-languages] ISO TR 21636 Dimensions

Sebastian Drude <drude@xs4all.nl> Sat, 28 November 2020 00:25 UTC

Return-Path: <drude@xs4all.nl>
X-Original-To: ietf-languages@ietfa.amsl.com
Delivered-To: ietf-languages@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id E07C23A0967 for <ietf-languages@ietfa.amsl.com>; Fri, 27 Nov 2020 16:25:29 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.119
X-Spam-Level:
X-Spam-Status: No, score=-2.119 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, HTML_MESSAGE=0.001, NICE_REPLY_A=-0.001, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=unavailable autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=xs4all.nl
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id m29qZsgmtaeG for <ietf-languages@ietfa.amsl.com>; Fri, 27 Nov 2020 16:25:27 -0800 (PST)
Received: from lb1-smtp-cloud8.xs4all.net (lb1-smtp-cloud8.xs4all.net [194.109.24.21]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 89B233A0966 for <ietf-languages@ietf.org>; Fri, 27 Nov 2020 16:25:26 -0800 (PST)
Received: from cust-d2ef4cbd ([IPv6:fc0c:c138:75cc:34bc:4631:c48c:494:61cb]) by smtp-cloud8.xs4all.net with ESMTPA id io3BkU8zZDuFjio3GkstGK; Sat, 28 Nov 2020 01:25:24 +0100
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=xs4all.nl; s=s2; t=1606523124; bh=6Ei4HfK83BvE0wYEAfObgUUqrz8K7qyK4WEFvskGrSg=; h=Subject:To:From:Message-ID:Date:MIME-Version:Content-Type:From: Subject; b=b2+w9Upa6MpVFFfxUnexJsUggXXdMscPfz2V0dqApH7D8HBf/flZtjYxLRr4d172R lCNnc3Ud+is/KRZF54KuzR5kEDKJ0bj3KTV1OBDXJicbpQ29BKoh+LjeYmaE8yFqBo KOCvZzRGztNyRKXMTjiUmQAWGnWz7cslIfXSKuOY2Xlu/gzFoTLuf+WoVXG+TJMgxM j5uB2vB6lk/q6P7jdNz4BZBaKw15wsBj3UcIf0MIJGzuajLmjRPi1asSXqXNAo+Kz8 QtMqxgAuOUWCc7VVLfHIbMRbn5aTGnMOXgsfuKFOxpwGG8GRWwZsWQwZJj+jLdimxY WFscXpLd724zw==
To: Richard Wordingham <richard.wordingham=40ntlworld.com@dmarc.ietf.org>, ietf-languages@ietf.org
References: <CAKZQS29HBak-v6M2HLCpdgeZHJTFVc2W_w4G=qOK+mtPcXEenQ@mail.gmail.com> <4846f915-5706-e9dc-8b16-9f16362f82f0@xs4all.nl> <CAD2gp_SMJ_P7kyKT13Ax_ae+nr9rbTOHNn+rRp7=EKOKVSVq_g@mail.gmail.com> <236b86db-3dfb-b9e0-ab82-fa31753d0459@xs4all.nl> <20201126005439.76a9cc1d@JRWUBU2> <920a1960-ab18-aaaa-ae3e-b547314b66d7@xs4all.nl> <20201127234315.32daf238@JRWUBU2>
From: Sebastian Drude <drude@xs4all.nl>
Message-ID: <1444327f-674d-7505-3ae8-b1e4762cc1cf@xs4all.nl>
Date: Fri, 27 Nov 2020 21:25:16 -0300
User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:68.0) Gecko/20100101 Thunderbird/68.10.0
MIME-Version: 1.0
In-Reply-To: <20201127234315.32daf238@JRWUBU2>
Content-Type: multipart/alternative; boundary="------------8D3DA447EAE904DE6EC18A20"
Content-Language: pt-BR
X-CMAE-Envelope: MS4xfBK8ozK6+0/GPaXo5UbVztpY2m+7zlvvNJRaA6kD+D391xOfxTC1A19m5NYl6W38DtIJJGBZyoulNlsITiutbEy0UomLFIHvMSUKkRBcEpev0f4lCI9b 5gRn/+GTbyb1PG5awgXMbzHYNJtGqVE5eT2c09CweDIT1RZJqWemNBUJNyBPhDGl04CiCF7DIlVmGhDXcceuvEwr8H049g0v3K6iXRdG+lbIrYkBDDnePPf8 QvoHkonSyR1/45SpD6GLMJiiRtJmkhPqpIfqnH8y6pW8e2d/EDT91Ea7MWY62qZzryQ04euq0gK13S3pacFg6dMjtjQrx2HBUZka6D8cEAoI+ocy57c2oXL9 H5YAUrWC2fVuSD+l6ZpIpf3+rTghTg==
Archived-At: <https://mailarchive.ietf.org/arch/msg/ietf-languages/Y6ORNmryZ2Q3nZ9YOoAYFGC1w1o>
Subject: Re: [Ietf-languages] ISO TR 21636 Dimensions
X-BeenThere: ietf-languages@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: <ietf-languages.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/ietf-languages>, <mailto:ietf-languages-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/ietf-languages/>
List-Post: <mailto:ietf-languages@ietf.org>
List-Help: <mailto:ietf-languages-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ietf-languages>, <mailto:ietf-languages-request@ietf.org?subject=subscribe>
X-List-Received-Date: Sat, 28 Nov 2020 00:25:30 -0000

Thanks, Richard!

Yes, you have fleshed out your use cases, although again in some cases I 
lack the factual knowledge to understand the subleties involved (as in 
your pi-Lana-TH example).

I would indeed argue that all variants related to writing systems, 
insofar they are not already covered by existing (or creatable, in the 
current framework) BCP47 subtags, would go into the medium dimension as 
subvarieties within written modality.  It is true that they vary over 
time (e.g., in the case of a reform), but as you are indicating 
yourself, at any given point in time there may be more than one system 
in use in parallel, for instance by writers adhering or not to new 
rules, or along other lines (Oxford vs. Non-Oxford spelling rules for 
BE), so this is again a case where different dimensions (here time & 
medium) have influence one on another, but still need to be recognized 
as different.  So yes, even "German 1901" and "German 1996" indicate 
different written sub-/*modalities*/ (which are labelled by their year 
of introduction).

As to dialects being identified down to the street level, I doubt that 
many convincing usecases could be made to create subtag-values of such a 
fine resolution (besides by Prof. Henry Higgins, that is).


Best wishes,

Sebastian

-- 

Museu P.E. Goeldi, CCH, Linguistica ▪ Av. Perimetral, 1901
Terra Firme, CEP: 66077-530 ▪ Belém do Pará – PA ▪ Brazil
drude@xs4all.nl ▪ +55 (91) 3217 6024 ▪ +55 (91) 983733319
Priv: Tv. Juvenal Cordeiro, 184, Apt 104 ▪ 66070-300 Belém

On 27/11/2020 20:43, Richard Wordingham wrote:
> On Wed, 25 Nov 2020 23:23:08 -0300
> Sebastian Drude <drude@xs4all.nl> wrote:
>
>> thanks for your thorough thinking and your comments and questions.  I
>> will try to address them below in your text as good as I can.
>> ISO 21636 mainly proposes to differenciate between the 8 dimensions
>> clearly.  BCP 47 is fine for dialects if they coincide with usual
>> countries regions, and for writing systems (scripts and
>> orthographies) and their
>> variants.  But for time periods,  sociolects, registers, even
>> modalities (signed, whistled, drummed speech of non-sign-languages)?
>> And learners varieties and the 'anomalies' (we use the less pejorative
>> and pathologizing 'communicative functioning')?  There ISO 21636 can
>> make a contribution, I believe, (see the case at hand: sociolect?
>> register?) and that may even help with those dialects that do not
>> coincide with regional administrative borders.
> BCP 47 can handle things like the Scanian dialect of Bornholm
> (da-bornholm).  It can't handle street-by-street variations.
>
> On the other hand, BCP 47 can tempt one to some dubious practices.  For
> example, I might tag Unicode-encoded Pali in the Tham script as
> pi-Lana-TH for material from Northern Thailand but as pi-Lana-LA
> for material from North-eastern Thailand.  The latter may come unstuck
> for text to speech - I can imagine a Siamese accent being used in NE
> Thailand.
>
>>> After that comes the question of orthography within the script.
>>> I'm not sure that differences with a political tint (Russian, Lao)
>>> come within the time dimension, and the gross differences in Thai
>>> script Northern Thai (Thai names thap sap v. rup pariwat) definitely
>>> don't.
>> I do not know what you are referring to here, so I do not understand
>> at all why the time dimension would be involved.
> BCP 47 has variant subtags for spelling schemes, such as
> "1901" and "1996" for German.  These give a strong hint as to the
> period of the text described by them, so I had though they were
> categorisations along the time dimension.  You now tell me that they
> apply along the medium dimension.
>
> Some spelling differences have a political implication.  Russian
> exiles between the World Wars generally ignored the spelling reforms
> introduced after the Russian Revolution.  Rejection of the Lao
> replacement of LO LING ('lo lot' in Lao) by LO LOOT ('lo ling' in Lao)
> is reportedly a sign of political leanings.  I now gather such
> differences are to be categorised along the medium dimension.
>
>>> The same passage in the same script in Pali can have quite a
>>> variation in punctuation system.
>> Wouldn't that go into the same category as different orthographic
>> rules?
>>> Perhaps that's a separate subdimension within 'time'.
>> I confess I do not know enough about Asian languages to grasp why the
>> time dimension would be involved here -- did these rules for
>> punctuation change over time?  Then see my comment on Turkish, above.
> Yes, many Asian writing systems have recently adopted European
> punctuation.  It can be quite uneven.  A novel in a Thai magazine will
> have far more non-blank punctuation than a Thai text book.
>
>>> Similarly, Pali chants in Thai script use different writing systems
>>> for the masses and for more academic use - the former is an
>>> 'alphabet' by Daniels' definition and the latter is an abugida. Is
>>> this difference on the 'communicative functioning' dimension?
>> No, this would all go into medium, just like script and orthography,
>> but I reckon that these differences are already covered by BCP 47, and
>> would not even attempted to be addressed by any implementation of ISO
>> 21636.
> It's in scope, but no subtags have been assigned for this case.
>
>>> Old manuscript European documents can be full of abbreviations - bars
>>> for Vr and rV survived quite late in Modern English.  The
>>> abbreviations are usually expanded when such documents
>>> non-palaeographically transcribed.  Is the use of these abbreviations
>>> on the 'communicative functioning' dimension?
>> No, I would guess, same thing, orthography --> medium dimension if
>> needed, hopefully already covered by BCP 47.
> It's in scope, but no subtags have been assigned for this case.
>
> Hoping I've fleshed out my use cases,
>
> Richard.
>
> _______________________________________________
> Ietf-languages mailing list
> Ietf-languages@ietf.org
> https://www.ietf.org/mailman/listinfo/ietf-languages