Re: [Ltru] Proposed -t0- subtag

Mark Davis ☕ <mark@macchiato.com> Tue, 26 July 2011 00:21 UTC

Return-Path: <mark.edward.davis@gmail.com>
X-Original-To: ltru@ietfa.amsl.com
Delivered-To: ltru@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 20C0621F8BF0 for <ltru@ietfa.amsl.com>; Mon, 25 Jul 2011 17:21:19 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.239
X-Spam-Level:
X-Spam-Status: No, score=-1.239 tagged_above=-999 required=5 tests=[AWL=-0.047, BAYES_00=-2.599, FM_FORGED_GMAIL=0.622, HTML_FONT_FACE_BAD=0.884, HTML_MESSAGE=0.001, J_CHICKENPOX_34=0.6, MIME_8BIT_HEADER=0.3, RCVD_IN_DNSWL_LOW=-1]
Received: from mail.ietf.org ([64.170.98.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id OnDoAXqryu+9 for <ltru@ietfa.amsl.com>; Mon, 25 Jul 2011 17:21:17 -0700 (PDT)
Received: from mail-yw0-f44.google.com (mail-yw0-f44.google.com [209.85.213.44]) by ietfa.amsl.com (Postfix) with ESMTP id 5BAAD21F8BEF for <ltru@ietf.org>; Mon, 25 Jul 2011 17:21:17 -0700 (PDT)
Received: by ywp31 with SMTP id 31so2993005ywp.31 for <ltru@ietf.org>; Mon, 25 Jul 2011 17:21:10 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=mime-version:sender:in-reply-to:references:date :x-google-sender-auth:message-id:subject:from:to:cc:content-type; bh=vlqyOxua3itiBfkYtv4N9qNqLs1+VJbeCB0XgUrLS3w=; b=urXZ+A5Ba14OC+k9AbeqlLaxz2i7QfpbjIndGdtg/x4NNDmJ6fvuKAawLyVpAl2tam AfOofpo3wmpa43phFz03uAvswG6UMiw9Kdy8ztsd0iWtX7i0f2D5yUaGSV+9S3L7mSC6 /dpcb4+/MADLjL9JpvB6MS/48zRu10hu9BTJ0=
MIME-Version: 1.0
Received: by 10.150.150.33 with SMTP id x33mr4923094ybd.250.1311639669248; Mon, 25 Jul 2011 17:21:09 -0700 (PDT)
Sender: mark.edward.davis@gmail.com
Received: by 10.151.83.9 with HTTP; Mon, 25 Jul 2011 17:21:09 -0700 (PDT)
In-Reply-To: <2CB55BFC7405E94F830537BD924318D5EBF16A4E44@USSDIXMSG11.am.sony.com>
References: <20110725104245.665a7a7059d7ee80bb4d670165c8327d.ee2bf8cf5c.wbe@email03.secureserver.net> <2CB55BFC7405E94F830537BD924318D5EBF16A4E44@USSDIXMSG11.am.sony.com>
Date: Mon, 25 Jul 2011 17:21:09 -0700
X-Google-Sender-Auth: LuxCA25phzzROC7aUJO8io5WkvY
Message-ID: <CAJ2xs_Gxv00Xey7z=yg3utRWhGpMDOubY5JbzWjXYLeURCbv+w@mail.gmail.com>
From: Mark Davis ☕ <mark@macchiato.com>
To: "Broome, Karen" <Karen.Broome@am.sony.com>
Content-Type: multipart/alternative; boundary="000e0cd70eb28f142704a8ede84b"
Cc: "ltru@ietf.org" <ltru@ietf.org>, Doug Ewell <doug@ewellic.org>
Subject: Re: [Ltru] Proposed -t0- subtag
X-BeenThere: ltru@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: Language Tag Registry Update working group discussion list <ltru.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/ltru>, <mailto:ltru-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/ltru>
List-Post: <mailto:ltru@ietf.org>
List-Help: <mailto:ltru-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ltru>, <mailto:ltru-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 26 Jul 2011 00:21:19 -0000

Note that I retitled, since it is drifting away from the original topic.

===

The distinctions in tagging content that can easily be made within BCP47* as
currently constituted* are:

   1. *Written content* (in a particular script: "uz-Latn", "uz-Arab", etc.)
   2. *Non-Written content* (eg "uz-Zxxx")
   3. *Either Written or Non-Written content* (eg "uz")
   4. *Both Written and Non-Written content* ("uz-Latn, uz-Zxxx", using
   language list)

Some distinctions that are not possible to make are:

   1. *Spoken content* (excluding other non-written content like "sung
   content", "telepathic content", etc.)
   2. *Both Spoken and Written content* (eg captioned video with audio,
   excluding sung content, etc)

One on the edge would be:

   1. *Written content* (unspecified script: Zyyy would probably serve fine
   for this)

The question is whether the first four are sufficient, and if not, whether
the extra work of doing an extension is necessary, or whether the
distinctions can be handled via variants (the simpler route).

Mark
*— Il meglio è l’inimico del bene —*


On Mon, Jul 25, 2011 at 11:09, Broome, Karen <Karen.Broome@am.sony.com>wrote:

> Well, I had considered registering a singleton on behalf of the Society of
> Motion Pictures and Television Engineers -- the xml:lang compliant tags are
> now specified in many, if not most, new industry standards. (Things are
> certainly still in transition from older ISO 639 classifications.) But SMPTE
> might benefit from more motion picture specific values (possible values:
> original, dubbed, captioned, subtitled, audio description).
>
> But I also wonder about the general utility of this and hadn't really
> broached the subject of "tone" or had an immediate need for something like
> that. It's interesting, but I think maybe it goes beyond language
> identification into aspects of the delivery and content. Fun stuff to work
> on, but not sure I need it now. But I think "spoken" and "written" are two
> distinctions that definitely help identify the "language". I realize this is
> a somewhat controversial belief.
>
> Regards,
>
> Karen Broome
>
> -----Original Message-----
> From: Doug Ewell [mailto:doug@ewellic.org]
> Sent: Monday, July 25, 2011 10:43 AM
> To: ltru@ietf.org
> Cc: Broome, Karen
> Subject: Re: [Ltru] Proposed -t0- subtag
>
> "Broome, Karen" <Karen dot Broome at am dot sony dot com> wrote:
>
> > Sorry, I was thinking the original ISO registration had gone through
> > as 'No Linguistic content' but now I'm remembering that this was
> > somewhat of a partner tag for 'zxx' which does mean that. I wasn't
> > referencing the RFC text per se, which gives the user an option — and
> > I'm OK with the option. Still, I generally find this muddy. I don't
> > want to look to a script tag to define the language mode because there
> > are commonly used language modes that are not written forms. The
> > disconnect with the 'zxx' tag this tags mirrors may be another reason
> > to avoid use to indicate the language mode.
>
> I think what you are finding "muddy" is not the subtag 'Zxxx', which
> does have a fairly clear meaning in BCP 47, but rather the use of a
> script (writing system) subtag to tag content that isn't written.  I can
> see how this is unintuitive.  I think the answer is that BCP 47 tags
> aren't really designed to tag modes out of the box, and that, in turn,
> is exactly what extensions are for.
>
> If you or anyone else did want to create an extension RFC for modes
> ('m', perhaps), you'd probably want to start the discussion by
> identifying the initial values (spoken, written, signed, anything else?)
> and setting constraints.  As examples of the latter, you'd want to
> establish whether male vs. female, shouting vs. whispering, friendly vs.
> angry, handwritten vs. printed, etc. are in scope.
>
> --
> Doug Ewell | Thornton, Colorado, USA | RFC 5645, 4645, UTN #14
> www.ewellic.org | www.facebook.com/doug.ewell | @DougEwell ­
>
>
>
> _______________________________________________
> Ltru mailing list
> Ltru@ietf.org
> https://www.ietf.org/mailman/listinfo/ltru
>