Re: [Ltru] RE: ISO 639-2 decision: "mis"

"Randy Presuhn" <randy_presuhn@mindspring.com> Fri, 15 June 2007 18:46 UTC

Return-path: <ltru-bounces@ietf.org>
Received: from [127.0.0.1] (helo=stiedprmman1.va.neustar.com) by megatron.ietf.org with esmtp (Exim 4.43) id 1HzGog-0005gk-8A; Fri, 15 Jun 2007 14:46:38 -0400
Received: from ltru by megatron.ietf.org with local (Exim 4.43) id 1HzGoe-0005gR-Qn for ltru-confirm+ok@megatron.ietf.org; Fri, 15 Jun 2007 14:46:36 -0400
Received: from [10.91.34.44] (helo=ietf-mx.ietf.org) by megatron.ietf.org with esmtp (Exim 4.43) id 1HzGoe-0005gD-H0 for ltru@ietf.org; Fri, 15 Jun 2007 14:46:36 -0400
Received: from elasmtp-galgo.atl.sa.earthlink.net ([209.86.89.61]) by ietf-mx.ietf.org with esmtp (Exim 4.43) id 1HzGod-0007Ty-6z for ltru@ietf.org; Fri, 15 Jun 2007 14:46:36 -0400
DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=dk20050327; d=mindspring.com; b=KCYB2YLzyu3bXNipyr9gbRhuB5lh7898IAvvVN2vWkc0YMOppi9Q9uUwX+/fs2P4; h=Received:Message-ID:From:To:References:Subject:Date:MIME-Version:Content-Type:Content-Transfer-Encoding:X-Priority:X-MSMail-Priority:X-Mailer:X-MimeOLE:X-ELNK-Trace:X-Originating-IP;
Received: from [66.167.204.252] (helo=oemcomputer) by elasmtp-galgo.atl.sa.earthlink.net with asmtp (Exim 4.34) id 1HzGoc-0000JI-I3 for ltru@ietf.org; Fri, 15 Jun 2007 14:46:34 -0400
Message-ID: <010c01c7af7e$13f1c3e0$6601a8c0@oemcomputer>
From: Randy Presuhn <randy_presuhn@mindspring.com>
To: LTRU Working Group <ltru@ietf.org>
References: <E1HzDyl-0004Iy-4B@megatron.ietf.org> <002f01c7af6c$a4659d00$6601a8c0@oemcomputer> <4672D81B.4090700@yahoo-inc.com>
Subject: Re: [Ltru] RE: ISO 639-2 decision: "mis"
Date: Fri, 15 Jun 2007 11:50:42 -0700
MIME-Version: 1.0
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: 7bit
X-Priority: 3
X-MSMail-Priority: Normal
X-Mailer: Microsoft Outlook Express 6.00.2800.1478
X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2800.1478
X-ELNK-Trace: 4488c18417c9426da92b9037bc8bcf44d4c20f6b8d69d888fa44b31bb60a93567591c007630e9094007fded893320713350badd9bab72f9c350badd9bab72f9c
X-Originating-IP: 66.167.204.252
X-Spam-Score: 0.0 (/)
X-Scan-Signature: 082a9cbf4d599f360ac7f815372a6a15
X-BeenThere: ltru@ietf.org
X-Mailman-Version: 2.1.5
Precedence: list
List-Id: Language Tag Registry Update working group discussion list <ltru.ietf.org>
List-Unsubscribe: <https://www1.ietf.org/mailman/listinfo/ltru>, <mailto:ltru-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www1.ietf.org/pipermail/ltru>
List-Post: <mailto:ltru@ietf.org>
List-Help: <mailto:ltru-request@ietf.org?subject=help>
List-Subscribe: <https://www1.ietf.org/mailman/listinfo/ltru>, <mailto:ltru-request@ietf.org?subject=subscribe>
Errors-To: ltru-bounces@ietf.org

Hi -

> From: "Addison Phillips" <addison@yahoo-inc.com>
> To: "Randy Presuhn" <randy_presuhn@mindspring.com>
> Cc: "LTRU Working Group" <ltru@ietf.org>
> Sent: Friday, June 15, 2007 11:19 AM
> Subject: Re: [Ltru] RE: ISO 639-2 decision: "mis"
>
> Randy Presuhn wrote:
> > 
> > I disagree with the phrase "or when the range of language tags supported in
> > a given application are constrained."  If the application knows what language
> > it is, it should use the correct tag, if one exists.  If the application doesn't
> > know what language is in use, "und" would be correct.
> 
> That's nice in theory. But some applications use a subset of tags (I 
> used the MARC21 example in the text) and then transmit them through 
> another system (where the larger range of RFC 4646 is available). Those 
> systems only know that the content is 'mis' (because it is tagged that 
> way) and not what the miscellaneous language happens to be.

This is an interworking problem, where data from a non-BCP 47 environment
needs to travel through a BCP 47 one.  In this particular case, things
work out.  But as for the proposed text...

...
> The 'mis' (Uncoded) primary language subtag is used to identify 
> linguistic content whose language is known but cannot otherwise be 
> identified.

The phrase "known but cannot otherwise be identified" is a bit obscure.
I suggest replacing it with "known but for which no subtag has been defined."

> It is intended for use when the range of language tags is 
> constrained or for languages not otherwise categorized.

This only makes sense if we somehow want to shoehorn MARC into being
BCP 47 compliant.  I propose deleting this sentence.

> It SHOULD NOT be 
> used except when other means of identifying the language are not 
> available.

Minor edit: I think the would be much easier to read if worded:
 It SHOULD NOT be used when other means of identifying the language are available.

> For example, a library application might be limited to the 
> set of subtags defined for use by the [MARC21] standard. The 'mis' 
> subtag might be used by this application for languages not included in 
> that set.

I think this is not a good example of a BCP 47 use of "mis".

Although I think it's quite reasonable and fortunate that MARC data
could traverse a BCP 47 environment unharmed (and this falls under the
"SHOULD NOT" rules of RFC 2119), I do *not* think we should stretch things
to where we'd claim that MARC is "compliant".  It's just a happy circumstance
that data encoded under the MARC regime is still meaningful when interpreted
under BCP 47 rules, unless tagged with "mis".

Randy



_______________________________________________
Ltru mailing list
Ltru@ietf.org
https://www1.ietf.org/mailman/listinfo/ltru