Re: [Slim] Adam Roach's No Objection on draft-ietf-slim-multilangcontent-13: (with COMMENT)

Adam Roach <adam@nostrum.com> Thu, 17 August 2017 09:54 UTC

Return-Path: <adam@nostrum.com>
X-Original-To: slim@ietfa.amsl.com
Delivered-To: slim@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 5C4B4132418; Thu, 17 Aug 2017 02:54:57 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.88
X-Spam-Level:
X-Spam-Status: No, score=-1.88 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, T_SPF_HELO_PERMERROR=0.01, T_SPF_PERMERROR=0.01] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id YpyuPvKFhfK0; Thu, 17 Aug 2017 02:54:55 -0700 (PDT)
Received: from nostrum.com (raven-v6.nostrum.com [IPv6:2001:470:d:1130::1]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id A8E8A13239C; Thu, 17 Aug 2017 02:54:55 -0700 (PDT)
Received: from Svantevit.roach.at (cpe-70-122-154-80.tx.res.rr.com [70.122.154.80]) (authenticated bits=0) by nostrum.com (8.15.2/8.15.2) with ESMTPSA id v7H9slKS063636 (version=TLSv1.2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128 verify=NO); Thu, 17 Aug 2017 04:54:49 -0500 (CDT) (envelope-from adam@nostrum.com)
X-Authentication-Warning: raven.nostrum.com: Host cpe-70-122-154-80.tx.res.rr.com [70.122.154.80] claimed to be Svantevit.roach.at
To: Alexey Melnikov <aamelnikov@fastmail.fm>, The IESG <iesg@ietf.org>
Cc: slim@ietf.org, draft-ietf-slim-multilangcontent@ietf.org, slim-chairs@ietf.org, bernard.aboba@gmail.com
References: <150292740024.12316.10186295594648032334.idtracker@ietfa.amsl.com> <1502962175.1590900.1076220224.097DD098@webmail.messagingengine.com>
From: Adam Roach <adam@nostrum.com>
Message-ID: <eee8abeb-aa90-a1a6-5961-900ca1a88c05@nostrum.com>
Date: Thu, 17 Aug 2017 04:54:47 -0500
User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.10; rv:52.0) Gecko/20100101 Thunderbird/52.2.1
MIME-Version: 1.0
In-Reply-To: <1502962175.1590900.1076220224.097DD098@webmail.messagingengine.com>
Content-Type: text/plain; charset="utf-8"; format="flowed"
Content-Transfer-Encoding: 7bit
Content-Language: en-US
Archived-At: <https://mailarchive.ietf.org/arch/msg/slim/g_lxDsAw0t9CCJmE6zZ9DdY6k2U>
Subject: Re: [Slim] Adam Roach's No Objection on draft-ietf-slim-multilangcontent-13: (with COMMENT)
X-BeenThere: slim@ietf.org
X-Mailman-Version: 2.1.22
Precedence: list
List-Id: Selection of Language for Internet Media <slim.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/slim>, <mailto:slim-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/slim/>
List-Post: <mailto:slim@ietf.org>
List-Help: <mailto:slim-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/slim>, <mailto:slim-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 17 Aug 2017 09:54:57 -0000

On 8/17/17 4:29 AM, Alexey Melnikov wrote:
> Hi Adam,
>
> On Thu, Aug 17, 2017, at 12:50 AM, Adam Roach wrote:
>> Adam Roach has entered the following ballot position for
>> draft-ietf-slim-multilangcontent-13: No Objection
>> I note that this mechanism uses only language tags and implicitly leaves
>> aside
>> any discussion of script or region tags.
> Actually this is not true. Language Tags as specified in RFC 5646 can
> optionally include all of these things.
>
>> I'm going to assume this was a
>> intentional decision that was considered by the group with the end result
>> being
>> that there was no perceived need to distinguish between (e.g.) Latin and
>> Jawi
>> scripts for Malay, or (e.g.) Brazilian and Portuguese variations of 'pt'.
>> (If
>> this was not an explicit decision made by the WG, I would ask that it be
>> posed
>> to the WG for an explicit decision -- the current mechanism seems
>> somewhat
>> deficient in this regard from my admittedly brief analysis of the
>> situation.)
>>
>> I'm a little worried that an imprecise reading by implementors of the
>> citation
>> to RFC5646 may lead them to believe that script and region tags may be
>> included
>> in the Content-Language header fields.
> They are absolutely allowed.

Wow. If I had gone to implement this, I would have gotten it 100% wrong, 
and argued with anyone who told me as much. The document says:

    The Content-Language MUST comply with RFC 3282 [RFC3282] (which
    defines the Content-Language field) and BCP 47/RFC 5646 [RFC5646]
    (which defines the structure and semantics for the language code
    values).


What is worth taking note of here is the fact that this says "language 
*code* values," not "language *tag* values." RFC 5646 defines a Language 
_Tag_ (e.g., "sr-Latn-RS"), that is composed of a Language _Code_ (e.g., 
"sr") followed by an optional Script Code (e.g., "Latn") and an optional 
Country Code (e.g., "RS").

As if to reinforce this, the examples then show values of the 
Content-Language header field that consist exclusively of language 
codes, even though it predominantly uses two languages with regional 
variations (en and es): to my eye, "en" by itself looks strange, as I'm 
used to always seeing that language written as either "en-US" or "en-GB".

So I would suggest (a) the passage above be changed to say "...for the 
language tag values...", and (b) the examples be amended to show more 
than simple language codes; e.g.:

    Content-Language: en-US

    Content-Language: de

    Content-Language: es-ES, fr

    Content-Language: sr-Cyrl


/a