Re: Last Call: 'Tags for Identifying Languages' to BCP

David Hopwood <david.nospam.hopwood@blueyonder.co.uk> Thu, 25 August 2005 22:40 UTC

Received: from localhost.localdomain ([127.0.0.1] helo=megatron.ietf.org) by megatron.ietf.org with esmtp (Exim 4.32) id 1E8QOx-0003N9-HN; Thu, 25 Aug 2005 18:40:51 -0400
Received: from odin.ietf.org ([132.151.1.176] helo=ietf.org) by megatron.ietf.org with esmtp (Exim 4.32) id 1E8QOv-0003Mf-EN for ietf@megatron.ietf.org; Thu, 25 Aug 2005 18:40:49 -0400
Received: from ietf-mx.ietf.org (ietf-mx [132.151.6.1]) by ietf.org (8.9.1a/8.9.1a) with ESMTP id SAA25434 for <ietf@ietf.org>; Thu, 25 Aug 2005 18:40:46 -0400 (EDT)
Received: from smtp-out6.blueyonder.co.uk ([195.188.213.9]) by ietf-mx.ietf.org with esmtp (Exim 4.43) id 1E8QPW-0004uP-41 for ietf@ietf.org; Thu, 25 Aug 2005 18:41:27 -0400
Received: from [127.0.0.1] ([82.42.16.20]) by smtp-out6.blueyonder.co.uk with Microsoft SMTPSVC(5.0.2195.6713); Thu, 25 Aug 2005 23:41:33 +0100
Message-ID: <430E48F8.7050102@blueyonder.co.uk>
Date: Thu, 25 Aug 2005 23:40:56 +0100
From: David Hopwood <david.nospam.hopwood@blueyonder.co.uk>
User-Agent: Mozilla Thunderbird 1.0.2 (Windows/20050317)
X-Accept-Language: en-us, en
MIME-Version: 1.0
To: ietf@ietf.org
References: <E1E7bto-0003lh-0J@newodin.ietf.org> <6.2.3.4.2.20050824104625.051a4ab0@mail.jefsey.com> <430C937E.40103@blueyonder.co.uk> <6.2.3.4.2.20050825024457.03512840@mail.jefsey.com>
In-Reply-To: <6.2.3.4.2.20050825024457.03512840@mail.jefsey.com>
Content-Type: text/plain; charset="ISO-8859-1"; format="flowed"
Content-Transfer-Encoding: 7bit
X-OriginalArrivalTime: 25 Aug 2005 22:41:33.0234 (UTC) FILETIME=[2495C120:01C5A9C6]
X-Spam-Score: 0.1 (/)
X-Scan-Signature: 538aad3a3c4f01d8b6a6477ca4248793
Content-Transfer-Encoding: 7bit
Subject: Re: Last Call: 'Tags for Identifying Languages' to BCP
X-BeenThere: ietf@ietf.org
X-Mailman-Version: 2.1.5
Precedence: list
Reply-To: david.nospam.hopwood@blueyonder.co.uk
List-Id: IETF-Discussion <ietf.ietf.org>
List-Unsubscribe: <https://www1.ietf.org/mailman/listinfo/ietf>, <mailto:ietf-request@ietf.org?subject=unsubscribe>
List-Post: <mailto:ietf@ietf.org>
List-Help: <mailto:ietf-request@ietf.org?subject=help>
List-Subscribe: <https://www1.ietf.org/mailman/listinfo/ietf>, <mailto:ietf-request@ietf.org?subject=subscribe>
Sender: ietf-bounces@ietf.org
Errors-To: ietf-bounces@ietf.org

JFC (Jefsey) Morfin wrote:
> [...] Today, the common practice of 
> nearly one billion of Internet users is to be able to turn off cookies 
> to protect their anonymous free usage of the web. Once the Draft enters 
> into action they will be imposed a conflicting privacy violation: "tell 
> me what you read, I will tell you who you are": any OPES can monitor the 
> exchange, extact these unambigous ASCII tags, and know (or block) what 
> you read. You can call these tags in google and learn a lot about 
> people. There is no proposed way to turn that personal tagging off, nor 
> to encode it.

I don't know which browser you use, but in Firefox, I can configure exactly
which language tags it sends. If it were sending other information using
language tags as a covert channel (which it *could* do regardless of the
draft under discussion), I'd expect that to be treated as at least a bug,
and if it were a deliberate privacy violation, I'd expect that to cause a
big scandal.

>>> I support it as a transition standard track RFC needed by some, as 
>>> long as it does not exclude more specific/advanced language 
>>> identification formats, processes or future IANA or ISO 11179 
>>> conformant registries.
>>
>> The grammar defined in the draft is already flexible enough.
> 
> (I suppose you mean more than just grammar. Talking of the ABNF is 
> probably clearer?).
> 
> I am certainly eager to learn how I can support modal information (type 
> of voice, accent, signs, icons, feelings, fount, etc.), medium 
> information, language references (for example is it plain, basic, 
> popular English? used dictionary, used software publisher), nor the 
> context (style, relation, etc.), nor the nature of the text (mono, 
> multilingual, human or machine oriented - for example what is the tag to 
> use for a multilingual file [printed in a language of choice]), the date 
> of the langtag version being used, etc.

I mean that the grammar is flexible enough to encode any of the above
attributes (not that it would be useful or a good idea to encode most
of them).

> The Draft has introduced the "script" subtag in addition to RFC 3066 
> (what is an obvious change). However in order to stay "compatible" with 
> RFC 3066, author says it cannot introduce a specific support of URI 
> tags.

This objection seems to be correct: URI tags include characters not
allowed by RFC 3066. But you could easily encode the equivalent information
to an URI tag, if you wanted to.

-- 
David Hopwood <david.nospam.hopwood@blueyonder.co.uk>


_______________________________________________
Ietf mailing list
Ietf@ietf.org
https://www1.ietf.org/mailman/listinfo/ietf