Re: [Ltru] [apps-discuss] Fwd: Defining a CBOR tag for RFC 5646 Language Tags

"Doug Ewell" <doug@ewellic.org> Wed, 14 May 2014 18:03 UTC

Return-Path: <doug@ewellic.org>
X-Original-To: ltru@ietfa.amsl.com
Delivered-To: ltru@ietfa.amsl.com
Received: from localhost (ietfa.amsl.com [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 7E8081A00A6 for <ltru@ietfa.amsl.com>; Wed, 14 May 2014 11:03:06 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -0.001
X-Spam-Level:
X-Spam-Status: No, score=-0.001 tagged_above=-999 required=5 tests=[BAYES_20=-0.001] autolearn=ham
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id KKQFaJieoL8Z for <ltru@ietfa.amsl.com>; Wed, 14 May 2014 11:03:05 -0700 (PDT)
Received: from p3plwbeout03-01.prod.phx3.secureserver.net (p3plsmtp03-01-2.prod.phx3.secureserver.net [72.167.218.213]) by ietfa.amsl.com (Postfix) with ESMTP id 5E1A71A012F for <ltru@ietf.org>; Wed, 14 May 2014 11:03:05 -0700 (PDT)
Received: from localhost ([72.167.218.244]) by p3plwbeout03-01.prod.phx3.secureserver.net with bizsmtp id 1u2v1o0025GyNsw01u2vwh; Wed, 14 May 2014 11:02:55 -0700
X-SID: 1u2v1o0025GyNsw01
Received: (qmail 24590 invoked by uid 99); 14 May 2014 18:02:55 -0000
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain; charset="utf-8"
X-Originating-IP: 208.51.143.189
User-Agent: Workspace Webmail 5.6.47
Message-Id: <20140514110254.665a7a7059d7ee80bb4d670165c8327d.d5e042b353.wbe@email03.secureserver.net>
From: Doug Ewell <doug@ewellic.org>
To: ltru@ietf.org
Date: Wed, 14 May 2014 11:02:54 -0700
Mime-Version: 1.0
Archived-At: http://mailarchive.ietf.org/arch/msg/ltru/-qMFmMl83VWDynMl-wfsMnLafDA
Cc: dave@cridland.net
Subject: Re: [Ltru] [apps-discuss] Fwd: Defining a CBOR tag for RFC 5646 Language Tags
X-BeenThere: ltru@ietf.org
X-Mailman-Version: 2.1.15
Precedence: list
List-Id: Language Tag Registry Update working group discussion list <ltru.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/ltru>, <mailto:ltru-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/ltru/>
List-Post: <mailto:ltru@ietf.org>
List-Help: <mailto:ltru-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ltru>, <mailto:ltru-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 14 May 2014 18:03:06 -0000

Mark Davis ☕️ <mark at macchiato dot com> replied to Dave Cridland:

>> Many years ago, Mark Crispin and Chris Newman had a proposal for
>> embedding language tags in invalid UTF-8; I seem to recall they
>> publicly renounced their proposal rather dramatically in favour of a
>> Unicode Consortium proposal for embedding the language tags somewhere
>> in Plane 14 - published as RFC 2482.
>>
>> The fact it was all initiated in order to support the pressing needs
>> of ACAP might give you some hints as to why it never really took off,
>> but as a counter-proposal to language tags in metadata, it might be
>> worth re-examining.
>
> The tag characters in Unicode are deprecated, and should not be used.

As much as this is true, using invalid UTF-8 sequences to encode any
sort of meta-information is a far, far worse idea.

--
Doug Ewell | Thornton, CO, USA
http://ewellic.org | @DougEwell