Re: [Ltru] Consensus call: extlang

Leif Halvard Silli <lhs@malform.no> Thu, 29 May 2008 18:31 UTC

Return-Path: <ltru-bounces@ietf.org>
X-Original-To: ltru-archive@megatron.ietf.org
Delivered-To: ietfarch-ltru-archive@core3.amsl.com
Received: from [127.0.0.1] (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id 75FE228C16F; Thu, 29 May 2008 11:31:45 -0700 (PDT)
X-Original-To: ltru@core3.amsl.com
Delivered-To: ltru@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id 31C993A6BA5 for <ltru@core3.amsl.com>; Thu, 29 May 2008 11:31:44 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -0.888
X-Spam-Level:
X-Spam-Status: No, score=-0.888 tagged_above=-999 required=5 tests=[AWL=1.711, BAYES_00=-2.599]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id k-hNHkk8YUAC for <ltru@core3.amsl.com>; Thu, 29 May 2008 11:31:43 -0700 (PDT)
Received: from lakepoint.domeneshop.no (lakepoint.domeneshop.no [194.63.248.54]) by core3.amsl.com (Postfix) with ESMTP id 3FDA63A6800 for <ltru@ietf.org>; Thu, 29 May 2008 11:25:37 -0700 (PDT)
Received: from 10013.local (cm-84.208.108.246.getinternet.no [84.208.108.246]) (authenticated bits=0) by lakepoint.domeneshop.no (8.13.8/8.13.8) with ESMTP id m4TIPYa6001578; Thu, 29 May 2008 20:25:34 +0200
Message-ID: <483EF50F.7080307@malform.no>
Date: Thu, 29 May 2008 20:25:19 +0200
From: Leif Halvard Silli <lhs@malform.no>
User-Agent: Mozilla/5.0 (Macintosh; U; PPC Mac OS X Mach-O; en-US; rv:1.8.1b1) Gecko/20060724 Thunderbird/2.0a1 Mnenhy/0.7.4.666
MIME-Version: 1.0
To: Peter Constable <petercon@microsoft.com>
References: <422633.90603.qm@web31813.mail.mud.yahoo.com> <E19FDBD7A3A7F04788F00E90915BD36C13C2528ABB@USSDIXMSG20.spe.sony.com> <30b660a20805282114v642c07dawa905112dbd6a35f5@mail.gmail.com> <483E54DC.4050605@malform.no> <DDB6DE6E9D27DD478AE6D1BBBB835795633304E966@NA-EXMSG-C117.redmond.corp.microsoft.com>
In-Reply-To: <DDB6DE6E9D27DD478AE6D1BBBB835795633304E966@NA-EXMSG-C117.redmond.corp.microsoft.com>
Cc: "ltru@ietf.org" <ltru@ietf.org>
Subject: Re: [Ltru] Consensus call: extlang
X-BeenThere: ltru@ietf.org
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: Language Tag Registry Update working group discussion list <ltru.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/listinfo/ltru>, <mailto:ltru-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/pipermail/ltru>
List-Post: <mailto:ltru@ietf.org>
List-Help: <mailto:ltru-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ltru>, <mailto:ltru-request@ietf.org?subject=subscribe>
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: 7bit
Sender: ltru-bounces@ietf.org
Errors-To: ltru-bounces@ietf.org

Peter Constable 2008-05-29 18.53:

>> From: ltru-bounces@ietf.org [mailto:ltru-bounces@ietf.org] On
>> Behalf Of Leif Halvard Silli
> 
> 
>> If the focus is identification, then the fallback and
>> filtering doesn't matter (or must be defined subsequently).
> 
> Sorry, but this is not a sensible approach to coding.

   [...]

 

Note the "if". And if we try to solve too many issues at once, 
then perhaps we won't solve any thing at all.

> "cmn" requires that a process have a separate business
> knowledge relating Mandarin and "Chinese", but "zh-cmn" embeds
> that knowledge into the tag itself. Since both are the same in
> terms of their identificational value, the question to be asked
> is which of these is more useful for processing. [...]


Who will be more useful - ok. But also: Who will lead to most 
processing/most tagging.

I think that 'cmn' will be perceived as 'oh, now they have decided 
something entirely new'.  We used 'zh' in all these yeaers, and 
now - why 'yue'?

>> Extlang makes identifying/finding *all* Chinese languages
>> simple,
> 
> Note that you have already introduced processing.


I know.

>> for those that need to do find them all. It would only
>> require that you ask for 'zh'. Thus with extlang one could
>> start tagging resources spesificly *immediatly*, because it
>> should be highly compatible with existing application
>> behaviour.
> 
> For applications that use right-truncation matching, but not
> lookup.


It is of course not difficult to per se to add all tags instead of 
just one or two to one's Web browser. But in my experience it is 
difficult to even add more than one. Not to mention how difficult 
it is to get application to ad sensible algoritms so that spesific 
tags does not lead you to loose content.

>> (Btw, this approach strikes me as higly compatible with
>> another popular slogan: Don't break the Web.)
> 
> Well, not all servers use right-truncation matching logic, and
> for those that don't your plan would be breaking the Web. (I'm


It would be interesting to know some Web servers and othe servers 
where it works differently.

> not saying that the no-extlang path doesn't have some of the
> same issues; either way, there are things that would need to be
> done to keep everything working smoothly if we want to
> transition from a "zh"-only past into a future that includes
> "cmn"/"yue"/etc.


But there are no Web browsers today which ask for 'cmn' or 'yue'. 
It is 'zh' they ask for. Hence there is a chance that content 
tagged as 'zh-cmn' will Just Work, today, many places. And 
therefore I view it as a good bit more compatible with existing 
content to go for Extlang.
-- 
leif halvard silli
_______________________________________________
Ltru mailing list
Ltru@ietf.org
https://www.ietf.org/mailman/listinfo/ltru