Re: [Ltru] updated demo

Felix Sasaki <felix.sasaki@fh-potsdam.de> Sun, 28 June 2009 09:07 UTC

Return-Path: <felix.sasaki@googlemail.com>
X-Original-To: ltru@core3.amsl.com
Delivered-To: ltru@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id 2C0233A69B7 for <ltru@core3.amsl.com>; Sun, 28 Jun 2009 02:07:25 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -0.593
X-Spam-Level:
X-Spam-Status: No, score=-0.593 tagged_above=-999 required=5 tests=[BAYES_00=-2.599, FM_FORGED_GMAIL=0.622, HTML_MESSAGE=0.001, MIME_8BIT_HEADER=0.3, URIBL_RHS_DOB=1.083]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 0yfWA7nYZZyy for <ltru@core3.amsl.com>; Sun, 28 Jun 2009 02:07:24 -0700 (PDT)
Received: from mail-bw0-f213.google.com (mail-bw0-f213.google.com [209.85.218.213]) by core3.amsl.com (Postfix) with ESMTP id B1C1B3A6359 for <ltru@ietf.org>; Sun, 28 Jun 2009 02:07:23 -0700 (PDT)
Received: by bwz9 with SMTP id 9so2733486bwz.37 for <ltru@ietf.org>; Sun, 28 Jun 2009 02:07:40 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlemail.com; s=gamma; h=domainkey-signature:mime-version:sender:received:in-reply-to :references:date:x-google-sender-auth:message-id:subject:from:to:cc :content-type; bh=TxxXgxzde4kK1JSdcofkiqMLgS+BcBy5tlJssgDuhGA=; b=u54sCqgYCcX8QX1c4RvEzo1Jql0FKkbdjkY2laaOhfjT/Dd/nC3zhyfG34T9KM9b8n YGrwEkm+EK8tvLQ8O6c8Bw58miqC5eatTthA/HG/zyxDnzY1u/J3EdcCxiIDjGsiYcCn NiR9JV/Km56pVlBzactHvOrRbYGK0BChzARuE=
DomainKey-Signature: a=rsa-sha1; c=nofws; d=googlemail.com; s=gamma; h=mime-version:sender:in-reply-to:references:date :x-google-sender-auth:message-id:subject:from:to:cc:content-type; b=X7TjtlG1e2tC+3+OVaFbt/27HikHGfzh10b7x252icVg66qigbRd4dqs8P7HOQKBCW iFbHS5e6Nit6VjCKc0GJYUAkA4cSJ5M8LB1yi2yW8EOqoo7GJfipmvbXU4kbu0q0k+1j aOEAY0m8w8IrxB2x2SsHeHdrIb4tcFYDXvUuE=
MIME-Version: 1.0
Sender: felix.sasaki@googlemail.com
Received: by 10.223.122.15 with SMTP id j15mr3836772far.74.1246180060665; Sun, 28 Jun 2009 02:07:40 -0700 (PDT)
In-Reply-To: <30b660a20906271138o186f82a5xd2531f70806ab3be@mail.gmail.com>
References: <30b660a20906271138o186f82a5xd2531f70806ab3be@mail.gmail.com>
Date: Sun, 28 Jun 2009 11:07:40 +0200
X-Google-Sender-Auth: 9747c18c55bdc723
Message-ID: <ba4134970906280207td8dbdd4l8a4860f7ee4de28@mail.gmail.com>
From: Felix Sasaki <felix.sasaki@fh-potsdam.de>
To: Mark Davis ⌛ <mark@macchiato.com>
Content-Type: multipart/alternative; boundary="001636c5a723d798a7046d64e75f"
X-Mailman-Approved-At: Sun, 28 Jun 2009 14:33:20 -0700
Cc: LTRU Working Group <ltru@ietf.org>
Subject: Re: [Ltru] updated demo
X-BeenThere: ltru@ietf.org
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: Language Tag Registry Update working group discussion list <ltru.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/listinfo/ltru>, <mailto:ltru-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/ltru>
List-Post: <mailto:ltru@ietf.org>
List-Help: <mailto:ltru-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ltru>, <mailto:ltru-request@ietf.org?subject=subscribe>
X-List-Received-Date: Sun, 28 Jun 2009 09:09:45 -0000

Hello Mark,

this looks similar to
http://www.w3.org/2008/05/lta/
my language tag parser currently based on draft 21 of rfc4646bis. lta also
contains some error checking mechanisms, see examples like

http://www.w3.org/2008/05/lta/language-tags/q?input=de-x
http://www.w3.org/2008/05/lta/language-tags/q?input=xa
http://www.w3.org/2008/05/lta/language-tags/q?input=en-latn
http://www.w3.org/2008/05/lta/language-tags/q?input=ja-1901
http://www.w3.org/2008/05/lta/language-tags/q?input=fr-cmn
http://www.w3.org/2008/05/lta/language-tags/q?input=zh-cmn-cmn
http://www.w3.org/2008/05/lta/language-tags/q?input=zh-cmn-a-bbb-a-ccc
http://www.w3.org/2008/05/lta/language-tags/q?input=de-de-1901-1901

Output is available in HTML with German UI and English, and in an XML
format, see e.g.
http://www.w3.org/2008/05/lta/language-tags/q?input=de-de-1901-1901&output=xml

My comment on your tool is that to co-ordinate such efforts it would be
great to have a common machine-readable output format for language tag
parsing, also e.g. to deal with error descriptions like

 <lta:variant>
      <lta:subtag>1901</lta:subtag>
      <lta:registryInfo>
         <lta:var ty="variant" su="1901" ad="2005-10-16">
            <lta:ds>Traditional German orthography

</lta:ds>
            <lta:pref>de</lta:pref>
         </lta:var>
      </lta:registryInfo>
      <lta:matchedPrefix>de</lta:matchedPrefix>
      <lta:error type="e007">
         <lta:errorText>Variant repetition</lta:errorText>
         <lta:errorAddInfo>

            <lta:subtag>1901</lta:subtag>
         </lta:errorAddInfo>
      </lta:error>
   </lta:variant>


Felix

2009/6/27 Mark Davis ⌛ <mark@macchiato.com>

> I updated the demo at http://unicode.org/cldr/utility/languageid.jsp to
> parse extlangs. The samples include official languages and the scripts they
> use (based on CLDR data), and the names have localizations where available.
>
> Comments welcome.
>
> Mark
>
> _______________________________________________
> Ltru mailing list
> Ltru@ietf.org
> https://www.ietf.org/mailman/listinfo/ltru
>
>