Re: [Ltru] updated demo

Felix Sasaki <felix.sasaki@fh-potsdam.de> Sun, 28 June 2009 12:11 UTC

Return-Path: <felix.sasaki@googlemail.com>
X-Original-To: ltru@core3.amsl.com
Delivered-To: ltru@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id C2CA73A6867 for <ltru@core3.amsl.com>; Sun, 28 Jun 2009 05:11:20 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -0.742
X-Spam-Level:
X-Spam-Status: No, score=-0.742 tagged_above=-999 required=5 tests=[AWL=0.151, BAYES_00=-2.599, FM_FORGED_GMAIL=0.622, HTML_MESSAGE=0.001, URIBL_RHS_DOB=1.083]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id nBFXZP0zeIrv for <ltru@core3.amsl.com>; Sun, 28 Jun 2009 05:11:19 -0700 (PDT)
Received: from mail-bw0-f213.google.com (mail-bw0-f213.google.com [209.85.218.213]) by core3.amsl.com (Postfix) with ESMTP id 4C7963A677C for <ltru@ietf.org>; Sun, 28 Jun 2009 05:11:18 -0700 (PDT)
Received: by bwz9 with SMTP id 9so2778282bwz.37 for <ltru@ietf.org>; Sun, 28 Jun 2009 05:10:12 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlemail.com; s=gamma; h=domainkey-signature:mime-version:sender:received:in-reply-to :references:date:x-google-sender-auth:message-id:subject:from:to :content-type; bh=Bs41Sry5Bv+qQggssksn9fWfrTuwwn2/QHHgI8XnTq8=; b=slBI3q8nY7J6U2RdBBHNHfoZLQMr/vKkEh/nnovB+gH5Z0XAGADkqDRzsg8jSRQEt5 JGZxYYrWQTC7mMZw7IIFD1NPgtrXIvbuJIB7syO8xsLn84hcc+ttEIrwsibn4ROQaGR3 2fyHXsqhpjpesISgikLwf3UF1/HDYrCNFx4BU=
DomainKey-Signature: a=rsa-sha1; c=nofws; d=googlemail.com; s=gamma; h=mime-version:sender:in-reply-to:references:date :x-google-sender-auth:message-id:subject:from:to:content-type; b=i3ZEDIjaxP/fykr0MKItL0szZT2Qu4MGu+L3IABWe6LwD0JNarYKXLnO0/IxyBXiCv eYCbgxNDpuqEA4q7ti+mIre7XJhuWJJgevaFtCHAnkpJRHNYvNKQsHJhWwcbYosyuch7 75vU4Yub33J2Mg5l9h21PDGbovs0GVJ/Mj8/c=
MIME-Version: 1.0
Sender: felix.sasaki@googlemail.com
Received: by 10.223.123.210 with SMTP id q18mr3859681far.60.1246191011849; Sun, 28 Jun 2009 05:10:11 -0700 (PDT)
In-Reply-To: <ba4134970906280207td8dbdd4l8a4860f7ee4de28@mail.gmail.com>
References: <30b660a20906271138o186f82a5xd2531f70806ab3be@mail.gmail.com> <ba4134970906280207td8dbdd4l8a4860f7ee4de28@mail.gmail.com>
Date: Sun, 28 Jun 2009 14:10:11 +0200
X-Google-Sender-Auth: f5d7e1233d9a7dc0
Message-ID: <ba4134970906280510n6b1a4cf9hc6379eacaef86115@mail.gmail.com>
From: Felix Sasaki <felix.sasaki@fh-potsdam.de>
To: LTRU Working Group <ltru@ietf.org>
Content-Type: multipart/alternative; boundary="001636988a0b956eb0046d6774e4"
Subject: Re: [Ltru] updated demo
X-BeenThere: ltru@ietf.org
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: Language Tag Registry Update working group discussion list <ltru.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/listinfo/ltru>, <mailto:ltru-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/ltru>
List-Post: <mailto:ltru@ietf.org>
List-Help: <mailto:ltru-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ltru>, <mailto:ltru-request@ietf.org?subject=subscribe>
X-List-Received-Date: Sun, 28 Jun 2009 12:11:20 -0000

Resent, now this should reach the list.

Hello Mark,

this looks similar to
http://www.w3.org/2008/05/lta/
my language tag parser currently based on draft 21 of rfc4646bis. lta also
contains some error checking mechanisms, see examples like

http://www.w3.org/2008/05/lta/language-tags/q?input=de-x
http://www.w3.org/2008/05/lta/language-tags/q?input=xa
http://www.w3.org/2008/05/lta/language-tags/q?input=en-latn
http://www.w3.org/2008/05/lta/language-tags/q?input=ja-1901
http://www.w3.org/2008/05/lta/language-tags/q?input=fr-cmn
http://www.w3.org/2008/05/lta/language-tags/q?input=zh-cmn-cmn
http://www.w3.org/2008/05/lta/language-tags/q?input=zh-cmn-a-bbb-a-ccc
http://www.w3.org/2008/05/lta/language-tags/q?input=de-de-1901-1901

Output is available in HTML with German UI and English, and in an XML
format, see e.g.
http://www.w3.org/2008/05/lta/language-tags/q?input=de-de-1901-1901&output=xml

My comment on your tool is that to co-ordinate such efforts it would be
great to have a common machine-readable output format for language tag
parsing, also e.g. to deal with error descriptions like

 <lta:variant>
      <lta:subtag>1901</lta:subtag>
      <lta:registryInfo>

         <lta:var ty="variant" su="1901" ad="2005-10-16">

            <lta:ds>Traditional German orthography

</lta:ds>
            <lta:pref>de</lta:pref>

         </lta:var>
      </lta:registryInfo>
      <lta:matchedPrefix>de</lta:matchedPrefix>

      <lta:error type="e007">
         <lta:errorText>Variant repetition</lta:errorText>

         <lta:errorAddInfo>

            <lta:subtag>1901</lta:subtag>
         </lta:errorAddInfo>

      </lta:error>
   </lta:variant>


Felix

2009/6/27 Mark Davis ⌛ <mark@macchiato.com>
I updated the demo at http://unicode.org/cldr/utility/languageid.jsp to
parse extlangs. The samples include official languages and the scripts they
use (based on CLDR data), and the names have localizations where available.

Comments welcome.

Mark

______________________________
>
> _________________
> Ltru mailing list
> Ltru@ietf.org
> https://www.ietf.org/mailman/listinfo/ltru