[Ltru] Re: Test suite for language tags?

Stephane Bortzmeyer <bortzmeyer@nic.fr> Sun, 17 September 2006 16:52 UTC

Received: from [127.0.0.1] (helo=stiedprmman1.va.neustar.com) by megatron.ietf.org with esmtp (Exim 4.43) id 1GOzsP-0007kY-Er; Sun, 17 Sep 2006 12:52:17 -0400
Received: from [10.91.34.44] (helo=ietf-mx.ietf.org) by megatron.ietf.org with esmtp (Exim 4.43) id 1GOzsO-0007iH-4J for ltru@lists.ietf.org; Sun, 17 Sep 2006 12:52:16 -0400
Received: from virtual3.netaktiv.com ([80.67.170.53] helo=mail.bortzmeyer.org) by ietf-mx.ietf.org with esmtp (Exim 4.43) id 1GOzsM-0006BW-S6 for ltru@lists.ietf.org; Sun, 17 Sep 2006 12:52:16 -0400
Received: by mail.bortzmeyer.org (Postfix, from userid 10) id 0F2F3240814; Sun, 17 Sep 2006 18:52:08 +0200 (CEST)
Received: by mail.sources.org (Postfix, from userid 1000) id C3C6B1185A; Sun, 17 Sep 2006 18:50:50 +0200 (CEST)
Date: Sun, 17 Sep 2006 18:50:50 +0200
From: Stephane Bortzmeyer <bortzmeyer@nic.fr>
To: Mark Davis <mark.davis@icu-project.org>
Message-ID: <20060917165050.GA29413@sources.org>
References: <20060802072709.GA17404@nic.fr> <44D21ACD.4040707@yahoo-inc.com> <20060804165720.GA24037@sources.org> <44D4AC42.79E0@xyzzy.claranet.de> <20060830093000.GA31895@nic.fr> <44F6313D.2070000@yahoo-inc.com> <6.0.0.20.2.20060831201004.101ab8d0@localhost> <44F6EF0E.20602@yahoo-inc.com> <6.0.0.20.2.20060901024806.109a6d90@localhost> <30b660a20609161628t22ab3c4flc81ea92f40800a09@mail.gmail.com>
Mime-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
Content-Disposition: inline
In-Reply-To: <30b660a20609161628t22ab3c4flc81ea92f40800a09@mail.gmail.com>
X-Transport: UUCP rules
X-Operating-System: Debian GNU/Linux 3.1
User-Agent: Mutt/1.5.9i
X-Spam-Score: 0.1 (/)
X-Scan-Signature: e5ba305d0e64821bf3d8bc5d3bb07228
Cc: ltru@lists.ietf.org
Subject: [Ltru] Re: Test suite for language tags?
X-BeenThere: ltru@ietf.org
X-Mailman-Version: 2.1.5
Precedence: list
List-Id: Language Tag Registry Update working group discussion list <ltru.ietf.org>
List-Unsubscribe: <https://www1.ietf.org/mailman/listinfo/ltru>, <mailto:ltru-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www1.ietf.org/pipermail/ltru>
List-Post: <mailto:ltru@ietf.org>
List-Help: <mailto:ltru-request@ietf.org?subject=help>
List-Subscribe: <https://www1.ietf.org/mailman/listinfo/ltru>, <mailto:ltru-request@ietf.org?subject=subscribe>
Errors-To: ltru-bounces@ietf.org

On Sat, Sep 16, 2006 at 04:28:01PM -0700,
 Mark Davis <mark.davis@icu-project.org> wrote 
 a message of 98 lines which said:

> BTW, I had updated my regex to the final spec for 4646. Here is a
> single Perl or Java regex that does most of the parse:

Isn't it too lax? When testing it in a Perl script, I find it accepts
all my well-formed tags (OK) but also accepts wrongly:

fr-Latn-F is well-formed
en-a-bbb-a-ccc is well-formed
tlh-a-b-foo is well-formed
abcdefghi-012345678 is well-formed
ab-abc-abc-abc-abc is well-formed
ab-abcd-abc is well-formed
ab-ab-abc is well-formed
ab-123-abc is well-formed
ab-abcde-abc is well-formed
ab-1abc-abc is well-formed
ab-ab-abcd is well-formed
ab-123-abcd is well-formed
ab-abcde-abcd is well-formed
ab-1abc-abcd is well-formed
ab-a-b is well-formed
ab-a-x is well-formed
ab--ab is well-formed
ab-abc- is well-formed
ab-c-abc-r-toto-c-abc is well-formed
abcd-efg is well-formed
aabbccddE is well-formed

_______________________________________________
Ltru mailing list
Ltru@ietf.org
https://www1.ietf.org/mailman/listinfo/ltru