Re: Require guidance on Unicode in IETF formats

Stephane Bortzmeyer <> Wed, 21 March 2007 09:32 UTC

Return-path: <>
Received: from [] ( by with esmtp (Exim 4.43) id 1HTxB5-0004cB-Lf; Wed, 21 Mar 2007 05:32:19 -0400
Received: from [] ( by with esmtp (Exim 4.43) id 1HTxB3-0004bv-OB for; Wed, 21 Mar 2007 05:32:17 -0400
Received: from ([] by with esmtp (Exim 4.43) id 1HTxB1-0004L3-51 for; Wed, 21 Mar 2007 05:32:17 -0400
Received: by (Postfix, from userid 10) id 5C8D424080E; Wed, 21 Mar 2007 10:32:14 +0100 (CET)
Received: by fetiche (Postfix, from userid 1000) id B5C5618597; Wed, 21 Mar 2007 10:31:42 +0100 (CET)
Date: Wed, 21 Mar 2007 10:31:42 +0100
From: Stephane Bortzmeyer <>
To: Michael Richardson <>
Message-ID: <>
References: <> <etp4jq$kmp$>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
In-Reply-To: <etp4jq$kmp$>
X-Transport: UUCP rules
X-Operating-System: Debian GNU/Linux 3.1
User-Agent: Mutt/1.5.9i
X-Spam-Score: 0.1 (/)
X-Scan-Signature: 8abaac9e10c826e8252866cbe6766464
Subject: Re: Require guidance on Unicode in IETF formats
X-Mailman-Version: 2.1.5
Precedence: list
List-Id: DIscussion on state machine specification in IETF protocols <>
List-Unsubscribe: <>, <>
List-Archive: <>
List-Post: <>
List-Help: <>
List-Subscribe: <>, <>

On Tue, Mar 20, 2007 at 06:11:11PM +0100,
 Michael Richardson <> wrote 
 a message of 16 lines which said:

> Given that ASCII and Latin-1 is a proper subset of Unicode, I don't
> understand how permitting Unicode makes tools for writing SMDL
> harder.

Because Unicode is a superset of ASCII. So, if you develop with
ASCII-only tools, they may not be able to process all Cosmogol files,
only those written in pure ASCII.
> For me, I'm thinking that this is mostly text editors, and some gui
> tools that would mostly *export* a SMDL.  i.e. that we work mostly
> in latex-style.

No, the typical Cosmogol tools are not editors (Cosmogol is, by
design, a text-based language, so you can use any editor), they are
checkers, translators, etc. Same as with ABNF.

> Is that really that much harder?   

I suggested that people post here "implementation reports" of parsers
for Unicode-based language, together with code examples. So far, we
received mostly "It should work" reports :-)

> Was it simply the "case-insensitive" nature of things that threw a
> wrench in things?

Cosmogol is case-sensitive.

Cosmogol mailing list