Re: Syntax

Stephane Bortzmeyer <bortzmeyer@nic.fr> Tue, 09 January 2007 22:12 UTC

Received: from [127.0.0.1] (helo=stiedprmman1.va.neustar.com) by megatron.ietf.org with esmtp (Exim 4.43) id 1H4PCZ-0007mT-BX; Tue, 09 Jan 2007 17:12:15 -0500
Received: from [10.91.34.44] (helo=ietf-mx.ietf.org) by megatron.ietf.org with esmtp (Exim 4.43) id 1H4PCX-0007jf-Qa for cosmogol@ietf.org; Tue, 09 Jan 2007 17:12:13 -0500
Received: from virtual3.netaktiv.com ([80.67.170.53] helo=mail.bortzmeyer.org) by ietf-mx.ietf.org with esmtp (Exim 4.43) id 1H4PCW-000567-IP for cosmogol@ietf.org; Tue, 09 Jan 2007 17:12:13 -0500
Received: by mail.bortzmeyer.org (Postfix, from userid 10) id 01CC5240823; Tue, 9 Jan 2007 23:12:10 +0100 (CET)
Received: by mail.sources.org (Postfix, from userid 1000) id 7A87D11B4A; Tue, 9 Jan 2007 23:08:56 +0100 (CET)
Date: Tue, 9 Jan 2007 23:08:56 +0100
From: Stephane Bortzmeyer <bortzmeyer@nic.fr>
To: Julian Reschke <julian.reschke@gmx.de>
Message-ID: <20070109220856.GB28340@sources.org>
References: <45A129E9.50905@gmx.de> <20070107205255.GA14621@sources.org>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
In-Reply-To: <20070107205255.GA14621@sources.org>
X-Transport: UUCP rules
X-Operating-System: Debian GNU/Linux 3.1
User-Agent: Mutt/1.5.9i
X-Spam-Score: 0.1 (/)
X-Scan-Signature: 08170828343bcf1325e4a0fb4584481c
Cc: cosmogol@ietf.org
Subject: Re: Syntax
X-BeenThere: cosmogol@ietf.org
X-Mailman-Version: 2.1.5
Precedence: list
List-Id: DIscussion on state machine specification in IETF protocols <cosmogol.ietf.org>
List-Unsubscribe: <https://www1.ietf.org/mailman/listinfo/cosmogol>, <mailto:cosmogol-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www1.ietf.org/pipermail/cosmogol>
List-Post: <mailto:cosmogol@ietf.org>
List-Help: <mailto:cosmogol-request@ietf.org?subject=help>
List-Subscribe: <https://www1.ietf.org/mailman/listinfo/cosmogol>, <mailto:cosmogol-request@ietf.org?subject=subscribe>
Errors-To: cosmogol-bounces@ietf.org

On Sun, Jan 07, 2007 at 09:52:55PM +0100,
 Stephane Bortzmeyer <bortzmeyer@nic.fr> wrote 
 a message of 30 lines which said:

> Any idea about the support of UTF-8 in typical languages *and*
> parsing tools? For instance, with C and Yacc, I assume it is quite
> difficult. With Haskell, I'm not sure :-)

Searching and asking C gurus, it seems that C programmers using Lex
and Yacc (or ANTLR) are out of luck here (they would have to do
everything themselves, not relying on these useful tools).

These tools rely heavily on tables, so you can imagine what it would
be to port them to the world of 100 000 characters, after they were
written for a world of 256 characters :-)

_______________________________________________
Cosmogol mailing list
Cosmogol@ietf.org
https://www1.ietf.org/mailman/listinfo/cosmogol