Re: Syntax
"Clive D.W. Feather" <clive@demon.net> Tue, 09 January 2007 00:07 UTC
Received: from [127.0.0.1] (helo=stiedprmman1.va.neustar.com) by megatron.ietf.org with esmtp (Exim 4.43) id 1H44WB-0000CT-Gq; Mon, 08 Jan 2007 19:07:07 -0500
Received: from [10.91.34.44] (helo=ietf-mx.ietf.org) by megatron.ietf.org with esmtp (Exim 4.43) id 1H44WA-0000CO-NI for cosmogol@ietf.org; Mon, 08 Jan 2007 19:07:06 -0500
Received: from anchor-internal-1.mail.demon.net ([195.173.56.100]) by ietf-mx.ietf.org with esmtp (Exim 4.43) id 1H44W9-0000Vw-AW for cosmogol@ietf.org; Mon, 08 Jan 2007 19:07:06 -0500
Received: from finch-staff-1.server.demon.net (finch-staff-1.server.demon.net [193.195.224.1]) by anchor-internal-1.mail.demon.net with ESMTP� id l09074Re002625Tue, 9 Jan 2007 00:07:04 GMT
Received: from clive by finch-staff-1.server.demon.net with local (Exim 3.36 #1) id 1H44W8-0005Dx-00; Tue, 09 Jan 2007 00:07:04 +0000
Date: Tue, 09 Jan 2007 00:07:04 +0000
From: "Clive D.W. Feather" <clive@demon.net>
To: Stephane Bortzmeyer <bortzmeyer@nic.fr>
Message-ID: <20070109000704.GB17340@finch-staff-1.thus.net>
References: <45A129E9.50905@gmx.de> <20070107205255.GA14621@sources.org> <45A20F62.9060306@gmx.de> <20070108204618.GA29407@sources.org>
Mime-Version: 1.0
Content-Type: text/plain; charset="iso-8859-1"
Content-Disposition: inline
Content-Transfer-Encoding: quoted-printable
In-Reply-To: <20070108204618.GA29407@sources.org>
User-Agent: Mutt/1.5.3i
X-Spam-Score: 0.0 (/)
X-Scan-Signature: 769a46790fb42fbb0b0cc700c82f7081
Cc: cosmogol@ietf.org
Subject: Re: Syntax
X-BeenThere: cosmogol@ietf.org
X-Mailman-Version: 2.1.5
Precedence: list
List-Id: DIscussion on state machine specification in IETF protocols <cosmogol.ietf.org>
List-Unsubscribe: <https://www1.ietf.org/mailman/listinfo/cosmogol>, <mailto:cosmogol-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www1.ietf.org/pipermail/cosmogol>
List-Post: <mailto:cosmogol@ietf.org>
List-Help: <mailto:cosmogol-request@ietf.org?subject=help>
List-Subscribe: <https://www1.ietf.org/mailman/listinfo/cosmogol>, <mailto:cosmogol-request@ietf.org?subject=subscribe>
Errors-To: cosmogol-bounces@ietf.org
Stephane Bortzmeyer said: > On Mon, Jan 08, 2007 at 10:31:14AM +0100, > Julian Reschke <julian.reschke@gmx.de> wrote > a message of 48 lines which said: > >> Choosing characters for identifiers: again, just borrow from >> somewhere else, such as <http://www.w3.org/TR/REC-xml/#NT-Name>). [...] > Otherwise, what do you think of the solution used in RFC 4646? > > ASCCHAR = %x21-25 / %x27-7E / UNICHAR ; Note: AMPERSAND is %x26 > UNICHAR = "&#x" 2*6HEXDIG ";" I seem to have missed a message or two. What is the problem you're trying to solve. If it's identifiers using characters outside the ASCII regime, then you want people to be able to write them in their own language, not encode them. Equally, you want *an* encoding for portable use. The way we solved this in the C Standard is that you can use such characters directly, or you can encode them as \u#### or \U######## (where # represents a hexadecimal digit). So I can write "xán" or "x\u00E1n", and the two are interchangeable. > Characters from outside the US-ASCII [ISO646] repertoire, as well as > the AMPERSAND character ("&", %x26) when it occurs in a field-body, > are represented by a "Numeric Character Reference" using hexadecimal > notation Here they're trying to encode characters for portable storage and exchange. Is that what we need in this document? >> I think inventing a new format, but not taking I18N is very hard to >> defend. As far as I can tell, there's no real chance to get it >> published. > Hmmm, how many IETF formats are in Unicode? (Apart from those based > only on XML, like Atom in RFC 4287.) ABNF is not, for instance (right, > it is not a few format, the RFC is recent but it derives from an older > format.) For RFC 3977 we only had to use UTF-8 for "content" type things, not for "control" type things. So, for example, a NNTP command is still ASCII. -- Clive D.W. Feather | Work: <clive@demon.net> | Tel: +44 20 8495 6138 Internet Expert | Home: <clive@davros.org> | Fax: +44 870 051 9937 Demon Internet | WWW: http://www.davros.org | Mobile: +44 7973 377646 THUS plc | | _______________________________________________ Cosmogol mailing list Cosmogol@ietf.org https://www1.ietf.org/mailman/listinfo/cosmogol
- Syntax Julian Reschke
- Re: Syntax Stephane Bortzmeyer
- Re: Syntax Stephane Bortzmeyer
- Re: Syntax Julian Reschke
- Re: Syntax Julian Reschke
- Re: Syntax Cullen Jennings
- Re: Syntax Stephane Bortzmeyer
- Re: Syntax Clive D.W. Feather
- Re: Syntax Julian Reschke
- Re: Syntax Stephane Bortzmeyer
- Re: Syntax Stephane Bortzmeyer
- Re: Syntax Julian Reschke
- Re: Syntax Frank Ellermann
- Re: Syntax Julian Reschke
- Re: Syntax Stephane Bortzmeyer
- Re: Syntax Clive D.W. Feather
- Re: Syntax Stephane Bortzmeyer
- Re: Syntax Stephane Bortzmeyer
- Re: Syntax Clive D.W. Feather
- Re: Syntax Clive D.W. Feather
- Re: Syntax Clive D.W. Feather
- Re: Syntax Clive D.W. Feather
- Re: Syntax Frank Ellermann
- Re: Syntax Frank Ellermann
- Re: Syntax Julian Reschke
- Re: Syntax Clive D.W. Feather
- Re: Syntax Clive D.W. Feather
- Re: Syntax Stephane Bortzmeyer
- Re: Syntax Julian Reschke
- Re: Syntax Frank Ellermann
- Re: Syntax Frank Ellermann
- Re: Syntax Frank Ellermann
- Re: Syntax Julian Reschke
- Re: Syntax Julian Reschke
- OT: ABNF (was: Syntax) Frank Ellermann
- Re: Syntax Frank Ellermann
- Re: Syntax Frank Ellermann
- Re: OT: ABNF Julian Reschke
- Re: OT: ABNF Frank Ellermann
- Re: OT: ABNF Julian Reschke
- Re: Syntax Clive D.W. Feather
- Re: Syntax Julian Reschke
- Unicode identifiers in other RFCs (Was: Syntax Stephane Bortzmeyer
- Re: Unicode identifiers in other RFCs Frank Ellermann
- Jabber room for BoF in Prague (Was: Unicode ident… Stephane Bortzmeyer
- Re: Jabber room for BoF in Prague (Was: Unicode i… Stephane Bortzmeyer