Re: [abnf-discuss] Mail regarding draft-seantek-constrained-abnf

Sean Leonard <dev+ietf@seantek.com> Sat, 09 July 2016 07:51 UTC

Return-Path: <dev+ietf@seantek.com>
X-Original-To: abnf-discuss@ietfa.amsl.com
Delivered-To: abnf-discuss@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 364B012D0C3; Sat, 9 Jul 2016 00:51:21 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.601
X-Spam-Level:
X-Spam-Status: No, score=-2.601 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, RCVD_IN_DNSWL_LOW=-0.7, SPF_HELO_PASS=-0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 7jpm9x7bGNzi; Sat, 9 Jul 2016 00:51:19 -0700 (PDT)
Received: from mxout-08.mxes.net (mxout-08.mxes.net [216.86.168.183]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id C65C612D0A4; Sat, 9 Jul 2016 00:51:19 -0700 (PDT)
Received: from [192.168.123.151] (unknown [75.83.2.34]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by smtp.mxes.net (Postfix) with ESMTPSA id 1F4FF509B6; Sat, 9 Jul 2016 03:51:17 -0400 (EDT)
Content-Type: text/plain; charset="utf-8"
Mime-Version: 1.0 (Mac OS X Mail 9.3 \(3124\))
From: Sean Leonard <dev+ietf@seantek.com>
In-Reply-To: <3ce7dbb5-24d5-909e-27a2-f8447f9bf3f5@alum.mit.edu>
Date: Sat, 09 Jul 2016 00:51:16 -0700
Content-Transfer-Encoding: quoted-printable
Message-Id: <ABBDBEE6-C069-46A1-AE28-B05D7EA7A7AE@seantek.com>
References: <20160708121202.A3E5D12D19F@ietfa.amsl.com> <43d3ffea-57de-f7ef-e740-e448564008ed@alum.mit.edu> <bf12cdc4-8d17-d6aa-cc01-5afad19127ac@seantek.com> <92276351-a21c-302c-f0c8-7b4843c9b5f7@alum.mit.edu> <0781109A-9AFB-42AA-8828-DA5CDF38C377@seantek.com> <3ce7dbb5-24d5-909e-27a2-f8447f9bf3f5@alum.mit.edu>
To: Paul Kyzivat <pkyzivat@alum.mit.edu>
X-Mailer: Apple Mail (2.3124)
Archived-At: <https://mailarchive.ietf.org/arch/msg/abnf-discuss/q5cQB51ABHKvzycYvRgtkqM11jE>
Cc: Jonathan Hansford <jonathan@hansfords.net>, "draft-seantek-constrained-abnf@ietf.org" <draft-seantek-constrained-abnf@ietf.org>, "abnf-discuss@ietf.org" <abnf-discuss@ietf.org>
Subject: Re: [abnf-discuss] Mail regarding draft-seantek-constrained-abnf
X-BeenThere: abnf-discuss@ietf.org
X-Mailman-Version: 2.1.17
Precedence: list
List-Id: "General discussion about tools, activities and capabilities involving the ABNF meta-language" <abnf-discuss.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/abnf-discuss>, <mailto:abnf-discuss-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/abnf-discuss/>
List-Post: <mailto:abnf-discuss@ietf.org>
List-Help: <mailto:abnf-discuss-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/abnf-discuss>, <mailto:abnf-discuss-request@ietf.org?subject=subscribe>
X-List-Received-Date: Sat, 09 Jul 2016 07:51:21 -0000

> On Jul 8, 2016, at 3:39 PM, Paul Kyzivat <pkyzivat@alum.mit.edu> wrote:
> 
> [...]
> 
> "Defined over X" could be a predicate that a verifier could decide for a given ABNF grammar. [...]
> 
> "Defined over ASCII" and "Defined over Unicode" could also be determined by analysis of the grammar.

Up to this point, that is determined by (human) analysis of the specification text. It’s worked out overall.

As much as I would like to see some more Unicode action, we already have two proposals on the table that I would like to focus on and get adopted & passed, before working on new stuff. The proposals are abnf-more-core-rules and constrained-abnf.

In abnf-more-core-rules, there already are mechanisms for incorporating Unicode, in the productions:
 UNICODE
 BEYONDASCII
 C1
 BEYONDC1 (to be included in next draft)

I suppose the following could be added:
 LATIN1
 BEYONDLATIN1

I submit that these mechanisms are probably “quite enough” for IETF usage. There is a strong bias in ABNF for ASCII, which basically reflects the institutional bias in the IETF for US-English and the history of the Internet’s development. I have yet to see an IETF standard that calls out a specific character beyond the ASCII range and gives it special semantics (other than NEL, and even then, NEL is not really used).


Sean