Re: Registration of media type application/calendar+xml

Keith Moore <moore@cs.utk.edu> Fri, 10 September 2010 05:08 UTC

Return-Path: <moore@cs.utk.edu>
X-Original-To: ietf@core3.amsl.com
Delivered-To: ietf@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id 0CACF3A6895 for <ietf@core3.amsl.com>; Thu, 9 Sep 2010 22:08:55 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -5.569
X-Spam-Level:
X-Spam-Status: No, score=-5.569 tagged_above=-999 required=5 tests=[AWL=-0.830, BAYES_20=-0.74, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_MED=-4]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id RoqjeACBcEx5 for <ietf@core3.amsl.com>; Thu, 9 Sep 2010 22:08:53 -0700 (PDT)
Received: from ren.eecs.utk.edu (ren.eecs.utk.edu [160.36.56.153]) by core3.amsl.com (Postfix) with ESMTP id D21FE3A6839 for <IETF@ietf.org>; Thu, 9 Sep 2010 22:08:49 -0700 (PDT)
Received: from localhost (localhost.localdomain [127.0.0.1]) by ren.eecs.utk.edu (Postfix) with ESMTP id 94D16100A1; Fri, 10 Sep 2010 01:09:16 -0400 (EDT)
X-Virus-Scanned: by amavisd-new with ClamAV and SpamAssasin at eecs.utk.edu
Received: from ren.eecs.utk.edu ([127.0.0.1]) by localhost (ren.eecs.utk.edu [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id tN1v5H9vPKfw; Fri, 10 Sep 2010 01:09:15 -0400 (EDT)
Received: from 173-137-249-197.pools.spcsdns.net (173-137-249-197.pools.spcsdns.net [173.137.249.197]) by ren.eecs.utk.edu (Postfix) with ESMTP id 8550C1009E; Fri, 10 Sep 2010 01:09:12 -0400 (EDT)
Subject: Re: Registration of media type application/calendar+xml
Mime-Version: 1.0 (Apple Message framework v1081)
Content-Type: multipart/alternative; boundary="Apple-Mail-16-845471581"
From: Keith Moore <moore@cs.utk.edu>
In-Reply-To: <AANLkTinon97N3njcAV=FUj7-_ZJugazVCuaVbySbXr_L@mail.gmail.com>
Date: Fri, 10 Sep 2010 01:09:09 -0400
Message-Id: <193EC4D4-1B6C-4B14-ACD7-3237517566F5@cs.utk.edu>
References: <F842A373EE7E9C439CA07CCB01BBD1D0564C4899@TK5EX14MBXC138.redmond.corp.microsoft.com> <341B449F-7DFE-4A40-84B0-D008658A08DF@cs.utk.edu> <B0EA09C87A5701A94419DB8F@socrates.local> <673F57D3-B2EC-4ABF-B450-EEEA3A4C185A@cs.utk.edu> <AANLkTinon97N3njcAV=FUj7-_ZJugazVCuaVbySbXr_L@mail.gmail.com>
To: Phillip Hallam-Baker <hallam@gmail.com>
X-Mailer: Apple Mail (2.1081)
Cc: Douglass Mike <douglm@rpi.edu>, Cyrus Daboo <cyrus@daboo.name>, Alexey Melnikov <Alexey.Melnikov@isode.com>, ietf-types@iana.org, Steven Lees <Steven.Lees@microsoft.com>, IETF@ietf.org
X-BeenThere: ietf@ietf.org
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: IETF-Discussion <ietf.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/listinfo/ietf>, <mailto:ietf-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/ietf>
List-Post: <mailto:ietf@ietf.org>
List-Help: <mailto:ietf-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ietf>, <mailto:ietf-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 10 Sep 2010 05:08:55 -0000

On Sep 10, 2010, at 12:34 AM, Phillip Hallam-Baker wrote:

> The objections raised by Keith do not appear to me to fall under any of the requirements for MIME type registration set out in RFC 4288.

I didn't claim that they did.  I'm just taking the opportunity to ask that the proposal be voluntarily withdrawn.

> I disagree with the argument made in any case.
> 
> If you want to have a system in which 95% of your data structures are in XML you probably don't want to have to introduce a separate syntax and you most certainly do not want to deal with a separate data model for dealing with calendaring.

It's not at all clear that trying to coerce 95% of the data models in your system to be compatible with XML is a worthwhile goal.  The XML data model is tortured.   Trying to impose it on network protocols should be regarded as an act of violence.

At any rate, this proposal doesn't change the iCalendar data model - it just makes it harder to use.  If you have a beef with the iCalendar data model, feel free to try to come up with a better one.

> The iCalendar format represents a 1990s style approach to the problem. There is no real separation of syntax from the data model. Software developed in that manner is notoriously difficult to get right for the reasons that Keith describes. 

XML has lots of problems of its own.  I recently had to review a specification that referenced WS-i and WS-security and about a couple of thousand other pages of useless crap that went with it.  All for the sake of being able to transmit about six meaningful name-value pairs from a client to a server.  It was completely ridiculous.

I'm no fan of the iCalendar format, nor the vCard and vCalendar formats that preceded it.  But for all of its purported generality (and perhaps because of it) XML has turned out to be no better, and in many ways worse.  It's amazing how hard people will work to make a simple idea complex, especially when the simple idea has become a bandwagon, or (in this case) a religion.  

In principle, it would make sense for things to have a uniform syntax.  But XML gets this wrong in several different ways.  The most obvious is that XML data structures don't map well onto data structures supported by programming languages.   That's probably because SGML wasn't designed to do that - it was designed to mark up text.  Another problem is that XML confuses typing and naming.   Another problem (especially when mapping other structures to/from XML) is that the distinction between parameters and sub-elements is pretty much arbitrary.


> XML is a substantial overhead if you are dealing with a single protocol but when you are dealing with multiple protocols the benefits are substantial and allow something like 70% of your coding effort to be pushed onto the platform layer. That means that you have 70% less new code and new code paths to contend with.

If your programmer is spending 70% of his coding time dealing with a presentation layer, even one as convoluted as iCalendar, you should fire him.  It's not like regular expression parsers are hard to come by these days.   Nor are libraries that can parse standard formats hard to come by.

Another of the big problems with the XML religion is that its adherents have the mistaken impression that defining the syntax is most of the work of defining a protocol - so that once you decide to use XML, most of the work is done.  Apparently, semantics don't matter much.

Another problem with XML is that it makes data models so easy to extend (just add more element definitions) that people often don't take due care in defining their data models.

> One of the discoveries of the mid 1990s was that yacc and LR(1) grammars are no more useful for describing computer languages than they are for describing natural languages.

That's a ridiculous statement.  (Okay, maybe strictly speaking LR(1) isn't quite enough, but it's close enough for most computer languages that you can usually make an LR(1) parser work.)  Computer languages don't need the same kind of expressive power that natural languages have.  Natural languages have to allow for a certain amount of ambiguity, but that's a liability in computer languages.

> The most useful feature of a computer grammar is regularity and consistency. XML enforces a high degree of consistency.

It enforces consistency in syntax. Taken by itself, that's a good thing.  But when you parse XML you don't get a very usable data structure.  You get a mess.  And once you do the work of transforming that data structure into an effective internal representation, you've negated whatever advantage you might have found in not having to have written a parser/generator for it.  You haven't solved the problem of needing a parser and generator - you've just moved it.  Instead of parsing text you're parsing a DOM structure.  You've added an extra layer or two for no benefit.

You're essentially arguing the syntax by which data is represented on the wire - the presentation layer - should constrain how data is represented internally in a system.   And then you're arguing that the particular constraints imposed by XML are appropriate constraints.    That's brain damage.

> Now I would quite prefer to take about 50% or more of the XML spec and discard it. They did a good job of taking out the most insane features of SGML but there is much more cruft that could be cut out. But that does not change the fact that using XML as is produces clearer specifications that are more likely to be implemented without errors than with the 1990s approach.
> 
As is often the case, you're simply and utterly incorrect.

Let's stamp out XML in our lifetime.  Even FORTRAN deserves to live longer.

Keith