Re: [apps-discuss] I-D Action: draft-ietf-appsawg-xml-mediatypes-07.txt

Tim Bray <tbray@textuality.com> Sat, 08 February 2014 19:46 UTC

Return-Path: <tbray@textuality.com>
X-Original-To: apps-discuss@ietfa.amsl.com
Delivered-To: apps-discuss@ietfa.amsl.com
Received: from localhost (ietfa.amsl.com [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id B59E21A018C for <apps-discuss@ietfa.amsl.com>; Sat, 8 Feb 2014 11:46:04 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.977
X-Spam-Level:
X-Spam-Status: No, score=-1.977 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, FM_FORGED_GMAIL=0.622, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_LOW=-0.7] autolearn=ham
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id xsNQ7hdhCPfa for <apps-discuss@ietfa.amsl.com>; Sat, 8 Feb 2014 11:46:01 -0800 (PST)
Received: from mail-vb0-f45.google.com (mail-vb0-f45.google.com [209.85.212.45]) by ietfa.amsl.com (Postfix) with ESMTP id 4FD671A00FE for <apps-discuss@ietf.org>; Sat, 8 Feb 2014 11:46:01 -0800 (PST)
Received: by mail-vb0-f45.google.com with SMTP id m10so3718985vbh.32 for <apps-discuss@ietf.org>; Sat, 08 Feb 2014 11:46:01 -0800 (PST)
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:date :message-id:subject:from:to:cc:content-type; bh=6q5B4Igt2tVg2wfR/4VioG7lJCsvprTAA2EMQx2Ned4=; b=IXb1h+EMrwKietuH/3qJbgbIv4yuBhI7ubS+6/aHpzn1vzBFzfIW/gVtoje971jsLW c4Qz9COoD5raEmEs5hw6LrW6n5kkHi9R5LacJPAmT6Ro7jzILSe/A7YeUMduOT7WI4F6 pYSDOl+hKpj9KvtQQ+8ULZKl1CLvCDuCqN29qQ3XiLbDkc0YOVcGDKafQHvuxQC3nu3F enpQclLNlDoZV1zHzBtQkFpCijK1YwV0IH+xN0eSQQTOCYPsYqcj9+0BkergV5a4k+5v whDo8VHnwDQG94awo6w5S9ak2RLBgieZcJAgC+hFxhDiayUGuHLIWG1ZVuMsatVHxzC4 U/wg==
X-Gm-Message-State: ALoCoQlA8xaQaqCJcfsVdyZnFtKMRpIaF6uQIEc1nlh6HqO32YXyU6GoVSjS2X4xWc2BkXco3yy1
MIME-Version: 1.0
X-Received: by 10.58.37.67 with SMTP id w3mr2271555vej.22.1391888761392; Sat, 08 Feb 2014 11:46:01 -0800 (PST)
Received: by 10.220.98.73 with HTTP; Sat, 8 Feb 2014 11:46:01 -0800 (PST)
X-Originating-IP: [24.84.235.32]
In-Reply-To: <f5bsirvjf27.fsf@troutbeck.inf.ed.ac.uk>
References: <20140206183642.28098.24139.idtracker@ietfa.amsl.com> <f5bsirvjf27.fsf@troutbeck.inf.ed.ac.uk>
Date: Sat, 08 Feb 2014 11:46:01 -0800
Message-ID: <CAHBU6iuTLWFDV-2qM-FDMQeK1ONS8x4hUOGg2ssYRXQGnTZNTA@mail.gmail.com>
From: Tim Bray <tbray@textuality.com>
To: "Henry S. Thompson" <ht@inf.ed.ac.uk>
Content-Type: multipart/alternative; boundary="089e013a0d4230ab3d04f1ea5b91"
Cc: "apps-discuss@ietf.org" <apps-discuss@ietf.org>
Subject: Re: [apps-discuss] I-D Action: draft-ietf-appsawg-xml-mediatypes-07.txt
X-BeenThere: apps-discuss@ietf.org
X-Mailman-Version: 2.1.15
Precedence: list
List-Id: General discussion of application-layer protocols <apps-discuss.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/apps-discuss>, <mailto:apps-discuss-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/apps-discuss/>
List-Post: <mailto:apps-discuss@ietf.org>
List-Help: <mailto:apps-discuss-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/apps-discuss>, <mailto:apps-discuss-request@ietf.org?subject=subscribe>
X-List-Received-Date: Sat, 08 Feb 2014 19:46:04 -0000

Disclosure: This is the first draft of xml-mediatypes I’ve read in years,
so probably is a little lacking in context.

2.2: “[UNICODE] defines three "encoding forms", which are independent of
serialization” - what does “independent of serialization” mean?  I think
the UTF-* are actually serializations of unicode codepoints.  I suppose
UTF-16 is sort-of semi-independent of serialization, but UTF-8 never is.

2nd last para of 3.1, beginning “XML MIME producers are RECOMMENDED to
provide means for XML MIME entity authors to determine what value” baffles
me.  I just read it 3 times and I don’t get it.   Could we have an example
or something?  I also think I disagree with my guess as what it’s trying to
say.  I tend to think the tools are going to do a better job of figuring
out the right charset labeling than your typical document author.

The crucial “ this specification sets the priority as follows:” indented
para in 3.2.  I think a little more is needed. The crucial corner case is
when you’ve got a MIME-header charset that is just wrong but an XML-aware
receiver can in fact sort things out based on the encoding declaration.
 What does being “authoritative” mean concretely? Is it the RFC’s
recommendation that the receiver SHOULD refuse to parse the the XML even
though it could?  If so, we should say so explicitly.

Then in the example in 9.8, draft says “all processors will treat the
enclosed entity as iso-8859-1 encoded.   That is, the "UTF-8" encoding
declaration will be ignored.”  Is this really true in practice?  I suspect
not; so perhaps you should say “all processors which conform to this
specification will”.  Hm, or perhaps the real issue is that in this case,
you can’t predict what will happen; some implementations will ignore the
MIME header, others will drop-kick the XML because of the inconsistency.

Section 3.3 typo, “thatUTF-16”, space needed, also “entitiesnot” in the
same sentence.

Also some spacing problems in the NOTE: in 8.1









On Fri, Feb 7, 2014 at 4:13 AM, Henry S. Thompson <ht@inf.ed.ac.uk> wrote:

> As usual, a disposition of comments against the previous draft is
> available at
>
>   http://www.w3.org/XML/2012/10/3023bis/06-comments.html
>
> and an author's diff is at
>
>
> http://www.w3.org/XML/2012/10/3023bis/draft-ietf-appsawg-xml-mediatypes-07_diff.html
>
> This draft has few changes from version 6, almost all as a result of a
> review by Larry Masinter, for which thanks.
>
> The only change even close to substantive is a summary RECOMMENDATION
> near the beginning of section 3, Encoding Considerations, which says
>
>   The use of UTF-8, without a BOM, is RECOMMENDED for all XML MIME
> entities.
>
> See the 06-comments document for a summary of the discussion about
> this.
>
> The level of comments on this draft has shifted from substantive to
> rhetorical/editorial, and I think it's now pretty much fully-baked.
> I'd very much welcome some endorsements of its readiness, so that we
> can move it up and out of here.
>
> ht
> --
>        Henry S. Thompson, School of Informatics, University of Edinburgh
>       10 Crichton Street, Edinburgh EH8 9AB, SCOTLAND -- (44) 131 650-4440
>                 Fax: (44) 131 650-4587, e-mail: ht@inf.ed.ac.uk
>                        URL: http://www.ltg.ed.ac.uk/~ht/
>  [mail from me _always_ has a .sig like this -- mail without it is forged
> spam]
> _______________________________________________
> apps-discuss mailing list
> apps-discuss@ietf.org
> https://www.ietf.org/mailman/listinfo/apps-discuss
>