Re: [ietf-types] The application/www-form-urlencoded format

Bjoern Hoehrmann <derhoermi@gmx.net> Sun, 26 September 2010 20:32 UTC

Return-Path: <derhoermi@gmx.net>
X-Original-To: ietf-types@core3.amsl.com
Delivered-To: ietf-types@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id A9FB33A6AE0 for <ietf-types@core3.amsl.com>; Sun, 26 Sep 2010 13:32:12 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.897
X-Spam-Level:
X-Spam-Status: No, score=-2.897 tagged_above=-999 required=5 tests=[AWL=-0.298, BAYES_00=-2.599]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id QBmQaxi1Adyb for <ietf-types@core3.amsl.com>; Sun, 26 Sep 2010 13:32:10 -0700 (PDT)
Received: from mail.gmx.net (mailout-de.gmx.net [213.165.64.23]) by core3.amsl.com (Postfix) with SMTP id 6FD7E3A6AB3 for <ietf-types@ietf.org>; Sun, 26 Sep 2010 13:32:08 -0700 (PDT)
Received: (qmail invoked by alias); 26 Sep 2010 20:32:44 -0000
Received: from dslb-094-223-218-126.pools.arcor-ip.net (EHLO hive) [94.223.218.126] by mail.gmx.net (mp007) with SMTP; 26 Sep 2010 22:32:44 +0200
X-Authenticated: #723575
X-Provags-ID: V01U2FsdGVkX1/fSz14u2/fA0hdtTMZf0vrl/rZl7sGp/GU+VAq88 tiNz9RFNhUKUtL
From: Bjoern Hoehrmann <derhoermi@gmx.net>
To: "Stephen D. Williams" <sdw@lig.net>
Date: Sun, 26 Sep 2010 22:32:42 +0200
Message-ID: <4nav96tkh50itct882833rbjamou0r2jf0@hive.bjoern.hoehrmann.de>
References: <k1os96p03o78p78490hei104biadpiepit@hive.bjoern.hoehrmann.de> <op.vjmuz10364w2qv@anne-van-kesterens-macbook-pro.local> <4C9FA0BB.8090106@lig.net>
In-Reply-To: <4C9FA0BB.8090106@lig.net>
X-Mailer: Forte Agent 3.3/32.846
MIME-Version: 1.0
Content-Type: text/plain; charset="ISO-8859-1"
Content-Transfer-Encoding: 8bit
X-Y-GMX-Trusted: 0
Cc: ietf-types@ietf.org
Subject: Re: [ietf-types] The application/www-form-urlencoded format
X-BeenThere: ietf-types@ietf.org
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: "Media \(MIME\) type review" <ietf-types.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/listinfo/ietf-types>, <mailto:ietf-types-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/ietf-types>
List-Post: <mailto:ietf-types@ietf.org>
List-Help: <mailto:ietf-types-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ietf-types>, <mailto:ietf-types-request@ietf.org?subject=subscribe>
X-List-Received-Date: Sun, 26 Sep 2010 20:32:13 -0000

* Stephen D. Williams wrote:
>  On 9/26/10 2:22 AM, Anne van Kesteren wrote:
>> On Sat, 25 Sep 2010 23:14:39 +0200, Bjoern Hoehrmann <derhoermi@gmx.net> wrote:
>>>   http://tools.ietf.org/html/draft-hoehrmann-urlencoded -- the draft de-
>>> scribes the application/www-form-urlencoded format, a variant of the
>>> application/x-www-form-urlencoded format first described in RFC 1866.
>
>What the character encoding errors are at the end of section 5 needs to
>be explained, at least for 2 cases.

I believe the first encodes a pair of surrogate code points, the second
a character beyond U+10FFFF, the first is an overlong sequence, then it
is a truncated sequence, and finally you have an illegal starter with
ASCII after it. They are all prohibited by the definition of UTF-8. I do
not think that kind of technical detail would be well-placed there, but
I could rephrase so it refers to UTF-8 directly instead of calling it
"character encoding errors" though that's the same thing here.
-- 
Björn Höhrmann · mailto:bjoern@hoehrmann.de · http://bjoern.hoehrmann.de
Am Badedeich 7 · Telefon: +49(0)160/4415681 · http://www.bjoernsworld.de
25899 Dagebüll · PGP Pub. KeyID: 0xA4357E78 · http://www.websitedev.de/