Re: obs-text character encoding and error handling; duplicate parameter names in Content-Type

"Peter Occil" <poccil14@gmail.com> Sat, 25 May 2013 07:32 UTC

Return-Path: <ietf-http-wg-request@listhub.w3.org>
X-Original-To: ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com
Delivered-To: ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 0087421F9349 for <ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com>; Sat, 25 May 2013 00:32:03 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -10.171
X-Spam-Level:
X-Spam-Status: No, score=-10.171 tagged_above=-999 required=5 tests=[AWL=0.427, BAYES_00=-2.599, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_HI=-8]
Received: from mail.ietf.org ([12.22.58.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id Hgy2mKC5O7qE for <ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com>; Sat, 25 May 2013 00:31:56 -0700 (PDT)
Received: from frink.w3.org (frink.w3.org [128.30.52.56]) by ietfa.amsl.com (Postfix) with ESMTP id 615D121F8425 for <httpbisa-archive-bis2Juki@lists.ietf.org>; Sat, 25 May 2013 00:31:56 -0700 (PDT)
Received: from lists by frink.w3.org with local (Exim 4.72) (envelope-from <ietf-http-wg-request@listhub.w3.org>) id 1Ug8wL-0003pS-Bb for ietf-http-wg-dist@listhub.w3.org; Sat, 25 May 2013 07:30:57 +0000
Resent-Date: Sat, 25 May 2013 07:30:57 +0000
Resent-Message-Id: <E1Ug8wL-0003pS-Bb@frink.w3.org>
Received: from maggie.w3.org ([128.30.52.39]) by frink.w3.org with esmtp (Exim 4.72) (envelope-from <poccil14@gmail.com>) id 1Ug8w8-0003ns-C9 for ietf-http-wg@listhub.w3.org; Sat, 25 May 2013 07:30:44 +0000
Received: from mail-ye0-f182.google.com ([209.85.213.182]) by maggie.w3.org with esmtps (TLS1.0:RSA_ARCFOUR_SHA1:16) (Exim 4.72) (envelope-from <poccil14@gmail.com>) id 1Ug8w3-0004fh-LX for ietf-http-wg@w3.org; Sat, 25 May 2013 07:30:44 +0000
Received: by mail-ye0-f182.google.com with SMTP id h13so483986yee.13 for <ietf-http-wg@w3.org>; Sat, 25 May 2013 00:30:14 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=message-id:from:to:references:in-reply-to:subject:date:mime-version :content-type:x-priority:x-msmail-priority:importance:x-mailer :x-mimeole; bh=wzKHw1Msd32G0zb/s2RUEzyUmBQiAvDhy7Pyh2/+eoI=; b=bZcao8PmJOOZ9mxLFVQXQF6mprb7QN8ouB0lRym9pm2s6t5IJltJrG1TFfZUOzW3jP VtCtvqTW6WK3PJCGFmwEBu/B//wKjZXNXLFmc32ltd+aIYw+Sq4OuEFLEIu+px9FBA8S lSqV7umTZVr1DO0MAsyGHcZLaDfGMha2xIdexSHKtzPLpcLEX6CqR2Pyj7wMZgBOc0dD AoA/9XvSr6OxpSRCNl60hAttm68m1LXry2lJh5Z6/BUDPk0KEvEdiJ3FV4c9/ZoNdB07 Bi9BpXUJnwoC7cY7Yt9XJJKtN8tCjFvdgzPsPPKv+w1WS2EfjATBKX4zOW//vhhaJD/d SL/w==
X-Received: by 10.236.28.226 with SMTP id g62mr11555956yha.10.1369467013943; Sat, 25 May 2013 00:30:13 -0700 (PDT)
Received: from PeterPC (c-76-119-210-197.hsd1.ma.comcast.net. [76.119.210.197]) by mx.google.com with ESMTPSA id d91sm28391334yhq.16.2013.05.25.00.30.12 for <ietf-http-wg@w3.org> (version=TLSv1 cipher=RC4-SHA bits=128/128); Sat, 25 May 2013 00:30:13 -0700 (PDT)
Message-ID: <83535E5464C242B1A0612D300F4F87CF@PeterPC>
From: Peter Occil <poccil14@gmail.com>
To: HTTP Working Group <ietf-http-wg@w3.org>
References: <F2550CB07E9B440F9AC001D3038A634C@PeterPC>
In-Reply-To: <F2550CB07E9B440F9AC001D3038A634C@PeterPC>
Date: Sat, 25 May 2013 03:30:06 -0400
MIME-Version: 1.0
Content-Type: multipart/alternative; boundary="----=_NextPart_000_01C3_01CE58F8.26F72D70"
X-Priority: 3
X-MSMail-Priority: Normal
Importance: Normal
X-Mailer: Microsoft Windows Live Mail 15.4.3555.308
X-MimeOLE: Produced By Microsoft MimeOLE V15.4.3555.308
Received-SPF: pass client-ip=209.85.213.182; envelope-from=poccil14@gmail.com; helo=mail-ye0-f182.google.com
X-W3C-Hub-Spam-Status: No, score=-2.8
X-W3C-Hub-Spam-Report: AWL=-2.220, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FREEMAIL_ENVFROM_END_DIGIT=0.25, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_LOW=-0.7, SPF_PASS=-0.001
X-W3C-Scan-Sig: maggie.w3.org 1Ug8w3-0004fh-LX 212a2e77092f6d75c0876e08d375448e
X-Original-To: ietf-http-wg@w3.org
Subject: Re: obs-text character encoding and error handling; duplicate parameter names in Content-Type
Archived-At: <http://www.w3.org/mid/83535E5464C242B1A0612D300F4F87CF@PeterPC>
Resent-From: ietf-http-wg@w3.org
X-Mailing-List: <ietf-http-wg@w3.org> archive/latest/18087
X-Loop: ietf-http-wg@w3.org
Resent-Sender: ietf-http-wg-request@w3.org
Precedence: list
List-Id: <ietf-http-wg.w3.org>
List-Help: <http://www.w3.org/Mail/>
List-Post: <mailto:ietf-http-wg@w3.org>
List-Unsubscribe: <mailto:ietf-http-wg-request@w3.org?subject=unsubscribe>

On issue 1, I guess I was only reading the 22 version; in the latest version it says “Recipients SHOULD treat other octets in field content (obs-text) as opaque data”. If that’s the case, it should also say that the behavior of converting field values containing obs-text to Unicode strings (particularly parameter values in Content-Type) is undefined.

Issue 2 still stands.

From: Peter Occil 
Sent: Saturday, May 25, 2013 3:14 AM
To: HTTP Working Group 
Subject: obs-text character encoding and error handling; duplicate parameter names in Content-Type

obs-text character encoding and error handling; duplicate parameter names in Content-Type

I have two issues.

1. obs-text character encoding and error handling

What is the character encoding used when a header field value contains obs-text, and particularly
parameter values in Content-Type?  Is it ISO-8859-1, UTF-8, or something else?  Or is the encoding
  undefined?  Error handling rules for obs-text, unlike for obs-fold, are also absent.

2. Duplicate parameter names in Content-Type

Suppose that the following Content-Type is received:

    text/html; charset=iso-8859-1; charset=utf-8
What is the resulting value of the charset parameter?  Is it iso-8859-1, utf-8, an error, or undefined?
(This issue also applies to Content-Disposition, Accept, and other header fields that use parameters.)
--Peter