Re: [http-state] non-ASCII cookie values (was Re: Closing Ticket 3: Public Suffixes)

Maciej Stachowiak <mjs@apple.com> Tue, 02 February 2010 01:24 UTC

Return-Path: <mjs@apple.com>
X-Original-To: http-state@core3.amsl.com
Delivered-To: http-state@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id 6A23E3A693E for <http-state@core3.amsl.com>; Mon, 1 Feb 2010 17:24:30 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -106.599
X-Spam-Level:
X-Spam-Status: No, score=-106.599 tagged_above=-999 required=5 tests=[BAYES_00=-2.599, RCVD_IN_DNSWL_MED=-4, USER_IN_WHITELIST=-100]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id RB2fgPnG19gl for <http-state@core3.amsl.com>; Mon, 1 Feb 2010 17:24:29 -0800 (PST)
Received: from mail-out4.apple.com (mail-out4.apple.com [17.254.13.23]) by core3.amsl.com (Postfix) with ESMTP id 93A1C3A6767 for <http-state@ietf.org>; Mon, 1 Feb 2010 17:24:29 -0800 (PST)
Received: from relay16.apple.com (relay16.apple.com [17.128.113.55]) by mail-out4.apple.com (Postfix) with ESMTP id E918389B2B66 for <http-state@ietf.org>; Mon, 1 Feb 2010 17:25:05 -0800 (PST)
X-AuditID: 11807137-b7bd4ae000000f0d-af-4b677ef1434a
Received: from elliott.apple.com (elliott.apple.com [17.151.62.13]) by relay16.apple.com (Apple SCV relay) with SMTP id 00.33.03853.1FE776B4; Mon, 1 Feb 2010 17:25:05 -0800 (PST)
MIME-version: 1.0
Content-transfer-encoding: 7bit
Content-type: text/plain; charset="us-ascii"
Received: from [17.246.17.218] by elliott.apple.com (Sun Java(tm) System Messaging Server 6.3-7.04 (built Sep 26 2008; 32bit)) with ESMTPSA id <0KX6001K8X9TKS90@elliott.apple.com> for http-state@ietf.org; Mon, 01 Feb 2010 17:25:05 -0800 (PST)
From: Maciej Stachowiak <mjs@apple.com>
In-reply-to: <7789133a1002011014x5d587436j663a73bc92270a65@mail.gmail.com>
Date: Mon, 01 Feb 2010 17:25:05 -0800
Message-id: <E1E6C8DE-EFB6-4226-93EE-AF20053FF315@apple.com>
References: <7789133a1002011014x5d587436j663a73bc92270a65@mail.gmail.com>
To: Adam Barth <ietf@adambarth.com>
X-Mailer: Apple Mail (2.1077)
X-Brightmail-Tracker: AAAAAQAAAZE=
Cc: eric bianchetti <eric_bianchetti@yahoo.com>, http-state@ietf.org
Subject: Re: [http-state] non-ASCII cookie values (was Re: Closing Ticket 3: Public Suffixes)
X-BeenThere: http-state@ietf.org
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: Discuss HTTP State Management Mechanism <http-state.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/listinfo/http-state>, <mailto:http-state-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/http-state>
List-Post: <mailto:http-state@ietf.org>
List-Help: <mailto:http-state-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/http-state>, <mailto:http-state-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 02 Feb 2010 01:24:30 -0000

On Feb 1, 2010, at 10:14 AM, Adam Barth wrote:

> On Sun, Jan 31, 2010 at 2:37 PM, eric bianchetti
> <eric_bianchetti@yahoo.com> wrote:
>> That part does not please :
>> 
>> The cookie-value is opaque to the user agent and MAY be anything the
>>    origin server chooses to send, possibly in a server-selected
>>    printable ASCII encoding.
>> 
>> Livng and working in a non ASCII country, I tend to think we shall prepare for the coming of the other languages (Thai, Chines, Korean ....), IF a person get a cookie from a Thai server , can we securely suppose that person(computer) went to a thai site, and that person is using Thai on a daily basis? (Replace Thai by any multi bytes languages).
> 
> The part of that sentence after the "possibly" doesn't haven any
> normative force (it's just advice that the server can take or leave).
> I can remove the reference to ASCII here if you like.  Julian please
> correct me if I'm wrong, but I believe that HTTP headers typically
> contain only ASCII characters.

We do have to define what happens if there are non-ASCII characters in a header value. Presumably, if such bytes are set via Set-Cookie, then the Cookie header just echoes them back, so we can treat them as opaque. (That's assuming there is no requirement to reject non-ASCII cookie values.) But there are two ways in which it matters:

1) A header with non-ASCII bytes get set via Set-Cookie, then read through a JS API such as document.cookie. document.cookie gives a UTF-16 encoded string, so at this point the server has to decide how to interpret non-ASCII bytes in the cookie value.

2) If you set a cookie via document.cookie and include non-ASCII characters in the value, what bytes get sent?

Regards,
Maciej