Re: Delta Compression and UTF-8 Header Values

Phillip Hallam-Baker <hallam@gmail.com> Mon, 11 February 2013 01:52 UTC

Return-Path: <ietf-http-wg-request@listhub.w3.org>
X-Original-To: ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com
Delivered-To: ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id C892521F8808 for <ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com>; Sun, 10 Feb 2013 17:52:03 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -8.868
X-Spam-Level:
X-Spam-Status: No, score=-8.868 tagged_above=-999 required=5 tests=[AWL=1.578, BAYES_00=-2.599, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_HI=-8, SARE_SUB_ENC_UTF8=0.152]
Received: from mail.ietf.org ([64.170.98.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id r6392zvXtzpy for <ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com>; Sun, 10 Feb 2013 17:52:03 -0800 (PST)
Received: from frink.w3.org (frink.w3.org [128.30.52.56]) by ietfa.amsl.com (Postfix) with ESMTP id 52F9521F8804 for <httpbisa-archive-bis2Juki@lists.ietf.org>; Sun, 10 Feb 2013 17:52:01 -0800 (PST)
Received: from lists by frink.w3.org with local (Exim 4.72) (envelope-from <ietf-http-wg-request@listhub.w3.org>) id 1U4iXI-00057V-QQ for ietf-http-wg-dist@listhub.w3.org; Mon, 11 Feb 2013 01:50:24 +0000
Resent-Date: Mon, 11 Feb 2013 01:50:24 +0000
Resent-Message-Id: <E1U4iXI-00057V-QQ@frink.w3.org>
Received: from lisa.w3.org ([128.30.52.41]) by frink.w3.org with esmtp (Exim 4.72) (envelope-from <hallam@gmail.com>) id 1U4iXB-00056O-Eb for ietf-http-wg@listhub.w3.org; Mon, 11 Feb 2013 01:50:17 +0000
Received: from mail-we0-f169.google.com ([74.125.82.169]) by lisa.w3.org with esmtps (TLS1.0:RSA_ARCFOUR_SHA1:16) (Exim 4.72) (envelope-from <hallam@gmail.com>) id 1U4iXA-0006mT-C9 for ietf-http-wg@w3.org; Mon, 11 Feb 2013 01:50:17 +0000
Received: by mail-we0-f169.google.com with SMTP id t11so4722898wey.28 for <ietf-http-wg@w3.org>; Sun, 10 Feb 2013 17:49:50 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:x-received:in-reply-to:references:date:message-id :subject:from:to:cc:content-type; bh=C6OTj+Mrc11NBZgAntDmEquPOdeOPH2Jkn6Dhj9ht8s=; b=MU0AcYFitRgSYXVjWsmD/VcNeFD08ZJwM06732ZtgQfvnBmi4ETRnpq2mLnCxn0Qkl nTo6/qqzDIJkoGhNnCz7IHVh4AXicPnTOK373NVKDr62D6nR4H4mOicnWJuCd1CRPMHB ZDzy/hJueiQe9fBOpJidjD+lJeJ3bDIpaixcNyCXIVD+vEYvMYri9P3I2YqVXOxZB7uu EpftD+DqqIPX/zQJRtdAZrspQChQsbptOcvbnQ26g2g5Sd+XE6KnDxMYypxkgDr+B+gS +rC4f2Kj2Ye7k4qX7mV7yg9iLQBQSEyBDjPiI4Lx1QS0ZC0uQ+GDFzpw5H/Pw6GNrWpy 6GYA==
MIME-Version: 1.0
X-Received: by 10.180.81.164 with SMTP id b4mr13077973wiy.34.1360547389963; Sun, 10 Feb 2013 17:49:49 -0800 (PST)
Received: by 10.194.153.104 with HTTP; Sun, 10 Feb 2013 17:49:49 -0800 (PST)
In-Reply-To: <84523.1360541730@critter.freebsd.dk>
References: <CABP7RbfRLXPpL4=wip=FvqD3DM7BM8PXi7uRswHAusXUmPO_xw@mail.gmail.com> <6372.1360352116@critter.freebsd.dk> <51164503.2030709@it.aoyama.ac.jp> <58832.1360414202@critter.freebsd.dk> <511726A5.5030302@it.aoyama.ac.jp> <79576.1360488507@critter.freebsd.dk> <51176C95.1040308@gmx.de> <79780.1360491855@critter.freebsd.dk> <CACuKZqHMQdktfOU3PJC=X-G8R=BQ40bhFJw=ZTfeSpem9L=GEw@mail.gmail.com> <CAK3OfOi+cXMLGsMCpD1cRBxzz46wVYYj8nz021fhqhM7fTDMWA@mail.gmail.com> <84523.1360541730@critter.freebsd.dk>
Date: Sun, 10 Feb 2013 20:49:49 -0500
Message-ID: <CAMm+LwjJ2Z03jsjfZCWsqgi75t5XNRgSgXY9QAudvXz08CPUDQ@mail.gmail.com>
From: Phillip Hallam-Baker <hallam@gmail.com>
To: Poul-Henning Kamp <phk@phk.freebsd.dk>
Cc: Nico Williams <nico@cryptonector.com>, Zhong Yu <zhong.j.yu@gmail.com>, Julian Reschke <julian.reschke@gmx.de>, "\"Martin J. Dürst\"" <duerst@it.aoyama.ac.jp>, James M Snell <jasnell@gmail.com>, "ietf-http-wg@w3.org" <ietf-http-wg@w3.org>
Content-Type: multipart/alternative; boundary="14dae9cc9fbee0fd0504d5691f36"
Received-SPF: pass client-ip=74.125.82.169; envelope-from=hallam@gmail.com; helo=mail-we0-f169.google.com
X-W3C-Hub-Spam-Status: No, score=-3.1
X-W3C-Hub-Spam-Report: AWL=-2.252, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_LOW=-0.7, SPF_PASS=-0.001
X-W3C-Scan-Sig: lisa.w3.org 1U4iXA-0006mT-C9 3d8d79699fc6a07069be8c1e770b1304
X-Original-To: ietf-http-wg@w3.org
Subject: Re: Delta Compression and UTF-8 Header Values
Archived-At: <http://www.w3.org/mid/CAMm+LwjJ2Z03jsjfZCWsqgi75t5XNRgSgXY9QAudvXz08CPUDQ@mail.gmail.com>
Resent-From: ietf-http-wg@w3.org
X-Mailing-List: <ietf-http-wg@w3.org> archive/latest/16543
X-Loop: ietf-http-wg@w3.org
Resent-Sender: ietf-http-wg-request@w3.org
Precedence: list
List-Id: <ietf-http-wg.w3.org>
List-Help: <http://www.w3.org/Mail/>
List-Post: <mailto:ietf-http-wg@w3.org>
List-Unsubscribe: <mailto:ietf-http-wg-request@w3.org?subject=unsubscribe>

On Sun, Feb 10, 2013 at 7:15 PM, Poul-Henning Kamp <phk@phk.freebsd.dk>wrote:

> Content-Type: text/plain; charset=ISO-8859-1
> --------
> In message <
> CAK3OfOi+cXMLGsMCpD1cRBxzz46wVYYj8nz021fhqhM7fTDMWA@mail.gmail.com>
> , Nico Williams writes:
>
> >> But how does the 2 ends agree on which encoding to use? It might be
> >> easier if HTTP just dictate UTF-8.
> >
> >Not might be.  Will be.
>
> Really ?
>
> I have a hard time squaring that with the "HTTP/2 is just a transport
> protocol, we don't change the semantics" credo that was waved around
> rather forcefully previously ?
>
> And if we are going to change semantics, shouldn't we change the
> ones that really matter[1] ?
>
> Poul-Henning
>
> [1] We can probably do much more for transmission efficiency by killing
> cookies and adding client provided session-identifieres, than any
> kind of encoding or compression will ever be able to...[2]
>
> [2] Not to mention the improved privacy and legal compliance that
> would automatically buy everybody...


Changing from US-ASCII encoded URIs to UTF8 encoded IRIs would seem like a
syntax issue to me, not semantics at all.

Since we are looking at coding efficiency it seems to me that we should at
least be open to specifying the length of the URI by means of a length
prefix rather than a space or CRLF separator. So I can't see a reason not
to consider UTF8 encoding of the URL which is only a syntax change when all
is said and done. Giving it a different name does not make it semantic.


Since the world is moving to UTF8, it would seem that getting HTTP fully in
sync with UTF8 would be the main motive for applications to move to HTTP2.
Unless we are looking for a full employment and a HTTP3 to follow.


-- 
Website: http://hallambaker.com/