Re: If not JSON, what then ?

Mark Nottingham <mnot@mnot.net> Tue, 02 August 2016 12:45 UTC

Resent-Date: Tue, 02 Aug 2016 12:41:54 +0000
Resent-Message-Id: <E1bUZ18-0005eo-0r@frink.w3.org>
Content-Type: text/plain; charset="us-ascii"
Mime-Version: 1.0 (Mac OS X Mail 9.3 \(3124\))
From: Mark Nottingham <mnot@mnot.net>
In-Reply-To: <20160802115355.GD32124@1wt.eu>
Date: Tue, 02 Aug 2016 14:41:19 +0200
Cc: Poul-Henning Kamp <phk@phk.freebsd.dk>, HTTP Working Group <ietf-http-wg@w3.org>
Content-Transfer-Encoding: quoted-printable
Message-Id: <ECE83331-ACDD-42E7-B99C-3E4E4C66DD13@mnot.net>
References: <77778.1470037414@critter.freebsd.dk> <12ED69B4-C924-475E-9432-B8FEB4B9DF80@mnot.net> <20160802115355.GD32124@1wt.eu>
To: Willy Tarreau <w@1wt.eu>
Received-SPF: pass client-ip=216.86.168.183; envelope-from=mnot@mnot.net; helo=mxout-08.mxes.net
Subject: Re: If not JSON, what then ?
Archived-At: <http://www.w3.org/mid/ECE83331-ACDD-42E7-B99C-3E4E4C66DD13@mnot.net>
Resent-From: ietf-http-wg@w3.org
Resent-Sender: ietf-http-wg-request@w3.org
Precedence: list

> On 2 Aug 2016, at 1:53 PM, Willy Tarreau <w@1wt.eu> wrote:
> 
> Hi Mark,
> 
> On Tue, Aug 02, 2016 at 01:33:39PM +0200, Mark Nottingham wrote:
>> 1) Using the first character of the field-value as a signal that the encoding
>> is in use is interesting. I was thinking of indicating it with a suffix on
>> the header field name (e.g., Date-J). Either is viable, but I don't think
>> it's a good idea to reuse existing header field names and rely on that signal
>> to differentiate the value type; that seems like it would cause a lot of
>> interop problems to me. Defining a new header field (whether it's Date-J or
>> Date2 or whatever) seems much safer to me.
> 
> I had the same feeling initially but I retracted. I fear that having two
> header fields will result in inconsistencies between the two (possibly
> intentional when that may be used to benefit an attacker). We'd rather
> avoid reproducing the Proxy-Connection vs Connection mess we've been seeing
> for a decade, where both were sent "just in case".

I know, I don't like it either. I'm just concerned that if we keep the name the same, it's much more likely it's going to not be properly converted, and that could enable attacks too.

Stepping back, I think we're talking about a set of rules something like this;

A. For a newly defined header field that explicitly uses the new format, send it in the new format
B. For existing header fields, if their expression in the new format is defined:
  1. If you have evidence that your peer can accept the new header format, send them in the new format
  2. Otherwise, send them in the original format.
C. All other fields are always sent in the original, HTTP/1 format.

I.e., having both versions of a single header's semantics the wire at the same time is an error.

This means that the format of those headers is effectively a hop-by-hop attribute; you might have a situation where a non-format-aware node forces the hops surrounding it back to the original format (for headers with two different ways to express those semantics).

This gives me pause. Converting from new to old and back to new is very likely to tickle a lot of bugs and cause a lot of interop problems. So, we could say that conversion only happens as a downgrade; i.e., if the next hop doesn't support the encoding, you can downgrade, but you never upgrade it again to the new encoding.

Presumably, the last "hop" might be inside the origin server, when it converts those header fields into the old format for backwards compatibility with existing applications that aren't aware of the new format.

Applications that *are* aware of the new format will still need to handle the original format, because there will be clients / hops generating it for the foreseeable future. 

This kind of seems like a mess to me, and leads me to think that the only time we should attempt this is during a major protocol revision (i.e., h3), and even then, with great trepidation.

If we get that far, deciding how to signal which headers are encoded seems more manageable :)

> However if we enumerate certain header fields that would deserve being
> encoded differently and find a way to group them, we may think about
> sending a composite, compact header field for transport/routing, another
> one for the entity where available information are grouped when relevant.
> Then maybe it could be decided that when one agent consumes such a field,
> before passing the message it must delete occurences of the other ones,
> and/or rebuild them from the composite one, in order to avoid inconsistency
> issues.
> 
> We have more or less this regarding Transfer-Encoding which voids
> Content-Length, and the Host header field which must always match the
> authority part of the URI if present.
> 
> These are just thoughts, maybe they are stupid.

Not stupid at all, but I am concerned about adding too much "magic"; if implementations are doing too much on your behalf, issues will arise (see above).

Cheers,

--
Mark Nottingham   https://www.mnot.net/

Re: If not JSON, what then ? Sam Johnston
Re: If not JSON, what then ? Martin J. Dürst
Re: If not JSON, what then ? Alcides Viamontes E
Re: If not JSON, what then ? Mark Nottingham
Re: If not JSON, what then ? Willy Tarreau
Re: If not JSON, what then ? Poul-Henning Kamp
Re: If not JSON, what then ? Martin Thomson
Re: If not JSON, what then ? Poul-Henning Kamp
Re: If not JSON, what then ? Mark Nottingham
Re: If not JSON, what then ? Poul-Henning Kamp
Re: If not JSON, what then ? Martin Thomson
Re: If not JSON, what then ? Willy Tarreau
Re: If not JSON, what then ? Kari hurtta
Re: If not JSON, what then ? Willy Tarreau
Re: If not JSON, what then ? Poul-Henning Kamp
Re: If not JSON, what then ? Kari hurtta
Re: If not JSON, what then ? Martin Thomson
Re: If not JSON, what then ? Carsten Bormann
Re: If not JSON, what then ? Poul-Henning Kamp
Re: If not JSON, what then ? Mark Nottingham
Re: If not JSON, what then ? Carsten Bormann
Re: If not JSON, what then ? Poul-Henning Kamp
Re: If not JSON, what then ? Carsten Bormann
Re: If not JSON, what then ? Poul-Henning Kamp
Re: If not JSON, what then ? Stefan Eissing
Re: If not JSON, what then ? Willy Tarreau
Re: If not JSON, what then ? nicolas.mailhot
Re: If not JSON, what then ? Poul-Henning Kamp
Re: If not JSON, what then ? Mark Nottingham
Re: If not JSON, what then ? Poul-Henning Kamp
Re: If not JSON, what then ? Mark Nottingham
Re: If not JSON, what then ? Mark Nottingham
Re: If not JSON, what then ? nicolas.mailhot
Re: If not JSON, what then ? Poul-Henning Kamp
Re: If not JSON, what then ? Stefan Eissing
Re: If not JSON, what then ? Poul-Henning Kamp
Re: If not JSON, what then ? Poul-Henning Kamp
Re: If not JSON, what then ? Willy Tarreau
Re: If not JSON, what then ? Mark Nottingham
Re: If not JSON, what then ? James M Snell
Re: If not JSON, what then ? Cory Benfield
Re: If not JSON, what then ? Poul-Henning Kamp
Re: If not JSON, what then ? Poul-Henning Kamp
Re: If not JSON, what then ? Willy Tarreau
Re: If not JSON, what then ? Nicolas Mailhot
Re: If not JSON, what then ? Poul-Henning Kamp
Re: If not JSON, what then ? Poul-Henning Kamp
Re: If not JSON, what then ? Willy Tarreau
Re: If not JSON, what then ? Carsten Bormann
If not JSON, what then ? Poul-Henning Kamp