Re: Binary packet protocol rethink

Bryan Ford <brynosaurus@gmail.com> Mon, 30 November 2015 21:45 UTC

Return-Path: <bounces-ietf-ssh-owner-secsh-tyoxbijeg7-archive=lists.ietf.org@NetBSD.org>
X-Original-To: ietfarch-secsh-tyoxbijeg7-archive@ietfa.amsl.com
Delivered-To: ietfarch-secsh-tyoxbijeg7-archive@ietfa.amsl.com
Received: from localhost (ietfa.amsl.com [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id BF8CF1B344E for <ietfarch-secsh-tyoxbijeg7-archive@ietfa.amsl.com>; Mon, 30 Nov 2015 13:45:29 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.109
X-Spam-Level:
X-Spam-Status: No, score=-1.109 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FREEMAIL_FROM=0.001, J_CHICKENPOX_22=0.6, MIME_8BIT_HEADER=0.3, T_RP_MATCHES_RCVD=-0.01] autolearn=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id PWknXFk3MGfy for <ietfarch-secsh-tyoxbijeg7-archive@ietfa.amsl.com>; Mon, 30 Nov 2015 13:45:27 -0800 (PST)
Received: from mail.netbsd.org (mail.NetBSD.org [IPv6:2001:4f8:3:7::25]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id EE82D1AD1A6 for <secsh-tyoxbijeg7-archive@lists.ietf.org>; Mon, 30 Nov 2015 13:45:26 -0800 (PST)
Received: by mail.netbsd.org (Postfix, from userid 605) id 8911014A3A2; Mon, 30 Nov 2015 21:45:26 +0000 (UTC)
Delivered-To: ietf-ssh@netbsd.org
Received: by mail.netbsd.org (Postfix, from userid 1347) id 070EA14A379; Mon, 30 Nov 2015 21:45:26 +0000 (UTC)
Received: from localhost (localhost [127.0.0.1]) by mail.netbsd.org (Postfix) with ESMTP id 8212A14A426 for <ietf-ssh@netbsd.org>; Mon, 30 Nov 2015 13:58:28 +0000 (UTC)
X-Virus-Scanned: amavisd-new at NetBSD.org
Authentication-Results: mail.NetBSD.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mail.netbsd.org ([127.0.0.1]) by localhost (mail.NetBSD.org [127.0.0.1]) (amavisd-new, port 10025) with ESMTP id EtaWtf8J1HWQ for <ietf-ssh@netbsd.org>; Mon, 30 Nov 2015 13:58:27 +0000 (UTC)
Received: from mail-wm0-x229.google.com (mail-wm0-x229.google.com [IPv6:2a00:1450:400c:c09::229]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by mail.netbsd.org (Postfix) with ESMTPS id C30C014A3A2 for <ietf-ssh@netbsd.org>; Mon, 30 Nov 2015 13:58:26 +0000 (UTC)
Received: by wmww144 with SMTP id w144so138718950wmw.0 for <ietf-ssh@netbsd.org>; Mon, 30 Nov 2015 05:58:25 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=content-type:mime-version:subject:from:in-reply-to:date:cc :message-id:references:to; bh=1a20k+AWMyXfZXyuALH4WaJ16e/DUgbLrIRIus+224g=; b=HmUe0M6XRshk7fRfXJWu/xCEH9KzH3grOELWb0ssPIbPeQd6BI+K4/cZWBTkwPDn2l /eJjVi8qURe+NPy+XK5QxVBlzVGgsp5LG1XEHI2LcFgbyIPidEOu62Buh8BNgg9mgDcz q7HTLAUTDtLzCOjOdBo6fX6KPYBmktZFezJ6cKidVFOCecBpcNlj9XH8TSjOQ4M2ycVJ R/O2YVUoeNSIW+BTzExwkE56L4O5LRkHbz7+xqSd3WuoI7XijYqJ2VxJ0ZEWPy7zEmvg 50hjxDlDL23eVW7BzgD7jv6Iy/t6m1LFFaCaRcnJA+EHlbGksXFePt/P4sI86Au+9NjB bIhg==
X-Received: by 10.28.60.11 with SMTP id j11mr27175335wma.57.1448891905120; Mon, 30 Nov 2015 05:58:25 -0800 (PST)
Received: from icsil1noteb193.epfl.ch (icsil1noteb193.epfl.ch. [128.178.151.41]) by smtp.gmail.com with ESMTPSA id bk2sm47109686wjc.3.2015.11.30.05.58.22 (version=TLS1 cipher=ECDHE-RSA-AES128-SHA bits=128/128); Mon, 30 Nov 2015 05:58:23 -0800 (PST)
Content-Type: multipart/signed; boundary="Apple-Mail=_E9D7C51F-4574-44C0-8CAA-F1151CD998B7"; protocol="application/pkcs7-signature"; micalg="sha1"
Mime-Version: 1.0 (Mac OS X Mail 9.1 \(3096.5\))
Subject: Re: Binary packet protocol rethink
From: Bryan Ford <brynosaurus@gmail.com>
In-Reply-To: <nnpoyrra9p.fsf@armitage.lysator.liu.se>
Date: Mon, 30 Nov 2015 14:58:22 +0100
Cc: Simon Tatham <anakin@pobox.com>, Peter Gutmann <pgut001@cs.auckland.ac.nz>, Damien Miller <djm@mindrot.org>, Simon Josefsson <simon@josefsson.org>, "ietf-ssh@netbsd.org" <ietf-ssh@netbsd.org>
Message-Id: <55D41D88-55E6-4FC7-890B-62B5E075B3B7@gmail.com>
References: <87egfdxebo.fsf@latte.josefsson.org> <nny4dksr3i.fsf@armitage.lysator.liu.se> <1448554180-sup-7145@atreus.tartarus.org> <9A043F3CF02CD34C8E74AC1594475C73F4B857C7@uxcn10-5.UoA.auckland.ac.nz> <alpine.BSO.2.20.1511292228450.12629@natsu.mindrot.org> <9A043F3CF02CD34C8E74AC1594475C73F4B92EF0@uxcn10-5.UoA.auckland.ac.nz> <nn37vnsyoi.fsf@armitage.lysator.liu.se> <9A043F3CF02CD34C8E74AC1594475C73F4B9321A@uxcn10-5.UoA.auckland.ac.nz> <1448874084-sup-4376@atreus.tartarus.org> <nnpoyrra9p.fsf@armitage.lysator.liu.se>
To: Niels Möller <nisse@lysator.liu.se>
X-Mailer: Apple Mail (2.3096.5)
Sender: ietf-ssh-owner@NetBSD.org
List-Id: ietf-ssh.NetBSD.org
Precedence: list

I learned about this discussion thread from Peter’s mention of it on the TLS list, where as he mentioned a parallel debate is going on…  (Thanks Peter; hope you don’t mind me stalk…er, following your hint over here. ;) )

On 30 Nov 2015, at 12:34, Niels Möller <nisse@lysator.liu.se> wrote:
> Simon Tatham <anakin@pobox.com> writes:
> 
>> an attacker either guesses the true length by correlating to the
>> TCP headers, or probes it by means of the byte-at-a-time dribbling
>> attack, or actively corrupts the cipher block containing the length and
>> waits to see when the resulting MAC failure is reported, 
> 
> Would you be happier if the length field were independently
> authenticated? I'm not sure how strong an authenticator we need, it
> seems a bit silly to use an authentication tag which is much larger than
> the message, but maybe it's really needed.

It’s hard for me to see how separately authenticating the length field would be a benefit; in fact I would worry about whether it could introduce a weakness, e.g., where an attacker could somehow contrive “mix-and-match” attacks that could get a receiver to use one (correctly-MAC’d) length field with a different (correctly but independently MAC’d) payload to cause Bad Things of whatever kind to happen.  I also don’t see any essential reason or even important advantage of doing so.

However, I do (think) I see the tension that might seem to suggest treating the length independently from the payload: in a protocol like SSH that encrypts both header and payload, you would like to read and decrypt the length first, then separately read and decrypt the payload (perhaps at a different location in memory).  Especially if the protocol moves to using AEAD ciphers, which has been suggested and I agree is probably generally a worthwhile idea, the standard AEAD API wants to assume the length is already known at decryption/integrity-check time, e.g., by being carried in the normally-unencrypted Additional Data (AD) field.

On the TLS list I posted a proposal for one way (of many) to resolve this seeming conflict between a desire to move to AEAD and a desire to continue encrypting headers (which I strongly support, whether in TLS or SSH or any other encrypted protocol).  A copy of my proposal is attached below, FWIW.  In summary, you still only need one MAC, and you integrity-check the header (including length) together with the payload in the “main” AEAD decryption, but you separately encrypt (without authentication) the header using either a stream cipher or an AEAD used as a stream cipher.  The fundamental argument for why the unauthenticated encryption of the header is safe is because it just changes the encoding of the header without affecting how the header is authenticated (separately, while authenticating the body).  Thus, changing the header encoding while leaving the authentication and body-encryption unchanged from standard AEAD practice cannot make security any worse than just transmitting the header (AD part) in cleartext, and can (sometimes) be a security benefit.

>> by making sure that the encrypted block boundaries do not also
>> reveal the length or position of any actually important data, such as a
>> particular SSH_MSG_anything.
> 
> Can we do that with the current protocol? If so, guidance is
> appreciated. What I object to is removing a feature (encrypted message
> lengths) which enables known counter measures to traffic analysis, and
> replace it by nothing.

Strongly agreed on this.  The fact that a particular security measure (like encrypting headers) isn’t by itself an end-all one-stop solution to all the world’s traffic analysis problems doesn’t make it not worthwhile.

Cheers
Bryan

————————————————(snip)————————————————

The idea of encrypting TLS record headers has come up before, the most
important purpose being to hide record lengths and boundaries and make
fingerprinting and traffic analysis harder.  I had convinced myself that
goal this would be "too hard" to accomplish in TLS 1.3, but after
further thought I'm not so sure.  So I would like to request comment on
one approach that strikes me as a practical and requires only a rather
minor change to the current spec.

The quick summary:

* To encrypt a record, we first AEAD-encrypt the record's payload,
protecting the header fields via the additional_data, exactly as
currently specified.  But then we XOR-encrypt the 5-byte TLS header just
before transmission, using a (separate) stream cipher indexed by a nonce
that depends on record sequence number and *_write_iv, in exactly the
same way the AEAD is already nonce-indexed.

* To decrypt a record, we simply do the reverse: first use the stream
cipher with the appropriate nonce to XOR-decrypt the 5-byte TLS header,
then sanity-check it as usual to determine its length, read the rest of
the record, and submit it to AEAD for decryption and full integrity
checking as before.

That's it, in a nutshell.  Two likely concerns immediately arise,
discussed below, but feel free to TL;DR the rest if you don't share
these concerns.

---

Concern #1: What if an active attacker messes with the TLS header,
especially the length field, since stream ciphers don't protect
integrity?  The simple answer is that *exactly* the same thing happens
as now: the AEAD decryption attempt fails, because the
(stream-decrypted) header is AEAD-protected as additional_data.  Nothing
is gained or lost.

SSH, which did something like this, ran into trouble with attackers
being able to twiddle the record length field to make the record length
look big, causing the receiver to try to receive a very large record,
and hence appear to the user to hang, instead of immediately detecting
the modification and terminating the connection.  But there are three
mitigating factors here: (1) TLS is not usually used for interactive
terminal traffic like SSH is; (2) TLS's 2-byte record length field
imposes a pretty reasonable upper-bound on the maximum size an attacker
could maliciously make a record appear to be; and (3) if this risk of
length-twiddling is at all a problem in this proposed encrypted-header
protocol, then it's already a problem for the current TLS 1.3 spec
without encrypted headers, because active attackers can twiddle the bits
of a cleartext length field just as easily (and even be *certain* they
are making the length appear large!).  So I can't see any way this
length-twiddling vulnerability becomes any worse, and maybe it gets a
bit better (because the attacker can no longer be entirely certain
whether he's setting a 1 bit to 0 or a 0 bit to 1).

---

Concern #2: Do we want to have to go to the trouble of adding a stream
cipher to every TLS 1.3-compatible ciphersuite?  Answer: maybe not, but
we don't necessarily need to.  We could instead just specify a generic
method of using the ciphersuite's main AEAD as a stream cipher for
header encryption/decryption purposes.

The conceptually simplest approach I can think of: In the specification
of how AEAD nonces are generated (section 5.2.2 of
draft-ietf-tls-tls13-07), reserve the least-significant bit of the
record sequence number, so that sequence numbers increment by 2 rather
than 1 each record.  Thus, we get two unique nonces per record from the
same set of symmetric keys.  We first use the nonce with a '0'
least-significant bit to perform the regular AEAD-encryption of the
record with the header info as additional_data.

Then for the same record we use the nonce with a '1' least-significant
to AEAD-encrypt a sequence of five zero bytes ("\0\0\0\0\0"), and use
the first five bytes of result as the cipherstream to XOR the 5-byte TLS
header with before transmitting.  The AEAD will of course uselessly
append some kind of authenticator to this ciphertext that we won't end
up using, but that's OK.  The receiver will just use AEAD-encrypt
(again) on the same five-zero-byte message to reproduce the 5
cipherstream bytes with which to decrypt the TLS header.  Thus, senders
perform two AEAD-encrypts per record, and receivers do one AEAD-encrypt
and one AEAD-decrypt per record.

This approach seems pretty conceptually clean and simple, but has the
performance downside that we always need to invoke the AEAD twice per
record rather than once, which might be (a bit) costly especially when
records are small.  So a simple refinement is to amortize this cost
across records: e.g., once every 256 records (every sequence number
ending in 0x00) we AEAD-encrypt a sequence of 5*256=1280 zero bytes, and
the result in 5-byte chunks as the cipherstream with which to encrypt
and decrypt 256 consecutive TLS record headers.  Thus, we're only adding
one additional AEAD-encryption of a "normal-packet-sized" 1280-byte blob
once every 256 records, which seems likely to be a pretty
inconsequential performance cost.

---

Comments?

Thanks
Bryan