Re: Dictionary Compression for HTTP (at Facebook)

Benjamin Kaduk <bkaduk@akamai.com> Mon, 03 September 2018 08:26 UTC

Return-Path: <ietf-http-wg-request+bounce-httpbisa-archive-bis2juki=lists.ie@listhub.w3.org>
X-Original-To: ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com
Delivered-To: ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 5243F130E23 for <ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com>; Mon, 3 Sep 2018 01:26:33 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -8.01
X-Spam-Level:
X-Spam-Status: No, score=-8.01 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HEADER_FROM_DIFFERENT_DOMAINS=0.001, MAILING_LIST_MULTI=-1, RCVD_IN_DNSWL_HI=-5, SPF_PASS=-0.001, T_DKIMWL_WL_HIGH=-0.01] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=akamai.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id wTP03f99N1ya for <ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com>; Mon, 3 Sep 2018 01:26:31 -0700 (PDT)
Received: from frink.w3.org (frink.w3.org [128.30.52.56]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id BCC48130E0C for <httpbisa-archive-bis2Juki@lists.ietf.org>; Mon, 3 Sep 2018 01:26:31 -0700 (PDT)
Received: from lists by frink.w3.org with local (Exim 4.89) (envelope-from <ietf-http-wg-request@listhub.w3.org>) id 1fwjBI-0000Oj-1y for ietf-http-wg-dist@listhub.w3.org; Mon, 03 Sep 2018 07:21:52 +0000
Resent-Date: Mon, 03 Sep 2018 07:21:52 +0000
Resent-Message-Id: <E1fwjBI-0000Oj-1y@frink.w3.org>
Received: from uranus.w3.org ([128.30.52.58]) by frink.w3.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.89) (envelope-from <bkaduk@akamai.com>) id 1fwjB7-0000LB-5R for ietf-http-wg@listhub.w3.org; Mon, 03 Sep 2018 07:21:41 +0000
Received: from www-data by uranus.w3.org with local (Exim 4.89) (envelope-from <bkaduk@akamai.com>) id 1fwjB7-0007le-0k for ietf-http-wg@listhub.w3.org; Mon, 03 Sep 2018 07:21:41 +0000
Received: from mimas.w3.org ([128.30.52.79]) by frink.w3.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.89) (envelope-from <bkaduk@akamai.com>) id 1fwJdx-000573-Nj for ietf-http-wg@listhub.w3.org; Sun, 02 Sep 2018 04:05:45 +0000
Received: from mx0b-00190b01.pphosted.com ([2620:100:9005:57f::1]) by mimas.w3.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.89) (envelope-from <bkaduk@akamai.com>) id 1fwJdw-0007qU-8M for ietf-http-wg@w3.org; Sun, 02 Sep 2018 04:05:45 +0000
Received: from pps.filterd (m0050102.ppops.net [127.0.0.1]) by m0050102.ppops.net-00190b01. (8.16.0.22/8.16.0.22) with SMTP id w823vZ6u027027; Sun, 2 Sep 2018 05:05:15 +0100
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=akamai.com; h=date : from : to : cc : subject : message-id : references : mime-version : content-type : in-reply-to; s=jan2016.eng; bh=06DO+IYs1madq8KY7wEma2icWAugf0+r6APh0MPKOX0=; b=czneugymssQF4uksZfSFKnDA5Bd86TIE9E3mN9NB1bC6EmTcutGs5AHzKZhkt/AlFSW+ LD/puH2lo8jcQp9vawPG5jT3jISZ6l2PcySvpRcgV0oZ3m8llkNZDP1UAY+4ZAKLawW7 nb3Ux8UvL+ETRbVLi1O0qOaaMhXfo8HY08UtHyEesFEOZJDtCh92nYrRo8dnwdPzAc0i qESk5AFn5aZjtq+8lf9jRO2veQl38G5eOPrEmEmWo4urZDlpzdfRQW1j0Po1SLDmVUm/ 6S0GW3DTccb4OUCPRZy4uQErk3fhnhLkXRpP5yhHxTiBU1K2oXVZ+LaZLk0PBOrFy5PU Yg==
Received: from prod-mail-ppoint2 (prod-mail-ppoint2.akamai.com [184.51.33.19]) by m0050102.ppops.net-00190b01. with ESMTP id 2m7fua5jnr-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); Sun, 02 Sep 2018 05:05:15 +0100
Received: from pps.filterd (prod-mail-ppoint2.akamai.com [127.0.0.1]) by prod-mail-ppoint2.akamai.com (8.16.0.21/8.16.0.21) with SMTP id w8244thT005269; Sun, 2 Sep 2018 00:05:14 -0400
Received: from prod-mail-relay10.akamai.com ([172.27.118.251]) by prod-mail-ppoint2.akamai.com with ESMTP id 2m7p4a3y6h-1; Sun, 02 Sep 2018 00:05:14 -0400
Received: from bos-lpczi.kendall.corp.akamai.com (bos-lpczi.kendall.corp.akamai.com [172.19.17.86]) by prod-mail-relay10.akamai.com (Postfix) with ESMTP id 75DAB2969A; Sun, 2 Sep 2018 04:05:14 +0000 (GMT)
Received: from bkaduk by bos-lpczi.kendall.corp.akamai.com with local (Exim 4.86_2) (envelope-from <bkaduk@akamai.com>) id 1fwJdR-0001t2-Gx; Sat, 01 Sep 2018 23:05:13 -0500
Date: Sat, 01 Sep 2018 23:05:13 -0500
From: Benjamin Kaduk <bkaduk@akamai.com>
To: Felix Handte <felixh@fb.com>
Cc: Mark Nottingham <mnot@mnot.net>, Jyrki Alakuijala <jyrki@google.com>, Charles McCathie-Neville <chaals@yandex-team.ru>, Evgenii Kliuchnikov <eustas@google.com>, Vlad Krasnov <vlad@cloudflare.com>, Nick Terrell <terrelln@fb.com>, Yann Collet <cyan@fb.com>, HTTP Working Group <ietf-http-wg@w3.org>
Message-ID: <20180902040513.GV5819@akamai.com>
References: <18eb0343-640c-8b95-1cc2-273bc72ec134@fb.com> <CAPapA7RLncAsHH5pr5RJSYjvPiNk8JvgBJ8T-tKebnC1C5ptHw@mail.gmail.com> <ED51E194-503A-4339-B564-A6543F42D0A1@mnot.net> <652edc11-2d19-aef9-e3fd-ecb77ab47c1a@fb.com>
MIME-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
Content-Disposition: inline
In-Reply-To: <652edc11-2d19-aef9-e3fd-ecb77ab47c1a@fb.com>
User-Agent: Mutt/1.5.24 (2015-08-30)
X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10434:, , definitions=2018-09-02_02:, , signatures=0
X-Proofpoint-Spam-Details: rule=notspam policy=default score=0 suspectscore=0 malwarescore=0 phishscore=0 bulkscore=0 spamscore=0 mlxscore=0 mlxlogscore=999 adultscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.0.1-1807170000 definitions=main-1809020045
X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10434:, , definitions=2018-09-02_02:, , signatures=0
X-Proofpoint-Spam-Details: rule=notspam policy=default score=0 priorityscore=1501 malwarescore=0 suspectscore=0 phishscore=0 bulkscore=0 spamscore=0 clxscore=1011 lowpriorityscore=0 mlxscore=0 impostorscore=0 mlxlogscore=999 adultscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.0.1-1807170000 definitions=main-1809020044
X-W3C-Hub-Spam-Status: No, score=-3.0
X-W3C-Hub-Spam-Report: AWL=1.683, BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, RCVD_IN_DNSWL_LOW=-0.7, SPF_PASS=-0.001, W3C_AA=-1, W3C_WL=-1
X-W3C-Scan-Sig: mimas.w3.org 1fwJdw-0007qU-8M 9ee98dfe9c3a079c0992d59897a4179e
X-caa-id: 20d1955b41
X-Original-To: ietf-http-wg@w3.org
Subject: Re: Dictionary Compression for HTTP (at Facebook)
Archived-At: <https://www.w3.org/mid/20180902040513.GV5819@akamai.com>
Resent-From: ietf-http-wg@w3.org
X-Mailing-List: <ietf-http-wg@w3.org> archive/latest/35881
X-Loop: ietf-http-wg@w3.org
Resent-Sender: ietf-http-wg-request@w3.org
Precedence: list
List-Id: <ietf-http-wg.w3.org>
List-Help: <https://www.w3.org/Mail/>
List-Post: <mailto:ietf-http-wg@w3.org>
List-Unsubscribe: <mailto:ietf-http-wg-request@w3.org?subject=unsubscribe>

On Fri, Aug 24, 2018 at 10:24:23AM +0000, Felix Handte wrote:
> Jyrki,
> 
> Glad to hear it! We're excited too!
> 
> We look forward to playing with Brotli's new capabilities when they're available.
> 
> Yes, Zstd can accept unformatted buffers as dictionaries, which it uses as you describe. Additionally though, Zstd describes a format for structured dictionaries[1], which includes metadata in addition to an LZ77 prefix.

One topic that came up during IESG review of draft-kucherawy-dispatch-zstd was
whether/when third-party or standard dictionaries would become available and how
dictionary IDs would be assigned for those cases (since at present, IIUC, the
dictionary IDs would need to be pre-negotiated between the two parties).  No
IANA registry was created at that time, but with a 4-byte dictionary identifier space
to work with, it seems like there might be space to create a registry for dictionary
IDs (including private use space, of course), and just publishing well-known
dictionaries.

-Ben

> [1] https://github.com/facebook/zstd/blob/dev/doc/zstd_compression_format.md#dictionary-format <https://github.com/facebook/zstd/blob/dev/doc/zstd_compression_format.md#dictionary-format>
> [2] https://tools.ietf.org/html/draft-vandevenne-shared-brotli-format-01#section-9 <https://tools.ietf.org/html/draft-vandevenne-shared-brotli-format-01#section-9>
>