Re: QCRAM Review Request: PR #1141

Martin Thomson <martin.thomson@gmail.com> Tue, 13 March 2018 02:40 UTC

MIME-Version: 1.0
In-Reply-To: <CAKcm_gOTLFBO5+rZMa98eXiWB1avh=-x+p90-dMFHbmg4X+PEA@mail.gmail.com>
References: <MWHPR08MB24325CADE88C887A869F65F1DAC60@MWHPR08MB2432.namprd08.prod.outlook.com> <CABkgnnWGWA7+e1DgmWT+MBDDSScoOyOg_RghhXgCNLaAOtTtPw@mail.gmail.com> <8FDB7B3D-E206-435F-9D74-22A18DC516F8@fb.com> <CAKcm_gOTLFBO5+rZMa98eXiWB1avh=-x+p90-dMFHbmg4X+PEA@mail.gmail.com>
From: Martin Thomson <martin.thomson@gmail.com>
Date: Tue, 13 Mar 2018 02:40:45 +0000
Message-ID: <CABkgnnV4Z_XkVAuiW8mfTRmVPOmEnRDOUAcB_xNTN8gsz6AJaw@mail.gmail.com>
Subject: Re: QCRAM Review Request: PR #1141
To: Ian Swett <ianswett@google.com>
Cc: Alan Frindell <afrind@fb.com>, Mike Bishop <mbishop@evequefou.be>, IETF QUIC WG <quic@ietf.org>
Content-Type: multipart/alternative; boundary="001a113d2c728845d00567422f9d"
Archived-At: <https://mailarchive.ietf.org/arch/msg/quic/XgwLulE6smiauz2fFbYydfGW5xY>
Precedence: list

Efficiency gains are not especially relevant here. I think that the primary
concern is which choice is best for those implementing the spec.

FWIW, I disagree with Alan, I think that the impact of a change is minimal.
We might discuss ways to wind back some of the changes Mike suggests, but I
would be sad if we didn't at least fix the but where you have to discard
invalid instructions. That is where this started. I also very much like
using the bit to separate static and dynamic. That will let us fix that
rotten static table.  I care much less about the changes to strings though.

(I expect the gains to be miniscule.)

On 12 Mar. 2018 4:23 pm, "Ian Swett" <ianswett@google.com> wrote:

> The table above is great, but is it also possible to estimate 'small
> efficiency gains' in terms of %?
>
> On Mon, Mar 12, 2018 at 11:58 AM Alan Frindell <afrind@fb.com> wrote:
>
>> I mentioned this on issue #1123, but will repeat it on the list.  I think
>> redoing the instructions and string literal representation will limit code
>> re-use between HPACK and QPACK (formerly called QCRAM).  Initial simulation
>> of QPACK shows that it is pretty close to HPACK compression as is.  I’m
>> more swayed by eliminating the possibility of sending an instruction in an
>> invalid context (eg: a literal on the control stream), but on balance I
>> prefer to keep the same instructions as HPACK.
>>
>>
>>
>> It would be great to hear from other implementers (eg folks not on the
>> compression design committee) how they value code-reuse versus small
>> efficiency gains and fewer error checks.
>>
>>
>>
>> -Alan
>>
>>
>>
>> *From: *QUIC <quic-bounces@ietf.org> on behalf of Martin Thomson <
>> martin.thomson@gmail.com>
>> *Date: *Thursday, March 1, 2018 at 8:15 PM
>> *To: *Mike Bishop <mbishop@evequefou.be>
>> *Cc: *QUIC WG <quic@ietf.org>
>> *Subject: *Re: QCRAM Review Request: PR #1141
>>
>>
>>
>> I was a little surprised to see the savings here, modest though they
>> are.  Overall, this is overwhelmingly in the right direction.
>>
>>
>>
>> We might quibble on some of the details, but I would prefer to see this
>> larger change made and then iterate where the details are less good (for
>> instance, we might revisit the notion of octet alignment in other ways, or
>> go back to strings starting on octet boundaries).
>>
>>
>>
>> On Thu, Mar 1, 2018 at 11:39 AM, Mike Bishop <mbishop@evequefou.be>
>> wrote:
>>
>> https://github.com/quicwg/base-drafts/pull/1141
>>
>>
>>
>> One suggestion from Martin’s implementation feedback that has been
>> mentioned before is reworking the QCRAM instruction space.  The QCRAM draft
>> uses the HPACK instructions, except that it cannibalizes table size changes
>> for the Duplicate instruction.  That leads to some inefficiency with the
>> separate streams, since you have to validate that instructions aren’t used
>> on the wrong stream and some of the opcode space is wasted on each stream.
>>
>>
>>
>> This PR observes that each stream has a fully disjoint set of commands
>> from the other:
>>
>>    - Table updates
>>
>>
>>    - Insert with Name Reference from Static Table
>>       - Insert with Name Reference from Dynamic Table
>>       - Insert without Name Reference
>>       - Duplicate
>>       - Change table size
>>
>>
>>    - Header blocks
>>
>>
>>    - Indexed entry from Static
>>       - Indexed entry from Dynamic
>>       - Literal with Name Reference from Static Table
>>       - Literal with Name Reference from Static Table, Never Indexed
>>       - Literal with Name Reference from Dynamic Table
>>       - Literal with Name Reference from Dynamic Table, Never Indexed
>>       - Literal without Name Reference
>>       - Literal without Name Reference, Never Indexed
>>
>>
>>
>> The PR takes advantage of the disjoint space to rework the opcodes in
>> hopes of saving bytes on common instructions.  It does this in two ways:
>>
>>    - Uses a bit in the opcode to differentiate static versus dynamic
>>    tables, instead of concatenating them
>>
>>
>>    - Gives us the future option to make the static table longer without
>>       hurting the dynamic table performance
>>
>>
>>    - Uses a bit in the opcode to identify string literals, instead of
>>    using a sentinel value of zero
>>
>>
>>    - Requires modifying the HPACK string definition to not require byte
>>       alignment at the beginning
>>       - Gives us the future option to make the tables zero-based instead
>>       of one-based
>>       - Makes indexing easier to explain in a future PR
>>
>>
>>
>> Here’s the impact:
>>
>>
>>
>> *Operation*
>>
>> *HPACK*
>>
>> *QCRAM*
>>
>> *PR#1141*
>>
>> *Impact*
>>
>> *Table updates*
>>
>>
>>
>>
>>
>>
>>
>>
>>
>> *Insert with Name Reference from Static Table*
>>
>> “01” + index < 62
>>
>> “01” + index < 62
>>
>> “11” + index
>>
>> Same
>>
>> *Insert with Name Reference from Dynamic Table*
>>
>> “01” + index >= 62
>>
>> (one byte for first 2 entries, then 2 bytes)
>>
>> “01” + index >= 62
>>
>> (one byte for first 2 entries, then 2 bytes)
>>
>> “10” + index
>>
>> (one byte for first 64 entries)
>>
>> One byte saved for 62 of 64 most recent entries
>>
>> *Insert without Name Reference*
>>
>> “01000000”
>>
>> “01000000”
>>
>> “01”
>>
>>
>>
>> One byte saved for header names < 32 octets
>>
>> *Duplicate*
>>
>> Not supported
>>
>> “001” + index >= 62
>>
>> “000” + index
>>
>> Moot; you’re unlikely to duplicate recent entries
>>
>> *Change table size*
>>
>> “001”
>>
>> Not supported
>>
>> “001”
>>
>> Same
>>
>> *Header blocks*
>>
>>
>>
>>
>>
>>
>>
>>
>>
>> *Indexed entry from Static*
>>
>> “1” + index < 62
>>
>> “1” + index < 62
>>
>> “11” + index
>>
>> Same
>>
>> *Indexed entry from Dynamic*
>>
>> “1” + index >= 62
>>
>> “1” + index >= 62
>>
>> “10” + index
>>
>> One byte longer for indices 62-63
>>
>> *Literal with Name Reference from Static Table*
>>
>> “0000” + index < 62
>>
>> (two bytes for index > 15)
>>
>> “0000” + index < 62
>>
>> (two bytes for index > 15)
>>
>> “0001” + index
>>
>> Same
>>
>> *Literal with Name Reference from Static Table, Never Indexed*
>>
>> “0001” + index < 62
>>
>> (two bytes for index > 15)
>>
>> “0001” + index < 62
>>
>> (two bytes for index > 15)
>>
>> “0011” + index
>>
>> Same
>>
>> *Literal with Name Reference from Dynamic Table*
>>
>> “0000” + index >= 62
>>
>> (always 2+ bytes)
>>
>> “0000” + index >= 62
>>
>> (always 2+ bytes)
>>
>> “0000” + index
>>
>> One byte saved for first 15 entries
>>
>> *Literal with Name Reference from Dynamic Table, Never Indexed*
>>
>> “0001” + index > 62
>>
>> (always 2+ bytes)
>>
>> “0001” + index > 62
>>
>> (always 2+ bytes)
>>
>> “0010” + index
>>
>> One byte saved for first 15 entries
>>
>> *Literal without Name Reference*
>>
>> “00000000”
>>
>> “00000000”
>>
>> “010”
>>
>> One byte saved for header names < 16 octets
>>
>> *Literal without Name Reference, Never Indexed*
>>
>> “00010000”
>>
>> “00010000”
>>
>> “011”
>>
>> One byte saved for header names < 16 octets
>>
>>
>>
>> There’s been some churn on the PR today as Martin and I worked some of
>> this out, so please give it a read (or another read) now and provide
>> feedback.
>>
>>
>>
>> Thanks!
>>
>>
>>
>

QCRAM Review Request: PR #1141 Mike Bishop
Re: QCRAM Review Request: PR #1141 Martin Thomson
Re: QCRAM Review Request: PR #1141 Alan Frindell
Re: QCRAM Review Request: PR #1141 Ian Swett
Re: QCRAM Review Request: PR #1141 Martin Thomson