Re: [MLS] Optimizing Unmerged Leaves (or: bringing the MLS tree size back to O(n))

Théophile Wallez <theophile.wallez@inria.fr> Mon, 21 June 2021 11:14 UTC

Return-Path: <theophile.wallez@inria.fr>
X-Original-To: mls@ietfa.amsl.com
Delivered-To: mls@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 69DBB3A0417 for <mls@ietfa.amsl.com>; Mon, 21 Jun 2021 04:14:17 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -0.333
X-Spam-Level:
X-Spam-Status: No, score=-0.333 tagged_above=-999 required=5 tests=[HTML_MESSAGE=0.001, NICE_REPLY_A=-0.338, RCVD_IN_DNSWL_BLOCKED=0.001, RCVD_IN_MSPIKE_H3=0.001, RCVD_IN_MSPIKE_WL=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id k2bxccYXOeol for <mls@ietfa.amsl.com>; Mon, 21 Jun 2021 04:14:12 -0700 (PDT)
Received: from mail3-relais-sop.national.inria.fr (mail3-relais-sop.national.inria.fr [192.134.164.104]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 78D5F3A040C for <mls@ietf.org>; Mon, 21 Jun 2021 04:14:11 -0700 (PDT)
IronPort-HdrOrdr: A9a23:ttii+6E6Q6XnTL74pLqE2ceALOsnbusQ8zAXPo5KKCC9E/bo8PxG+c5w6faaslossR0b9uxofZPwIk80lqQFhbX5X43DYOCOggLBR+xfBMnZsl/d8kbFh4tgPNJbAtND4arLfCBHZKjBjjVQa+xQueVvp5rY49vj8w==
X-IronPort-AV: E=Sophos;i="5.83,289,1616454000"; d="scan'208,217";a="385669634"
Received: from unknown (HELO [192.168.42.138]) ([37.171.158.62]) by mail3-relais-sop.national.inria.fr with ESMTP/TLS/AES256-GCM-SHA384; 21 Jun 2021 13:14:09 +0200
To: Raphael Robert <ietf=40raphaelrobert.com@dmarc.ietf.org>
Cc: MLS List <mls@ietf.org>
References: <162040556024.10515.18434867074453969330@ietfa.amsl.com> <F94C2B20-DC04-4028-937E-698D598E319D@sn3rd.com> <81ED88F6-16A8-45C4-A9B9-FD6659CB68FB@sn3rd.com> <4614A3C1-8461-48F8-AD53-C1399BC5F673@sn3rd.com> <6B8101F5-A54A-429A-82B0-5386E321F525@inria.fr> <8E5EC435-7B97-4852-AB90-F4E501ACDEDA@raphaelrobert.com>
From: Théophile Wallez <theophile.wallez@inria.fr>
Message-ID: <42346e03-df57-57c1-c781-249ebe13e798@inria.fr>
Date: Mon, 21 Jun 2021 13:14:09 +0200
User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:78.0) Gecko/20100101 Thunderbird/78.11.0
MIME-Version: 1.0
In-Reply-To: <8E5EC435-7B97-4852-AB90-F4E501ACDEDA@raphaelrobert.com>
Content-Type: multipart/alternative; boundary="------------99814DA93854DDF8DE8746EF"
Content-Language: en-US
Archived-At: <https://mailarchive.ietf.org/arch/msg/mls/AKrZ2EUXRREUNmClPQi_Sc1O2Dk>
Subject: Re: [MLS] Optimizing Unmerged Leaves (or: bringing the MLS tree size back to O(n))
X-BeenThere: mls@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Messaging Layer Security <mls.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/mls>, <mailto:mls-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/mls/>
List-Post: <mailto:mls@ietf.org>
List-Help: <mailto:mls-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/mls>, <mailto:mls-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 21 Jun 2021 11:14:17 -0000

Hi and sorry for the late answer,

I'll also respond inline:

On 22/05/2021 12:14, Raphael Robert wrote:
>
>> On 21. May 2021, at 16:59, Karthik Bhargavan 
>> <karthikeyan.bhargavan@inria.fr 
>> <mailto:karthikeyan.bhargavan@inria.fr>> wrote:
>>
>> Solution 1
>> —————
>>
>> One possibility, close to the current MLS spec would be to not store 
>> the unmerged leaf at a node if it is already stored at the parent.
>> This way, every leaf is present at most once in some 
>> `unmerged_leaves` array.
>> In the worst case, this uses 4*n + 4*(n/2) = 6*n bytes (4*n for the 
>> `unmerged_leaves` array size, 4*(n/2) because at most n/2 leaves are 
>> stored in the `unmerged_leaves` arrays).
>> In the best case, this uses 4*n bytes.
>> The tree resolution algorithm will now have to do a little more 
>> calculation to re-compute the unmerged leaves at each node.
>
> This sounds rather straight forward on a logical level. It clearly 
> reduces the stored/transmitted payload size, but it introduces a new 
> cost: previously only O(log n) nodes needed to be modified during an 
> Update/Remove. Now we might have to “push down” the book keeping 
> information from parents to children, in the worst case this affects 
> n/2 nodes, so the cost of modifying nodes goes up to O(n).
> If we consider a very large group, there would be a measurable 
> difference when clients load/store nodes from/to disk.
> My current intuition is that currently I/O operations can be done in 
> O(log n) with clever parent hash caching. The worst case here would be 
> the following: A client is a member of a large group and has been 
> offline for a while (which is not a far fetched scenario). The client 
> now comes online and has to process a large number of handshake 
> messages. For robustness, the client might be inclined to frequently 
> persist the intermediary state. This is where the I/O cost kicks in.

Indeed, when "pushing down" the unmerged leaves, if it is pushed down to 
a blank node, we might have to modify more than O(log n) nodes. This 
could be solved by storing unmerged leaves in blank nodes, but it is 
starting to get quite ugly.

>> Solution 2+
>> —————
>>
>> However, if we wish to compress further, we can do so as well, using 
>> two facts:
>> - if a node `n` has two children `l` and `r`, then 
>> n.last_update_epoch is equal to either l.last_update_epoch or 
>> r.last_update_epoch,
>> - if `l` is unmerged for some node then l.creation_epoch = 
>> l.last_update_epoch (because in this case, the leaf never generated 
>> an UpdatePath) (the converse is not true, for example in the case 
>> when its sibling generates an UpdatePath).
>>
>> So at each node, we can store the direction from which the node was 
>> last updated (for example with a boolean flag), then given any node 
>> `n` we can "follow the path" down the tree to obtain a leaf `l`, and 
>> we have n.last_update_epoch = l.last_update_epoch.
>> Furthermore, we don't need to store the creation epoch for leaves:
>> - if `l` is unmerged for `n`, then n.last_update_epoch < 
>> l.creation_epoch = l.last_update_epoch,
>> - if `l` is not unmerged for `n`, then n.last_update_epoch >= 
>> l.last_update_epoch (because when `l` is updated, it updates all the 
>> nodes above it).
>> This uses 8*n + 1*n = 9*n bytes for the whole tree.
>
> I didn’t check all the ramifications here tbh. I think it is similar 
> to solution 1 in terms of increased I/O cost.

Actually, there is no "pushing down" here, when modifying nodes on a 
path from leaf to the root, only those nodes are modified (we update the 
direction of the parent nodes in the path, and update the last update 
epoch of the leaf), so the I/O cost is actually O(log n).

Thank you for your comments,
Théophile.