Re: [Tzdist-bis] [calsify] tzdist and IANA -- estimating the operational parameters

Paul Eggert <eggert@cs.ucla.edu> Thu, 18 July 2019 20:15 UTC

Return-Path: <eggert@cs.ucla.edu>
X-Original-To: tzdist-bis@ietfa.amsl.com
Delivered-To: tzdist-bis@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id F27801200EF; Thu, 18 Jul 2019 13:15:58 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -4.2
X-Spam-Level:
X-Spam-Status: No, score=-4.2 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, RCVD_IN_DNSWL_MED=-2.3, SPF_HELO_NONE=0.001, SPF_PASS=-0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id IS8P1NsT0Qid; Thu, 18 Jul 2019 13:15:57 -0700 (PDT)
Received: from zimbra.cs.ucla.edu (zimbra.cs.ucla.edu [131.179.128.68]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 28EA1120074; Thu, 18 Jul 2019 13:15:57 -0700 (PDT)
Received: from localhost (localhost [127.0.0.1]) by zimbra.cs.ucla.edu (Postfix) with ESMTP id 7632E1626E8; Thu, 18 Jul 2019 13:15:56 -0700 (PDT)
Received: from zimbra.cs.ucla.edu ([127.0.0.1]) by localhost (zimbra.cs.ucla.edu [127.0.0.1]) (amavisd-new, port 10032) with ESMTP id axPdfgpBU37K; Thu, 18 Jul 2019 13:15:55 -0700 (PDT)
Received: from localhost (localhost [127.0.0.1]) by zimbra.cs.ucla.edu (Postfix) with ESMTP id 366411626E9; Thu, 18 Jul 2019 13:15:55 -0700 (PDT)
X-Virus-Scanned: amavisd-new at zimbra.cs.ucla.edu
Received: from zimbra.cs.ucla.edu ([127.0.0.1]) by localhost (zimbra.cs.ucla.edu [127.0.0.1]) (amavisd-new, port 10026) with ESMTP id WImwP9RbMV-M; Thu, 18 Jul 2019 13:15:55 -0700 (PDT)
Received: from [192.168.1.9] (cpe-23-242-74-103.socal.res.rr.com [23.242.74.103]) by zimbra.cs.ucla.edu (Postfix) with ESMTPSA id 022BA1626E8; Thu, 18 Jul 2019 13:15:54 -0700 (PDT)
To: Steve Crocker <steve@shinkuro.com>, Time zone mailing list <tz@iana.org>, IETF-Calsify <calsify@ietf.org>
Cc: Martin Burnicki <martin.burnicki@burnicki.net>, tzdist-bis@ietf.org, Daniel Migault <daniel.migault@ericsson.com>
References: <CADZyTkkNuenOTx77cB8vFzWaSjp_fqtEgaYd00t5e2+Kk6MtRA@mail.gmail.com> <47038bd0-210a-4dbc-173c-15bfa89ac54f@cs.ucla.edu> <6a50708d-5fac-1da9-56b3-ed1cea793752@fastmail.com> <6b1616e3-bfc0-a22a-2aa9-11033c525ff3@cs.ucla.edu> <CADZyTkms=skOgqfoDxTF6TSZBsWbfaCmfPBWCG0E9qb4-qA0Uw@mail.gmail.com> <6e243cf0-a3ff-82a1-3cae-70fbf699f1f7@gmail.com> <98A352CD-3386-49FB-B6C1-D4EC61BB79EC@cisco.com> <caf6c792-3bb0-c9ec-9ac1-1e7b7c1b6bfd@burnicki.net> <B5E14F98-5CD1-4FFE-92FF-082301BC60F2@cisco.com> <e80eea72-04fa-fe94-c7aa-3c17daedbe9d@burnicki.net> <CADZyTknTp212k7ZLFsw51r=KZcz==UGKw0Z+BWT0fVs4-cYMNw@mail.gmail.com> <CABf5zvK=KcW0YMtevmR3A01xMNbxF1nwE5ib2jj3482FFt75UA@mail.gmail.com>
From: Paul Eggert <eggert@cs.ucla.edu>
Organization: UCLA Computer Science Department
Message-ID: <118a452c-25ba-8427-4401-5465270da738@cs.ucla.edu>
Date: Thu, 18 Jul 2019 13:15:54 -0700
User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.8.0
MIME-Version: 1.0
In-Reply-To: <CABf5zvK=KcW0YMtevmR3A01xMNbxF1nwE5ib2jj3482FFt75UA@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"; format="flowed"
Content-Language: en-US
Content-Transfer-Encoding: 7bit
Archived-At: <https://mailarchive.ietf.org/arch/msg/tzdist-bis/NZuZoQfmZGseAYmwRIgTiWsqmkQ>
Subject: Re: [Tzdist-bis] [calsify] tzdist and IANA -- estimating the operational parameters
X-BeenThere: tzdist-bis@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Extensions to Time Zone Data Distribution Service <tzdist-bis.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tzdist-bis>, <mailto:tzdist-bis-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tzdist-bis/>
List-Post: <mailto:tzdist-bis@ietf.org>
List-Help: <mailto:tzdist-bis-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tzdist-bis>, <mailto:tzdist-bis-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 18 Jul 2019 20:15:59 -0000

Steve Crocker wrote:
> Early in this thread it was mentioned the the time zone database should be
> served in a fashion similar to DNS.  My first thought was the numbers are
> wildly different.  I jotted down a first cut at identifying the relevant
> operational parameters.  Perhaps the people proposing this service can
> flesh out the quantitative aspects.

Here are some estimates. There is no central TZDIST server now, so all the 
non-size estimates are for if/when that happens. The estimated rate of change 
for these numbers is all zero, except for the database size where I'd estimate a 
growth of 5%/year (this is just off the top of my head).


Frequency of database update: 18 times per year has been the worst case since 
the project started in the 1980s. Recently updates have occurred three to ten 
times per year.

Frequency of access: If we're assuming a pull model like TZDIST with primary and 
secondary servers, I would expect queries once every hour-or-so from each 
downstream server that wants to be up-to-date. Usually the response will be 
"nothing has changed".

Required response time: Most clients and servers can be expected to have a 
(possibly-stale) copy of the data already, which they can fall back on. So I 
would say "minutes".

Required uptime: Again, not crucial. I would say 90% would be enough.


For sizes, it depends on data format and whether you want all the data.

Size of entire tzdb database (including all time zone history, version info, and 
legal notices), in minimal source-code (text) form: 111 kB uncompressed, 27 kB 
compressed via gzip, 22 kB compressed via lzip.

Size of entire tzdb database in binary (TZif) form in traditional form: 456 kB 
uncompressed, 152 kB tar+gzip, 68 kB tar+lzip. This includes all time zone 
history but omits version info and legal notices.

Same size, but with zic's new '-b slim' option that relies on Internet RFC 8536 
instead of attempting to work around bugs in older applications: 216 kB 
uncompressed, 77 kB tar+gzip, 43 kB tar+lzip.

I don't know how to compute the size of tzdb converted into iCalendar form, but 
my guess is that it'd be the same order-of-magnitude as the TZif files.

All sizes are for ordinary (POSIX) tzdb, not for the rarely-used leapsecond variant.