Re: [81attendees] What is it at the bottom of restaurant receipts?

"Worley, Dale R (Dale)" <dworley@avaya.com> Sat, 20 August 2011 03:47 UTC

Return-Path: <dworley@avaya.com>
X-Original-To: 81attendees@ietfa.amsl.com
Delivered-To: 81attendees@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 5984B21F85EC for <81attendees@ietfa.amsl.com>; Fri, 19 Aug 2011 20:47:49 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -103.446
X-Spam-Level:
X-Spam-Status: No, score=-103.446 tagged_above=-999 required=5 tests=[AWL=0.153, BAYES_00=-2.599, RCVD_IN_DNSWL_LOW=-1, USER_IN_WHITELIST=-100]
Received: from mail.ietf.org ([12.22.58.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 6o9CfwYLe033 for <81attendees@ietfa.amsl.com>; Fri, 19 Aug 2011 20:47:48 -0700 (PDT)
Received: from de307622-de-outbound.net.avaya.com (de307622-de-outbound.net.avaya.com [198.152.71.100]) by ietfa.amsl.com (Postfix) with ESMTP id 9022021F85F2 for <81attendees@ietf.org>; Fri, 19 Aug 2011 20:47:48 -0700 (PDT)
X-IronPort-Anti-Spam-Filtered: true
X-IronPort-Anti-Spam-Result: Av4EAO0tT07GmAcF/2dsb2JhbABCqA53gUABAQEBAxIoPxACAQgNKRAyJQEBBAENDRqkDAKbd4VpXwSYPotn
X-IronPort-AV: E=Sophos;i="4.68,253,1312171200"; d="scan'208";a="263249044"
Received: from unknown (HELO co300216-co-erhwest.avaya.com) ([198.152.7.5]) by de307622-de-outbound.net.avaya.com with ESMTP; 19 Aug 2011 23:48:44 -0400
Received: from dc-us1hcex1.us1.avaya.com (HELO DC-US1HCEX1.global.avaya.com) ([135.11.52.20]) by co300216-co-erhwest-out.avaya.com with ESMTP; 19 Aug 2011 23:45:00 -0400
Received: from DC-US1MBEX4.global.avaya.com ([169.254.2.172]) by DC-US1HCEX1.global.avaya.com ([2002:870b:3414::870b:3414]) with mapi; Fri, 19 Aug 2011 23:48:43 -0400
From: "Worley, Dale R (Dale)" <dworley@avaya.com>
To: Richard Barnes <richard.barnes@gmail.com>, "John R. Levine" <johnl@iecc.com>
Date: Fri, 19 Aug 2011 23:48:41 -0400
Thread-Topic: [81attendees] What is it at the bottom of restaurant receipts?
Thread-Index: AcxWyDTZ83t77v3QTLeaZMBrGcvdIAIIHVI9
Message-ID: <CD5674C3CD99574EBA7432465FC13C1B222B1F583F@DC-US1MBEX4.global.avaya.com>
References: <CD5674C3CD99574EBA7432465FC13C1B222B1F57D4@DC-US1MBEX4.global.avaya.com> <CD5674C3CD99574EBA7432465FC13C1B222B1F5801@DC-US1MBEX4.global.avaya.com> <9B3D0397-56CB-4270-9787-DCB67544D857@estacado.net> <alpine.BSF.2.00.1108091333400.781@joyce.lan>, <CACB24MufX1Pb2PusTn1BquqMxiGUR_E7skz8rn_AiFAd6UBtOw@mail.gmail.com>
In-Reply-To: <CACB24MufX1Pb2PusTn1BquqMxiGUR_E7skz8rn_AiFAd6UBtOw@mail.gmail.com>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
acceptlanguage: en-US
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: quoted-printable
MIME-Version: 1.0
Cc: "81attendees@ietf.org" <81attendees@ietf.org>
Subject: Re: [81attendees] What is it at the bottom of restaurant receipts?
X-BeenThere: 81attendees@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: IETF 81 Attendee List <81attendees.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/81attendees>, <mailto:81attendees-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/81attendees>
List-Post: <mailto:81attendees@ietf.org>
List-Help: <mailto:81attendees-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/81attendees>, <mailto:81attendees-request@ietf.org?subject=subscribe>
X-List-Received-Date: Sat, 20 Aug 2011 03:47:49 -0000

I went through the math, and it looks like there are 256 symbols in
the set of dingbats.

I did a maximum-likelihood calculation, or rather, a Bayesian analysis
assuming that all alphabet sizes that are powers of 2 are a-priori
equally likely, then looking at a "large" set of dingbats, seeing
the pattern of duplications among them, and computing a-posteriori
probabilities of the alphabet size.  (This assumes that the dingbats
are statistically random.)

I had to reconstruct the formula for "If you have N symbols, and draw
from them randomly n times, allowing duplicates, the resulting
multiset will be a partition of n.  Given that partition, what is the
probability of that partition resulting, as a function of N?"

Throwing away some factors which are independent of N, which don't
affect the Bayes Rule calculation, the resulting probability is a
function only of N, n, and the number of parts in the partition, k:
P = descendingfactorial(N, k)/N**n.

Taking 5 slips that were perfectly readable, there were 60 symbols,
which included 3 triplets, one duplicate, and 49 unique symbols:

60 = 3 + 3 + 3 + 2 + (49)*1

Plugging this into the formula gives:

         n         ln(p)
        64   -61.867
       128   -46.607
       256   -44.610
       512   -46.457
      1024   -49.890
      2048   -54.051
      4096   -58.562
      8192   -63.245
     16384   -68.013

So it looks like there are 256 symbols, and they carry 8 bits of
information each, which isn't surprising.

Checking the Unicode charts, almost all of the symbols are on the page
U+22xx, "Mathematical symbols".  I haven't tracked down the rest, some
of which are seriously obscure, but some seem to be sans-serif Hebrew.
I am guessing that the symbols not in U+22xx are to replace some
symbols on that page that are too much like others.

So the dingbats contain 96 bits.

I still favor the idea that they are some sort of keyed hash of the
data in the barcode, but I can't figure out how it would be used
operationally, since a Revenue Quebec bar-code reader could easily
read a keyed hash and verify the signature.  It has to be compared by
eye with another display in the same format.

And then there's a receipt whose 12 symbols are two repetitions of a
sequence of 6 symbols.  This is so incredibly unlikely that there must
be something wrong with it.

Dale