Re: [Json] JSON Schema Language

Rob Sayre <sayrer@gmail.com> Wed, 29 May 2019 03:54 UTC

Return-Path: <sayrer@gmail.com>
X-Original-To: json@ietfa.amsl.com
Delivered-To: json@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 221D61200FE for <json@ietfa.amsl.com>; Tue, 28 May 2019 20:54:47 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.998
X-Spam-Level:
X-Spam-Status: No, score=-1.998 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id G79AVeYvinNK for <json@ietfa.amsl.com>; Tue, 28 May 2019 20:54:43 -0700 (PDT)
Received: from mail-it1-x12c.google.com (mail-it1-x12c.google.com [IPv6:2607:f8b0:4864:20::12c]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 1F52B120090 for <json@ietf.org>; Tue, 28 May 2019 20:54:43 -0700 (PDT)
Received: by mail-it1-x12c.google.com with SMTP id m3so1413675itl.1 for <json@ietf.org>; Tue, 28 May 2019 20:54:43 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=p57yF5BM+YdWIOlY3HJLumYcio4YXBobgMW+U6nXBWc=; b=MJP2tPEEysSoyZ0SLCm8AcTaz1R2a4WI/Img+gYEOG9pFXtRP3bvAi2lwWOK9DovaH Np87ZCodUwkgv0s/dQzxFJ8dvMBLhFgRKMQ/NGhmmAFM5uTnikwJ6Nr/qsPMaH51JWAC v0Y+XxtPedkM9nKPSTtO/j7BbqcV0tLo09nNgxWaLU7Cfh0vdyRx5+wmOm3SFyN6dhrB R+xTkz2SWStOgfOpgNtgeRyHQomN6/P/wG3m0CepqbbmQ04fVXt4dnqLDDmvFQtXp15C n7UiXIdZuqi109HUfsflOhEAwvqv9Pw3sCWglPe+BfuPYmBCHgpXf/9tFLFWLUUtPyNx RMGw==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=p57yF5BM+YdWIOlY3HJLumYcio4YXBobgMW+U6nXBWc=; b=oWPM8uKHR6LV7PqqxFTgktqaMN+qAVlF8qrwtL5dRvNZkLEnRbWRFtgi85/dD3Zbqm RubfTNhqBKVLvCtLT2S9ZCszr4GQHMw7TcmCMLEKoWucBhLqVI3nrwvvRh0BMYOlkGFx K+hf2PHrtBbXIvRF+7nS4v2jECs66mR4nHKMJ5FC1rMgptEM/1dW71E5LWq0ngmXGJvI cnBB2GoqbhTSY9y+WnzJC34pPaBza/x4nkikUIKwLsBKa5mb3CIsGolMvo3lkcW1Gtjp FpnXYFAzlUmrhCN83H1a+lsb3SNt3/o6PvN9JAn8lg+sgOCPOHpTiS20YqC3xBA9Fmyh eexw==
X-Gm-Message-State: APjAAAXZAl21a8TlH3GILoDMDymTPqLfXz7CRiukFLCnKcpn+v8xFaUg clb0dNxqD3ql09269fgG56K8E4LyAEVtRgczjhM=
X-Google-Smtp-Source: APXvYqyD+wMPPI2J8fuh2Q/ZQa2Wp9cwqK0wKREW8hsmh5+fd1tKULhsB2w0+a8Ke17WWyLnkV6WTn8TXSM3+USkOcM=
X-Received: by 2002:a05:660c:392:: with SMTP id x18mr5439294itj.89.1559102082245; Tue, 28 May 2019 20:54:42 -0700 (PDT)
MIME-Version: 1.0
References: <CAJK=1RjV1uv0eOdtFZ8cKn-FfCwCiGP5r2hOz1UamiM6YV4H1A@mail.gmail.com> <CAHBU6itE8kub1qtdRoW8BqxaOmzMv=vUo1aDeuAr3HX141NUGg@mail.gmail.com> <77994bdb-a400-be90-5893-b846a8e13899@gmail.com> <20190507154201.GP21049@localhost> <CEF72901-5077-4305-BA68-60624DCE952D@bzfx.net> <69ea0c99-e983-5972-c0aa-824ddeecb7c4@dret.net> <CAMm+LwjyVjnJuWE4+a9Ea=_X1uuEGuK+O4KojzN3uVQ+s+HqUQ@mail.gmail.com> <058f58a3-dd27-998e-5f54-4874aff5f2f0@dret.net> <20190507221726.GR21049@localhost> <CAJK=1Rj7PBD-bbwvsqgjQQzp4Aoidb-W2q5Lj6asMHHDHaTVYQ@mail.gmail.com> <646abf11-496b-c120-45d6-2a1aeab051a8@codalogic.com> <8224451C-F21B-41E5-A834-A9005050CB1F@tzi.org> <CAJK=1RjdYD6TZCNrw=H3d9ZLKLxZZOwVCOYYPwfbP+1ETDDz1Q@mail.gmail.com> <11CDA7F6-30BB-40E4-8926-2EDCBCFD785B@tzi.org> <CAHBU6iv8ZsFM5yco5gi+gcyU8d=u3bOSgiKaF6-hv-GARgNh9w@mail.gmail.com> <CAChr6SwNvG4Z7TKUxAVeH7HMVWiPsEBNb12K9zVkjaGt2_v0fw@mail.gmail.com> <CAHBU6ivTD_v7L-wQ+P9TmSfBY=5N+k-caaZ0TZhg6yZ_SWR_aA@mail.gmail.com> <CAChr6SzD8qdETafQKKU41BcYayTWf+C4GENd9FNzy5JYOv5jRQ@mail.gmail.com> <CAHBU6isx5aB94U-vn_t6GGoQ9W+ATDNYR6_+CtXgOhFho5Qh-g@mail.gmail.com>
In-Reply-To: <CAHBU6isx5aB94U-vn_t6GGoQ9W+ATDNYR6_+CtXgOhFho5Qh-g@mail.gmail.com>
From: Rob Sayre <sayrer@gmail.com>
Date: Tue, 28 May 2019 20:54:30 -0700
Message-ID: <CAChr6SzkAV5MPGt1px=FCb=j0egWabNZDvu7=07m1iW3+RQyLw@mail.gmail.com>
To: Tim Bray <tbray@textuality.com>
Cc: Carsten Bormann <cabo@tzi.org>, JSON WG <json@ietf.org>, Ulysse Carion <ulysse@segment.com>
Content-Type: multipart/alternative; boundary="000000000000c7f60d0589febd92"
Archived-At: <https://mailarchive.ietf.org/arch/msg/json/6IX89zwx66sxHvELKYEl6t6CR-0>
Subject: Re: [Json] JSON Schema Language
X-BeenThere: json@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: "JavaScript Object Notation \(JSON\) WG mailing list" <json.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/json>, <mailto:json-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/json/>
List-Post: <mailto:json@ietf.org>
List-Help: <mailto:json-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/json>, <mailto:json-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 29 May 2019 03:54:47 -0000

On Tue, May 28, 2019 at 8:47 PM Tim Bray <tbray@textuality.com> wrote:

> TBH, what I want from a schema system is (a) useful error messages and (b)
> ability to drive code generation, classes and serializer/deserializers and
> so on.
>

No problem. If it can drive serializer/deserializers, let's spec out some
conformance tests.

- Rob


> On Tue, May 28, 2019 at 8:40 PM Rob Sayre <sayrer@gmail.com> wrote:
>
>> On Tue, May 28, 2019 at 8:23 PM Tim Bray <tbray@textuality.com> wrote:
>>
>>> What do you mean by “Output specification”?
>>>
>>
>> Fair question. I'd say a good example might be the objects transferable
>> across postMessage() in browser JavaScript.
>>
>>
>> https://html.spec.whatwg.org/multipage/structured-data.html#transferable-objects
>>
>> It seems like a deep problem at first, but it isn't. Those message
>> formats add Date objects and a few other things, but not that much.
>>
>> - Rob
>>
>>
>>
>>
>>
>>> On Tue, May 28, 2019 at 8:01 PM Rob Sayre <sayrer@gmail.com> wrote:
>>>
>>>> I'm not sure this schema language will be productive as a source format
>>>> in the long term, but I don't think it will be harmful. So, it's not worth
>>>> an objection. The worst case would be that servers claim to conform to this
>>>> schema language, but fail to serve schema-conformant JSON in practice. This
>>>> seems like the likely outcome, in the absence of an output specification.
>>>>
>>>> I have to say, I think I am uniquely qualified to raise this point. I
>>>> objected to the original JSON Stringify specification, which was something
>>>> pretty close to string concatenation, and got it corrected to something
>>>> close to its current form.
>>>>
>>>> If a schema language doesn't provide an output specification, it's
>>>> probably not going to work in practice. Best case, it's a standards
>>>> checkbox.
>>>>
>>>> - Rob
>>>>
>>>>
>>>> On Tue, May 28, 2019 at 7:41 PM Tim Bray <tbray@textuality.com> wrote:
>>>>
>>>>> So Carsten, I am a person who would benefit from a super-simple JSON
>>>>> schema langauge; in effect, something like JSL + enums + timestamps.
>>>>> Furthermore, as I've said, I'm willing to invest some IETF-work cycles to
>>>>> get such a thing, in the event we could interest the community.  Do you
>>>>> think that doing such a JSL and defining it using established CDDL
>>>>> semantics might be a good idea?
>>>>>
>>>>> Alternately, might we retain CDDL syntax and define a profile/subset
>>>>> which would be appropriate for us JSON-only simpletons?
>>>>>
>>>>> I'm not terribly fussy about mechanisms.
>>>>>
>>>>>
>>>>>
>>>>> On Fri, May 10, 2019 at 12:16 AM Carsten Bormann <cabo@tzi.org> wrote:
>>>>>
>>>>>> On May 10, 2019, at 05:55, Ulysse Carion <ulysse@segment.com> wrote:
>>>>>> >
>>>>>> > Carsten,
>>>>>> >
>>>>>> > Do you think that our difference in opinion on CDDL vs JSON Schema
>>>>>> > Language may be attributable to a difference in requirements?
>>>>>>
>>>>>> Hi Ulysse,
>>>>>>
>>>>>> I’m not even sure we disagree.  That was why I became interested in
>>>>>> converting between JSL and CDDL.  As I was trying to show, CDDL might make
>>>>>> a fine presentation language for JSL, and JSL might be a nice “profile” for
>>>>>> CDDL that is very simple to process.
>>>>>>
>>>>>> > It seems to me that your use-case is centered around defining
>>>>>> > standards and other complex data requirements. CDDL is, in my view,
>>>>>> > unquestionably a better choice for this use-case. In my mind, CDDL
>>>>>> is
>>>>>> > ABNF for CBOR, and that is undeniably what standards dealing with
>>>>>> CBOR
>>>>>> > or its near-equivalents require. The existing references, from RFCs,
>>>>>> > to CDDL are testament to this.
>>>>>>
>>>>>> Yes, you are describing the intention correctly.  I would add that
>>>>>> CDDL has proven as useful for describing pure JSON protocols as for CBOR.
>>>>>>
>>>>>> Not all JSON/CBOR protocols need the full capabilities of CDDL.  For
>>>>>> instance, the example in the CDDL spec for RFC 7071 could easily be
>>>>>> expressed in JSL, except for two details: reputation-object is not meant to
>>>>>> be extensible (reputon is), and there are some value constraints (some
>>>>>> values are integers).
>>>>>>
>>>>>> > But I (and I suspect Tim) am more preoccupied solely with defining
>>>>>> the
>>>>>> > mundane sorts of messages that come out of JSON event processing and
>>>>>> > repetitive JSON APIs. Tim has blogged (see link in my original
>>>>>> email)
>>>>>> > about dealing with AWS's CloudWatch events. That's messages that
>>>>>> look
>>>>>> > like this:
>>>>>> >
>>>>>> >
>>>>>> https://docs.aws.amazon.com/AmazonCloudWatch/latest/events/EventTypes.html
>>>>>> >
>>>>>> > Tons of messages, and frequently being added and updated. But none
>>>>>> of
>>>>>> > these messages are particularly exciting from a schema perspective.
>>>>>>
>>>>>> Well, I just had a look at (randomly selected)
>>>>>>
>>>>>> https://docs.aws.amazon.com/AmazonCloudWatch/latest/events/EventTypes.html#health-event-types
>>>>>> You’d need to add enumerations to JSL.  There are also timestamps in
>>>>>> the object (ironically in two different formats).  There is a table that
>>>>>> maps language tags to messages in that language.  (And the second and third
>>>>>> example have a missing bracket.)  But I can’t really say, because the
>>>>>> description by example only just begins to expose the actual intention.
>>>>>>
>>>>>> > CDDL can do much more than is necessary for merely representing
>>>>>> > CloudWatch events. This may seem like a good thing, but such excess
>>>>>> > capability reduces the suitability of the solution. JSON Schema
>>>>>> > Language is intentionally small and scuttled in scope, so as to
>>>>>> > simplify code and UI generation. By being so limited in scope, JSON
>>>>>> > Schema Language fits more easily into the architecture of a system
>>>>>> > that would like to integrate it.
>>>>>>
>>>>>> I’ve seen my share of developments that start simple.  How much
>>>>>> functionality will be added to JSL before it becomes a standard?  Also, the
>>>>>> law of extensibility tells us it will be extended even after becoming a
>>>>>> standard.
>>>>>>
>>>>>> Of course, in its domain, CDDL is incredibly simple.  Compare to JCR:
>>>>>> JCR is about three times as complex as CDDL.  This is because CDDL was
>>>>>> built from a few very simple building blocks, which combine nicely to
>>>>>> provide its functionality.  JCR is more of an accretion of features, which
>>>>>> in sum can do most of what CDDL can do.
>>>>>>
>>>>>> But back to JSL and CDDL:
>>>>>> What I’m interested in is what are the sweet spots on this
>>>>>> functionality vs. complexity continuum.  I think we have found two of these
>>>>>> sweet spots (at least maybe after a little more calibration).  Now how do
>>>>>> we handle the onslaught of applications that don’t quite fit the sweet
>>>>>> spots?
>>>>>>
>>>>>> The question that intrigues me: Is it possible to define something
>>>>>> that is as simple as JSL when you need just that, but allows dipping into
>>>>>> the capabilities of something like CDDL where needed?
>>>>>>
>>>>>> By the way: You may not be aware of the WISHI activity we have in the
>>>>>> T2TRG (thing-to-thing research group) of the IRTF.  Here we look at
>>>>>> modeling (not just for data) and at translating between different modeling
>>>>>> approaches.  http://wishi.space if you want to have a look.
>>>>>>
>>>>>> Grüße, Carsten
>>>>>>
>>>>>> _______________________________________________
>>>>>> json mailing list
>>>>>> json@ietf.org
>>>>>> https://www.ietf.org/mailman/listinfo/json
>>>>>>
>>>>> _______________________________________________
>>>>> json mailing list
>>>>> json@ietf.org
>>>>> https://www.ietf.org/mailman/listinfo/json
>>>>>
>>>>