Re: [Json] Schema Requirements (Was: Re: Nudging the English-language vs. formalisms discussion forward)

Phillip Hallam-Baker <hallam@gmail.com> Thu, 20 February 2014 18:31 UTC

MIME-Version: 1.0
In-Reply-To: <CAK3OfOiogM36fR9oobh3D61ybsV6ZVbTb+WGjD8OZ71ALey5Qw@mail.gmail.com>
References: <C87F9B96-E028-4F0E-A950-B39D3F68FFE7@vpnc.org> <CAMm+LwhUh_yN-hzaoDWfrO_H2iGvYvj99BCE4EcYmgqCPqXoVQ@mail.gmail.com> <CAHBU6itpttXBfVQGKw=u==k_XSdrht81+m_YDNZP6RM+=9CNow@mail.gmail.com> <CAK3OfOjHkBFOzJSx=bhhoQJ8Z2bWyEXK52dNyYGWVb9FAj99ow@mail.gmail.com> <CAHBU6itzQ0rzU3EUYUqzm2qhx03qk1mpx2sehS_zeiw1ypcEgw@mail.gmail.com> <CAK3OfOhfjkbq6eREkt=MBVL1C9ubh-6My3Lvg-mnOxD0+cpN1Q@mail.gmail.com> <CAHBU6isZbew8O1HJ+XcFsMCR42iDoO_uemPXVwa3=vM5A=MngA@mail.gmail.com> <CAK3OfOgmVsNJqrqCfsD7h37axssOoaX3DGHqO=bTn5bWrA+MFA@mail.gmail.com> <A4B53816-6FBF-4A37-8BC9-F0A9D0867BCD@tzi.org> <357740A8AA0F4316BE630917321FAB4D@codalogic> <B1EBE05A69362F001777F807@cyrus.local> <47BB9131737D42218A6382DEF45BBE2C@codalogic> <CAMm+LwgmHjoLu2=zTOERN8LO74hWpp45yy2epd2JzqDRM9oFfg@mail.gmail.com> <AF211B67DB3D453D9DE8F8FA53886F73@codalogic> <CAMm+LwguTBkGQBHN+e2kU6XxECsic9Kcvda+7X6KDNe0TQxq4w@mail.gmail.com> <FE06CD427A4044B995F57C4926A1C8C2@codalogic> <CAK3OfOiogM36fR9oobh3D61ybsV6ZVbTb+WGjD8OZ71ALey5Qw@mail.gmail.com>
Date: Thu, 20 Feb 2014 13:30:59 -0500
Message-ID: <CAMm+LwhTprkCHBhK=xxZrqKLR+b3zE3K71MbZt+gTgAC9OxvBA@mail.gmail.com>
From: Phillip Hallam-Baker <hallam@gmail.com>
To: Nico Williams <nico@cryptonector.com>
Content-Type: multipart/alternative; boundary="001a113470d2f4ce3f04f2dab481"
Archived-At: http://mailarchive.ietf.org/arch/msg/json/kfELKdN3qXfFB2lMkwN_xPP_brA
Cc: Carsten Bormann <cabo@tzi.org>, Pete Cordell <petejson@codalogic.com>, JSON WG <json@ietf.org>
Subject: Re: [Json] Schema Requirements (Was: Re: Nudging the English-language vs. formalisms discussion forward)
Precedence: list

On Thu, Feb 20, 2014 at 12:22 PM, Nico Williams <nico@cryptonector.com>wrote:

> On Thu, Feb 20, 2014 at 10:55 AM, Pete Cordell <petejson@codalogic.com>
> wrote:
> > My position is that, having recognised that Dates represent a case where
> > microformats are useful, perhaps we should not assume that these are the
> > only cases.  IP addresses?  Crypto OIDs?  Dates on Mars?
>
> There are two ways to deal with alternative representations:
>
>  - convert to/from a canonical representation and use that one for
> interchange
>  - use a discriminated union (XDR) / CHOICE (ASN.1, same thing)
>
> I think any decent schema will need to allow for the latter, and not
> just because of types that have multiple possible representations.
>

In the code as it is right now it is possible to specify different formats
as different tags:

ConvolutedTime Structure
    DateTime                Date
    Format RFC1123     Date1
    Format RFC822       Date2

Would allow for any of the following:
    {"Date":"2002-10-02T10:00:00Z"}
    {"Date1":"Thursday, 20 Feb 2014 18:14:16 GMT"}
    {"Date2":"Thu, 20 Feb 2014 18:14:16 GMT"}

What I don't support at the moment is a constraint that says that only one
of the "Date", "Date1" and "Date2" tags is permitted. This could be
specified as:

ConvolutedTime Choice
    DateTime                Date
    Format RFC1123     Date1
    Format RFC822       Date2

[Protogen does allow braces to be used instead of indentation to denote
blocks but does not need end of statement or statement separators. If
people really insist on semicolons I can add a production to the lexer so
they are just treated like whitespace.]

 - which grammar parsing algorithms we want to support: LR, LALR(1),
> LALR(k), GLR, ...
>

I would hope we make the syntax so simple that we don't need the power of
LR parsers. They are models of human languages rather than computer
languages.

>  - the basic metaphor:
>     - types! (a-la ASN.1, only without the tags, no IOS, ...)
>     - pattern matching rules! (something like collections of
> XPath-like expressions)
>     - something else
>

Since the target languages are likely to be C#, JS and Java, I suggest that
we use dot separated tags and array indexes as path extractors.

Given:

{"first" : {"second" : {"third" : [{ "fourth": 1}, { "fourth": 42}, {
"fourth": 666}]}}}

first.second.third[1].fourth = 42

> The metaphor thing gets to, in part, the purpose of the schema:
>
>  - documentation for sure, validation no doubt, but, code generation
> (into C, Java, JS, C#, ...)?  (IMO: yes)
>

I have code generation for C# and C. I can add Java without difficulty.

It is not clear to me that code generation is necessary or particularly
useful for scripting languages with late binding like JS. In those
situations you can just say

X = JSON_Parse (text)

Answer = first.second.third[1].fourth

> Finally:
>
>  - extensibility (meta: to have it or not; if yes, how)
>

Hopefully formats are enough.

 - modularity (meta: to have it or not; if yes, how)
>

I am adding a Using directive right now. It seems to do the trick.

> Oh, and:
>
>  - one schema language, or more?  (IMO: inevitably we'll end up with more)
>

There will be multiple schemas. I think the progression will work like it
did for JSON itself. People will choose the one they like and there will
eventually be a convergence.

-- 
Website: http://hallambaker.com/

[Json] Nudging the English-language vs. formalism… Paul Hoffman
Re: [Json] Nudging the English-language vs. forma… Phillip Hallam-Baker
Re: [Json] Nudging the English-language vs. forma… Nico Williams
Re: [Json] Nudging the English-language vs. forma… Paul Hoffman
Re: [Json] Nudging the English-language vs. forma… Phillip Hallam-Baker
[Json] Nudging the English-language vs. formalism… Paul Hoffman
Re: [Json] Nudging the English-language vs. forma… Tim Bray
Re: [Json] Nudging the English-language vs. forma… Phillip Hallam-Baker
Re: [Json] Nudging the English-language vs. forma… Tim Bray
Re: [Json] Nudging the English-language vs. forma… Nico Williams
Re: [Json] Nudging the English-language vs. forma… John Cowan
Re: [Json] Nudging the English-language vs. forma… Phillip Hallam-Baker
Re: [Json] Nudging the English-language vs. forma… Tim Bray
Re: [Json] Nudging the English-language vs. forma… Nico Williams
Re: [Json] Nudging the English-language vs. forma… Tim Bray
Re: [Json] Nudging the English-language vs. forma… John Cowan
Re: [Json] Nudging the English-language vs. forma… Tim Bray
Re: [Json] Nudging the English-language vs. forma… Nico Williams
Re: [Json] Nudging the English-language vs. forma… Nico Williams
Re: [Json] Nudging the English-language vs. forma… Pete Cordell
Re: [Json] Nudging the English-language vs. forma… Pete Cordell
Re: [Json] Nudging the English-language vs. forma… Tim Bray
Re: [Json] Nudging the English-language vs. forma… Nico Williams
Re: [Json] Nudging the English-language vs. forma… Phillip Hallam-Baker
Re: [Json] Nudging the English-language vs. forma… Tatu Saloranta
Re: [Json] Nudging the English-language vs. forma… Carsten Bormann
Re: [Json] Nudging the English-language vs. forma… John Cowan
Re: [Json] Nudging the English-language vs. forma… John Cowan
Re: [Json] Nudging the English-language vs. forma… Barry Leiba
Re: [Json] Nudging the English-language vs. forma… Mark Nottingham
Re: [Json] Nudging the English-language vs. forma… Phillip Hallam-Baker
Re: [Json] Nudging the English-language vs. forma… Andrew Newton
Re: [Json] Nudging the English-language vs. forma… Phillip Hallam-Baker
Re: [Json] Nudging the English-language vs. forma… Pete Cordell
Re: [Json] Nudging the English-language vs. forma… Barry Leiba
Re: [Json] Nudging the English-language vs. forma… Bjoern Hoehrmann
Re: [Json] Nudging the English-language vs. forma… John Cowan
Re: [Json] Nudging the English-language vs. forma… Nico Williams
Re: [Json] Nudging the English-language vs. forma… Nico Williams
Re: [Json] Nudging the English-language vs. forma… Manger, James
Re: [Json] Nudging the English-language vs. forma… Tim Bray
Re: [Json] Nudging the English-language vs. forma… Nico Williams
Re: [Json] Nudging the English-language vs. forma… Phillip Hallam-Baker
Re: [Json] Nudging the English-language vs. forma… Mark Nottingham
Re: [Json] Nudging the English-language vs. forma… Nico Williams
Re: [Json] Nudging the English-language vs. forma… Cyrus Daboo
Re: [Json] Nudging the English-language vs. forma… Andrew Newton
Re: [Json] Nudging the English-language vs. forma… Paul Hoffman
Re: [Json] Nudging the English-language vs. forma… Pete Cordell
Re: [Json] Nudging the English-language vs. forma… Phillip Hallam-Baker
Re: [Json] Nudging the English-language vs. forma… John Cowan
Re: [Json] Nudging the English-language vs. forma… Pete Cordell
Re: [Json] Nudging the English-language vs. forma… Phillip Hallam-Baker
[Json] Schema Requirements (Was: Re: Nudging the … Pete Cordell
Re: [Json] Schema Requirements (Was: Re: Nudging … Phillip Hallam-Baker
Re: [Json] Schema Requirements (Was: Re: Nudging … Nico Williams
Re: [Json] Schema Requirements (Was: Re: Nudging … Nico Williams
Re: [Json] Schema Requirements (Was: Re: Nudging … Phillip Hallam-Baker
Re: [Json] Schema Requirements (Was: Re: Nudging … Nico Williams
Re: [Json] Schema Requirements (Was: Re: Nudging … Pete Cordell
Re: [Json] Schema Requirements (Was: Re: Nudging … Phillip Hallam-Baker
Re: [Json] Schema Requirements (Was: Re: Nudging … Pete Cordell
Re: [Json] Schema Requirements (Was: Re: Nudging … Nico Williams
Re: [Json] Schema Requirements (Was: Re: Nudging … Pete Cordell
Re: [Json] Schema Requirements (Was: Re: Nudging … Nico Williams
Re: [Json] Schema Requirements (Was: Re: Nudging … Pete Cordell