Re: [Json] Proposed minimal change for duplicate names in objects

Tim Bray <tbray@textuality.com> Sun, 07 July 2013 04:58 UTC

MIME-Version: 1.0
In-Reply-To: <CAGrxA24y4D62XY-YnbDvKVwNKUickcEFxv1FUhc_yqG4KP-m-w@mail.gmail.com>
References: <B86E1D4B-1DC8-4AD6-B8B3-E989599E0537@vpnc.org> <CAK3OfOj3MNNhjwo2bMa5CgoqynzMRVvviBXC8szxt5D17Z7FDg@mail.gmail.com> <51D3C63C.5030703@cisco.com> <51D48023.1020008@qti.qualcomm.com> <20130703201143.GL32044@mercury.ccil.org> <00cd01ce7a9f$19adeaa0$4d09bfe0$@augustcellars.com> <00d701ce7aa6$cc5fe700$651fb500$@augustcellars.com> <CAK3OfOiWrWCvNQneokyycV1Jb98M=UR-U7z0dhxUjzVdf+PwDw@mail.gmail.com> <CAHBU6itdi3B1rWv2TiOYhL1QuOVxrFKt7OTWRoG+6TgV8Bc_uw@mail.gmail.com> <CAK3OfOgOYA5fas0oomF5amjP1bR5F=0+uve7mFD4=FMoEV7sWg@mail.gmail.com> <CAGrxA24y4D62XY-YnbDvKVwNKUickcEFxv1FUhc_yqG4KP-m-w@mail.gmail.com>
Date: Sat, 06 Jul 2013 21:58:24 -0700
Message-ID: <CAHBU6iuWLcXv0QKR=Ow8gkzoZLmoZjqYCqXDXR8FLVb7w7M2Tw@mail.gmail.com>
From: Tim Bray <tbray@textuality.com>
To: Tatu Saloranta <tsaloranta@gmail.com>
Content-Type: multipart/alternative; boundary="20cf307f38de200ff404e0e4c7cc"
Cc: Nico Williams <nico@cryptonector.com>, Jim Schaad <ietf@augustcellars.com>, "json@ietf.org" <json@ietf.org>
Subject: Re: [Json] Proposed minimal change for duplicate names in objects
Precedence: list

I’ll assume you’re right when you say dupe detection has been measured as
expensive at run time, but I’m thinking that if I were writing a reader I'd
implement a hash table with a test-and-set method, so I admit I'm surprised
by the finding.  I think I’d need to see a little more research before I’d
accept that as a given.  -T


On Sat, Jul 6, 2013 at 9:37 PM, Tatu Saloranta <tsaloranta@gmail.com> wrote:

> On Sat, Jul 6, 2013 at 7:57 PM, Nico Williams <nico@cryptonector.com>wrote:
>
>> On Sat, Jul 6, 2013 at 8:44 PM, Tim Bray <tbray@textuality.com> wrote:
>> > This feels like a no-brainer to me, but that’s probably because (as I’ve
>> > said before) I’m an API guy, and the only use for JSON objects in my
>> world
>> > is to transfer a hash table or database record or whatever from here to
>> > there, and there, and in such a situation dupes can never be useful or
>> > intended and can only be a symptom of breakage (or, in the JOSE case, a
>> > symptom of a malicious attack on my crypto).
>>
>> I agree.  As a security guy I would prefer if one way or another we
>> end up with no dup names, but as an "API guy" myself I think of the
>> streaming parsers (they offer an API after all).  Just say the magic
>> words: "to hell with minimal state streaming parsers" or perhaps
>> something to the effect that *some* component of a layered application
>> MUST reject objects with dup names.  It's either or.  Let's choose.
>>
>> I'm happy with "some component of a layered application MUST reject
>> objects with duplicate names" -- I prefer this to the "no minimal
>> state streaming parsers" alternative.
>>
>> I will assume that in general objects rarely have lots of names, so
>> that parsers need not keep that much state in order to check for dups.
>>  Requiring parsers to reject objects with dup names is my second
>> choice.
>>
>
> Just to make sure: I also do not have any use for duplicates, and consider
> them flaws in processing. I have never used duplicates for anything, nor
> find that interesting approach.
> The only concern really is that of mandating (or not) detection and/or
> prevention at lowest level components of commonly used processing stacks
> (low-level push/pull parser, higher-level library or app code that builds
> full representations), since this is significant cost, based on extensive
> profiling I have done at this level.
>
> Case of application code directly using streaming parser/generators is not
> nearly as common as that of frameworks using them to produce higher level
> abstractions.
> These higher level abstraction (JSON tree representation, binding to
> native objects) do either report errors such as duplicates, and at very
> least can detect it and use consistent handling. They can do it much more
> efficiently than low-level components since they have to build
> representations that then serve as data structures for detecting/preventing
> duplicates.
>
> Specific concern for me is this: if specification mandates detection
> and/or prevention for parser, generators, without any mention that 'parser'
> and 'generator' are logical concepts (thing that reads JSON, thing that
> writes JSON), I will have lots of users who promptly demand low-level
> components to force checks.
> And then I get to spend lots of time discussing why such checks can (and
> IMO should) still be pushed to higher level of processing. It is amazing
> how much FUD can be generated based on cursory reading of specifications.
>
> This concern extends to suggested "Internet JSON messages" specification.
>
> I would like simple suggestion that some component of the processing
> should/must detect and report duplicates; and prevent producing of same;
> or, lengthier explanation of what "parser" and "generator" mean (parser is
> such a horrible misnomer -- there is very very little parsing involved,
> it's just a lexer, and optional object builder -- but I digress).
>
> -+ Tatu +-
>
>

[Json] Proposed minimal change for duplicate name… Paul Hoffman
Re: [Json] Proposed minimal change for duplicate … Vinny A
Re: [Json] Proposed minimal change for duplicate … John Cowan
Re: [Json] Proposed minimal change for duplicate … Nico Williams
Re: [Json] Proposed minimal change for duplicate … Stefan Drees
Re: [Json] Proposed minimal change for duplicate … Eliot Lear
Re: [Json] Proposed minimal change for duplicate … Nico Williams
Re: [Json] Proposed minimal change for duplicate … Eliot Lear
Re: [Json] Proposed minimal change for duplicate … Stefan Drees
Re: [Json] Proposed minimal change for duplicate … Markus Lanthaler
Re: [Json] Proposed minimal change for duplicate … Paul Hoffman
Re: [Json] Proposed minimal change for duplicate … Nico Williams
Re: [Json] Proposed minimal change for duplicate … John Cowan
Re: [Json] Proposed minimal change for duplicate … John Cowan
Re: [Json] Proposed minimal change for duplicate … John Cowan
Re: [Json] Proposed minimal change for duplicate … Eliot Lear
Re: [Json] Proposed minimal change for duplicate … John Cowan
Re: [Json] Proposed minimal change for duplicate … Paul Hoffman
Re: [Json] Proposed minimal change for duplicate … Bjoern Hoehrmann
Re: [Json] Proposed minimal change for duplicate … Tatu Saloranta
Re: [Json] Proposed minimal change for duplicate … Pete Resnick
Re: [Json] Proposed minimal change for duplicate … John Cowan
Re: [Json] Proposed minimal change for duplicate … Eliot Lear
Re: [Json] Proposed minimal change for duplicate … Tim Bray
Re: [Json] Proposed minimal change for duplicate … Nico Williams
Re: [Json] Proposed minimal change for duplicate … Nico Williams
Re: [Json] Proposed minimal change for duplicate … Tim Bray
Re: [Json] Proposed minimal change for duplicate … John Cowan
Re: [Json] Proposed minimal change for duplicate … Stefan Drees
Re: [Json] Proposed minimal change for duplicate … Matthew Morley
Re: [Json] Proposed minimal change for duplicate … Tatu Saloranta
Re: [Json] Proposed minimal change for duplicate … Nico Williams
Re: [Json] Proposed minimal change for duplicate … Jorge
Re: [Json] Proposed minimal change for duplicate … Bjoern Hoehrmann
Re: [Json] Proposed minimal change for duplicate … Jim Schaad
Re: [Json] Proposed minimal change for duplicate … Jim Schaad
Re: [Json] Proposed minimal change for duplicate … Nico Williams
Re: [Json] Proposed minimal change for duplicate … Tim Bray
Re: [Json] Proposed minimal change for duplicate … Nico Williams
Re: [Json] Proposed minimal change for duplicate … Tatu Saloranta
Re: [Json] Proposed minimal change for duplicate … Tim Bray
Re: [Json] Proposed minimal change for duplicate … Jim Schaad
[Json] Duplicate names: are they erroneous or not? John Cowan
Re: [Json] Proposed minimal change for duplicate … Nico Williams
Re: [Json] Proposed minimal change for duplicate … Nico Williams
Re: [Json] Duplicate names: are they erroneous or… Stefan Drees
Re: [Json] Duplicate names: are they erroneous or… Bjoern Hoehrmann
Re: [Json] Proposed minimal change for duplicate … Bjoern Hoehrmann
Re: [Json] Proposed minimal change for duplicate … Carsten Bormann
Re: [Json] Proposed minimal change for duplicate … Tatu Saloranta
Re: [Json] Proposed minimal change for duplicate … Tatu Saloranta
Re: [Json] Proposed minimal change for duplicate … Stephan Beal
Re: [Json] Proposed minimal change for duplicate … John Cowan
Re: [Json] Proposed minimal change for duplicate … John Cowan
Re: [Json] Proposed minimal change for duplicate … Carsten Bormann
Re: [Json] Proposed minimal change for duplicate … John Cowan
Re: [Json] Proposed minimal change for duplicate … Bjoern Hoehrmann
Re: [Json] Proposed minimal change for duplicate … Carsten Bormann
Re: [Json] Proposed minimal change for duplicate … Joe Hildebrand (jhildebr)
Re: [Json] Proposed minimal change for duplicate … Vinny A
Re: [Json] Proposed minimal change for duplicate … Eliot Lear
Re: [Json] Proposed minimal change for duplicate … Markus Lanthaler
Re: [Json] Proposed minimal change for duplicate … Stephan Beal
Re: [Json] Proposed minimal change for duplicate … Eliot Lear
Re: [Json] Proposed minimal change for duplicate … John Cowan
Re: [Json] Proposed minimal change for duplicate … Tatu Saloranta
Re: [Json] Proposed minimal change for duplicate … Tatu Saloranta
Re: [Json] Proposed minimal change for duplicate … John Cowan