Re: [netmod] instance file parsing

Robert Wilton <rwilton@cisco.com> Tue, 04 December 2018 17:14 UTC

Return-Path: <rwilton@cisco.com>
X-Original-To: netmod@ietfa.amsl.com
Delivered-To: netmod@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id F2481130F65 for <netmod@ietfa.amsl.com>; Tue, 4 Dec 2018 09:14:58 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -15.959
X-Spam-Level:
X-Spam-Status: No, score=-15.959 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIMWL_WL_MED=-1.459, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, RCVD_IN_DNSWL_HI=-5, SPF_PASS=-0.001, URIBL_BLOCKED=0.001, USER_IN_DEF_DKIM_WL=-7.5] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=cisco.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id LvmgmhEmFAwe for <netmod@ietfa.amsl.com>; Tue, 4 Dec 2018 09:14:57 -0800 (PST)
Received: from aer-iport-2.cisco.com (aer-iport-2.cisco.com [173.38.203.52]) (using TLSv1.2 with cipher DHE-RSA-SEED-SHA (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id CEB01130EC9 for <netmod@ietf.org>; Tue, 4 Dec 2018 09:14:56 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=cisco.com; i=@cisco.com; l=5968; q=dns/txt; s=iport; t=1543943697; x=1545153297; h=subject:to:references:from:message-id:date:mime-version: in-reply-to:content-transfer-encoding; bh=mAD/Hnl6fpF3hOco4ybtkg3RiYcd7C35FCNyT8FswL0=; b=H71GlhhfcKEPDtWpn+TPt6igmJntni6a3QxSFKEQEef2xVnC/3WqplZn 2NXb37EzLiKcAM4a1iWvzo5o4/AqQhmDO6zt+b8wOtq6h//RQb4WC6dUV lHGWOa5wLh/mxy0uD/6hkTrmnUx8R4A+71mDCcQeEViny6c0Z2hAEAuNH U=;
X-IronPort-Anti-Spam-Filtered: true
X-IronPort-Anti-Spam-Result: A0AHAAC1tAZc/xbLJq1kGQEBAQEBAQEBAQEBAQcBAQEBAQGBUwIBAQEBAQsBg1kSJ4N5iHiNEAgll0mBeg2EbAKDLDYHDQEDAQECAQECbSiFPQEFIw8BBVELEAgCAiYCAlcGAQwGAgEBF4MGggKlHYEvhUCEb4ELiyqBQD+BOAyCX4RrgxqCNSICiR8+hVWRFwmROwYYgVuIECaHFYkGiQiGaYFNCCmBVTMaCBsVO4JskFs/AzCJHiuCIAEB
X-IronPort-AV: E=Sophos;i="5.56,314,1539648000"; d="scan'208";a="8576701"
Received: from aer-iport-nat.cisco.com (HELO aer-core-3.cisco.com) ([173.38.203.22]) by aer-iport-2.cisco.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 04 Dec 2018 17:14:54 +0000
Received: from [10.63.23.68] (dhcp-ensft1-uk-vla370-10-63-23-68.cisco.com [10.63.23.68]) by aer-core-3.cisco.com (8.15.2/8.15.2) with ESMTP id wB4HEs85000913; Tue, 4 Dec 2018 17:14:54 GMT
To: Andy Bierman <andy@yumaworks.com>, NetMod WG <netmod@ietf.org>
References: <CABCOCHQYR_iY7Lp=0m8D-GQ8+Rzuijaa8_41bJw6ZvWPc+Cw_w@mail.gmail.com> <b2ee0537-465a-fbc0-1b94-000da5993b05@cisco.com> <20181130192830.g2622izshwn4f55z@anna.jacobs.jacobs-university.de> <6281fd79-2284-46c0-350c-770dcdade29d@cisco.com> <20181203113633.v3nfuen66ngjjp6l@anna.jacobs.jacobs-university.de> <CABCOCHRC_Z7QJhV-sMrUyX_d49UM8C=x9rgOtTc9iP2Wz52+VQ@mail.gmail.com> <20181203181323.dfbtlkumwp7ui5m6@anna.jacobs.jacobs-university.de>
From: Robert Wilton <rwilton@cisco.com>
Message-ID: <357e815d-cf6e-c093-0212-947fc8c7344d@cisco.com>
Date: Tue, 04 Dec 2018 17:14:54 +0000
User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:60.0) Gecko/20100101 Thunderbird/60.3.2
MIME-Version: 1.0
In-Reply-To: <20181203181323.dfbtlkumwp7ui5m6@anna.jacobs.jacobs-university.de>
Content-Type: text/plain; charset="utf-8"; format="flowed"
Content-Transfer-Encoding: 8bit
Content-Language: en-US
X-Outbound-SMTP-Client: 10.63.23.68, dhcp-ensft1-uk-vla370-10-63-23-68.cisco.com
X-Outbound-Node: aer-core-3.cisco.com
Archived-At: <https://mailarchive.ietf.org/arch/msg/netmod/w3zw5ibXm49EkyJeUFMMr1swLVs>
Subject: Re: [netmod] instance file parsing
X-BeenThere: netmod@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: NETMOD WG list <netmod.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/netmod>, <mailto:netmod-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/netmod/>
List-Post: <mailto:netmod@ietf.org>
List-Help: <mailto:netmod-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/netmod>, <mailto:netmod-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 04 Dec 2018 17:14:59 -0000

On 03/12/2018 18:13, Juergen Schoenwaelder wrote:
> On Mon, Dec 03, 2018 at 09:56:48AM -0800, Andy Bierman wrote:
>> On Mon, Dec 3, 2018 at 3:36 AM Juergen Schoenwaelder <
>> j.schoenwaelder@jacobs-university.de> wrote:
>>
>>> On Mon, Dec 03, 2018 at 10:21:20AM +0000, Robert Wilton wrote:
>>>> On 30/11/2018 19:28, Juergen Schoenwaelder wrote:
>>>>> I doubt it makes sense to carry an entire yang library schema with the
>>>>> instance data. The modules are actually known via the namespaces.
>>>> Knowing the module name or namespace isn't sufficient to be able to parse
>>>> the data:
>>>>   - E.g. if the contents are in the format of RFC7895bis, but the client
>>>> tries to parse it using the schema from RFC7895, then it will fail.
>>>>   - Once NBC changes are allowed via YANG versioning it will become more
>>>> necessary to know the revision or version of the modules to be able to
>>> parse
>>>> the data.  E.g. not even just the latest module revision will necessarily
>>>> work.   It is entirely plausible that different server revisions may be
>>>> generating instance data using slightly different schema.
>>> Perhaps this is a problem of the versioning solution then. ;-)
>>>
>>>> If the server has deviations then the client may also need to know those
>>>> deviations to correctly parse the file.
>>>>
>>>> So, I think that it is pretty clear that knowing the right schema is
>>>> required to be able to correctly parse and interpret the instance data.
>>>>
>>>> I think that there are 4 ways that this can be achieved:
>>>>   1) embedded the necessary schema information into the YANG instance
>>> data.
>>>>   2) put the necessary schema information online somewhere and have a URI
>>>> reference to it.
>>>>   3) some combination of (1) and (2), e.g. packages defined centrally,
>>> with
>>>> deviations listed in the file.
>>>>   4) the schema is determined using some out of bound mechanism, or
>>> possibly
>>>> it is hard coded.
>>> Perhaps we need to define first what "parse" means. I am not sure
>>> parsing requires to know all schema details. Anyway, if all schema
>>> details are needed, then the simplest solution seems to be to read the
>>> schema from another instance file. This then only requires to
>>> bootstrap the schema model version.
>>>
>>>
>> I agree with Rob.
>> The solution has to support accurate parsing of the instance data.
>> This means that the parser has the correct schema tree to use for
>> validating an instance document against the schema.
>>
>> Since the new yang-library can have a different schema tree for each
>> datastore,
>> clearly the option of specifying the datastore is needed.
> We need to get the terminology straight. I can parse XML and JSON
> files without the need to have schema information available. So I
> think the word 'parsing' is quite misleading here. And I can extract
> data out of XML and JSON files that I find interesting without schema
> information as well. So its a certain type of tools that may take
> advantage of being schema aware but not all tools need to be schema
> aware.
>
>>>> I don't think that there is a one size fits all answer here.  I think
>>> that
>>>> that it depends on the use case.  Certainly, I think that facilitating 1
>>> -3
>>>> is useful, but they should be optional rather than mandatory.  I.e.
>>> defining
>>>> nodes for these doesn't seem to cost much if a server isn't obliged to
>>>> populate them.
>>>>
>>>> I do think that YANG packages (themselves defined in instance data
>>>> documents) could be very helpful here.  I.e. rather than listing all the
>>>> modules, instead list the packages + any deviations from that.  I'm
>>>> presuming here that the definitions of the packages are available via a
>>> URI.
>>>
>>>
>> Yes -- this has been my goal for YANG Packages since the start.
>> By using nested packages and offline caching, the entire YANG library for a
>> device
>> could be recognized in a few URIs.
> An instance file storing the schema tree is an offline cache. It falls
> out of the instance file format for free. Yes, packages are yet
> something entirely different but so far no work in this direction is
> chartered so we should get instance file formats defined without
> solving another bigger problem first.

In the IETF 103 discussion on YANG versioning, it seemed to be the 
consensus in the room that the versioning design team need to consider 
also versioning of sets of modules, rather than just individual 
modules.  I.e. something along the lines of YANG packaging.

Thanks,
Rob


>
>
>>>>>    If
>>>>> you want to capture the schema, dump the relevant yang library into
>>>>> another instance file.
>>>> That just means another file to carry around and manage.
>>> Yes, but compared to solutions that require new and/or much more
>>> elaborate data formats, this is very cheap and efficient, in
>>> particular for systems where the schema itself is relatively
>>> static. Also in terms of engineering time this is a rather cheap
>>> solution since you do not need to invent a new way to document and
>>> communicate a schema.
>>>
>>>
>> I agree the cost of an extra instance document is not a problem, especially
>> since
>> it would be optional and only used if the defaults are not sufficient:
>>     - latest revision of module will be OK
>>     - assume all features present will be OK
>>     - assume no deviations will be OK
>>
>> If the defaults are not OK, then the parser will incorrectly accept or
>> reject certain instance data,
>> unless the correct schema tree is obtained somehow.
> Or the tool processing instance file content just does not care about
> the schema and it simply assumes that a certain (module,path)
> combination has fixed semantics. I am big fan of being able to use
> generic tools to filter and process XML or JSON encoded data.
>
> /js
>