Re: [core] [Last-Call] Last Call: <draft-ietf-core-senml-more-units-02.txt> (Additional Units for SenML) to Proposed Standard

Ari Keränen <ari.keranen@ericsson.com> Wed, 30 October 2019 21:19 UTC

Return-Path: <ari.keranen@ericsson.com>
X-Original-To: core@ietfa.amsl.com
Delivered-To: core@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 2F1F112083C; Wed, 30 Oct 2019 14:19:52 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.002
X-Spam-Level:
X-Spam-Status: No, score=-2.002 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIMWL_WL_HIGH=-0.001, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H2=-0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=ericsson.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id HrdI82wwh6i9; Wed, 30 Oct 2019 14:19:48 -0700 (PDT)
Received: from EUR03-DB5-obe.outbound.protection.outlook.com (mail-eopbgr40084.outbound.protection.outlook.com [40.107.4.84]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 9DA65120103; Wed, 30 Oct 2019 14:19:47 -0700 (PDT)
ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=kSc09Jtq79jf6fVmmpM/qfgXZ+O9tsWcgFkqa8eIN16l1TLcq1j7Fr2EUugZjvFQvzCwGypL3XAfmQk6k6HQ3c6RjjQfdfGVC8KR1zYenmseBiVx99s757JnW1I1ewqcojRfxNEIrC791FpJIaKaNjGtPNvw3E2VCMSN30b7No95JCjmap/PKPSnpCtlUWfy/3HMaIXmjQ0rFcr3WaJS0FPbzKwhjOP8gPPTJpknnJEzzZETJ0jFGn/SJ1ytSlg/U+4jwuGqujTzwNWvJ2ZKXyAvww+8sSvDwbgIOn/qukA2Acqzua5nYdaCFwqHsVFVxYi1qwuL6ZweGO/dVH2wHA==
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=Uta+bigLjch8R6b/tvPxi2bMJ9+Agt+2lpaa6jszgJk=; b=mXrmFfJFFlxuvGCpqcREV0uJeknxFW7auA4WWhPQTDBG+dZzCK4Rk/2HQc3zGkaJlMp7WDPkNmA7DStnrvbFcctj7oFr1mWMZCHoK6z5fKR6IZOdcaVRSXtK74dS3bsXXEloak0csiBAy90be5O13KXc8M+9B8/ckH8/m9eVPSMT8cFM8a2vdGyZYrQb3QReU9v64gJMDRt8tWIIsmQ/vwx1W5rPUHDkmZ2+zuFK9kYHtBmupRiXqVogmqk7Yh2+l3MYY1/7LNx09aUOZVH4CwDRZk08MvkvsuW9hiPq/jEOYYZ5hFbclJkRHxgfWWYU62DzFO/KFKhBg/0a5vmVWg==
ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=ericsson.com; dmarc=pass action=none header.from=ericsson.com; dkim=pass header.d=ericsson.com; arc=none
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ericsson.com; s=selector2; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=Uta+bigLjch8R6b/tvPxi2bMJ9+Agt+2lpaa6jszgJk=; b=EJerW86hnqtJNe9HDOvpCyZhhRGOpp6iBJsyHTpQQLa45lM7vbx+g9cEu7ul62E94BKtfdCKmoQjFerHHJYxlZyiuWxM/6mrFypb5ZqxS+8wkOGi9isIhBOIiCRuhOV2WIpEorVfIAA2nr2tYxXnlGOnO4ZKm5Ojs1Rgdgm3gW0=
Received: from HE1PR07MB4236.eurprd07.prod.outlook.com (20.176.166.145) by HE1PR07MB4332.eurprd07.prod.outlook.com (20.176.168.148) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.2408.13; Wed, 30 Oct 2019 21:19:44 +0000
Received: from HE1PR07MB4236.eurprd07.prod.outlook.com ([fe80::3840:ef4a:a3da:a6a6]) by HE1PR07MB4236.eurprd07.prod.outlook.com ([fe80::3840:ef4a:a3da:a6a6%6]) with mapi id 15.20.2408.014; Wed, 30 Oct 2019 21:19:44 +0000
From: =?utf-8?B?QXJpIEtlcsOkbmVu?= <ari.keranen@ericsson.com>
To: Cullen Jennings <fluffy@iii.ca>
CC: "draft-ietf-core-senml-more-units@ietf.org" <draft-ietf-core-senml-more-units@ietf.org>, IETF Crazy <ietf@ietf.org>, "core-chairs@ietf.org" <core-chairs@ietf.org>, "last-call@ietf.org" <last-call@ietf.org>, core <core@ietf.org>, Pete Resnick <resnick@episteme.net>
Thread-Topic: [core] [Last-Call] Last Call: <draft-ietf-core-senml-more-units-02.txt> (Additional Units for SenML) to Proposed Standard
Thread-Index: AQHVjzW5BLR+HRN1u0680p2MrqdK0KdzqiWA
Date: Wed, 30 Oct 2019 21:19:44 +0000
Message-ID: <F098D8A8-171D-41D1-B12E-5C44F82DF519@ericsson.com>
References: <DF78FB17-0DDA-471E-A28A-15AC0718F653@iii.ca>
Accept-Language: en-US
Content-Language: en-GB
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
user-agent: Microsoft-MacOutlook/10.1d.0.190908
authentication-results: spf=none (sender IP is ) smtp.mailfrom=ari.keranen@ericsson.com;
x-originating-ip: [2001:14bb:150:480b:b5ad:79ea:7a65:3809]
x-ms-publictraffictype: Email
x-ms-office365-filtering-correlation-id: bdd9de24-b34f-4d09-1f51-08d75d7ee2d3
x-ms-traffictypediagnostic: HE1PR07MB4332:
x-ms-exchange-purlcount: 2
x-microsoft-antispam-prvs: <HE1PR07MB43328C51B43AE6947C0FFB6085600@HE1PR07MB4332.eurprd07.prod.outlook.com>
x-ms-oob-tlc-oobclassifiers: OLM:8882;
x-forefront-prvs: 02065A9E77
x-forefront-antispam-report: SFV:NSPM; SFS:(10009020)(4636009)(376002)(346002)(366004)(396003)(136003)(39860400002)(189003)(199004)(13464003)(6116002)(8676002)(2906002)(85182001)(76116006)(8936002)(66946007)(86362001)(81166006)(81156014)(6506007)(85202003)(6246003)(33656002)(14444005)(256004)(53546011)(6306002)(25786009)(186003)(6916009)(478600001)(4326008)(30864003)(6512007)(99286004)(46003)(486006)(5660300002)(6436002)(966005)(4001150100001)(36756003)(66574012)(229853002)(6486002)(54906003)(58126008)(102836004)(14454004)(66556008)(64756008)(66446008)(316002)(71200400001)(7736002)(305945005)(76176011)(2616005)(446003)(71190400001)(476003)(66476007); DIR:OUT; SFP:1101; SCL:1; SRVR:HE1PR07MB4332; H:HE1PR07MB4236.eurprd07.prod.outlook.com; FPR:; SPF:None; LANG:en; PTR:InfoNoRecords; MX:1; A:1;
received-spf: None (protection.outlook.com: ericsson.com does not designate permitted sender hosts)
x-ms-exchange-senderadcheck: 1
x-microsoft-antispam: BCL:0;
x-microsoft-antispam-message-info: evlTt7AFZXN3zsQV/AGAgUWJX1qWzpa+TNSm95Yjo+5rzuhQ/CHUJDPEA2WDgPL8rlkfEYwEb8JnIj6czdLhPnRoKTgftddONmiNV/8xNVT4Vn7ytrROv9k/G6YtKMMYXal+sJIPLV9s3/ltbbl64XVeiyaz7xNI3HcxzoOpQx0VO2UZBorRTj3qgwfwYHfUF2bX8XO+BMrVBOMwdKmHFEmkYmJY5a1G3msDkoCXBfLpwR36eGpz2/Ef7yEch0dbM7U/A6ccQF/DpeM4SVdqervYlN637ObPIWJkQdTPP+QJGZ/MozR5F4Pjde9tb4ErFhmVCR5mxeul67WQAlukhjvKZg0xNgLH3WGIA3PtNLOwn1AfcRimbxUfojzh1dxqLjSlggQ1cZC7FPgp6T+SENjhsr3GIO5K7EGujYR4RA47wwRPfsiFBSx2xq6Xpt/tWqvtJcBJrAf3zo24nti7ZVsbX02/Rad8QqESIYXSTzA=
x-ms-exchange-transport-forked: True
Content-Type: text/plain; charset="utf-8"
Content-ID: <3CA9CBAA8D28A64AB7ABDA48E3F15F99@eurprd07.prod.outlook.com>
Content-Transfer-Encoding: base64
MIME-Version: 1.0
X-OriginatorOrg: ericsson.com
X-MS-Exchange-CrossTenant-Network-Message-Id: bdd9de24-b34f-4d09-1f51-08d75d7ee2d3
X-MS-Exchange-CrossTenant-originalarrivaltime: 30 Oct 2019 21:19:44.6995 (UTC)
X-MS-Exchange-CrossTenant-fromentityheader: Hosted
X-MS-Exchange-CrossTenant-id: 92e84ceb-fbfd-47ab-be52-080c6b87953f
X-MS-Exchange-CrossTenant-mailboxtype: HOSTED
X-MS-Exchange-CrossTenant-userprincipalname: wqQp9cSQK8NV4uwP9bo2ALQThVTj7RMMkupuwyLWxniyI2Mgqe9XFyD2y4ds3KF4B29QeVBXVP2DvlciFlSwAZBpJ4Mwkt8H2Ef49S5tVxg=
X-MS-Exchange-Transport-CrossTenantHeadersStamped: HE1PR07MB4332
Archived-At: <https://mailarchive.ietf.org/arch/msg/core/SRI36tMHMsuhso8qAnlXjZJgk88>
Subject: Re: [core] [Last-Call] Last Call: <draft-ietf-core-senml-more-units-02.txt> (Additional Units for SenML) to Proposed Standard
X-BeenThere: core@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: "Constrained RESTful Environments \(CoRE\) Working Group list" <core.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/core>, <mailto:core-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/core/>
List-Post: <mailto:core@ietf.org>
List-Help: <mailto:core-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/core>, <mailto:core-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 30 Oct 2019 21:19:52 -0000

Hi Cullen,

The key thing the secondary table enables is the use of legacy models with the units that they are already using (where changing the existing models is too high cost). New models and uses of SenML can avoid the problems you described below by sticking to the units in the primary registry. Perhaps the right solution here is to write more guidance into the draft to make this more clear.

And yes, also more guidance for the designated expert is very welcome. And I would argue that what is being measured belongs to another tag, not to the unit.


Cheers,
Ari

> On 30. Oct 2019, at 17.21, Cullen Jennings <fluffy@iii.ca> wrote:
> 
> If we are going to have add theses secondary scaled units, I do agree with Ari & Carten’s arguments that  having them in a secondary registry that is machine readable is the right way to do it. In fact would be better if IANA also made them available over git but that is a different topic. But here is the problems I see:
> 
> Currently interoperability in IoT is a mess and the 20+ SDOs in this space does not help. Us trying to meet the conflicting needs of all theses SDOs will also not help. So let me try and clarity some of the problems with this draft as it stands.
> 
> Imagine that today the secondary table has mm but not cm. I write my software to process mm and ship it. Next week cm is added, my software can’t process that, and in many IoT cases, it will take years to be upgraded. At this point, standardizing the table does nothing for anyone and is harmful in that it suggests that things will work together since they are senml but they will not. It’s true that a new unit nothing to do with distance could be added to main table but the way that table is done, that would be rare, and very unlikely my software needs it if my software deals with distance. However, if my software knows and uses m and mm, very likely the cm makes sense for it to use but it can’t. 
> 
> The argument for having this second table seems to be an argument to avoiding a single floating point multiple on the IoT decide. Keep in mind this is a device that probably supports EC crypto to send the data. I have not heard any argument that this single multiple is too high a burden on the device. 
> 
> The expert review advice for this new table is woefully inadequate for thd broad range of questions the expert will need to answer. For example, will we add “furlongs per fortnight”? why not? Will we add micro-fortnights as a unit? Several RFC use fortnights  (Thank you Dave Oran)
> 
> The expert review advise does not cover what is probably the single biggest unit issue with uses. That is dose the type of thing being measured belong in the unit in some cases. For example, is parts per million of oxygen a different unit that parts per million of carbon dioxide? I realize the unit purist would say those are both simply a ratio but if you look at how practitioners use this stuff, theses are totally different units. This draft does not cover how to deal with that. On a side note, I would see way more value of this secondary table if we were clear that we were going to allow both ppm_O2 and ppm_CO2 . However in the past, we argued that was better placed in the name in SENML or in a secondary tag value. 
> 
> As a trivial side note  …. the draft is just wrong on how it modifies the expert guidance from the 2nd registry. It says remove rules 4 and 5 from  RFC8428. However the examples we are talking about for this will also probably violate rules 1, 2, 3, 7, 8. I think this would have to be addressed before approving this draft. 
> 
> So let's get back to the fundamental issue. Why are we doing this? Some other SDO wants it is not really an answer. Understanding why the other SDO wants it would be a good answer. Do the reasons cause more good than interoperability problems that come out of it. I tried to get info from a few other SDO about this but I failed. I’d like to hear more about why we should do this. I’m not seeing big value in the saving the floating point calc on the device but perhaps there is a compelling use case where that matters that I am not thinking of. 
> 
> I think if I understood why people want this, it might change my mind. 
> 
> 
>> On Oct 29, 2019, at 2:58 PM, Ari Keränen <ari.keranen=40ericsson.com@dmarc.ietf.org> wrote:
>> 
>> Hi Pete,
>> 
>> Carsten replied to many of your questions/comments below, but perhaps one more point to mention is that the additional registry makes it simpler to build a system specification for SenML use where you can simply point that all implementations in that system should use the units from the first registry. Since all the units in the second registry have an equivalent non-scaled-and-shifted version in the first registry, that is rather feasible. Also many implementations are likely to "implement" only a small subset of the units since they would use them to simply check that a value is the kind of value they expected to receive. 
>> 
>> That said, the most important thing is to get these new units registered as soon as possible, in one or two registries, so that we don't hold back our stakeholders using SenML.
>> 
>> 
>> Cheers,
>> Ari
>> 
>>>> On 29. Oct 2019, at 16.16, Pete Resnick <resnick@episteme.net> wrote:
>>> 
>>> Hi Ari,
>>> 
>>> What is the upside to having a minimal set of unscaled units when all implementations are going to have to be able to interpret all of the entries from both registries? An implementation can't keep comparisons simple, because it's going to have to deal with all of the new entries anyway. If an implementation is allowed to not implement the ones in the second registry, then that does lead to the interoperability problems Cullen is worried about. Even if if the document says that you MUST be able to interpret values from both registries, the downside is the possibility that implementations might simply ignore the second registry. And once you have the second registry, I don't see that there's any reason to prefer any of the entries in the original registry: You have to be able to handle both, and since everyone has to handle both, there's not reason not to send them either.
>>> 
>>> Add the scale column to the main registry and use one registry. There's no upside to keeping two, as far as I can tell, and there's definitely the potential for interoperability problems.
>>> 
>>> pr
>>> 
>>>> On 28 Oct 2019, at 14:26, Ari Keränen wrote:
>>>> 
>>>> Hi Pete,
>>>> 
>>>> I don't have a strong objection for using the same registry for all units, but I think the second registry is a better approach since that allows us to keep the first registry in its original purpose: a minimal set of unscaled units where you would use the numeric part of the value to express the scale. This is still the best approach for many use cases / systems since it allows you to get by with a small amount of identifiers and keeps comparisons across values simple (you don't need to normalize values based on unit's scale before doing comparisons etc.). If we put all the units in the same registry, systems (or their specifications) that would like to have this property would have to state somehow which of the units in the single registry one should use.
>>>> 
>>>> Also the second registry has the "scale" column that indicates the relation (and enables automatic translation) to the units in the first registry. I guess we could get the same effect by extending the first registry with the scaling field, but at least to me the second registry feels like a cleaner option. And I don't see big downsides for having separate registries for these; both contain still valid SenML units.
>>>> 
>>>> 
>>>> Cheers,
>>>> Ari
>>>> 
>>>>>> On 28. Oct 2019, at 20.58, Pete Resnick <resnick@episteme.net> wrote:
>>>>> 
>>>>> Matt, Ari, and Harri,
>>>>> 
>>>>> While you give good support for adding these units, and I am OK with that idea, do you have any objection to adding them to the main registry, which was the other suggestion that Cullen made? I still have not seen a convincing argument not to make these part of the main registry.
>>>>> 
>>>>> pr
>>>>> 
>>>>>> On 28 Oct 2019, at 10:40, Gillmore, Matthew wrote:
>>>>>> 
>>>>>> Hi Cullen,
>>>>>> 
>>>>>> I am the chair of the IPSO working group in OMA and also support what Ari wrote.  During the last IETF in Montreal, stakeholders from the IPSO working group along with the Core working group met and it was consensus amongst both working groups that the second registry is how we would like to move forward.
>>>>>> 
>>>>>> Cheers,
>>>>>> 
>>>>>> Matt
>>>>>> 
>>>>>> -----Original Message-----
>>>>>> From: ietf <ietf-bounces@ietf.org> On Behalf Of Ari Keränen
>>>>>> Sent: Tuesday, October 22, 2019 6:37 PM
>>>>>> To: Cullen Jennings <fluffy@iii.ca>
>>>>>> Cc: core-chairs@ietf.org; IETF Crazy <ietf@ietf.org>rg>; core <core@ietf.org>rg>; draft-ietf-core-senml-more-units@ietf.org
>>>>>> Subject: Re: [core] Last Call: <draft-ietf-core-senml-more-units-02.txt> (Additional Units for SenML) to Proposed Standard
>>>>>> 
>>>>>> Hi Cullen,
>>>>>> 
>>>>>> Carsten already replied to some of these issues later in the thread, but let me add some more viewpoints.
>>>>>> 
>>>>>>>> On 22. Oct 2019, at 17.06, Cullen Jennings <fluffy@iii.ca> wrote:
>>>>>>> I am strongly opposed to the secondary registry - it will significantly harm interoperability. This could be resolved by not adding the items in the secondary repository or by adding them to the main repository.
>>>>>>> 
>>>>>>> Having a registry of stuff that may or may not work simply means that we will have less interoperability because what you can not decide what is valid SenML or not.  Sure some of the device vendors making things that produce SenML have thought this was a good idea in the past but I’d like to hear from a bunch of the people that have to process the SenML data that is produced and see if they think it is a good idea.
>>>>>> 
>>>>>> Both registries would work and their use would result in valid SenML. In a sense the second registry is almost the same as "adding them to the main repository", but having the second registry enables us to give better guidance which units to use/prefer and also provide the translation to the "main registry units".
>>>>>> 
>>>>>>> The working group discussed this for a long time and was against the items that are being added to the secondary repository.
>>>>>>> If there is a large group of people that agree the consensus has changed, then this should simply be put in the main registry. Many people believe that since SenML is not meant to be human readable, having a canonical form of just meters with a floating point number is better than having both km, mm, and god only knows what else.
>>>>>> 
>>>>>> Yes, the canonical form of "just meters" is still the recommended way in general, but that approach did not work as well as we hoped for all use cases. For example when your sensor value is always in integers of mm/h, it's more convenient and efficient to send integer values than multiples of 2.777e-7. Also in many use cases there is a well-established scaled unit that is used in the industry and forcing to use a different one didn't turn out to be very productive (we can still recommend, though, and that's one of the reasons why we are proposing an additional registry). Finally, legacy models that use non-scaled units but would like to use SenML as the wire format and units registry would be expensive to update to use only non-scaled units.
>>>>>> 
>>>>>>> I note the secondary repository add mm but not cm. How is this insanity going to work out? How will analytics code trying to process this data know what it can use or not use. If we are going down this path, this is the wrong way to do it. The right way is to define a set of all SI prefixes and say they can be used with any unit.
>>>>>> 
>>>>>> Any implementation should not try to use anything that is not in the IANA registry. We did also consider to enable all SI prefixes, but concluded that it is still useful to have the smallest reasonable set of units to facilitate interoperability. If there's a real use case that uses a unit with SI prefix that is missing, it can be simply registered. Also not all units in the additional registry would work with just different SI prefix (e.g., min or dBm).
>>>>>> 
>>>>>> 
>>>>>> Cheers,
>>>>>> Ari
>>>>>> 
>>>>>>> 
>>>>>>>> On Oct 16, 2019, at 11:42 AM, The IESG <iesg-secretary@ietf.org> wrote:
>>>>>>>> 
>>>>>>>> 
>>>>>>>> The IESG has received a request from the Constrained RESTful
>>>>>>>> Environments WG
>>>>>>>> (core) to consider the following document: - 'Additional Units for SenML'
>>>>>>>> <draft-ietf-core-senml-more-units-02.txt> as Proposed Standard
>>>>>>>> 
>>>>>>>> The IESG plans to make a decision in the next few weeks, and solicits
>>>>>>>> final comments on this action. Please send substantive comments to
>>>>>>>> the ietf@ietf.org mailing lists by 2019-10-30. Exceptionally,
>>>>>>>> comments may be sent to iesg@ietf.org instead. In either case, please
>>>>>>>> retain the beginning of the Subject line to allow automated sorting.
>>>>>>>> 
>>>>>>>> Abstract
>>>>>>>> 
>>>>>>>> 
>>>>>>>> The Sensor Measurement Lists (SenML) media type supports the
>>>>>>>> indication of units for a quantity represented.  This short document
>>>>>>>> registers a number of additional unit names in the IANA registry for
>>>>>>>> Units in SenML.  It also defines a registry for secondary units that
>>>>>>>> cannot be in SenML's main registry as they are derived by linear
>>>>>>>> transformation from units already in that registry; RFC 8428 is
>>>>>>>> updated to also accept these units.
>> 
>> -- 
>> last-call mailing list
>> last-call@ietf.org
>> https://www.ietf.org/mailman/listinfo/last-call
> 
> _______________________________________________
> core mailing list
> core@ietf.org
> https://www.ietf.org/mailman/listinfo/core