[Cbor] Line terminators in diagnostic notation strings

Anders Rundgren <anders.rundgren.net@gmail.com> Wed, 24 July 2024 20:55 UTC

Return-Path: <anders.rundgren.net@gmail.com>
X-Original-To: cbor@ietfa.amsl.com
Delivered-To: cbor@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id A4A41C17C8B6 for <cbor@ietfa.amsl.com>; Wed, 24 Jul 2024 13:55:53 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.106
X-Spam-Level:
X-Spam-Status: No, score=-2.106 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id N6i7NzCfFlj6 for <cbor@ietfa.amsl.com>; Wed, 24 Jul 2024 13:55:49 -0700 (PDT)
Received: from mail-wr1-x431.google.com (mail-wr1-x431.google.com [IPv6:2a00:1450:4864:20::431]) (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id A0FE9C169412 for <cbor@ietf.org>; Wed, 24 Jul 2024 13:55:49 -0700 (PDT)
Received: by mail-wr1-x431.google.com with SMTP id ffacd0b85a97d-3684e8220f9so124966f8f.1 for <cbor@ietf.org>; Wed, 24 Jul 2024 13:55:49 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1721854548; x=1722459348; darn=ietf.org; h=content-transfer-encoding:subject:from:to:content-language :user-agent:mime-version:date:message-id:from:to:cc:subject:date :message-id:reply-to; bh=h6ZZINAObpvW4Bud7J15lWXWNQz+T3C5GswwTxOY6gw=; b=RBmMOFwp5vik4viD8TsAycmtk/uoKxsvEv1dVf3PjttgWc8tfumTM5BgJROo/Bdz/E CelPTPXh3qxigJFwumOaBu/XBob24j81HCSk3AfXewB1qmjtGqKuBqoFVv6EtADOXrvn fVeGkC79SEMw+3Mrnd72VSEnxTgiBmWU6UqIYmn5HV+w64kQ4WWR9HUeJYLCCT1Wm2jW FjBhxxvNDQr58E0ihRzm/OzBxQdoCulp365bV/5loCftTCx4mHxx11Gu16sQkzeNo7E3 cGVaxUygBHqM4tCB5WpGRntOT0QnWJ0rbwhs0WVIcyqacBD03mHKQh6rzCcuTPm+Zni1 t8Tg==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1721854548; x=1722459348; h=content-transfer-encoding:subject:from:to:content-language :user-agent:mime-version:date:message-id:x-gm-message-state:from:to :cc:subject:date:message-id:reply-to; bh=h6ZZINAObpvW4Bud7J15lWXWNQz+T3C5GswwTxOY6gw=; b=E93eELwUwvIM6AN7ludi5HhX46lVT3/gR738WtB2sU5o2LK3c33a0oMDx5hA1hdIyA RtmtsCplOMEo669m3nzHBqgaitKwiHoPRVXSF5YkLWp3VeNBSD1OLANaXgX7pX8bly+1 G6Q1s05UzAzfOQpJVDmxjDcSOlciuRc0kIyIHjJ1gaee6NLl+ZQ4gNsNuKfwHVjk16Vw 6z0JxGz7IB2qiMNRULpQ0LSkz1r10dolWNUUB1C4xai7KqXm1z9WGl1CsQ6q11U8ktR5 xSDiAbXJCcBiAeCLdVFsoqtn3b56jN3bacUWWonm7Z43HE1fvLZ5LErnU11hWV7IRNlV TLGg==
X-Gm-Message-State: AOJu0YyfRYs3RMsZzfzgY6LMoXkYI4I+ahmoSVgp6Jyt/52mDGRzIM7U TIUHEjXWFda+ky1bZu0rLrHGqWo3ZwgYRJHwGNq2s5vUguEMUJvs7rVx2Q==
X-Google-Smtp-Source: AGHT+IERTD6hwZ08PslmHj1hMMDOz/zAU7ot0fdRvs55CYWisGORCtJfhnt5SuMsG9fCK2pf2AH+9w==
X-Received: by 2002:a05:6000:c07:b0:367:938f:550 with SMTP id ffacd0b85a97d-369f6706afcmr2584291f8f.25.1721854547435; Wed, 24 Jul 2024 13:55:47 -0700 (PDT)
Received: from ?IPV6:2a01:e0a:e1b:64b0:85d3:bb06:4cc4:d62e? ([2a01:e0a:e1b:64b0:85d3:bb06:4cc4:d62e]) by smtp.googlemail.com with ESMTPSA id ffacd0b85a97d-36878695165sm15224364f8f.62.2024.07.24.13.55.46 for <cbor@ietf.org> (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Wed, 24 Jul 2024 13:55:47 -0700 (PDT)
Message-ID: <d8763090-37b5-417f-b6b8-4c7569f86f2c@gmail.com>
Date: Wed, 24 Jul 2024 22:55:46 +0200
MIME-Version: 1.0
User-Agent: Mozilla Thunderbird
Content-Language: en-US
To: "cbor@ietf.org" <cbor@ietf.org>
From: Anders Rundgren <anders.rundgren.net@gmail.com>
Content-Type: text/plain; charset="UTF-8"; format="flowed"
Content-Transfer-Encoding: 7bit
Message-ID-Hash: ZDRKA7S65VWKSYRRQWEIMNF2YEHYT2U4
X-Message-ID-Hash: ZDRKA7S65VWKSYRRQWEIMNF2YEHYT2U4
X-MailFrom: anders.rundgren.net@gmail.com
X-Mailman-Rule-Misses: dmarc-mitigation; no-senders; approved; emergency; loop; banned-address; member-moderation; header-match-cbor.ietf.org-0; nonmember-moderation; administrivia; implicit-dest; max-recipients; max-size; news-moderation; no-subject; digests; suspicious-header
X-Mailman-Version: 3.3.9rc4
Precedence: list
Subject: [Cbor] Line terminators in diagnostic notation strings
List-Id: "Concise Binary Object Representation (CBOR)" <cbor.ietf.org>
Archived-At: <https://mailarchive.ietf.org/arch/msg/cbor/KUHa-OBdjF5clx_QHkPw3OzK0I8>
List-Archive: <https://mailarchive.ietf.org/arch/browse/cbor>
List-Help: <mailto:cbor-request@ietf.org?subject=help>
List-Owner: <mailto:cbor-owner@ietf.org>
List-Post: <mailto:cbor@ietf.org>
List-Subscribe: <mailto:cbor-join@ietf.org>
List-Unsubscribe: <mailto:cbor-leave@ietf.org>

By accident I found that https://cbor.me and https://cyberphone.github.io/CBOR.js/doc/playground.html treat the CBOR DN item

"first
next"

slightly different.

The JDK folks identified the problem: https://openjdk.org/jeps/378
I would consider adopting their line termination concept (=normalizing CR and CRLF to \n) as well as their line continuation concept "\" (adopted for RFCs as well)

There is more, but personally, I think this solves the core problems:
- line-oriented text
- huge text input

Anders