[media-types] Fwd: Thoughts on suffixes, single and multiple

Brian Campbell <bcampbell@pingidentity.com> Wed, 03 April 2024 16:07 UTC

Return-Path: <bcampbell@pingidentity.com>
X-Original-To: media-types@ietfa.amsl.com
Delivered-To: media-types@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 5DFE6C14F682 for <media-types@ietfa.amsl.com>; Wed, 3 Apr 2024 09:07:45 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.092
X-Spam-Level:
X-Spam-Status: No, score=-2.092 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, HTML_FONT_LOW_CONTRAST=0.001, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_HELO_NONE=0.001, SPF_NONE=0.001, URIBL_BLOCKED=0.001, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=pingidentity.com
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id x30DhNuSMxZg for <media-types@ietfa.amsl.com>; Wed, 3 Apr 2024 09:07:40 -0700 (PDT)
Received: from mail-il1-x12b.google.com (mail-il1-x12b.google.com [IPv6:2607:f8b0:4864:20::12b]) (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id AB6BFC14F61C for <media-types@ietf.org>; Wed, 3 Apr 2024 09:07:40 -0700 (PDT)
Received: by mail-il1-x12b.google.com with SMTP id e9e14a558f8ab-369ec1fbadfso5568595ab.2 for <media-types@ietf.org>; Wed, 03 Apr 2024 09:07:40 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=pingidentity.com; s=google; t=1712160459; x=1712765259; darn=ietf.org; h=to:subject:message-id:date:from:in-reply-to:references:mime-version :from:to:cc:subject:date:message-id:reply-to; bh=mAosbO7K0CiTsOCoZsIce+NBL5VHZZhDEQe0twaJwb0=; b=Jc1QJ8k1pm1KL0lLni6dK5D0FHCGxSpYSQCxbd+f067dpZ1sbjXGriFyarPU2phIUH 0U+/39kkKjujko+aTJYDbhX+NDMW9+R5eohZNxmNki0XfkuUe9RTWZu6SmXxwFQpzk1m qM/i8x83oXybg8hyobjaAkg+1ZFf1eEYfhtrNuEEJvAyfzqRXVaW9XLucP+il2BgAj1e wUK2CiagKNyPx4e93rydcubHfYpxHhm+3S0Dz4SeHqxiWDdMVa5EzxcYWpYFbRR80gz3 zfgPho52uecTdvbqESrn3Vb2b6Ix3BFMYh3qMFQ5WZ+nB0PWJkEgWvjUf6kI1I79QXdu w0QQ==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1712160459; x=1712765259; h=to:subject:message-id:date:from:in-reply-to:references:mime-version :x-gm-message-state:from:to:cc:subject:date:message-id:reply-to; bh=mAosbO7K0CiTsOCoZsIce+NBL5VHZZhDEQe0twaJwb0=; b=ZgWKnj6uBtHdLTq9HXLJZFpwTxXokyFzi3KJ4VLdhffupIOpudWJ+FQSf+Me3Qj5xQ fIAMe31QsIE0biQDIV3Uf1wJVzd2lp1G2YltdeRwgzPp0XwDER0ewg4jq6nP6MBxhZ77 k0muSQ7MenEYNS0PzcKTmv4b1CcUDUVXmV4Zi9KXx5gw+WKzVCu+35HTs1Rv3KiUUn/z 466ZZ7PnyAdw9t+QDtdNEJbFyVobAirncBkD+U5o6xEQA15vxcPAEm+Qm9t4/YjkoiFz tQIHrR1PDd0T/wjsHhTIXwQUPlyUPgReGtMgUtnqSQj2i/Xt3K3hJkr8SkbfVV1ntlGR cM4Q==
X-Gm-Message-State: AOJu0YwG5gIe9WIw+CcvP6jbKP5Tfv4lCyObedJD4/7YGlbMdgpaU9r1 w4pr+j8hZIovC8TSyidmuctbcErRAc8POOo2+yjOvToNJ99ggEv0tz+iUwRRqD+c3NHKMoEKH3s 7uFgcBXv6zFFbKRT+ge7/GVPAUzrhZ0bKGLISdtvgKwj0TaY9wJ3HbgSPRkmTbZAx5dyoeG+Spy 6Hs5Kf0JC5eWWskomFAUdWVnOSLXs2Gx8liw==
X-Google-Smtp-Source: AGHT+IHInaaXVv0r9L0pdJeKQ9TGw4L31kzFU6o443MkZrDFz4yzgDnpNPJ61f+ZE+N0V/36yKEsf7iCtE2Hk1wd8n0=
X-Received: by 2002:a05:6e02:1c05:b0:368:a261:5275 with SMTP id l5-20020a056e021c0500b00368a2615275mr10834ilh.1.1712160459375; Wed, 03 Apr 2024 09:07:39 -0700 (PDT)
MIME-Version: 1.0
References: <2E20FEDE-C766-43EE-A6E2-1FB63E79CF0B@mnot.net> <CAN8C-_KzQOPhv3Tep8gsnLqDxO7EnAo0qUkVUg1E6COttBTrfA@mail.gmail.com>
In-Reply-To: <CAN8C-_KzQOPhv3Tep8gsnLqDxO7EnAo0qUkVUg1E6COttBTrfA@mail.gmail.com>
From: Brian Campbell <bcampbell@pingidentity.com>
Date: Wed, 03 Apr 2024 10:07:13 -0600
Message-ID: <CA+k3eCSAh1Mbx8S1-Vnn2oGZT8ik1R5JOi-=Oc4Z5OaJyG=Rsg@mail.gmail.com>
To: media-types@ietf.org, Orie Steele <orie@transmute.industries>, Kristina Yasuda <yasudakristina@gmail.com>, Daniel Fett <fett@danielfett.de>, Oliver Terbu <oliver.terbu@mattr.global>
Content-Type: multipart/alternative; boundary="000000000000fae40f0615336bef"
Archived-At: <https://mailarchive.ietf.org/arch/msg/media-types/IO_m_DCTGqbZcgJcW7qhnL_r_n8>
Subject: [media-types] Fwd: Thoughts on suffixes, single and multiple
X-BeenThere: media-types@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: "IANA mailing list for reviewing Media Type \(MIME Type, Content Type\) registration requests." <media-types.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/media-types>, <mailto:media-types-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/media-types/>
List-Post: <mailto:media-types@ietf.org>
List-Help: <mailto:media-types-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/media-types>, <mailto:media-types-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 03 Apr 2024 16:07:45 -0000

Indeed there is work in progress that plans on requesting registration of
"+sd-jwt"
https://datatracker.ietf.org/doc/html/draft-ietf-oauth-selective-disclosure-jwt#name-structured-syntax-suffix-re
based party on anticipated/suggested usage
https://datatracker.ietf.org/doc/html/draft-ietf-oauth-selective-disclosure-jwt#name-explicit-typing
similar to "+jwt" that Mike mentioned here
https://mailarchive.ietf.org/arch/msg/media-types/WgnX1lyhUMR2M82HRlsegEGg8j0/.
And other work in progress that plans on requesting registration of
"vc+sd-jwt"
https://datatracker.ietf.org/doc/html/draft-ietf-oauth-sd-jwt-vc#name-media-types-registry.
I suppose "+jwt" and the nascent "+sd-jwt" fall into the "seldom used"
category mentioned by Mark (and those he's suspicious of) but they are used
in practice and expected to be used going forward so deprecating them or
the use of suffixes in media types would have ramifications with respect to
that.




---------- Forwarded message ---------
From: Orie Steele <orie@transmute.industries>
Date: Wed, Apr 3, 2024 at 7:43 AM
Subject: Fwd: [media-types] Thoughts on suffixes, single and multiple
To: Kristina Yasuda <Kristina.Yasuda@microsoft.com>, Daniel Fett <
fett@danielfett.de>, bcampbell <bcampbell@pingidentity.com>


Please remind this group that you are about to register +sd-jwt, and give
any comments that might support arriving at some kind of consensus here.

OS

---------- Forwarded message ---------
From: Mark Nottingham <mnot=40mnot.net@dmarc.ietf.org>
Date: Wed, Apr 3, 2024 at 1:30 AM
Subject: [media-types] Thoughts on suffixes, single and multiple
To: IETF Media Types <media-types@ietf.org>


After the meeting in Brisbane, some of us went aside to continue to the
multiple suffixes discussion. There, we quickly came to the conclusion that
we should deprecate the concept of suffixes in media subtypes -- i.e., they
would still be syntactically allowed, but would have no meaning or
registry. Martin Thomson and I took an action to write something down about
this.

Once I was home, I started to think more carefully about this and do
research. One thing that I haven't yet seen is a summary of how suffixes
are currently used (apologies if I missed someone else's effort there).
These are the counts for each suffix in the registry that I came up with
about a week ago:

+xml = 439
+json = 145
+ber = 0
+cbor = 16
+der = 1
+fastinfoset = 1
+wbxml = 7
+zip = 24
+tlv = 1
+json-seq = 2
+sqlite = 1
+jwt = 6
+gzip = 2
+cbor-seq = 4
+zstd = 0
+yaml = 2
+cose = 0

As you can see, we have a few very widely used suffixes (in a registry of
1,588 entries as of that survey), and many very seldom used ones - with a
few not used at all.

The widespread use of +xml and +json in particular made me more cautious
about deprecating suffixes altogether -- especially since we still sort-of
believe that they are indeed used by (or at least potentially useful to)
things like editors to hint syntactic conventions.

So, that leaves a few different options, considering the constraints we
have:

1) Disallow more than one "+" sign in media subtypes, as floated at the
meeting. This would put a fair amount of pressure on the registry's ability
to reflect reality, depending on how widely deployed some things get
(although we could grandfather some types in to ease the pressure here).

2) Syntactically allow suffixes before the last one, but not assign them
any meaning or register them; e.g., application/foo+bar+xml would be an XML
format, but who knows what bar is; effectively, it's just part of
"foo+bar". This would allow people to define suffix-like things, but
wouldn't give them any recognition or coordination -- potentially leading
to the need to formalise things more down the road, just as we did in the
first round of suffixes.

3) Consider multiple suffixes, when they occur, to be unrelated hints as to
the syntax of the format -- i.e., there is no processing model, there is no
ordering (although a registrant would have to choose an order;
registrations with different orderings should be refused). Effectively,
suffixes would just be a 'bag of hints' about the format being used.

I'd be interested in hearing people's reactions to these.

Separately, I think we need to settle a few other matters to make progress:


### Defining What Suffixes Are For (no matter how many there are)

After the discussion in Brisbane, I strongly believe that suffixes should
ONLY be for hinting about the syntax or format convention in use, as an aid
eg to editors, syntax highlighters, etc. This is the proven use case for
media type suffixes. Suffixes should not be used to hint semantics; only
syntax. We should have strong language about the dangers of using suffixes
to hint particular kinds of processing; cf the previous discussion on the
'polyglot problem' and the potential security issues around performing
processing based upon suffixes.

The suffix registration process should be designed to assure that only such
suffixes are registered.

Note that in this view, "+ld" is very likely unregistrable.


### Cleaning Up Existing Suffixes

+gzip and +zstd are problematic; the former should be disallowed for new
registrations, and the latter should be removed or obsoleted in the
registry. Likewise, I am highly suspicious of +jwt and +cose. +zip _is_ a
format convention, so I suppose it's OK?


Cheers,

--
Mark Nottingham   https://www.mnot.net/

_______________________________________________
media-types mailing list
media-types@ietf.org
https://www.ietf.org/mailman/listinfo/media-types

ᐧ

-- 
_CONFIDENTIALITY NOTICE: This email may contain confidential and privileged 
material for the sole use of the intended recipient(s). Any review, use, 
distribution or disclosure by others is strictly prohibited.  If you have 
received this communication in error, please notify the sender immediately 
by e-mail and delete the message and any file attachments from your 
computer. Thank you._