Re: [media-types] Thoughts on suffixes, single and multiple

Mark Nottingham <mnot@mnot.net> Fri, 05 April 2024 07:16 UTC

Return-Path: <mnot@mnot.net>
X-Original-To: media-types@ietfa.amsl.com
Delivered-To: media-types@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 04F35C14F615 for <media-types@ietfa.amsl.com>; Fri, 5 Apr 2024 00:16:30 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.797
X-Spam-Level:
X-Spam-Status: No, score=-2.797 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=unavailable autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=mnot.net header.b="Tlz6DiGO"; dkim=pass (2048-bit key) header.d=messagingengine.com header.b="qEBniaAd"
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id okOAhELR1Nq5 for <media-types@ietfa.amsl.com>; Fri, 5 Apr 2024 00:16:25 -0700 (PDT)
Received: from fhigh6-smtp.messagingengine.com (fhigh6-smtp.messagingengine.com [103.168.172.157]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 53223C14F680 for <media-types@ietf.org>; Fri, 5 Apr 2024 00:16:25 -0700 (PDT)
Received: from compute4.internal (compute4.nyi.internal [10.202.2.44]) by mailfhigh.nyi.internal (Postfix) with ESMTP id 2DF4F11400E1; Fri, 5 Apr 2024 03:09:54 -0400 (EDT)
Received: from mailfrontend1 ([10.202.2.162]) by compute4.internal (MEProxy); Fri, 05 Apr 2024 03:09:54 -0400
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=mnot.net; h=cc :cc:content-transfer-encoding:content-type:content-type:date :date:from:from:in-reply-to:in-reply-to:message-id:mime-version :references:reply-to:subject:subject:to:to; s=fm3; t=1712300994; x=1712387394; bh=7Mvg0N0oGmedSHn/cZmdin4KagYqPdtgm9iy7rmsNKk=; b= Tlz6DiGON39QqW7qlsnM/Hq2YLybRf77koPIgWH3jyJFw1NlQKdWEvlm5JzdsRGB PVVCDmap7yPAhFDrA1d5Q3YhtHAl5ratUprvNqzeZBXXUxUP3b65ug+VIC1LtMWU GAa9cdFBASPUPiApXKa3reVDSDf70Xku3ofQKhJj/eOKUXRIpE0BlwyzK3E7xc4Y KzBpVkBZF0HoV7QZkrPWbsvh0aVElr+u9HavZNMWTVAv945lQYxMLaRNJ7UFgP8K QLWi8ukzwX84DEFkWOYfXYX1Yz6LpayM51NieQ/xLYqyV5eO5kUcR5DCAK11UW51 uwcpP2yPpT3ChrdrVXm43w==
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d= messagingengine.com; h=cc:cc:content-transfer-encoding :content-type:content-type:date:date:feedback-id:feedback-id :from:from:in-reply-to:in-reply-to:message-id:mime-version :references:reply-to:subject:subject:to:to:x-me-proxy:x-me-proxy :x-me-sender:x-me-sender:x-sasl-enc; s=fm2; t=1712300994; x= 1712387394; bh=7Mvg0N0oGmedSHn/cZmdin4KagYqPdtgm9iy7rmsNKk=; b=q EBniaAdjMwyBL0cTvWTmORUSwD8LW/CEogzUwODb5f/YxzVsyCaHCBLuYlM5E1HP 936k6gtA8tbChbewyWCAw8BixV9Ta8DQT+D77wqdtlV/veZb3l3y2MybYive5exO lp4Jh416WisjsWMXz75oTrB8oFhv4+cn3B3qJY2oS0VB6bcZYIz5C3QNl0apifrd iaEanL0FaT9MIxSILzgC4zQAoPXD3Ho9XXl6yR+QDL7FTeWpXrfLrUKl+lc2/g5C XwYSgezTYAjYQeWHh6H7MvLmgHuDz1h1ZHl/Np2QebgTJlzEuSJDboDXUuUPvZr+ 599S+ztoPGDuCJL1KgAXg==
X-ME-Sender: <xms:waMPZrtkAQgbix7N7_H1ui2tjvAMoPfDpUK5Nni_8Cgs4z6c1OZ2Xg> <xme:waMPZscBcvf7tfltBlP6vu1aUTaiStdipNSCGFrt0phLNCAEE0hmWTtadTwzcdOf7 A8QJIN-8Q_cwWGTgQ>
X-ME-Received: <xmr:waMPZuzj9_X2WXk2haRSpfeWK7giv6Ho3c2Qlyby9Es7ZO4svpqTIrE8mJtAp7lebzIv>
X-ME-Proxy-Cause: gggruggvucftvghtrhhoucdtuddrgedvledrudefledguddukecutefuodetggdotefrod ftvfcurfhrohhfihhlvgemucfhrghsthforghilhdpqfgfvfdpuffrtefokffrpgfnqfgh necuuegrihhlohhuthemuceftddtnecunecujfgurheptggguffhjgffvefgkfhfvffose htqhhmtdhhtdejnecuhfhrohhmpeforghrkhcupfhothhtihhnghhhrghmuceomhhnohht sehmnhhothdrnhgvtheqnecuggftrfgrthhtvghrnhepfefhhfelleejjeejieekhfejfe eiheetgeejgffhudegveeigeehgefftdetudetnecuffhomhgrihhnpehmnhhothdrnhgv thenucevlhhushhtvghrufhiiigvpedtnecurfgrrhgrmhepmhgrihhlfhhrohhmpehmnh hothesmhhnohhtrdhnvght
X-ME-Proxy: <xmx:waMPZqN_xXungHRbDkWC_h7vIC-ZLD5BYoG7q6lz61B8c-M_rUWH3A> <xmx:waMPZr-OS2QPkIoJBVEUCFrKUsybqyIj-OuYN5XjyJRVxijrywu6Lg> <xmx:waMPZqX9ZVENTxVDZPxceSgT2tmzzebtBd-85TJdrTaLqCCMDjB6mg> <xmx:waMPZsfOCKcOwHaRajkpY1CSokQ0lGXukH6EiFvZ55uEoF3l_O3Q7w> <xmx:wqMPZnbPRNE7cOlTKioO2hnRJKCnMBRnhsflIFrkjB1OiNwdBsbHBDZP>
Feedback-ID: ie6694242:Fastmail
Received: by mail.messagingengine.com (Postfix) with ESMTPA; Fri, 5 Apr 2024 03:09:51 -0400 (EDT)
Content-Type: text/plain; charset="utf-8"
Mime-Version: 1.0 (Mac OS X Mail 16.0 \(3774.500.171.1.1\))
From: Mark Nottingham <mnot@mnot.net>
In-Reply-To: <E83E80FF-5810-4A53-85D8-E5095F9C1C1C@openlinksw.com>
Date: Fri, 05 Apr 2024 15:09:51 +0800
Cc: media-types@ietf.org
Content-Transfer-Encoding: quoted-printable
Message-Id: <837B503B-B9F9-40F7-8078-7D1BCD66D076@mnot.net>
References: <2E20FEDE-C766-43EE-A6E2-1FB63E79CF0B@mnot.net> <1c404c4d-437c-464a-b414-4e0d39c1d8ea@alvestrand.no> <E83E80FF-5810-4A53-85D8-E5095F9C1C1C@openlinksw.com>
To: Ted Thibodeau Jr <tthibodeau=40openlinksw.com@dmarc.ietf.org>
X-Mailer: Apple Mail (2.3774.500.171.1.1)
Archived-At: <https://mailarchive.ietf.org/arch/msg/media-types/v9XuzVZGfTTxvZrESz32kWwdS7s>
Subject: Re: [media-types] Thoughts on suffixes, single and multiple
X-BeenThere: media-types@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: "IANA mailing list for reviewing Media Type \(MIME Type, Content Type\) registration requests." <media-types.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/media-types>, <mailto:media-types-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/media-types/>
List-Post: <mailto:media-types@ietf.org>
List-Help: <mailto:media-types-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/media-types>, <mailto:media-types-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 05 Apr 2024 07:16:30 -0000

Ted,

> On 5 Apr 2024, at 07:26, Ted Thibodeau Jr <tthibodeau=40openlinksw.com@dmarc.ietf.org> wrote:
> 
> I wanted to put some *usage* statistics here, drawing on Google or other search engine for data about the media types they've encountered, but I cannot find a way to search on `Content-type` headers. The nearest I found was Google's `filetype:<type>` search term which is focused on filename extension, which is minimally helpful.
> 
> I think some analysis of what's out there on the (public) web, in actual *use*, would be far more informative than counting what is in the IANA registry, at present. (It would be great to also get some analysis of what's out there in various LANs, VPNs, and other non-public web, but this seems beyond hope.)

I gave registry information to inform a decision about suffixes' use _with relation to the registry_ -- anything we do has to consider current registry entries.

I suspect the data you seek will not be as useful as you might think; a media type being sent on a message only means that someone followed the specification of the format that minted it -- whether or not the suffix was actually used by software to do something useful is entirely separable, and as we've seen the most common suffixes are ignored by the most obvious software to use them (browsers).

> I would be strongly against this idea.
[...]
> I would be similarly strongly against this idea.

Statements like this aren't helpful in the IETF process. You need to explain why you're against them to convince others.

> I am disturbed by your suspiciousness, which doesn't seem to be based on anything more than goosebumps. Do your suspicions have any technical basis?

As I've explained a few times now, they don't appear to have any plausible use by generic software that doesn't recognise the full media type. If there are, it'd be great to hear about them. Absent that motivation, the obvious question is why it's necessary to surface this information.

> Structured suffixes are useful, and even necessary. When properly specified, they tell the consumer how to parse the document for internal fragment identifiers, vital for processing of Linked Data (including but not *only* JSON-LD), among other things.

When there are multiple structured suffixes, which determines what the fragment identifier is? The obvious answer here is 'the last one', but that begs the question about what the other suffixes are for, still.

> In clearer English, we could suppose —
> 
>    shape/quadrilateral
> 
>    shape/rectangle+quadrilateral
> 
>    shape/square+rectangle+quadrilateral

How does this square with 

   application/svg+xml+gzip

or even

   application/ld+json

... once there's another serialisation format for Linked Data?

> There have been some (to my mind) regrettable media type registrations, such as `image/svg+xml`, which degrades to `image/xml`, but which I think really ought to degrade to `application/xml`, meaning that its primary registration should be `application/svg+xml` *even though its desirably rendered content is an image*.
> 
> More regrettable, is the pattern of appending a compression or archive structured suffix on another media type. Consumption of these documents requires handling of the compression/archive structure — but requires no handling of the contents of the compression/archive document!
[...]
> (Further, it appears to me that the archive media types should probably all be converted from, e.g., `application/zip`, to `multipart/zip`, as they are indeed multipart!)

I'll be interested to see people's reactions to these proposals.

Cheers,


--
Mark Nottingham   https://www.mnot.net/