[Cellar] Re: Éric Vyncke's Discuss on draft-ietf-cellar-tags-20: (with DISCUSS and COMMENT)

Steve Lhomme <slhomme@matroska.org> Sun, 25 January 2026 10:41 UTC

Return-Path: <slhomme@matroska.org>
X-Original-To: cellar@mail2.ietf.org
Delivered-To: cellar@mail2.ietf.org
Received: from localhost (localhost [127.0.0.1]) by mail2.ietf.org (Postfix) with ESMTP id 90EE4ACAABD7 for <cellar@mail2.ietf.org>; Sun, 25 Jan 2026 02:41:57 -0800 (PST)
X-Virus-Scanned: amavisd-new at ietf.org
X-Spam-Flag: NO
X-Spam-Score: -1.897
X-Spam-Level:
X-Spam-Status: No, score=-1.897 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_NONE=0.001] autolearn=unavailable autolearn_force=no
Authentication-Results: mail2.ietf.org (amavisd-new); dkim=pass (2048-bit key) header.d=matroska-org.20230601.gappssmtp.com
Received: from mail2.ietf.org ([166.84.6.31]) by localhost (mail2.ietf.org [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id cCqLZX7rkAZ2 for <cellar@mail2.ietf.org>; Sun, 25 Jan 2026 02:41:56 -0800 (PST)
Received: from mail-wm1-x32b.google.com (mail-wm1-x32b.google.com [IPv6:2a00:1450:4864:20::32b]) (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits) key-exchange X25519 server-signature ECDSA (P-256) server-digest SHA256) (No client certificate requested) by mail2.ietf.org (Postfix) with ESMTPS id 850AAACAABB5 for <cellar@ietf.org>; Sun, 25 Jan 2026 02:41:56 -0800 (PST)
Received: by mail-wm1-x32b.google.com with SMTP id 5b1f17b1804b1-47ee66dab14so3044705e9.3 for <cellar@ietf.org>; Sun, 25 Jan 2026 02:41:56 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=matroska-org.20230601.gappssmtp.com; s=20230601; t=1769337710; x=1769942510; darn=ietf.org; h=references:to:cc:in-reply-to:date:subject:mime-version:message-id :from:from:to:cc:subject:date:message-id:reply-to; bh=+1r3VoOwihUwcntXOZebyyD6FAozv+/zQfMc9Vb88UI=; b=wLS2tetJaM41o0w4o3azz5UiSxIdrG5l5lYgQGpmaIExE2z6n895yjRwAOyxon8klJ TjpP6jsjulBoeFtsWeLzIcFS54nHEJMtYqIIQTxKvylrrPzhXOAQ9yV1z+8HhmHRU+fu lk7EXvWCaaINs6tuwjHjAsAjO7E40XKQfBC6d+jhzKxn67Mjer4l7OwTVOC/SlhmsBth HO9NUiIQRmdKoiETPqtntkkcuzQmTJ1ASQgoL6fB/yNW1A8cX0UhoCdgXNCMqSBpBLT2 E1Oy0HfSkPIwfQVzMapHI8owIAuiukDS6reP4uYdzSAmPZL/aZxBLb0LsDxPAYfsf0k9 ZlxA==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1769337710; x=1769942510; h=references:to:cc:in-reply-to:date:subject:mime-version:message-id :from:x-gm-gg:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=+1r3VoOwihUwcntXOZebyyD6FAozv+/zQfMc9Vb88UI=; b=g98sGB+QEvWHG8Kbt3DvaFmiPPXpD1pfOqv00Cw/pfkfuLPH2jf5w+RpyX21CQy83B +7AQP92Ly+QaSykZsSEizIPeMDFX9YXlRSUMeb/bVK3sDNuqvaOgHlSmqaEHX3cBIpvN 6Layp0270wrJDI+n97U0/Egf+ZgoCi1QzQwKKUYh1t3paUnpPQAtc9AeihX1Sw8jDzoA qMnq0efcC/hPn1EjJBEH2V66/7ougyawpD16s2a+1ybRlL7NrnNa3SQKnDPSQo7GTz+i J1VRaeQfRy6sLiKmQPCsTYqwHZFontElx5hDr8sb6AdRFCpa2nBZjn9z7bgywhvl8Xi9 yZcg==
X-Forwarded-Encrypted: i=1; AJvYcCUpWvQga/rOfMwSMyAwUfbRydP0Aplv9raCO1zCxTVrVQc5p7A1yZkUZiRHtDGcQSNBDId+o88=@ietf.org
X-Gm-Message-State: AOJu0Yxyh9Rblhw21rOcOC0X/II0VWBAh1AJ1GiQkmj9oO+iFuCbE+1H DCWpfhy5oJWR5zTW0OZkjjf7hdE/So/siS0NPxcDVrvuOST7I10yIYqKuAsM+ehXnA==
X-Gm-Gg: AZuq6aIJJ40gDiuwpNC9YubOS9abJLvOSb+0ou9UYkeEqK+Y0+yKdsdTdzDBorlTHAi x0mW5ww+phehsVnBm0MmDajulk0t7FjmZY7QAzPhA9CvngbtPIS3tcxlUYl1tCnrUO2LaKUzyih ll5BswqcVHe6W48bdTfbPPUOAJpvdBtQpfpuY2mhI5on+gIXQarZJwNNnKoaPrHgZaOFO3pB2hz DpMmCo+LZf6y4sIoCxCySuj5SKaJ7MT5cFVcwPYd3rP1ePBBu3EMpLe0JE81Zn/xv7g3OWeuYSt F90l16lCWWlbQOV+8/6z4+vDKFhDeNiDXy/FH4wsIXce9PzhQDIReH5ofVNv0MmieHaMOEuxByk VJptkb0v3ORVuAvJlwanr9g//5OQ+W7i4fdF+LAipcIuj8mR8X93sWITYBGc++VPXaz85Wzn2iv /oQSkw0jrXf42OZhjLDelH2DF/ucnGyTZ9bBg=
X-Received: by 2002:a05:6000:186d:b0:429:bde0:1da8 with SMTP id ffacd0b85a97d-435ca16e5afmr1197497f8f.7.1769337709669; Sun, 25 Jan 2026 02:41:49 -0800 (PST)
Received: from smtpclient.apple ([2001:861:34c4:290:d1be:1187:540b:bbf]) by smtp.gmail.com with ESMTPSA id ffacd0b85a97d-435b1f745dbsm21860957f8f.34.2026.01.25.02.41.49 (version=TLS1_2 cipher=ECDHE-ECDSA-AES128-GCM-SHA256 bits=128/128); Sun, 25 Jan 2026 02:41:49 -0800 (PST)
From: Steve Lhomme <slhomme@matroska.org>
Message-Id: <E06B9C63-1E75-4AAD-BFE9-5675040974EA@matroska.org>
Content-Type: multipart/alternative; boundary="Apple-Mail=_F8582529-44F2-42E8-9200-AF859A9FDDD0"
Mime-Version: 1.0 (Mac OS X Mail 16.0 \(3826.700.81.1.4\))
Date: Sun, 25 Jan 2026 11:41:38 +0100
In-Reply-To: <8A3B61FB-D25A-464B-A60A-57EC79C0D636@matroska.org>
To: Éric Vyncke <evyncke@cisco.com>
References: <176760281350.3304339.9509637392974117767@dt-datatracker-5656579b89-p6k4r> <8A3B61FB-D25A-464B-A60A-57EC79C0D636@matroska.org>
X-Mailer: Apple Mail (2.3826.700.81.1.4)
Message-ID-Hash: HRVMAIAWPGORA74LL6CW363VWT7EO6DA
X-Message-ID-Hash: HRVMAIAWPGORA74LL6CW363VWT7EO6DA
X-MailFrom: slhomme@matroska.org
X-Mailman-Rule-Misses: dmarc-mitigation; no-senders; approved; emergency; loop; banned-address; member-moderation; header-match-cellar.ietf.org-0; nonmember-moderation; administrivia; implicit-dest; max-recipients; max-size; news-moderation; no-subject; digests; suspicious-header
CC: The IESG <iesg@ietf.org>, cellar-chairs@ietf.org, cellar@ietf.org, draft-ietf-cellar-tags@ietf.org, spencerdawkins.ietf@gmail.com
X-Mailman-Version: 3.3.9rc6
Precedence: list
Subject: [Cellar] Re: Éric Vyncke's Discuss on draft-ietf-cellar-tags-20: (with DISCUSS and COMMENT)
List-Id: Codec Encoding for LossLess Archiving and Realtime transmission <cellar.ietf.org>
Archived-At: <https://mailarchive.ietf.org/arch/msg/cellar/ByQ2ULy2aAF6Cse1LNrY-El0-Vw>
List-Archive: <https://mailarchive.ietf.org/arch/browse/cellar>
List-Help: <mailto:cellar-request@ietf.org?subject=help>
List-Owner: <mailto:cellar-owner@ietf.org>
List-Post: <mailto:cellar@ietf.org>
List-Subscribe: <mailto:cellar-join@ietf.org>
List-Unsubscribe: <mailto:cellar-leave@ietf.org>

Hi Éric and everyone,

I added a new Pull Request to restrict the range of characters in TagName even further. Rather than “UTF-8 capital letters” which is vague and doesn’t really exist as such, it’s now A-Z only letters.

This capital rule, that turned into a UTF-8 capital rule, was really meant to limit to a simple set anyway, back when emojis didn’t exist (I think).

I also added an ABNF notation to make sure there is no confusion and can be easily parsed:
https://github.com/ietf-wg-cellar/matroska-specification/pull/1065

Best regards,
Steve

> On 11 Jan 2026, at 11:44, Steve Lhomme <slhomme@matroska.org> wrote:
> 
> Hi Eric and thanks for your review,
> 
> My comments are inline below.
> 
>> On 5 Jan 2026, at 09:46, Éric Vyncke via Datatracker <noreply@ietf.org> wrote:
>> 
>> Éric Vyncke has entered the following ballot position for
>> draft-ietf-cellar-tags-20: Discuss
>> 
>> When responding, please keep the subject line intact and reply to all
>> email addresses included in the To and CC lines. (Feel free to cut this
>> introductory paragraph, however.)
>> 
>> 
>> Please refer to https://www.ietf.org/about/groups/iesg/statements/handling-ballot-positions/ 
>> for more information about how to handle DISCUSS and COMMENT positions.
>> 
>> 
>> The document, along with other ballot positions, can be found here:
>> https://datatracker.ietf.org/doc/draft-ietf-cellar-tags/
>> 
>> 
>> 
>> ----------------------------------------------------------------------
>> DISCUSS:
>> ----------------------------------------------------------------------
>> 
>> 
>> # Éric Vyncke INT AD comments for draft-ietf-cellar-tags-20
>> CC @evyncke
>> 
>> Thank you for the work put into this document even if I find a little weird to
>> have tag definitions of a container done in a codec WG, but it fits the CELLAR
>> WG charter, so all is good.
> 
> What is now RFC9559 originally had the tags and codec definition into a single document (and even some other things). We split in the document so we can work on the main one faster.
> 
>> Please find below some blocking DISCUSS points (easy to address), some
>> non-blocking COMMENT points/nits (replies would be appreciated even if only for
>> my own education).
>> 
>> Special thanks to Spencer Dawkins for the shepherd's detailed write-up
>> including the WG consensus (albeit a small WG it seems) and the justification
>> of the intended status.
>> 
>> I hope that this review helps to improve the document,
>> 
>> Regards,
>> 
>> -éric
>> 
>> Note: this ballot comments follow the Markdown syntax of
>> https://github.com/mnot/ietf-comments/tree/main, i.e., they can be processed by
>> a tool to create github issues.
>> 
>> ## DISCUSS (blocking)
>> 
>> As noted in
>> https://datatracker.ietf.org/doc/statement-iesg-handling-ballot-positions-20220121/,
>> a DISCUSS ballot is a request to have a discussion on the points below; I
>> really think that the document would be improved with a change here, but can be
>> convinced otherwise.
>> 
>> ### Missing version
>> 
>> The CELLAR charter includes `Standards Track specification for Matroska
>> container format version 4` but this I-D, while proposed standard, does not
>> mention at all the version 4. So, it is really unclear.
>> 
>> Note: RFC 9559 section 7 is about versioning, but this I-D should really
>> mention version 4.
> 
> The tags document define the values set in Matroska elements \Segment\Tags\Tag\+SimpleTag\TagName, \Segment\Tags\Tag\+SimpleTag\TagString and \Segment\Tags\Tag\+SimpleTag\TagBinary.
> The elements don’t have a “minver” attribute, which means they are usable since Matroska v1. That’s why there is no versioning mentioned. 
> All the tags, even the ones that don’t exist yet and even “unofficial” ones, can be used in files compatible with Matroska v1. For apps that interpret tags, they only need to handle the names they care about and ignore the (infinite) rest.
> 
>> ### Section 3.2.1
>> 
>> Happy to be corrected as I am not a UTF-8 expert, but what is the exact
>> definition of `any space`? again thinking about UTF-8 and
>> https://en.wikipedia.org/wiki/Whitespace_character
> 
> A similar question was raised about capital letters as there’s no proper rule to define what is UTF-8/Unicode capital letter (I can find the post anymore).
> I can add a link to the wikipedia page if that helps. Now that these recommendations are a MUST we could also remove this line altogether, since it’s not part of the allowed characters that MUST be used.
> 
>> ### Section 4.11
>> 
>> `Library of Congress Control Number` but from which country? This definition is
>> ambiguous.
> 
> Addressed here https://github.com/ietf-wg-cellar/matroska-specification/pull/1051 
> 
>> ----------------------------------------------------------------------
>> COMMENT:
>> ----------------------------------------------------------------------
>> 
>> 
>> ## COMMENTS (non-blocking)
>> 
>> ### Support for others DISCUSS points
>> 
>> I second (hence won't repeat) the following DISCUSS points by Med:
>> - use of BCP14 in IANA considerations
>> - other issues mentioned in the IANA considerations
>> 
>> I note though that IANA tagged (punt intended) this I-D as "IANA OK"
>> 
>> ### Abstract
>> 
>> Abstracts need to be concise but also useful and descriptive. I.e., at least
>> mention the version (per my DISCUSS point) and be more descriptive.
>> 
>> The title should also include "version 4".
> 
> No, these tags are usable in all Matroska versions. There is no versioning of tag names and values.
> 
>> ### Section 1
>> 
>> Please expand `EBML` at first use.
> 
> What do you mean by “expand” ? There is a link to the "Section 7.7 of [RFC8794]” in the first sentence where it’s used.
> 
>> ### Use of personal pronouns
>> 
>> Let's avoid the use of personal pronouns such as `if you wanted to store`
>> (section 3) or `We also need an official list` (section 3.1) or even `Our
>> recommendations are in between` and `put any tag in your file` (section 3.1
>> albeit not pronouns ;-) ). I.e., who is the "we" ? The authors ? The WG ? The
>> IETF ?
> 
> Addressed by https://github.com/ietf-wg-cellar/matroska-specification/pull/1052 
> 
>> ### Use of 'official'
>> 
>> As noted in other ballots, please refrain using `official`, the IETF issues
>> standards and not laws; so, s/official/standard/ would be much better.
> 
> Addressed by https://github.com/ietf-wg-cellar/matroska-specification/pull/1048 
> 
>> ### Section 3.2.1
>> 
>> `UTF-8 capital letters`, I am not a UTF-8 or internationalisation expert, but
>> do all alphabets have "capital letters" (and should it be "character" rather
>> than `letter`)
> 
> There is no clear definition of UTF-8 capital letters nor a nice rule to apply.
> 
> That could be problem if someone proposes tag names in some language that don’t have capital letters (chinese ? japanese ?). No sure what to do here. We cannot change the “TagName" type from UTF-8 to ASCII anymore. Maybe we could say only latin capital letters ? A lot of languages have a latinized version.
> 
>> ### Section 3.2.2
>> 
>> Why not a MUST in `Multiple items SHOULD NOT`? See also
>> https://datatracker.ietf.org/doc/statement-iesg-statement-on-clarifying-the-use-of-bcp-14-key-words/
> 
> This used to be that way but turned back in https://github.com/ietf-wg-cellar/matroska-specification/pull/1030
> After the AD review from Orie https://mailarchive.ietf.org/arch/msg/cellar/4ebLFttRb_I8SFu5yMQSIDPuk2E/ 
> 
>> ### Section 3.2.2.2
>> 
>> A long discussion about the decimal '.' vs. ',', but nothing is written in
>> which base the "UTF-8 number" is represented. I guess base 10, but let's be
>> specific. I also wonder why not using a IEEE binary format was not used
>> (possibly for backward compatibility).
> 
> These tag values often correspond to tags found in other systems like ID3 where they are stored as strings. So for compatibility with other systems we use strings as well. Newer binary formats like the EBU R128 are stored as binary.
> 
> I added some text to mention it’s in base 10 in https://github.com/ietf-wg-cellar/matroska-specification/pull/1053 
> 
>> ### Section 3.3.1
>> 
>> Should there be a definition or reference for `TagChapterUID`? (also for other
>> *UID terms).
> 
> I added some text mentioning the ChapterUID which value is used in https://github.com/ietf-wg-cellar/matroska-specification/pull/1054 
> 
> Thanks again for your review.
> Steve