Re: [Last-Call] Artart last call review of draft-ietf-jmap-contacts-06

Carsten Bormann <cabo@tzi.org> Fri, 22 March 2024 19:02 UTC

Return-Path: <cabo@tzi.org>
X-Original-To: last-call@ietfa.amsl.com
Delivered-To: last-call@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 5F49DC180B6C; Fri, 22 Mar 2024 12:02:32 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.905
X-Spam-Level:
X-Spam-Status: No, score=-1.905 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, RCVD_IN_DNSWL_BLOCKED=0.001, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01, URIBL_BLOCKED=0.001, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id vCldAS6SXVmx; Fri, 22 Mar 2024 12:02:28 -0700 (PDT)
Received: from smtp.zfn.uni-bremen.de (smtp.zfn.uni-bremen.de [IPv6:2001:638:708:32::21]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id A4647C180B47; Fri, 22 Mar 2024 12:00:38 -0700 (PDT)
Received: from smtpclient.apple (unknown [202.4.29.207]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.zfn.uni-bremen.de (Postfix) with ESMTPSA id 4V1Wqj4S8DzDCcx; Fri, 22 Mar 2024 20:00:33 +0100 (CET)
Content-Type: text/plain; charset="utf-8"
Mime-Version: 1.0 (Mac OS X Mail 16.0 \(3774.500.171.1.1\))
From: Carsten Bormann <cabo@tzi.org>
In-Reply-To: <171112316193.8644.5801107423421446407@ietfa.amsl.com>
Date: Sat, 23 Mar 2024 05:00:19 +1000
Cc: art@ietf.org, draft-ietf-jmap-contacts.all@ietf.org, jmap@ietf.org, last-call@ietf.org
Content-Transfer-Encoding: quoted-printable
Message-Id: <1C02FE5D-624B-4BBE-A7F3-91EDF54CDE4F@tzi.org>
References: <171112316193.8644.5801107423421446407@ietfa.amsl.com>
To: Tim Bray <tbray@textuality.com>
X-Mailer: Apple Mail (2.3774.500.171.1.1)
Archived-At: <https://mailarchive.ietf.org/arch/msg/last-call/nzt3lF6TOvGOt-1dLn7171_6GxA>
Subject: Re: [Last-Call] Artart last call review of draft-ietf-jmap-contacts-06
X-BeenThere: last-call@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: IETF Last Calls <last-call.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/last-call>, <mailto:last-call-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/last-call/>
List-Post: <mailto:last-call@ietf.org>
List-Help: <mailto:last-call-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/last-call>, <mailto:last-call-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 22 Mar 2024 19:02:32 -0000

This review is informed by a discussion we are having on the art@ list about what to recommend to protocol designers with respect to the use of Unicode and specifically UTF-8.

Unicode is complex, and it has become the norm that designers of protocols like the one being discussed here essentially leave all the details to the implementer.  It is entirely impossible that each protocol designer can answer all the questions coming up (to many of which even Unicode experts have no ready answer).
Trying to cram some version of a specific understanding into each protocol definition is likely to fail and lead to a myriad of incompatible restrictions between which data is now hard to interchange.

So generally, protocol designers have not attempted to import all details of Unicode into their specification, generally not even the low-hanging fruit such as excluding non-characters.
(What is the point?  These never cause interoperability problems.)

PRECIS was an attempt to provide a referenceable standard in this space, but it is dependent on the Unicode version and has stopped following Unicode at 6.3.0.  Its status is pretty much an expression of the problems listed above, except there was an attempt to register a small number of variants, thwarted by the need for constant maintenance.

I do believe there are a few items that a protocol designer has to address (e.g., at the representation level: are BOMs allowed?  They are discouraged, but not disallowed by STD 63 (RFC 3629).  At the semantic level: What about RTL?).  But the current noise about character repertoires is not conducive to making progress on these.

What will happen in practice is that implementers will use whatever Unicode library they have conveniently available to them, which creates large islands of interoperability, but no perfect result.  Some form of soup (RFC 9413 “protocol decay”) will ensue, but the situation will not be much worse than the ecosystem into which these protocol definitions go.

Obviously, the specific editorial problems identified here should be addressed, but this effort should not be conflated with a PRECIS++ attempt.

Grüße, Carsten



[1]: https://www.rfc-editor.org/rfc/rfc9413.html#name-protocol-decay