Re: [I18ndir] [art] New Version Notification for draft-bray-unichars-06.txt

Carsten Bormann <cabo@tzi.org> Tue, 26 September 2023 08:32 UTC

Return-Path: <cabo@tzi.org>
X-Original-To: i18ndir@ietfa.amsl.com
Delivered-To: i18ndir@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 65A2EC15153F; Tue, 26 Sep 2023 01:32:50 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.906
X-Spam-Level:
X-Spam-Status: No, score=-1.906 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01, URIBL_BLOCKED=0.001, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 2fF6j-vbinlI; Tue, 26 Sep 2023 01:32:46 -0700 (PDT)
Received: from smtp.zfn.uni-bremen.de (smtp.zfn.uni-bremen.de [IPv6:2001:638:708:32::21]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 09F62C151532; Tue, 26 Sep 2023 01:32:43 -0700 (PDT)
Received: from smtpclient.apple (p548dc15c.dip0.t-ipconnect.de [84.141.193.92]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.zfn.uni-bremen.de (Postfix) with ESMTPSA id 4RvtKL3kS9zDCdH; Tue, 26 Sep 2023 10:32:38 +0200 (CEST)
Content-Type: text/plain; charset="utf-8"
Mime-Version: 1.0 (Mac OS X Mail 16.0 \(3731.700.6\))
From: Carsten Bormann <cabo@tzi.org>
In-Reply-To: <CAChr6SyuLc6-fLsThQJie2G_K4-vZtPK_emnFyA7NWoakBowiA@mail.gmail.com>
Date: Tue, 26 Sep 2023 10:32:27 +0200
Cc: Tim Bray <tbray@textuality.com>, i18ndir@ietf.org, ART Area <art@ietf.org>
Content-Transfer-Encoding: quoted-printable
Message-Id: <BB810B3D-C328-40F9-9526-6C732D925561@tzi.org>
References: <169566019635.41806.9804796677919971070@ietfa.amsl.com> <CAHBU6is-wU2NLXNWL56nSJ4=nKvDzGv_Aw4qJN6N2O8CuM4-yw@mail.gmail.com> <CAChr6SwM9re+0X8V9YkFLxkuxhSnu0chW9ecKq1JuNuo4fAEWw@mail.gmail.com> <CAHBU6ivSkEv0AcT52BWrYadmutdYNFx0D0MYR3Sv62a2LXckJw@mail.gmail.com> <CAChr6SyuLc6-fLsThQJie2G_K4-vZtPK_emnFyA7NWoakBowiA@mail.gmail.com>
To: Rob Sayre <sayrer@gmail.com>
X-Mailer: Apple Mail (2.3731.700.6)
Archived-At: <https://mailarchive.ietf.org/arch/msg/i18ndir/66QDxyKd3d5F88t2BUD8TlbOwlI>
Subject: Re: [I18ndir] [art] New Version Notification for draft-bray-unichars-06.txt
X-BeenThere: i18ndir@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: Internationalization Directorate <i18ndir.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/i18ndir>, <mailto:i18ndir-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/i18ndir/>
List-Post: <mailto:i18ndir@ietf.org>
List-Help: <mailto:i18ndir-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/i18ndir>, <mailto:i18ndir-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 26 Sep 2023 08:32:50 -0000

On 26. Sep 2023, at 03:59, Rob Sayre <sayrer@gmail.com> wrote:
> 
> The aim here is to point out that the Web (and Java, and Windows) can accommodate ill-formed Unicode. Is it possible to transmit any Windows path name via conforming JSON in UTF-8? Yes. Is it a good idea to naively design that into a protocol? No. But you might have to accept these things to be sufficiently compatible with the Web.
> 
> One could call some of this input "toxic waste", but there is a flipside. It would be something like "I'm sorry ACME Basic JSON Parser Version 1.0 can't handle web content". The other formats in the document make this intent explicit, and I support that.

There are certain industrial processes that generate toxic waste.

I would like to point out that the river next to the factory can accommodate toxic waste just fine.
For centuries, this was the way we worked, and it made a lot of business sense to do so.

But then people found out that putting the toxic waste into the river may impact other people.
People who don’t benefit from dumping that toxic waste into the river, but who now have to carry the costs caused by it (building expensive special cleanup into tap water generation, or usually just getting sick).

We call such a situation an externality [1].

Dumping ill-formed data into what should be Unicode is as convenient as dumping toxic waste in the river.
But I don’t think we should force the externality created by this on others, even if you think it makes a lot of business sense to do so.

Any document that treats dumping toxic waste into protocols by playing down this externality (e.g., by putting it at the same level as sending control characters or other “problematic” (but valid) data) is unacceptable.
The IETF should not be in the business of furthering such behavior.

Grüße, Carsten

[1]: https://en.wikipedia.org/wiki/Externality