Re: [rfc-i] Unicode names in RFCs and xml2rfc

Julian Reschke <julian.reschke@gmx.de> Wed, 04 December 2019 05:53 UTC

Return-Path: <rfc-interest-bounces@rfc-editor.org>
X-Original-To: ietfarch-rfc-interest-archive@ietfa.amsl.com
Delivered-To: ietfarch-rfc-interest-archive@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 1E65E1200EB for <ietfarch-rfc-interest-archive@ietfa.amsl.com>; Tue, 3 Dec 2019 21:53:52 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -4.499
X-Spam-Level:
X-Spam-Status: No, score=-4.499 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_INVALID=0.1, DKIM_SIGNED=0.1, FREEMAIL_FORGED_FROMDOMAIN=0.25, FREEMAIL_FROM=0.001, HEADER_FROM_DIFFERENT_DOMAINS=0.25, MAILING_LIST_MULTI=-1, RCVD_IN_DNSWL_MED=-2.3, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=fail (1024-bit key) reason="fail (body has been altered)" header.d=gmx.net
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id O7kKH92N3GQO for <ietfarch-rfc-interest-archive@ietfa.amsl.com>; Tue, 3 Dec 2019 21:53:49 -0800 (PST)
Received: from rfc-editor.org (rfc-editor.org [4.31.198.49]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id CA466120059 for <rfc-interest-archive-eekabaiReiB1@ietf.org>; Tue, 3 Dec 2019 21:53:49 -0800 (PST)
Received: from rfcpa.amsl.com (localhost [IPv6:::1]) by rfc-editor.org (Postfix) with ESMTP id 91BE9F40727; Tue, 3 Dec 2019 21:53:47 -0800 (PST)
X-Original-To: rfc-interest@rfc-editor.org
Delivered-To: rfc-interest@rfc-editor.org
Received: from localhost (localhost [127.0.0.1]) by rfc-editor.org (Postfix) with ESMTP id D3077F40727 for <rfc-interest@rfc-editor.org>; Tue, 3 Dec 2019 21:53:45 -0800 (PST)
X-Virus-Scanned: amavisd-new at rfc-editor.org
Authentication-Results: rfcpa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=gmx.net
Received: from rfc-editor.org ([127.0.0.1]) by localhost (rfcpa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id Wy7oCfFcIaIq for <rfc-interest@rfc-editor.org>; Tue, 3 Dec 2019 21:53:44 -0800 (PST)
Received: from mout.gmx.net (mout.gmx.net [212.227.15.19]) by rfc-editor.org (Postfix) with ESMTPS id 4E175F40723 for <rfc-interest@rfc-editor.org>; Tue, 3 Dec 2019 21:53:44 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=gmx.net; s=badeba3b8450; t=1575438824; bh=VTsrroFesI5X2ADbGD5oR1tdQzd0drE7Beo5L1549qc=; h=X-UI-Sender-Class:Subject:To:References:From:Date:In-Reply-To; b=MWm6ehh9d2J/X31zzgamWIERswKsy0iJZmfowNjcn3MKl/x5aSwdi1bH1jRlo90s5 83kch32eiiDgZfLApBXb960Yr51PPDwaQtoM5s5buC3YFchc81e6QhGzG/7QjSVnvY QRqVYRqdDmwlJY1sz+wu6j3lKUBxaIDxJaqodRs8=
X-UI-Sender-Class: 01bb95c1-4bf8-414a-932a-4f6e2808ef9c
Received: from [192.168.178.124] ([217.251.139.244]) by mail.gmx.com (mrgmx004 [212.227.17.190]) with ESMTPSA (Nemesis) id 1MJVDW-1iMu0M3iB6-00JpoP; Wed, 04 Dec 2019 06:48:30 +0100
To: Martin Thomson <mt@lowentropy.net>, rfc-interest@rfc-editor.org
References: <76d730cb-9fe1-4572-acbe-8db5bc0bd598@www.fastmail.com>
From: Julian Reschke <julian.reschke@gmx.de>
Message-ID: <7abf6aee-5b50-8821-62ba-1ee339e629cc@gmx.de>
Date: Wed, 04 Dec 2019 06:48:28 +0100
User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:60.0) Gecko/20100101 Thunderbird/60.9.1
MIME-Version: 1.0
In-Reply-To: <76d730cb-9fe1-4572-acbe-8db5bc0bd598@www.fastmail.com>
Content-Language: en-US
X-Provags-ID: V03:K1:igjYJcaOfZaQep5BTaIbvvIC1pG5NwAyMWY2PTiUErN1HkfRkgr 4tilMCnmsI9ntGrOPcyYHzDBMEMJQdK5NacvCOsDFM+IY+zhHobgreprMGDJs6oqYXNPGoL BxGYlPHukBNhTt0n3EK894RsjgD+1VHcXfB//Wk+6tiDU7Ac/UdeSHNRpEiNm8hWKy27i5w tn5API+FOiaesdKNiTQag==
X-UI-Out-Filterresults: notjunk:1;V03:K0:1FuWj1f0RJ0=:27nvg2XPwrHCJk9G+8J4xx wXeqiu0nuw0n/U4CGL+z07/9L9Phh1tb3TXR4DZ7IKfGiypYVhKOmn+bF7VTm98210TggKkxs 2Bb8LhiKiEUUJfiYiuDVdN1AaefU5+3wagHrFv9B11x883Nw4gRIxxq7Ut41sQsl9EBZQZaEF tNzpGJzgXXwn2mhvJsQMyF+Rqbq2TJGhG/H2qnY7Ns0NgNC2B9M4GdDD05haW6oay0RLIXQZP ms7q2tSFipSB4ali03mbLIYFgr5bIGOgcBlxbePWPcwSHOAr/9U1ORG42itYwNXPgTR7qkT7a 0lPMXn8mgzPiHENrcY5MbPwWHVfC3QgSLHkEhRCxKzt1mllONhmJeSLjvx9X8Yx4D2Y5+JiBG mit2sJ+pulFotsRTgM6ZJeSwC/xX4kYhonxsXrhSGpzMc0mLEl/wZ5iYJNm2nkMI5xI0gxmvg HV33aOAVxTd+ht2oV4DNTQi5PC3twCkmh4dF6EIEfDxQhvQrBO/6jhgraXa6bt0/XY/Iaxr+g bsekFgBNDHj4983Y+A10sp/IP54DJE/tCcMKi6nx2SLjsd19tKTz7PknH/flP7U6f9qJVSmcz 0PfoQrdL1UcxQM4imyasl2XA9Hy/ahv7MljungafzUXP0RKlA3Fp0X+B2Lgj5wbSHqjV3CWun XwXXuxYNXy3fhb/WOoni6Jip+zH4ncymdcUl7/jW8dM/AQ+98mAmLFw8VdN4SYyc8VZMDSdpg XxAuRBVn1rdboqO/bYWOXUzrfmjHQOTNfXySCWzArm2jMs0q/yCXu1MqgX0TEn9zFr6uME3oO qbLi+IkOkcm2mzu2dKfqNzR2TQHWBbaAWNj7Qw+CARU7I3K3Y0sTYojkjl/3YfHygdYnsShWs 6UfmtmGgIbaDXeHr6leNb0EiElHQp42o40OJNDznPn+K82UB+JXU61T5OD5zPAylPN6g23EIu MHcBJ7JAqtaVxSg8zhJ9GYSsK90bdMwkK/y4xi5QOWlOxAkZJvcwcTnFqlPQdziaShVnJ1dMx A8PVf9tTQqFJsP/n7s/8uBL0J8Yv4ODvkMfTwfr0C3KfK1YL7b1p1qCfTM1CecHuHM56xRYlx EE7pOx5wENIcBE6J923jJZXZit7irXwjR67xCEmo3Y3Df55Ce1lRSGifuG04lKHPol4W2pyYo RTeuuWUwPvfqF2I2qzmJyEksTV5FnP8dFDh/t2VGOrKHgX54+9zCvu0x0zFg2Mg0EH+Sokl6z 27KW5+HSmoGTvj6kByhcGC9jxZVx6pseXqZ0FrncfqdryqgdMMHbf9250ci8=
Subject: Re: [rfc-i] Unicode names in RFCs and xml2rfc
X-BeenThere: rfc-interest@rfc-editor.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: "A list for discussion of the RFC series and RFC Editor functions." <rfc-interest.rfc-editor.org>
List-Unsubscribe: <https://www.rfc-editor.org/mailman/options/rfc-interest>, <mailto:rfc-interest-request@rfc-editor.org?subject=unsubscribe>
List-Archive: <http://www.rfc-editor.org/pipermail/rfc-interest/>
List-Post: <mailto:rfc-interest@rfc-editor.org>
List-Help: <mailto:rfc-interest-request@rfc-editor.org?subject=help>
List-Subscribe: <https://www.rfc-editor.org/mailman/listinfo/rfc-interest>, <mailto:rfc-interest-request@rfc-editor.org?subject=subscribe>
Content-Transfer-Encoding: base64
Content-Type: text/plain; charset="utf-8"; Format="flowed"
Errors-To: rfc-interest-bounces@rfc-editor.org
Sender: rfc-interest <rfc-interest-bounces@rfc-editor.org>

On 04.12.2019 03:31, Martin Thomson wrote:
> I'm reading the code in xml2rfc to work out how it is intended to work and finding it extraordinarily difficult to achieve a relatively modest goal: putting a person's name into the document.
>
> My requirements are simple: acknowledge contributions using a person's preferred name.  More concretely, I see no value in expanding ø or ü, but I would however like to provide ASCII analogues of the Japanese names in the list.   This goal seems consistent with the text in RFC 7997:
>
>     Person names may appear in several places within an RFC (e.g., the
>     header, Acknowledgements, and References).  When a script outside the
>     Unicode Latin blocks [UNICODE-CHART] is used for an individual name,
>     an author-provided, ASCII-only identifier will appear immediately
>     after the non-Latin characters, surrounded by parentheses.  This will
>     improve general readability of the text.
>
> I'm talking about acknowledgments, so the list appears in a <t> element.  The intent is to render the list of names in an ordinary paragraph, with commas separating each.
>
> None of the elements that permit Unicode text fit in this context.  I realize that I could use <artwork> for this, but that's clearly an abuse of that element; more so because it renders very differently depending on context (I could probably do something with SVG, now that I think of it...).
>
> <u> is singularly unsuitable for this purpose.  It insists on - at a minimum - including the U+NNNN notation for every character.  If I could use format="char" or format="char-ascii" it might be acceptable.  Assuming that I have properly understood the code.  The <u> element is not documented in RFC 7991.
>
> I appreciate the value in having a clear signal from the author that a block of text is intended to include Unicode.  Unicode tends to lead to all sorts of inconvenient inconsistencies, like multiple different dash and hyphen styles, quoting variations, and other such oddities.  I can (grudgingly) accept that some sort of indication is appropriate so that what should be relatively uncommon text usage can be scrutinized additionally.
>
> It shouldn't be this difficult to acknowledge someone using their name.

This has been raised as
<https://trac.tools.ietf.org/tools/xml2rfc/trac/ticket/416> (end of August).

Best regards, Julian

PS: it seems that the issue tracker actually is unmaintained; tons of
tickets are in the "New" state, although some of the issues apparently
have been addressed. Is there a better place to raise issues?
_______________________________________________
rfc-interest mailing list
rfc-interest@rfc-editor.org
https://www.rfc-editor.org/mailman/listinfo/rfc-interest