Re: [xml2rfc-dev] When is @ascii required?

Carsten Bormann <cabo@tzi.org> Sun, 27 October 2019 19:14 UTC

Return-Path: <cabo@tzi.org>
X-Original-To: xml2rfc-dev@ietfa.amsl.com
Delivered-To: xml2rfc-dev@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id E11F21200C1 for <xml2rfc-dev@ietfa.amsl.com>; Sun, 27 Oct 2019 12:14:29 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -4.199
X-Spam-Level:
X-Spam-Status: No, score=-4.199 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, RCVD_IN_DNSWL_MED=-2.3, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id rRRColo9jDPp for <xml2rfc-dev@ietfa.amsl.com>; Sun, 27 Oct 2019 12:14:27 -0700 (PDT)
Received: from gabriel-vm-2.zfn.uni-bremen.de (gabriel-vm-2.zfn.uni-bremen.de [134.102.50.17]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 9B765120024 for <xml2rfc-dev@ietf.org>; Sun, 27 Oct 2019 12:14:27 -0700 (PDT)
Received: from [192.168.217.110] (p5089AE1C.dip0.t-ipconnect.de [80.137.174.28]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by gabriel-vm-2.zfn.uni-bremen.de (Postfix) with ESMTPSA id 471SFs5g7czyPp; Sun, 27 Oct 2019 20:14:25 +0100 (CET)
Content-Type: text/plain; charset="utf-8"
Mime-Version: 1.0 (Mac OS X Mail 11.5 \(3445.9.1\))
From: Carsten Bormann <cabo@tzi.org>
In-Reply-To: <7e60b01c-664d-9242-1070-309085b25d16@levkowetz.com>
Date: Sun, 27 Oct 2019 20:14:25 +0100
Cc: xml2rfc-dev@ietf.org
X-Mao-Original-Outgoing-Id: 593896463.1323839-2322858b2e166608051bbcdc158d46ac
Content-Transfer-Encoding: quoted-printable
Message-Id: <3CF0F610-E862-4851-8F08-4894F03C7962@tzi.org>
References: <37D9DCA7-A262-46A6-88C7-369127959164@tzi.org> <7e60b01c-664d-9242-1070-309085b25d16@levkowetz.com>
To: Henrik Levkowetz <henrik@levkowetz.com>
X-Mailer: Apple Mail (2.3445.9.1)
Archived-At: <https://mailarchive.ietf.org/arch/msg/xml2rfc-dev/KbA8V6yKOh-DptUS7cEnxxOkEaY>
Subject: Re: [xml2rfc-dev] When is @ascii required?
X-BeenThere: xml2rfc-dev@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: "Discussion about particulars of xml2rfc V3 design, development and code." <xml2rfc-dev.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/xml2rfc-dev>, <mailto:xml2rfc-dev-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/xml2rfc-dev/>
List-Post: <mailto:xml2rfc-dev@ietf.org>
List-Help: <mailto:xml2rfc-dev-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/xml2rfc-dev>, <mailto:xml2rfc-dev-request@ietf.org?subject=subscribe>
X-List-Received-Date: Sun, 27 Oct 2019 19:14:30 -0000

On Oct 27, 2019, at 19:40, Henrik Levkowetz <henrik@levkowetz.com> wrote:
> 
> Hi Carsten,
> 
> On 2019-10-27 18:24, Carsten Bormann wrote:
>> I’m not quite sure I understand when I have to provide the ascii=“” attribute.
>> 
>> I can have authors that have umlauts without further ado.
>> If I have organizations with umlauts, I need to put in ascii=“”, or I get:
>> 
>> t2trg-sworn.xml(20): Error: Found non-ascii content without matching ascii attribute in <organization>: Universit?t Bremen TZI
> 
> This is inconsistent.

Thanks.  So I’ll continue those experiments with the next xml2rfc version.

(I think that the general direction to allow Latin input but require manual ASCII transliteration for non-Latin is right — most English-speaking people should be able to approximately render Latin outside ßðÐþÞŋƣƦ and maybe İıĸ; there are transliteration libraries for other scripts as well but employing them is probably not a very stable approach.)

>> (Note also that the error message has some mojibake.)
> 
> Is this from the web service or on the command line?

Command line.  I notice that I had a Python 2.7 install running; with 3.7 (I know) I get:

t2trg-sworn.xml(20): Error: Found non-ascii content without matching ascii attribute in <organization>: b'Universit?t Bremen TZI’

which is interesting.

Grüße, Carsten