Re: [Tools-discuss] UTF-8 in PDF on datatracker

Henrik Levkowetz <henrik@levkowetz.com> Mon, 05 August 2019 20:16 UTC

Return-Path: <henrik@levkowetz.com>
X-Original-To: tools-discuss@ietfa.amsl.com
Delivered-To: tools-discuss@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id B2F3312012A for <tools-discuss@ietfa.amsl.com>; Mon, 5 Aug 2019 13:16:41 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.899
X-Spam-Level:
X-Spam-Status: No, score=-1.899 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id p-b7plF46uPb for <tools-discuss@ietfa.amsl.com>; Mon, 5 Aug 2019 13:16:40 -0700 (PDT)
Received: from zinfandel.tools.ietf.org (zinfandel.tools.ietf.org [IPv6:2001:1890:126c::1:2a]) (using TLSv1.2 with cipher DHE-RSA-AES128-SHA (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 40E2F1200D8 for <tools-discuss@ietf.org>; Mon, 5 Aug 2019 13:16:40 -0700 (PDT)
Received: from h-202-242.a357.priv.bahnhof.se ([158.174.202.242]:51941 helo=tannat.localdomain) by zinfandel.tools.ietf.org with esmtpsa (TLS1.2:DHE_RSA_AES_128_CBC_SHA1:128) (Exim 4.80) (envelope-from <henrik@levkowetz.com>) id 1hujPI-0000Li-ON; Mon, 05 Aug 2019 13:16:38 -0700
To: Tom Pusateri <pusateri=40bangj.com@dmarc.ietf.org>, Carsten Bormann <cabo@tzi.org>
References: <AE88CB27-2A5A-47D7-9CAB-3C2D08CDCE33@bangj.com> <96DA1C37-0268-4D29-B472-3BB38AE1ED5A@tzi.org> <B7BFFE9C-C96E-45F2-B380-F9B94A18404C@bangj.com>
Cc: tools-discuss@ietf.org
From: Henrik Levkowetz <henrik@levkowetz.com>
Message-ID: <499800be-6624-8b70-3516-3d9c07392b2f@levkowetz.com>
Date: Mon, 05 Aug 2019 22:16:27 +0200
User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.11; rv:45.0) Gecko/20100101 Thunderbird/45.8.0
MIME-Version: 1.0
In-Reply-To: <B7BFFE9C-C96E-45F2-B380-F9B94A18404C@bangj.com>
Content-Type: multipart/signed; micalg="pgp-sha256"; protocol="application/pgp-signature"; boundary="dmjqm9wvWIC7lWgF47rfXrHhpTF2qNEnS"
X-SA-Exim-Connect-IP: 158.174.202.242
X-SA-Exim-Rcpt-To: tools-discuss@ietf.org, cabo@tzi.org, pusateri=40bangj.com@dmarc.ietf.org
X-SA-Exim-Mail-From: henrik@levkowetz.com
X-SA-Exim-Version: 4.2.1 (built Mon, 26 Dec 2011 16:24:06 +0000)
X-SA-Exim-Scanned: Yes (on zinfandel.tools.ietf.org)
X-Clacks-Overhead: GNU Terry Pratchett
Archived-At: <https://mailarchive.ietf.org/arch/msg/tools-discuss/6tbD2ojRF9PQuNBnAfkoXWMKA0Y>
Subject: Re: [Tools-discuss] UTF-8 in PDF on datatracker
X-BeenThere: tools-discuss@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: IETF Tools Discussion <tools-discuss.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tools-discuss>, <mailto:tools-discuss-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tools-discuss/>
List-Post: <mailto:tools-discuss@ietf.org>
List-Help: <mailto:tools-discuss-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tools-discuss>, <mailto:tools-discuss-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 05 Aug 2019 20:16:42 -0000

Hi Tom,

On 2019-08-05 20:01, Tom Pusateri wrote:
> 
> 
>> On Aug 5, 2019, at 1:54 PM, Carsten Bormann <cabo@tzi.org> wrote:
>> 
>> On Aug 5, 2019, at 19:29, Tom Pusateri <pusateri=40bangj.com@dmarc.ietf.org> wrote:
>>> 
>>> “On page 21, HTML and text version has “TTL⩾0", which gets mangled in PDF version.”
>> 
>> Well, you have a 
>> 
>> U+2A7E GREATER-THAN OR SLANTED EQUAL TO
>> 
>> there.  I think you want a
>> 
>> U+2265 GREATER-THAN OR EQUAL TO
>> 
>> (I have no comment on the PDF situation.)
>> 
>> Grüße, Carsten
> 
> Thanks. My co-author inserted it and also added the —utf8 switch but I see that is now deprecated:
> 
> xml2rfc --utf8 draft-ietf-dnssd-push.xml -o draft-ietf-dnssd-push.html --html
> Warning: The --utf8 switch is deprecated.  Use the new unicode insertion element <u>

In this case (since the PDF is created from the htmlized text), this is
irrelevant.

However, even if --utf8 (for v2 xml) is deprecated, it still works for some
uses.  Going forward, it will eventually be removed.  But the bug here is in
the htmlized-text to pdf processing, not in whether --utf8 is used or not.


	Henrik