Re: [Tools-discuss] UTF-8 in PDF on datatracker

Tom Pusateri <pusateri@bangj.com> Mon, 05 August 2019 21:05 UTC

Return-Path: <pusateri@bangj.com>
X-Original-To: tools-discuss@ietfa.amsl.com
Delivered-To: tools-discuss@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 46D761202AA for <tools-discuss@ietfa.amsl.com>; Mon, 5 Aug 2019 14:05:01 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.999
X-Spam-Level:
X-Spam-Status: No, score=-1.999 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, MIME_QP_LONG_LINE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=unavailable autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=bangj.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id tr86RsNM6ONK for <tools-discuss@ietfa.amsl.com>; Mon, 5 Aug 2019 14:04:59 -0700 (PDT)
Received: from oj.bangj.com (69-77-154-174.static.skybest.com [69.77.154.174]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 2B5191202A5 for <tools-discuss@ietf.org>; Mon, 5 Aug 2019 14:04:59 -0700 (PDT)
Received: from [172.16.25.117] (69-77-155-155.static.skybest.com [69.77.155.155]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by oj.bangj.com (Postfix) with ESMTPSA id 4364431834; Mon, 5 Aug 2019 17:04:58 -0400 (EDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=bangj.com; s=201907; t=1565039098; bh=qVsYzJfli/bdMkRVig4ssH8R/O/sBesxj7QurXzvuq4=; h=Subject:From:In-Reply-To:Date:Cc:References:To:From; b=o/xv8k+HL9cCK06ehoo3/M/hr78CBEq1hjoXFLyqgOfFGUYBmZ6oHbux/+309J/jK 0y5CmS0Vykx0mbV6b6ujBOyEE5ivrueK0G5AARDPpZPH1DN7rAqvuLyBEO7hNluHwa mCGwTy7Ug3AcAeHsLcU2IT8he1oKiKrqEVfo169h/mmrMbboxhturmsjdedjW8Z00q s+hvc54ccvKarJ/G8KW4Klo0g5R5sQDmZujvFv4UeHnsdVHWxvFUPncsxadSSMqaNE LXoZP4kajmQu/R3yjfbzZ0vbGSOKKNgg9IzQl12dzO06/s34/hgfOUtSpWROiDETxC 1REPSEHBB9JeA==
Content-Type: text/plain; charset="utf-8"
Mime-Version: 1.0 (1.0)
From: Tom Pusateri <pusateri@bangj.com>
X-Mailer: iPad Mail (16G77)
In-Reply-To: <499800be-6624-8b70-3516-3d9c07392b2f@levkowetz.com>
Date: Mon, 05 Aug 2019 17:04:57 -0400
Cc: Tom Pusateri <pusateri=40bangj.com@dmarc.ietf.org>, Carsten Bormann <cabo@tzi.org>, tools-discuss@ietf.org
Content-Transfer-Encoding: quoted-printable
Message-Id: <0E31A757-CECF-4F6E-B7F9-B1BF075F3404@bangj.com>
References: <AE88CB27-2A5A-47D7-9CAB-3C2D08CDCE33@bangj.com> <96DA1C37-0268-4D29-B472-3BB38AE1ED5A@tzi.org> <B7BFFE9C-C96E-45F2-B380-F9B94A18404C@bangj.com> <499800be-6624-8b70-3516-3d9c07392b2f@levkowetz.com>
To: Henrik Levkowetz <henrik@levkowetz.com>
Archived-At: <https://mailarchive.ietf.org/arch/msg/tools-discuss/InRQ85Q7uPhs1C-uEpfLUkRgjL0>
Subject: Re: [Tools-discuss] UTF-8 in PDF on datatracker
X-BeenThere: tools-discuss@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: IETF Tools Discussion <tools-discuss.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tools-discuss>, <mailto:tools-discuss-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tools-discuss/>
List-Post: <mailto:tools-discuss@ietf.org>
List-Help: <mailto:tools-discuss-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tools-discuss>, <mailto:tools-discuss-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 05 Aug 2019 21:05:02 -0000


> On Aug 5, 2019, at 4:16 PM, Henrik Levkowetz <henrik@levkowetz.com> wrote:
> 
> Hi Tom,
> 
>> On 2019-08-05 20:01, Tom Pusateri wrote:
>> 
>> 
>>> On Aug 5, 2019, at 1:54 PM, Carsten Bormann <cabo@tzi.org> wrote:
>>> 
>>> On Aug 5, 2019, at 19:29, Tom Pusateri <pusateri=40bangj.com@dmarc.ietf.org> wrote:
>>>> 
>>>> “On page 21, HTML and text version has “TTL⩾0", which gets mangled in PDF version.”
>>> 
>>> Well, you have a 
>>> 
>>> U+2A7E GREATER-THAN OR SLANTED EQUAL TO
>>> 
>>> there.  I think you want a
>>> 
>>> U+2265 GREATER-THAN OR EQUAL TO
>>> 
>>> (I have no comment on the PDF situation.)
>>> 
>>> Grüße, Carsten
>> 
>> Thanks. My co-author inserted it and also added the —utf8 switch but I see that is now deprecated:
>> 
>> xml2rfc --utf8 draft-ietf-dnssd-push.xml -o draft-ietf-dnssd-push.html --html
>> Warning: The --utf8 switch is deprecated.  Use the new unicode insertion element <u>
> 
> In this case (since the PDF is created from the htmlized text), this is
> irrelevant.
> 
> However, even if --utf8 (for v2 xml) is deprecated, it still works for some
> uses.  Going forward, it will eventually be removed.  But the bug here is in
> the htmlized-text to pdf processing, not in whether --utf8 is used or not.
> 
> 
>    Henrik

In this case, for v2 or v3, I don’t think these two options (—utf8 and <u>) are equivalent.
In the current xml2rfc using v2 input, the —utf8 option is necessary to get the correct TXT and HTML output.

The <u> element will not do this as currently defined and seems to be for a different purpose.

Thanks,
Tom