Re: [rfc-i] character sets, Bug in RFC Search page...

"John Levine" <johnl@taugh.com> Sat, 30 September 2017 15:42 UTC

Return-Path: <rfc-interest-bounces@rfc-editor.org>
X-Original-To: ietfarch-rfc-interest-archive@ietfa.amsl.com
Delivered-To: ietfarch-rfc-interest-archive@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id D51C4134290 for <ietfarch-rfc-interest-archive@ietfa.amsl.com>; Sat, 30 Sep 2017 08:42:48 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.299
X-Spam-Level:
X-Spam-Status: No, score=-2.299 tagged_above=-999 required=5 tests=[HEADER_FROM_DIFFERENT_DOMAINS=0.001, RCVD_IN_DNSWL_MED=-2.3, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id BlgDC_xL5-ZL for <ietfarch-rfc-interest-archive@ietfa.amsl.com>; Sat, 30 Sep 2017 08:42:47 -0700 (PDT)
Received: from rfc-editor.org (rfc-editor.org [4.31.198.49]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 860A71330B1 for <rfc-interest-archive-eekabaiReiB1@ietf.org>; Sat, 30 Sep 2017 08:42:47 -0700 (PDT)
Received: from rfcpa.amsl.com (localhost [IPv6:::1]) by rfc-editor.org (Postfix) with ESMTP id D47A3B81030; Sat, 30 Sep 2017 08:42:45 -0700 (PDT)
X-Original-To: rfc-interest@rfc-editor.org
Delivered-To: rfc-interest@rfc-editor.org
Received: from localhost (localhost [127.0.0.1]) by rfc-editor.org (Postfix) with ESMTP id 2E25CB80DA0 for <rfc-interest@rfc-editor.org>; Wed, 27 Sep 2017 10:53:47 -0700 (PDT)
X-Virus-Scanned: amavisd-new at rfc-editor.org
Received: from rfc-editor.org ([127.0.0.1]) by localhost (rfcpa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id gHg5CBS9X1Sg for <rfc-interest@rfc-editor.org>; Wed, 27 Sep 2017 10:53:46 -0700 (PDT)
Received: from gal.iecc.com (abusenet-1-pt.tunnel.tserv4.nyc4.ipv6.he.net [IPv6:2001:470:1f06:1126::2]) by rfc-editor.org (Postfix) with ESMTPS id DA217B80D9D for <rfc-interest@rfc-editor.org>; Wed, 27 Sep 2017 10:53:45 -0700 (PDT)
Received: (qmail 93809 invoked from network); 27 Sep 2017 17:54:10 -0000
Received: from unknown (64.57.183.53) by gal.iecc.com with QMQP; 27 Sep 2017 17:54:10 -0000
Date: Wed, 27 Sep 2017 17:53:48 -0000
Message-ID: <20170927175348.15593.qmail@ary.lan>
From: John Levine <johnl@taugh.com>
To: rfc-interest@rfc-editor.org
In-Reply-To: <20170926215743.GM31007@faui40p.informatik.uni-erlangen.de>
Organization:
X-Headerized: yes
Mime-Version: 1.0
X-Mailman-Approved-At: Sat, 30 Sep 2017 08:42:44 -0700
Subject: Re: [rfc-i] character sets, Bug in RFC Search page...
X-BeenThere: rfc-interest@rfc-editor.org
X-Mailman-Version: 2.1.22
Precedence: list
List-Id: "A list for discussion of the RFC series and RFC Editor functions." <rfc-interest.rfc-editor.org>
List-Unsubscribe: <https://www.rfc-editor.org/mailman/options/rfc-interest>, <mailto:rfc-interest-request@rfc-editor.org?subject=unsubscribe>
List-Archive: <http://www.rfc-editor.org/pipermail/rfc-interest/>
List-Post: <mailto:rfc-interest@rfc-editor.org>
List-Help: <mailto:rfc-interest-request@rfc-editor.org?subject=help>
List-Subscribe: <https://www.rfc-editor.org/mailman/listinfo/rfc-interest>, <mailto:rfc-interest-request@rfc-editor.org?subject=subscribe>
Cc: tte@cs.fau.de
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: 7bit
Errors-To: rfc-interest-bounces@rfc-editor.org
Sender: rfc-interest <rfc-interest-bounces@rfc-editor.org>

In article <20170926215743.GM31007@faui40p.informatik.uni-erlangen.de> you write:
>Given how browsers are quite inconsistent in the characer sets they load, it would be nice to have explicit indication
>of the character sets required to render a document. I've seen PDF documents where i had to go to page 100 before some
>crucial text was not rendered because it used some character set not available to me.

Those are buggy PDFs.  A PDF is supposed to contain all the fonts it
uses beyond a small standard set, but there is a lot of PDF crudware
that wrongly assumes that whatever fonts it finds on the machine where
it's running will exist everywhere.  

>There is a lot of tool chains out there that will not render UTF-8 correctly,
>therefore it is IMHO extremely viable to have a clear indication of the
>minimum toolchain features required to render a document correctly (in the first
>page header of an RFC wold be great).

Hey, maybe we could put a couple of bytes at the begining that say
this is a UTF-8 document.

R's,
John
_______________________________________________
rfc-interest mailing list
rfc-interest@rfc-editor.org
https://www.rfc-editor.org/mailman/listinfo/rfc-interest