Re: Reference for UTF8 in SSH UTF8 terminal mode

Ron Frederick <ronf@timeheart.net> Sun, 11 December 2016 06:49 UTC

Return-Path: <bounces-ietf-ssh-owner-secsh-tyoxbijeg7-archive=lists.ietf.org@NetBSD.org>
X-Original-To: ietfarch-secsh-tyoxbijeg7-archive@ietfa.amsl.com
Delivered-To: ietfarch-secsh-tyoxbijeg7-archive@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 32AD712956C for <ietfarch-secsh-tyoxbijeg7-archive@ietfa.amsl.com>; Sat, 10 Dec 2016 22:49:42 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -7.195
X-Spam-Level:
X-Spam-Status: No, score=-7.195 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HEADER_FROM_DIFFERENT_DOMAINS=0.001, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_MED=-2.3, RP_MATCHES_RCVD=-2.896, SPF_PASS=-0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=timeheart.net
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 49p95eloD0oh for <ietfarch-secsh-tyoxbijeg7-archive@ietfa.amsl.com>; Sat, 10 Dec 2016 22:49:39 -0800 (PST)
Received: from mail.netbsd.org (mail.NetBSD.org [IPv6:2001:470:a085:999::25]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 8041112957F for <secsh-tyoxbijeg7-archive@lists.ietf.org>; Sat, 10 Dec 2016 22:49:39 -0800 (PST)
Received: by mail.netbsd.org (Postfix, from userid 605) id 32B6C855FD; Sun, 11 Dec 2016 06:49:39 +0000 (UTC)
Delivered-To: ietf-ssh@netbsd.org
Received: by mail.netbsd.org (Postfix, from userid 1347) id CDD98855A4; Sun, 11 Dec 2016 06:49:38 +0000 (UTC)
Received: from localhost (localhost [127.0.0.1]) by mail.netbsd.org (Postfix) with ESMTP id B3F4885586 for <ietf-ssh@netbsd.org>; Sat, 10 Dec 2016 15:06:06 +0000 (UTC)
X-Virus-Scanned: amavisd-new at netbsd.org
Authentication-Results: mail.netbsd.org (amavisd-new); dkim=pass (1024-bit key) header.d=timeheart.net
Received: from mail.netbsd.org ([IPv6:::1]) by localhost (mail.netbsd.org [IPv6:::1]) (amavisd-new, port 10025) with ESMTP id pb3Qv4h1vnbZ for <ietf-ssh@netbsd.org>; Sat, 10 Dec 2016 15:06:05 +0000 (UTC)
Received: from mail-pg0-x230.google.com (mail-pg0-x230.google.com [IPv6:2607:f8b0:400e:c05::230]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by mail.netbsd.org (Postfix) with ESMTPS id 78EE584CEE for <ietf-ssh@netbsd.org>; Sat, 10 Dec 2016 15:06:04 +0000 (UTC)
Received: by mail-pg0-x230.google.com with SMTP id x23so18514033pgx.1 for <ietf-ssh@netbsd.org>; Sat, 10 Dec 2016 07:06:04 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=timeheart.net; s=mail; h=from:message-id:mime-version:subject:date:in-reply-to:cc:to :references; bh=pCdFlDHPCHzR+G7cafb/NfrpukibA2s/GbK0uaOorzg=; b=ZVTdJM2mAZ2EN3LjyYWXdWiP5Tvznn5JS94sXAgBiEe/Rh2Se2jeuIc+IQaCAegdbT CE+12eSz6UuOgdAhxtuDPPDmtB2q0lBBRetQ8lE8wWT9sRwU3NQjQB0/zxqeGR9WxBb2 gwATyF16kw8FSRtsrZRgLfEvv/OwAPmOLCNNA=
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:from:message-id:mime-version:subject:date :in-reply-to:cc:to:references; bh=pCdFlDHPCHzR+G7cafb/NfrpukibA2s/GbK0uaOorzg=; b=KDyARHrW7/9vrFjSGBlBZ8Xqfv+gNIgkmARe3xJeVZJzunhuv7TBDJ1bdGKGGh2CrV NB8rhKmlCu3tAJWBtM6+l5kMbW97NQSgZIhTQAHULhxQTJDRWbmVdPQ6fpd3RWZzz4pz N1XdhKGPuDBT5VoNQSko001d+KnOl89GRtAFlf+7tbT1VeVZ4VkArY65RhTIOH0Gg5WD AR+xBy9nzTB7ox46y+o1VOxxIhXHmrRTDxc8/qchiURNxs+2g6ZNUo1vu2kZBZgT302M LyIbtoLwRRCGRGbNUWrGdqAkqvwuUgmVPNNtYzsquKZKpRs6dvlqj25MaV/eCZ3pgxu6 OE1Q==
X-Gm-Message-State: AKaTC00JdRNCUzp5xqsEbXizP7VwZAgcpBs3Qg9aWrzbv2jY5MCVZDZoskW3np0pFlUGdg==
X-Received: by 10.98.192.148 with SMTP id g20mr88086771pfk.36.1481382364155; Sat, 10 Dec 2016 07:06:04 -0800 (PST)
Received: from ?IPv6:2601:647:4282:2200:cfa:5497:d692:7eeb? ([2601:647:4282:2200:cfa:5497:d692:7eeb]) by smtp.gmail.com with ESMTPSA id p14sm32618911pfl.74.2016.12.10.07.06.01 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Sat, 10 Dec 2016 07:06:02 -0800 (PST)
From: Ron Frederick <ronf@timeheart.net>
Message-Id: <B5D11E46-3FF4-48D7-98CE-328A18C6137D@timeheart.net>
Content-Type: multipart/alternative; boundary="Apple-Mail=_2784F5CC-49C1-4E0E-8229-945DFA91F089"
Mime-Version: 1.0 (Mac OS X Mail 10.1 \(3251\))
Subject: Re: Reference for UTF8 in SSH UTF8 terminal mode
Date: Sat, 10 Dec 2016 07:06:00 -0800
In-Reply-To: <2DD56D786E600F45AC6BDE7DA4E8A8C117FE79EF@eusaamb107.ericsson.se>
Cc: "Mark D. Baushke" <mdb@juniper.net>, "ietf-ssh@netbsd.org" <ietf-ssh@netbsd.org>
To: Daniel Migault <daniel.migault@ericsson.com>
References: <2DD56D786E600F45AC6BDE7DA4E8A8C117FE79EF@eusaamb107.ericsson.se>
X-Mailer: Apple Mail (2.3251)
Sender: ietf-ssh-owner@NetBSD.org
List-Id: ietf-ssh.NetBSD.org
Precedence: list

On Dec 9, 2016, at 6:53 PM, Daniel Migault <daniel.migault@ericsson.com> wrote:
> We are looking at which reference to UTF8 we should mention into the SSH UTF8 terminal mode.
>  
> [1] mentions that  RFC3629 is slightly out of date and that a reference to ISO/IEC 10646:2014 may also be useful.
>  
> Is anyone aware of any deficiencies in RFC3629 fixed in ISO/IEC 10646:2014 ?
>  
> The question is whether we should have one reference or both in the draft. Unless RFC 3629 has some deficiencies fixed in ISO/IEC 10646:2014, I am incline to have only RFC3629. Is that something that sounds reasonable to everyone ?  


According to RFC 3629:

   NOTE -- The authoritative definition of UTF-8 is in [UNICODE].  This
   grammar is believed to describe the same thing Unicode describes, but
   does not claim to be authoritative.  Implementors are urged to rely
   on the authoritative source, rather than on this ABNF.

The actual link in the RFC for [UNICODE] is somewhat out of date as you said, but wouldn’t the best reference here be to http://www.unicode.org/versions/latest/ <http://www.unicode.org/versions/latest/> listed on the page that the RFC links to? According to that page:
The documentation for the latest version of the Unicode Standard is always found at the stable link: http://www.unicode.org/versions/latest/ <http://www.unicode.org/versions/latest/>
This reference also has an excellent appendix which describes the relationship between the Unicode standard and ISO/IEC 10646 at:

http://www.unicode.org/versions/Unicode9.0.0/appC.pdf <http://www.unicode.org/versions/Unicode9.0.0/appC.pdf>

The definition of UTF-8 can be found in section 3.9 of the Unicode specification, and the appendix states:
The ISO/IEC 10646 definition of UTF-8 is identical to UTF-8 as described under Definition D92 in Section 3.9, Unicode Encoding Forms. 

-- 
Ron Frederick
ronf@timeheart.net