Re: [art] Modern Network Unicode — –02 submitted

Larry Masinter <masinter@gmail.com> Tue, 09 July 2019 03:17 UTC

Return-Path: <masinter@gmail.com>
X-Original-To: art@ietfa.amsl.com
Delivered-To: art@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 293E91200FD for <art@ietfa.amsl.com>; Mon, 8 Jul 2019 20:17:46 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -0.701
X-Spam-Level:
X-Spam-Status: No, score=-0.701 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, MIME_QP_LONG_LINE=0.001, PDS_NO_HELO_DNS=1.295, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=no autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id PaVwzXuwEW-1 for <art@ietfa.amsl.com>; Mon, 8 Jul 2019 20:17:44 -0700 (PDT)
Received: from mail-pf1-x42e.google.com (mail-pf1-x42e.google.com [IPv6:2607:f8b0:4864:20::42e]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id E29EC1200EF for <art@ietf.org>; Mon, 8 Jul 2019 20:17:43 -0700 (PDT)
Received: by mail-pf1-x42e.google.com with SMTP id g2so3773791pfq.0 for <art@ietf.org>; Mon, 08 Jul 2019 20:17:43 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:subject:from:in-reply-to:date:cc :content-transfer-encoding:message-id:references:to; bh=0bdmqAgZs2+qE2fDZ/g5No5aVhVsoTW3L6R1t/1b1eQ=; b=msio975q3DYWafEup2iS/rj9xdUqDsCghyA3Z2ihSD0Z+2xIAL4gLZtQ2eOgWm+7zi notzZ/SUs77ojyqBiRZDLlcmzqwtpea6peVsfdSIXPVEkH2aTGsPaa61G2UrTxIa8fgi ZuYkmifTJi2cZzFEexKd4CI/MTgF2OLCJZ/+dNVscK85orMHBE/kR+XzHAt4Jy5iQvJ8 g25yVxl/nNfjZ6Kb83Hab/Yzgw8paHPtHtMdcNZSkM9Xxip4StwhdR5Kw0HTGQ5Z2qMd mCSofHtRtMpg1yu6npRNA/7SoSvkIqPxqRrFma0dOnJFTuNBQV7j56uUi/KB/zAlCa2D PTWw==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:subject:from:in-reply-to:date:cc :content-transfer-encoding:message-id:references:to; bh=0bdmqAgZs2+qE2fDZ/g5No5aVhVsoTW3L6R1t/1b1eQ=; b=cQ9Kg5Ihg5l9C+BCa4mGaulEEGoPG13L8f8FK8gtXGI+Kfk+09ZvV0m4HIP4waCaym VkUqPsle+kV/+/Hf9Jyw3H/F5tLbOts/3znPEvluCPcOjeCxW+N7h123RnbwrTWcTLcO big5lfCNtP1z7GtXxaETHMs9TsTQNkOYyB8ZuhONraNUyUcjZESYmgPM1EwCTAMfsDo8 Hi+MWqVS9TJvl+LHPGxPiHiu0S4jCSO6v8YsHd6oDfza7IzVhPFxZdoQqrhe9fkRiULG 76k8N3wHPemNhJRYlXbjLjGRFeSs2cjUMbVfR7vlFQ/iWrMp/6uGHSpfOO2pDgfe/ufB +/oQ==
X-Gm-Message-State: APjAAAW3NBRXmBDOkyGknWU7U67jIrDZHMi/ioBSRWjn/BKvLoaMk+z9 kjbJuCXGStEJt4Zn55OuCj0=
X-Google-Smtp-Source: APXvYqytfZJuKZjQ6OmZOPn0Geg5ZiP3ShT2VnKzMuXQF+YCk63AcB6T92zv1M2W7CdO+J9ocxB7hw==
X-Received: by 2002:a63:1908:: with SMTP id z8mr20155118pgl.433.1562642262103; Mon, 08 Jul 2019 20:17:42 -0700 (PDT)
Received: from [10.54.205.172] ([65.121.125.202]) by smtp.gmail.com with ESMTPSA id a10sm11441356pfc.162.2019.07.08.20.17.40 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Mon, 08 Jul 2019 20:17:40 -0700 (PDT)
Content-Type: multipart/alternative; boundary="Apple-Mail-955623CA-41B4-4B55-BBB5-5D1F7557B484"
Mime-Version: 1.0 (1.0)
From: Larry Masinter <masinter@gmail.com>
X-Mailer: iPhone Mail (16F203)
In-Reply-To: <5d23ea83.1c69fb81.23b36.b4fb@mx.google.com>
Date: Mon, 08 Jul 2019 21:17:39 -0600
Cc: Carsten Bormann <cabo@tzi.org>, "Manger, James" <James.H.Manger@team.telstra.com>, "art@ietf.org" <art@ietf.org>
Content-Transfer-Encoding: 7bit
Message-Id: <BAC5091D-415B-403F-BAED-56D4A081CB32@gmail.com>
References: <CE3AD543-5847-4CAA-9B37-B293BF74C7D8@tzi.org> <SY2PR01MB2764141CB5B9863D0B358C39E5F60@SY2PR01MB2764.ausprd01.prod.outlook.com> <790BA1EF-7C0C-4E14-8BB2-4AD421ACAFB5@tzi.org> <5d23ea83.1c69fb81.23b36.b4fb@mx.google.com>
To: Peter Occil <poccil14@gmail.com>
Archived-At: <https://mailarchive.ietf.org/arch/msg/art/xemVtMgK_qmYBU7QI9fhQ3Jd0nw>
Subject: Re: [art] Modern Network Unicode — –02 submitted
X-BeenThere: art@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Applications and Real-Time Area Discussion <art.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/art>, <mailto:art-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/art/>
List-Post: <mailto:art@ietf.org>
List-Help: <mailto:art-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/art>, <mailto:art-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 09 Jul 2019 03:17:46 -0000

I am confused how this could be to stand-alone without mentioning the https://developers.google.com/search/reference/robots_meta_tag robots html meta tag??? Which seems to me a better design
--
LarryMasinter.net

> On Jul 8, 2019, at 7:14 PM, Peter Occil <poccil14@gmail.com> wrote:
> 
> Saying U+FFFE and U+FFFF (or any other noncharacters) “MUST NOT be used” merely “as per the Unicode specification” ought to be guided by looking at what the intent of Unicode actually is.
>  
> See corrigendum 9 of the Unicode Standard for that intent: http://www.unicode.org/versions/corrigendum9.html
>  
> “Noncharacters in the Unicode Standard are intended for internal use and have no standard interpretation when exchanged outside the context of internal use. However, they are not illegal in interchange nor do they cause ill-formed Unicode text. This has always been the intent of the standard, as expressed by the Unicode Technical Committee.”
>  
> Since noncharacters have no standard meaning outside of internal use, however, they may be even more problematic than the Unicode paragraph and line separators (which do have standard meaning but are forbidden in the draft for Modern Network Unicode).  Many kinds of protocol strings, such as URIs, IRIs, and strings complying with the PRECIS framework, do not allow noncharacter code points, while other kinds of UTF-8 text, such as JSON, do allow noncharacters.  XML allows all noncharacters except for U+FFFE and U+FFFF.
>  
> --Peter
>  
> From: Carsten Bormann
> Sent: Monday, July 8, 2019 4:24 PM
> To: Manger, James
> Cc: art@ietf.org
> Subject: Re: [art]Modern Network Unicode — –02 submitted
>  
> Hi James, Tim, Martin,
>  
> thank you for the quick feedback!  As suggested, I have started turning this into a standalone document (of course, still requiring RFC 3629, the UTF-8 definition):
>  
> Status:         https://datatracker.ietf.org/doc/draft-bormann-dispatch-modern-network-unicode/
> Htmlized:       https://tools.ietf.org/html/draft-bormann-dispatch-modern-network-unicode-02
> Diff:           https://tools.ietf.org/rfcdiff?url2=draft-bormann-dispatch-modern-network-unicode-02
>  
> I hope I haven’t missed anything important from RFC 5198 that I wanted to keep. 
> Maybe time to involve the authors of that RFC…
>  
> Grüße, Carsten
>  
>  
> > On Jul 8, 2019, at 04:18, Manger, James <James.H.Manger@team.telstra.com> wrote:
> >
> > Would be nicer if it wasn't written as a diff from RFC5198 (Network Unicode). That is, if you could get all the rules directly from this doc. For instance, I assume Clean Modern Network Unicode must/should be NFC. Keep the RFC5198 comparisons for an informative annex.
>  
> _______________________________________________
> art mailing list
> art@ietf.org
> https://www.ietf.org/mailman/listinfo/art
>  
> _______________________________________________
> art mailing list
> art@ietf.org
> https://www.ietf.org/mailman/listinfo/art