Re: Last Call: draft-klensin-net-utf8 (Unicode Format for NetworkInterchange) to Proposed Standard
"Frank Ellermann" <nobody@xyzzy.claranet.de> Fri, 11 January 2008 06:52 UTC
Return-path: <ietf-bounces@ietf.org>
Received: from [127.0.0.1] (helo=stiedprmman1.va.neustar.com) by megatron.ietf.org with esmtp (Exim 4.43) id 1JDDl9-0000uA-GH; Fri, 11 Jan 2008 01:52:55 -0500
Received: from [10.90.34.44] (helo=chiedprmail1.ietf.org) by megatron.ietf.org with esmtp (Exim 4.43) id 1JDDl7-0000u5-Tl for ietf@ietf.org; Fri, 11 Jan 2008 01:52:53 -0500
Received: from main.gmane.org ([80.91.229.2] helo=ciao.gmane.org) by chiedprmail1.ietf.org with esmtp (Exim 4.43) id 1JDDl7-0007VZ-7w for ietf@ietf.org; Fri, 11 Jan 2008 01:52:53 -0500
Received: from list by ciao.gmane.org with local (Exim 4.43) id 1JDDl1-0001XK-9L for ietf@ietf.org; Fri, 11 Jan 2008 06:52:47 +0000
Received: from c-134-88-1.hh.dial.de.ignite.net ([62.134.88.1]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for <ietf@ietf.org>; Fri, 11 Jan 2008 06:52:47 +0000
Received: from nobody by c-134-88-1.hh.dial.de.ignite.net with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for <ietf@ietf.org>; Fri, 11 Jan 2008 06:52:47 +0000
X-Injected-Via-Gmane: http://gmane.org/
To: ietf@ietf.org
From: Frank Ellermann <nobody@xyzzy.claranet.de>
Date: Fri, 11 Jan 2008 07:52:50 +0100
Organization: <http://purl.net/xyzzy>
Lines: 45
Message-ID: <fm73nj$t2$1@ger.gmane.org>
References: <E1JByfQ-0002Xd-Oh@stiedprstage1.ietf.org><20080110103311.GA19519@nic.fr> <fm59k5$eqf$1@ger.gmane.org> <27B4E7C374D57C2B18252533@p3.JCK.COM>
Mime-Version: 1.0
Content-Type: text/plain; charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable
X-Complaints-To: usenet@ger.gmane.org
X-Gmane-NNTP-Posting-Host: c-134-88-1.hh.dial.de.ignite.net
X-MSMail-Priority: Normal
X-Newsreader: Microsoft Outlook Express 6.00.2800.1914
X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2800.1914
X-Spam-Score: 0.0 (/)
X-Scan-Signature: 4adaf050708fb13be3316a9eee889caa
Subject: Re: Last Call: draft-klensin-net-utf8 (Unicode Format for NetworkInterchange) to Proposed Standard
X-BeenThere: ietf@ietf.org
X-Mailman-Version: 2.1.5
Precedence: list
Reply-To: Frank Ellermann <hmdmhdfmhdjmzdtjmzdtzktdkztdjz@gmail.com>
List-Id: IETF-Discussion <ietf.ietf.org>
List-Unsubscribe: <https://www1.ietf.org/mailman/listinfo/ietf>, <mailto:ietf-request@ietf.org?subject=unsubscribe>
List-Post: <mailto:ietf@ietf.org>
List-Help: <mailto:ietf-request@ietf.org?subject=help>
List-Subscribe: <https://www1.ietf.org/mailman/listinfo/ietf>, <mailto:ietf-request@ietf.org?subject=subscribe>
Errors-To: ietf-bounces@ietf.org
John C Klensin wrote: > It is ambiguous for HT. Yes, but we typically don't care about this in protocols as long as it behaves like one or more spaces. I think that's the idea of "WSP = SP / HTAB ; white space" in RFC 4234bis, waiting for its STD number. We talked about the 4234bis issue of "trailing white space", which could cause havoc when it is silently removed, and a "really empty line" is not the same as an "apparently empty line" (i.e. CRLF CRLF vs. CRLF 1*WSP CRLF). A similar robustness principle would support to accept old "HTAB-compression" or "HTAB-beautification" (e.g. as first character in a folded line). In other words WSP, not only SP. It is clear that the outcome is ambiguous, but in some protocols I care about (headers in MIME, mail, and news) *WSP or 1*WSP are acceptable. Admittedly it is a pain when signatures need white space canonicalization. But replacing *WSP by *SP would only simplify this step, not get rid of it. [About CRLF] > Unicode 5.0, Section 5.8, provides significant insight into > the complexity of this problem and probably should have > been referenced. It would be even more helpful had Table > 5-2 included identifying CRLF as a standard Internet "wire" > form of NLF, not just binding that form to Windows. Indeed, this chapter offers significantly *broken* insight for our purposes. What they found was a horrible mess, then they introduced wannabe-unambiguous LS + PS, and what they arrived at was messier than before. Claiming that CRLF is "windows" is odd for DOS + OS/2 users, it is also at odds with numerous Internet standards - precisely the reason why we need your draft. The chapter talks about line and paragraph separators without mentioning relevant ASCII controls such as RS. On the other hand it mentions MS Word interna which are nobody's business outside of MS Word. It is interesting, but IMO unusable for net-utf8. Frank _______________________________________________ Ietf mailing list Ietf@ietf.org https://www1.ietf.org/mailman/listinfo/ietf
- RE: Last Call: draft-klensin-net-utf8 (Unicode Fo… Bob Braden
- Re: Last Call: draft-klensin-net-utf8 (Unicode Fo… Stephane Bortzmeyer
- RE: Last Call: draft-klensin-net-utf8 (Unicode Fo… Kent Karlsson
- Re: Last Call: draft-klensin-net-utf8 (Unicode Fo… Frank Ellermann
- Re: Last Call: draft-klensin-net-utf8 (Unicode Fo… John C Klensin
- Re: Last Call: draft-klensin-net-utf8 (Unicode Fo… Frank Ellermann
- RE: Last Call: draft-klensin-net-utf8 (Unicode Fo… Karlsson, Kent
- RE: Last Call: draft-klensin-net-utf8 (Unicode Fo… Karlsson, Kent
- Re: Last Call: draft-klensin-net-utf8 (Unicode Fo… Frank Ellermann
- Re: Last Call: draft-klensin-net-utf8 (Unicode Fo… Stephane Bortzmeyer
- RE: Last Call: draft-klensin-net-utf8 (Unicode Fo… Kent Karlsson
- RE: Last Call: draft-klensin-net-utf8 (Unicode Fo… John C Klensin
- Re: Last Call: draft-klensin-net-utf8 (Unicode Fo… Stephane Bortzmeyer
- RE: Last Call: draft-klensin-net-utf8 (Unicode Fo… John C Klensin
- Re: Last Call: draft-klensin-net-utf8 (Unicode Fo… Frank Ellermann
- RE: Last Call: draft-klensin-net-utf8 (Unicode Fo… Kent Karlsson
- RE: Last Call: draft-klensin-net-utf8 (Unicode Fo… michael.dillon
- RE: Last Call: draft-klensin-net-utf8 (Unicode Fo… John C Klensin