Re: New Version Notification for draft-kamp-httpbis-structure-00.txt (fwd)

Kari Hurtta <hurtta-ietf@elmme-mailer.org> Thu, 13 October 2016 03:39 UTC

Return-Path: <ietf-http-wg-request+bounce-httpbisa-archive-bis2juki=lists.ie@listhub.w3.org>
X-Original-To: ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com
Delivered-To: ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 35FCB1297BE for <ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com>; Wed, 12 Oct 2016 20:39:24 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -9.917
X-Spam-Level:
X-Spam-Status: No, score=-9.917 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, HEADER_FROM_DIFFERENT_DOMAINS=0.001, RCVD_IN_DNSWL_HI=-5, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, RP_MATCHES_RCVD=-2.996, SPF_HELO_PASS=-0.001, SPF_PASS=-0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id vnKNa4a46f_c for <ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com>; Wed, 12 Oct 2016 20:39:22 -0700 (PDT)
Received: from frink.w3.org (frink.w3.org [128.30.52.56]) (using TLSv1.2 with cipher DHE-RSA-AES128-SHA (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id ECD4E12973A for <httpbisa-archive-bis2Juki@lists.ietf.org>; Wed, 12 Oct 2016 20:39:21 -0700 (PDT)
Received: from lists by frink.w3.org with local (Exim 4.80) (envelope-from <ietf-http-wg-request@listhub.w3.org>) id 1buWnZ-0006rn-SB for ietf-http-wg-dist@listhub.w3.org; Thu, 13 Oct 2016 03:35:13 +0000
Resent-Date: Thu, 13 Oct 2016 03:35:13 +0000
Resent-Message-Id: <E1buWnZ-0006rn-SB@frink.w3.org>
Received: from maggie.w3.org ([128.30.52.39]) by frink.w3.org with esmtps (TLS1.2:DHE_RSA_AES_128_CBC_SHA1:128) (Exim 4.80) (envelope-from <khurtta@welho.com>) id 1buWnU-0006qg-Ea for ietf-http-wg@listhub.w3.org; Thu, 13 Oct 2016 03:35:08 +0000
Received: from welho-filter2.welho.com ([83.102.41.24]) by maggie.w3.org with esmtp (Exim 4.80) (envelope-from <khurtta@welho.com>) id 1buWnL-0002zz-Eb for ietf-http-wg@w3.org; Thu, 13 Oct 2016 03:35:02 +0000
Received: from localhost (localhost [127.0.0.1]) by welho-filter2.welho.com (Postfix) with ESMTP id B21C413FF4; Thu, 13 Oct 2016 06:34:31 +0300 (EEST)
X-Virus-Scanned: Debian amavisd-new at pp.htv.fi
Received: from welho-smtp3.welho.com ([IPv6:::ffff:83.102.41.86]) by localhost (welho-filter2.welho.com [::ffff:83.102.41.24]) (amavisd-new, port 10024) with ESMTP id Bmt_UpYa5TKP; Thu, 13 Oct 2016 06:34:30 +0300 (EEST)
Received: from hurtta09lk.keh.iki.fi (89-27-35-245.bb.dnainternet.fi [89.27.35.245]) (using TLSv1 with cipher ECDHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by welho-smtp3.welho.com (Postfix) with ESMTPS id C7B422310; Thu, 13 Oct 2016 06:34:30 +0300 (EEST)
In-Reply-To: <9480.1475675376@critter.freebsd.dk>
References: <9480.1475675376@critter.freebsd.dk>
To: Poul-Henning Kamp <phk@phk.freebsd.dk>
Date: Thu, 13 Oct 2016 06:34:30 +0300 (EEST)
Sender: hurtta@hurtta09lk.keh.iki.fi
From: Kari Hurtta <hurtta-ietf@elmme-mailer.org>
CC: HTTP working group mailing list <ietf-http-wg@w3.org>, Kari Hurtta <hurtta-ietf@elmme-mailer.org>
X-Mailer: ELM [version ME+ 2.5 PLalpha42]
MIME-Version: 1.0
Content-Transfer-Encoding: 7bit
Content-Type: text/plain; charset="US-ASCII"
Message-Id: <20161013033431.B21C413FF4@welho-filter2.welho.com>
Received-SPF: none client-ip=83.102.41.24; envelope-from=khurtta@welho.com; helo=welho-filter2.welho.com
X-W3C-Hub-Spam-Status: No, score=-5.1
X-W3C-Hub-Spam-Report: AWL=-0.888, BAYES_00=-1.9, RCVD_IN_DNSWL_NONE=-0.0001, RP_MATCHES_RCVD=-0.336, W3C_AA=-1, W3C_WL=-1
X-W3C-Scan-Sig: maggie.w3.org 1buWnL-0002zz-Eb 8c9daae66704aebf34276a18e9be5aed
X-Original-To: ietf-http-wg@w3.org
Subject: Re: New Version Notification for draft-kamp-httpbis-structure-00.txt (fwd)
Archived-At: <http://www.w3.org/mid/20161013033431.B21C413FF4@welho-filter2.welho.com>
Resent-From: ietf-http-wg@w3.org
X-Mailing-List: <ietf-http-wg@w3.org> archive/latest/32565
X-Loop: ietf-http-wg@w3.org
Resent-Sender: ietf-http-wg-request@w3.org
Precedence: list
List-Id: <ietf-http-wg.w3.org>
List-Help: <http://www.w3.org/Mail/>
List-Post: <mailto:ietf-http-wg@w3.org>
List-Unsubscribe: <mailto:ietf-http-wg-request@w3.org?subject=unsubscribe>

> Htmlized:       https://tools.ietf.org/html/draft-kamp-httpbis-structure-00

3.  HTTP/1 serialization of HTTP header Common Structure
https://tools.ietf.org/html/draft-kamp-httpbis-structure-00#section-3

|       h1_unicode_string = DQUOTE *(
|                       ( "\" DQUOTE )
|                       ( "\" "\" ) /
|                       ( "\" "u" 4*HEXDIG ) /
|                       0x20-21 /
|                       0x23-5B /
|                       0x5D-7E /
|                       0x80-F7
|                       ) DQUOTE
|               # XXX: how to say/import "UTF-8 encoding" ?
|               # HTTP1 unfriendly codepoints (00-1f, 7f) must be
|               # encoded with \uXXXX escapes

How about

RFC 3629: UTF-8, a transformation format of ISO 10646
https://tools.ietf.org/html/rfc3629

4.  Syntax of UTF-8 Byte Sequences
https://tools.ietf.org/html/rfc3629#section-4

|   UTF8-octets = *( UTF8-char )
|   UTF8-char   = UTF8-1 / UTF8-2 / UTF8-3 / UTF8-4
|   UTF8-1      = %x00-7F
|   UTF8-2      = %xC2-DF UTF8-tail
|   UTF8-3      = %xE0 %xA0-BF UTF8-tail / %xE1-EC 2( UTF8-tail ) /
|                 %xED %x80-9F UTF8-tail / %xEE-EF 2( UTF8-tail )
|   UTF8-4      = %xF0 %x90-BF 2( UTF8-tail ) / %xF1-F3 3( UTF8-tail ) /
|                 %xF4 %x80-8F 2( UTF8-tail )
|   UTF8-tail   = %x80-BF
|
|   NOTE -- The authoritative definition of UTF-8 is in [UNICODE].  This
|   grammar is believed to describe the same thing Unicode describes, but
|   does not claim to be authoritative.  Implementors are urged to rely
|   on the authoritative source, rather than on this ABNF.

This

|               # HTTP1 unfriendly codepoints (00-1f, 7f) must be
|               # encoded with \uXXXX escapes

means that you can not use UTF8-1 however.

Are uou meaining following:

h1_unicode_utf8 = h1_utf8_1 / UTF8-2 / UTF8-3 / UTF8-4
h1_utf8_1 = ( "\" "\" ) /
            ( "\" "u" 4*HEXDIG ) /
            0x20-21 / 
            0x23-5B / 
            0x5D-7E /
            0x80-F7
UTF8-2 = <UTF8-2, defined in RFC 3629, Section 4>
UTF8-3 = <UTF8-3, defined in RFC 3629, Section 4>
UTF8-4 = <UTF8-4, defined in RFC 3629, Section 4>

/ Kari Hurtta