Re: Unicode escape sequence | Re: draft-ietf-httpbis-header-structure-00, unicode range

Daurnimator <quae@daurnimator.com> Wed, 21 December 2016 06:31 UTC

Return-Path: <ietf-http-wg-request+bounce-httpbisa-archive-bis2juki=lists.ie@listhub.w3.org>
X-Original-To: ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com
Delivered-To: ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id BE14C1294AE for <ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com>; Tue, 20 Dec 2016 22:31:40 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -9.391
X-Spam-Level:
X-Spam-Status: No, score=-9.391 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, HEADER_FROM_DIFFERENT_DOMAINS=0.001, RCVD_IN_DNSWL_HI=-5, RCVD_IN_SORBS_SPAM=0.5, RP_MATCHES_RCVD=-3.1, SPF_HELO_PASS=-0.001, SPF_PASS=-0.001, T_DKIM_INVALID=0.01] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=fail (1024-bit key) reason="fail (body has been altered)" header.d=daurnimator.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id YqThtyMNjYTd for <ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com>; Tue, 20 Dec 2016 22:31:39 -0800 (PST)
Received: from frink.w3.org (frink.w3.org [128.30.52.56]) (using TLSv1.2 with cipher DHE-RSA-AES128-SHA (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 25500127058 for <httpbisa-archive-bis2Juki@lists.ietf.org>; Tue, 20 Dec 2016 22:31:38 -0800 (PST)
Received: from lists by frink.w3.org with local (Exim 4.80) (envelope-from <ietf-http-wg-request@listhub.w3.org>) id 1cJaN7-0005eu-B9 for ietf-http-wg-dist@listhub.w3.org; Wed, 21 Dec 2016 06:27:29 +0000
Resent-Date: Wed, 21 Dec 2016 06:27:29 +0000
Resent-Message-Id: <E1cJaN7-0005eu-B9@frink.w3.org>
Received: from mimas.w3.org ([128.30.52.79]) by frink.w3.org with esmtps (TLS1.2:RSA_AES_128_CBC_SHA1:128) (Exim 4.80) (envelope-from <quae@daurnimator.com>) id 1cJaMu-0005dQ-Tl for ietf-http-wg@listhub.w3.org; Wed, 21 Dec 2016 06:27:16 +0000
Received: from mail-lf0-f41.google.com ([209.85.215.41]) by mimas.w3.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.84_2) (envelope-from <quae@daurnimator.com>) id 1cJaMg-00032h-Ep for ietf-http-wg@w3.org; Wed, 21 Dec 2016 06:27:11 +0000
Received: by mail-lf0-f41.google.com with SMTP id t196so91219229lff.3 for <ietf-http-wg@w3.org>; Tue, 20 Dec 2016 22:26:41 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=daurnimator.com; s=daurnimator; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :cc; bh=Ene21lBhWI5JJTQHmenNrsrI8JGQ/SuFJGWbuNMAWLc=; b=Q2EenulyexwgdzXvjVNGGCw8Kek8FdE30rqyBaJnQZrWKp9VqWrP09nbGzMNNOHMNh X5fHEOlSAa1vNHuEaJ+waUyJtGVXO6BizA1GTcK1V8HDdaFtOi3lfAVlHCPvhYSEz083 OAC6pfiB/c+UzClANVmxflpaUMm9zee1EGmwA=
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to:cc; bh=Ene21lBhWI5JJTQHmenNrsrI8JGQ/SuFJGWbuNMAWLc=; b=mHXffNnirrdwi3rjGGljbvAKUIUfQx8pGBIVGrEl7lbm2Si4Kqa2lU2Rf/F7iOo7uX xHLFWXJwENvTg6+JuSpEK+SUwGs9kTp/xfpKOiH7UtSCuMdH48+7hKTRSQ+1cqslLqE5 70dB2Oi+pmZGijEuoIhuMTwTvDqsPP+UuzkN9e/pOZXD95I8YlCMYJqkKanuECx3ZTvN ZPBQNfvr+ClgCPk+M/K+2cmXYSUtwB3BG08ml6Y1nJMfF0kOnYOn+9doCu2NRvWNFx6j BHkTCh35oHkJZO7IlQwppaZCYrJRyPyn5/ncMmExBvk6Qb3BPwM4ovoUeLM/b8uEN97N iofg==
X-Gm-Message-State: AIkVDXKchh2PIdBCOdbHVoLfA/16YaNlHqBlJICxxOtK+f//gaOG7M1CWtDWdKE6RbLG9A==
X-Received: by 10.25.199.198 with SMTP id x189mr1143970lff.164.1482301594913; Tue, 20 Dec 2016 22:26:34 -0800 (PST)
Received: from mail-lf0-f51.google.com (mail-lf0-f51.google.com. [209.85.215.51]) by smtp.gmail.com with ESMTPSA id t15sm5321403lfe.13.2016.12.20.22.26.33 for <ietf-http-wg@w3.org> (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Tue, 20 Dec 2016 22:26:33 -0800 (PST)
Received: by mail-lf0-f51.google.com with SMTP id c13so91219023lfg.0 for <ietf-http-wg@w3.org>; Tue, 20 Dec 2016 22:26:33 -0800 (PST)
X-Received: by 10.25.27.145 with SMTP id b139mr898434lfb.114.1482301593241; Tue, 20 Dec 2016 22:26:33 -0800 (PST)
MIME-Version: 1.0
Received: by 10.25.16.90 with HTTP; Tue, 20 Dec 2016 22:26:32 -0800 (PST)
In-Reply-To: <CACweHNDbv9dDXqjpU61HvfpgZ6Dt4S-CG=GjwOZcwaZh6LEirQ@mail.gmail.com>
References: <20161213173327.C1F7D1714B@welho-filter2.welho.com> <20161213175419.GA7943@LK-Perkele-V2.elisa-laajakaista.fi> <25434.1481665395@critter.freebsd.dk> <201612140628.uBE6SO3L025885@shell.siilo.fmi.fi> <36792.1481701328@critter.freebsd.dk> <CACweHNDKgWQewZHb=Kz3_2=41M58sY5472Q5OwpqPLxorvkzHQ@mail.gmail.com> <37223.1481707288@critter.freebsd.dk> <3a65ca44-f652-3b14-6d64-46f35b32df57@isode.com> <725824b9-de61-2650-4007-fb5b026bc7a6@gmx.de> <87f1efaf-74c5-f02b-d09e-a721afa86032@isode.com> <0cce5fdf-5f1a-4fd3-2e3a-e810a34baccb@gmx.de> <CACweHNBYf-UuxsKNxYakt22rgku9xEP4YK4yL2R+=vMf_uB2Vg@mail.gmail.com> <201612141739.uBEHdwiq024972@shell.siilo.fmi.fi> <CACweHNDbv9dDXqjpU61HvfpgZ6Dt4S-CG=GjwOZcwaZh6LEirQ@mail.gmail.com>
From: Daurnimator <quae@daurnimator.com>
Date: Wed, 21 Dec 2016 17:26:32 +1100
X-Gmail-Original-Message-ID: <CAEnbY+dL6gbFHe=h_xyEwCCbra8NgELpPmqeyK+mrnU40TVACQ@mail.gmail.com>
Message-ID: <CAEnbY+dL6gbFHe=h_xyEwCCbra8NgELpPmqeyK+mrnU40TVACQ@mail.gmail.com>
To: Matthew Kerwin <matthew@kerwin.net.au>
Cc: Kari Hurtta <hurtta-ietf@elmme-mailer.org>, Julian Reschke <julian.reschke@gmx.de>, Alexey Melnikov <alexey.melnikov@isode.com>, Poul-Henning Kamp <phk@phk.freebsd.dk>, Ilari Liusvaara <ilariliusvaara@welho.com>, HTTP working group mailing list <ietf-http-wg@w3.org>, Poul-Henning Kamp <phk@varnish-cache.org>
Content-Type: text/plain; charset="UTF-8"
Received-SPF: none client-ip=209.85.215.41; envelope-from=quae@daurnimator.com; helo=mail-lf0-f41.google.com
X-W3C-Hub-Spam-Status: No, score=-5.1
X-W3C-Hub-Spam-Report: AWL=-1.552, BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, RCVD_IN_SORBS_SPAM=0.5, URIBL_BLOCKED=0.001, W3C_AA=-1, W3C_WL=-1
X-W3C-Scan-Sig: mimas.w3.org 1cJaMg-00032h-Ep ca8c9a43d9727599cf71a8b6fc381b4a
X-Original-To: ietf-http-wg@w3.org
Subject: Re: Unicode escape sequence | Re: draft-ietf-httpbis-header-structure-00, unicode range
Archived-At: <http://www.w3.org/mid/CAEnbY+dL6gbFHe=h_xyEwCCbra8NgELpPmqeyK+mrnU40TVACQ@mail.gmail.com>
Resent-From: ietf-http-wg@w3.org
X-Mailing-List: <ietf-http-wg@w3.org> archive/latest/33209
X-Loop: ietf-http-wg@w3.org
Resent-Sender: ietf-http-wg-request@w3.org
Precedence: list
List-Id: <ietf-http-wg.w3.org>
List-Help: <http://www.w3.org/Mail/>
List-Post: <mailto:ietf-http-wg@w3.org>
List-Unsubscribe: <mailto:ietf-http-wg-request@w3.org?subject=unsubscribe>

On 15 December 2016 at 12:57, Matthew Kerwin <matthew@kerwin.net.au> wrote:
>
>
> On 15 December 2016 at 03:39, Kari Hurtta <hurtta-ietf@elmme-mailer.org>
> wrote:
>>
>> Matthew Kerwin <matthew@kerwin.net.au>: (Wed Dec 14 13:53:45 2016)
>> > It says that "forms that use explicit string delimiters are generally
>> > preferred over other alternatives. In many contexts, symmetric paired
>> > delimiters are easier to recognize and understand than visually
>> > unrelated
>> > ones." So brackets are good.
>> >
>> > And while it advises against using Perl's \x{NNNN...} syntax (because of
>> > potential ambiguities with two-digit hex codes), it doesn't say anything
>> > at
>> > all about \u{N...}
>> >
>
>
> I have should noted here that Ruby uses this \u{N...} syntax, including the
> lower limit of one hexadecimal digit.  This is a valid string literal in
> Ruby:
>
> "\u{df}\u{9}\u{1f602}"

Lua also has this style of unicode codepoint escaping using curly braces.

>From lua reference manual:
> The UTF-8 encoding of a Unicode character can be inserted in a literal string with the escape sequence \u{XXX} (note the mandatory enclosing brackets), where XXX is a sequence of one or more hexadecimal digits representing the character code point.