[Uri-review] about: scheme; Simplified Encoding Considerations

Joseph A Holsten <joseph@josephholsten.com> Mon, 21 September 2009 05:51 UTC

Return-Path: <josephholsten@gmail.com>
X-Original-To: uri-review@core3.amsl.com
Delivered-To: uri-review@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id D3CDA3A67A5 for <uri-review@core3.amsl.com>; Sun, 20 Sep 2009 22:51:52 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.599
X-Spam-Level:
X-Spam-Status: No, score=-2.599 tagged_above=-999 required=5 tests=[BAYES_00=-2.599]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id W6WxENTfx9wV for <uri-review@core3.amsl.com>; Sun, 20 Sep 2009 22:51:51 -0700 (PDT)
Received: from mail-yx0-f192.google.com (mail-yx0-f192.google.com [209.85.210.192]) by core3.amsl.com (Postfix) with ESMTP id 182A73A6898 for <uri-review@ietf.org>; Sun, 20 Sep 2009 22:51:51 -0700 (PDT)
Received: by yxe30 with SMTP id 30so3238939yxe.29 for <uri-review@ietf.org>; Sun, 20 Sep 2009 22:52:49 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:sender:message-id:from:to :content-type:content-transfer-encoding:subject:mime-version:date :x-mailer; bh=qT48I29nLx18pZ6zszK8q32euVTRdsD7v3259dNZN8U=; b=anx/GTyJjIrQJmOD6l4E9hQF5GTaxqiRuX//PirR2KjjJtTPmJxcy/7hJul+93ihCy G2dkY/oimmqGBz3REofvejgiu7MeQEoEwBPwu+duDfcZ5UNzKDG7jA/F6+8wVtETC4Dv qDMd2CDKCrlAIwfoISbiRm+amtuh1Xn/fuAUM=
DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=sender:message-id:from:to:content-type:content-transfer-encoding :subject:mime-version:date:x-mailer; b=EgWb/AHLcARg5BQal3D8eGSsIx5sTwDdSXjWoJ0TZPF6x8CV4KSh6EH4ziJ0obKoXJ yGtO7f89tOGc2p/9OZO6kRDsn56Q3soIkEO4SAzIBtsqLCSIghsX36IfRxhNuDbC9x8k yU6W/bV1+Po4va9QOkpC6tymv5Hb5x8arhIP4=
Received: by 10.90.245.3 with SMTP id s3mr3076263agh.43.1253512368978; Sun, 20 Sep 2009 22:52:48 -0700 (PDT)
Received: from ?192.168.1.15? (ppp-70-128-111-59.dsl.tulsok.swbell.net [70.128.111.59]) by mx.google.com with ESMTPS id 36sm3589260aga.14.2009.09.20.22.52.47 (version=TLSv1/SSLv3 cipher=RC4-MD5); Sun, 20 Sep 2009 22:52:48 -0700 (PDT)
Sender: Joseph Holsten <josephholsten@gmail.com>
Message-Id: <9E1BA1C0-F936-4202-885C-CA26FA86DF60@josephholsten.com>
From: Joseph A Holsten <joseph@josephholsten.com>
To: uri-review@ietf.org
Content-Type: text/plain; charset="US-ASCII"; format="flowed"; delsp="yes"
Content-Transfer-Encoding: 7bit
Mime-Version: 1.0 (Apple Message framework v936)
Date: Mon, 21 Sep 2009 00:52:46 -0500
X-Mailer: Apple Mail (2.936)
Subject: [Uri-review] about: scheme; Simplified Encoding Considerations
X-BeenThere: uri-review@ietf.org
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: Proposed URI Schemes <uri-review.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/listinfo/uri-review>, <mailto:uri-review-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/uri-review>
List-Post: <mailto:uri-review@ietf.org>
List-Help: <mailto:uri-review-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/uri-review>, <mailto:uri-review-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 21 Sep 2009 05:51:52 -0000

URI people:

I intend to replace the current about: scheme Encoding  
Considerations[1]:

    Because many characters are not permitted with this syntax, the
    "segment" and "query" elements may contain characters from the
    Unicode Character Set [UCS] as suggested by URI [RFC3986], by first
    encoding those characters as octets to the UTF-8 character encoding
    [RFC3629]; then only those octets that do not correspond to
    characters in the unreserved set should be percent-encoded.

    By using UTF-8 encoding, there are no known compatibility issues  
with
    mapping Internationlized Resource Identifiers to about URIs  
according
    to [RFC3987].  Since about URIs do not use domain names, "ireg-name"
    conversion is unnecessary.

with the following (adapted from hixie's ws: scheme[2]):

    Characters in the "segment" or "query" parts that are excluded by  
the
    syntax defined above must be converted from Unicode to ASCII by  
first
    encoding the characters as UTF-8 and then replacing the  
corresponding
    bytes using their percent-encoded form as defined in the URI and IRI
    specifications. [RFC3986] [RFC3987]

Any objections or issues?


1: http://tools.ietf.org/html/draft-holsten-about-uri-scheme-02#section-4
2: http://tools.ietf.org/html/draft-hixie-thewebsocketprotocol-41#section-8.1

--
Joseph Holsten
http://josephholsten.com
mailto:joseph@josephholsten.com
tel:+1-918-948-6747