Re: [Uri-review] review requested for the YANG definition of a URI datatype

"Martin J. Dürst" <duerst@it.aoyama.ac.jp> Thu, 02 April 2009 02:19 UTC

Return-Path: <duerst@it.aoyama.ac.jp>
X-Original-To: uri-review@core3.amsl.com
Delivered-To: uri-review@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id 5926F3A6A0E for <uri-review@core3.amsl.com>; Wed, 1 Apr 2009 19:19:51 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -0.048
X-Spam-Level:
X-Spam-Status: No, score=-0.048 tagged_above=-999 required=5 tests=[AWL=-0.258, BAYES_00=-2.599, HELO_EQ_JP=1.244, HOST_EQ_JP=1.265, MIME_8BIT_HEADER=0.3]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id VYJOA+9KAeCw for <uri-review@core3.amsl.com>; Wed, 1 Apr 2009 19:19:50 -0700 (PDT)
Received: from scmailgw1.scop.aoyama.ac.jp (scmailgw1.scop.aoyama.ac.jp [133.2.251.194]) by core3.amsl.com (Postfix) with ESMTP id C96E83A67C0 for <uri-review@ietf.org>; Wed, 1 Apr 2009 19:19:47 -0700 (PDT)
Received: from scmse3.scbb.aoyama.ac.jp ([133.2.253.23]) by scmailgw1.scop.aoyama.ac.jp (secret/secret) with SMTP id n322KlnA004015 for <uri-review@ietf.org>; Thu, 2 Apr 2009 11:20:47 +0900 (JST)
Received: from (unknown [133.2.206.133]) by scmse3.scbb.aoyama.ac.jp with smtp id 1e0a_e090a6bc_1f2c_11de_802a_001d0969ab06; Thu, 02 Apr 2009 11:20:47 +0900
Received: from [IPv6:::1] ([133.2.210.1]:36285) by itmail.it.aoyama.ac.jp with [XMail 1.22 ESMTP Server] id <SC00CBC> for <uri-review@ietf.org> from <duerst@it.aoyama.ac.jp>; Thu, 2 Apr 2009 11:19:56 +0900
Message-ID: <49D420EA.5090906@it.aoyama.ac.jp>
Date: Thu, 02 Apr 2009 11:20:26 +0900
From: "\"Martin J. Dürst\"" <duerst@it.aoyama.ac.jp>
Organization: Aoyama Gakuin University
User-Agent: Mozilla/5.0 (Windows; U; Windows NT 6.0; en-US; rv:1.9.1b3pre) Gecko/20090108 Eudora/3.0b1pre
MIME-Version: 1.0
To: Juergen Schoenwaelder <j.schoenwaelder@jacobs-university.de>
References: <20090401091257.GC15362@elstar.local>
In-Reply-To: <20090401091257.GC15362@elstar.local>
Content-Type: text/plain; charset="ISO-8859-1"; format="flowed"
Content-Transfer-Encoding: 8bit
Cc: uri-review@ietf.org, Martin Bjorklund <mbj@tail-f.com>, "uri@w3.org" <uri@w3.org>, David Partain <david@partain.se>
Subject: Re: [Uri-review] review requested for the YANG definition of a URI datatype
X-BeenThere: uri-review@ietf.org
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: Proposed URI Schemes <uri-review.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/listinfo/uri-review>, <mailto:uri-review-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/uri-review>
List-Post: <mailto:uri-review@ietf.org>
List-Help: <mailto:uri-review-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/uri-review>, <mailto:uri-review-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 02 Apr 2009 02:19:51 -0000

Hello Juergen,

I think this is a question about generic URI syntax, and therefore
should go to uri@w3.org (which I have cc'ed).

Did you also consider having a datatype for IRIs?

Regards,    Martin.

P.S.: Please remove uri-review@ietf.org from the cc list when replying.

On 2009/04/01 18:12, Juergen Schoenwaelder wrote:
> Hi,
>
> I am editor of a document defining among other things a URI data type
> for the YANG data modeling language (NETMOD working group). Right now,
> our definition looks as follows (<draft-ietf-netmod-yang-types-02>):
>
>     typedef uri {
>       type string;    // [TODO] add the regex from RFC 3986 here?
>       description
>        "The uri type represents a Uniform Resource Identifier
>         (URI) as defined by STD 66.
>
>         Objects using the uri type must be in US-ASCII encoding,
>         and MUST be normalized as described by RFC 3986 Sections
>         6.2.1, 6.2.2.1, and 6.2.2.2.  All unnecessary
>         percent-encoding is removed, and all case-insensitive
>         characters are set to lowercase except for hexadecimal
>         digits, which are normalized to uppercase as described in
>         Section 6.2.2.1.
>
>         The purpose of this normalization is to help provide
>         unique URIs.  Note that this normalization is not
>         sufficient to provide uniqueness.  Two URIs that are
>         textually distinct after this normalization may still be
>         equivalent.
>
>         Objects using the uri type may restrict the schemes that
>         they permit.  For example, 'data:' and 'urn:' schemes
>         might not be appropriate.
>
>         A zero-length URI is not a valid URI.  This can be used to
>         express 'URI absent' where required
>
>         This type is in the value set and its semantics equivalent
>         to the Uri textual convention of the SMIv2.";
>       reference
>        "RFC 3986: Uniform Resource Identifier (URI): Generic Syntax
>         RFC 3305: Report from the Joint W3C/IETF URI Planning Interest
>                   Group: Uniform Resource Identifiers (URIs), URLs,
>                   and Uniform Resource Names (URNs): Clarifications
>                   and Recommendations
>         RFC 5017: MIB Textual Conventions for Uniform Resource
>                   Identifiers (URIs)";
>     }
>
> One particular question is whether it is safe to add the following
> pattern restriction (XSD regular expression syntax):
>
>    type string {
>      pattern '(([^:/?#]+):)?(//([^/?#]*))?([^?#]*)(\?([^#]*))?(#(.*))?';
>    }
>
> The regular expression is taken from appendix B of RFC 3986.
>
> /js
>

-- 
#-# Martin J.Dürst, Professor, Aoyama Gakuin University
#-# http://www.sw.it.aoyama.ac.jp   mailto:duerst@it.aoyama.ac.jp