Re: [urn] [apps-discuss] URNs are not URIs (another look at RFC 3986)

worley@ariadne.com (Dale R. Worley) Fri, 25 April 2014 21:10 UTC

Return-Path: <worley@ariadne.com>
X-Original-To: urn@ietfa.amsl.com
Delivered-To: urn@ietfa.amsl.com
Received: from localhost (ietfa.amsl.com [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 064D21A03E3 for <urn@ietfa.amsl.com>; Fri, 25 Apr 2014 14:10:41 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.3
X-Spam-Level:
X-Spam-Status: No, score=-1.3 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, J_CHICKENPOX_21=0.6] autolearn=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 8ZP9_2ta3u0d for <urn@ietfa.amsl.com>; Fri, 25 Apr 2014 14:10:38 -0700 (PDT)
Received: from qmta03.westchester.pa.mail.comcast.net (qmta03.westchester.pa.mail.comcast.net [IPv6:2001:558:fe14:43:76:96:62:32]) by ietfa.amsl.com (Postfix) with ESMTP id 5E2FD1A03D9 for <urn@ietf.org>; Fri, 25 Apr 2014 14:10:38 -0700 (PDT)
Received: from omta09.westchester.pa.mail.comcast.net ([76.96.62.20]) by qmta03.westchester.pa.mail.comcast.net with comcast id uC5o1n0050SCNGk53MAXhA; Fri, 25 Apr 2014 21:10:31 +0000
Received: from hobgoblin.ariadne.com ([24.34.72.61]) by omta09.westchester.pa.mail.comcast.net with comcast id uMAW1n0041KKtkw3VMAWhZ; Fri, 25 Apr 2014 21:10:31 +0000
Received: from hobgoblin.ariadne.com (hobgoblin.ariadne.com [127.0.0.1]) by hobgoblin.ariadne.com (8.14.7/8.14.7) with ESMTP id s3PLATBT031474; Fri, 25 Apr 2014 17:10:29 -0400
Received: (from worley@localhost) by hobgoblin.ariadne.com (8.14.7/8.14.7/Submit) id s3PLAPM1031471; Fri, 25 Apr 2014 17:10:25 -0400
Date: Fri, 25 Apr 2014 17:10:25 -0400
Message-Id: <201404252110.s3PLAPM1031471@hobgoblin.ariadne.com>
From: worley@ariadne.com
Sender: worley@ariadne.com
To: John C Klensin <john-ietf@jck.com>
In-reply-to: <11B3A42537CE2D687E206A34@JcK-HP8200.jck.com> (john-ietf@jck.com)
References: <C93A34DBE97565AD96CEC321@JcK-HP8200.jck.com> <534BED18.9090009@gmx.de> <3D39F1AA700A179F3C051DE2@JcK-HP8200.jck.com> <534D3410.50607@ninebynine.org> <54ecc96adba240159cf624c54c507136@BL2PR02MB307.namprd02.prod.outlook.com> <952E89C207E59D25CD5953D6@JCK-EEE10> <358467E0-F2C0-4468-A099-BBAA4F5438D2@mnot.net> <FAB32F8D-4BE4-4E49-AE8E-022D322C3BCC@pobox.com> <11B3A42537CE2D687E206A34@JcK-HP8200.jck.com>
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=comcast.net; s=q20140121; t=1398460231; bh=aSC14b24kwCTn+NtoS9PdjOwR77BPn76whviCWy84SI=; h=Received:Received:Received:Received:Date:Message-Id:From:To: Subject; b=EA4Ppsp5WS4t/IMjMjCgP0vcuqyAwgycn7ANrI5ShjWXeo9jS1QzpSjXV+xQ96fZY L1ULjKMKMk9sE6a4IHDaXTg64iRtvo5nWcXluJaM0e5fr1iKRRTQy8p7NZOdnvO8yF 411/LJ+HwPT6B4lbFuIdz0pklt5jxNzfjj6UqUzoIfJpSWIjbcoRJYf++6y9JbNd81 +Se/+xsra79/VrRLEW45sID8FoHceorEpQLUUYFTdryy7w1YY1gLpGQ7CHgRH0/79C lA/L9ulTxBEHTelYvgHRIgxfSTK0LeXwbXaJFo6MwXabP6AV2U4El+gJbVYRV44kEp zS02bdv+aDIqQ==
Archived-At: http://mailarchive.ietf.org/arch/msg/urn/afARVqKgmeAAL0H_V9E925UL8uU
Cc: urn@ietf.org
Subject: Re: [urn] [apps-discuss] URNs are not URIs (another look at RFC 3986)
X-BeenThere: urn@ietf.org
X-Mailman-Version: 2.1.15
Precedence: list
List-Id: Revisions to URN RFCs <urn.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/urn>, <mailto:urn-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/urn/>
List-Post: <mailto:urn@ietf.org>
List-Help: <mailto:urn-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/urn>, <mailto:urn-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 25 Apr 2014 21:10:41 -0000

>From the point of view of this analysis, the items that I have a
strong opinion are few, but I think that they bear heavily on the
practical engineering of the situation, and so need to be resolved one
way or the other.  The general idea is that being conservative
regarding syntax will avoid large costs, because much existing
software incorporates syntax assumptions.  But extending semantics is
not likely to be so expensive, because little software depends on the
semantic assumptions of a subset of UR* unless the software performs
detailed processing on those particular UR*.

> From: John C Klensin <john-ietf@jck.com>

> Perhaps
> as a corollary to the latter, perhaps independently, there have
> been efforts to create a Grand Unified Identifier syntax,
> sometimes in the very restricted "scheme:<stuff>" form that
> Phillip proposed, sometimes with a lot of reserved characters
> and associated semantics, and sometimes in between.

I have a strong sense that it's valuable to have and maintain a syntax
within which all the UR* fit, because processors of UR* enforce and
implement such syntaxes.  Currently the umbrella syntax is RFC 3986.
The cost of expanding the UR* syntax beyond what 3986 permits is going
to be relatively high, and will show up in systems that try to allow
all possible UR*s in certain situations.

Unfortunately, 3986 also specifies some semantics.  I believe that the
semantics defined for '#' (fragment) is sufficiently precise that
there is a risk that a considerable number of processors may be
incompatible with any other use for '#'.  That is, there will be a
significant cost to changing its semantics.

My current opinion is that the semantics of '?' (query) is
sufficiently broad that it can be used for any purpose that the UR*
scheme wishes define.

URNs of the "urn" scheme are currently constrained to the subset of
the syntax that is specified in RFC 2141.  2141 says that '#' and '?'
are reserved for expansion, but unfortunately the BNF given does not
specify their use.  This suggests that adding fragment and query to
"urn" URNs will be expensive, as software is likely to be designed to
the BNF.  However, defining additional URN schemes that do not have
that restriction should not have such a cost.

> (i) There is a community, [...] that have found the syntax and
> semantic constraints unreasonably restrictive given their perceived
> needs.

Can you give us pointers to this?  In particular, regarding the
syntax.  It seems to me that this is a deeply important point, but
short of reading the entire URNBIS archive, I don't see any
algorithmic way to obtain the supporting information for this
assertion.  Surely, if there are large communities with this opinion,
someone somewhere has made a clear statement and defense if this
conclusion.

Dale