Re: [apps-discuss] [link-relations] Fwd: I-D Action: draft-ohye-canonical-link-relation-00.txt

Joachim Kupke <joachim@kupke.za.net> Thu, 21 July 2011 03:48 UTC

Return-Path: <joe.kupke@gmail.com>
X-Original-To: apps-discuss@ietfa.amsl.com
Delivered-To: apps-discuss@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 6FD9821F8554; Wed, 20 Jul 2011 20:48:30 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.48
X-Spam-Level:
X-Spam-Status: No, score=-2.48 tagged_above=-999 required=5 tests=[AWL=-0.103, BAYES_00=-2.599, FM_FORGED_GMAIL=0.622, J_CHICKENPOX_39=0.6, RCVD_IN_DNSWL_LOW=-1]
Received: from mail.ietf.org ([64.170.98.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id OsuYMYoyI0rZ; Wed, 20 Jul 2011 20:48:29 -0700 (PDT)
Received: from mail-pz0-f53.google.com (mail-pz0-f53.google.com [209.85.210.53]) by ietfa.amsl.com (Postfix) with ESMTP id C60C521F8552; Wed, 20 Jul 2011 20:48:29 -0700 (PDT)
Received: by pzk6 with SMTP id 6so1201356pzk.26 for <multiple recipients>; Wed, 20 Jul 2011 20:48:29 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=mime-version:sender:in-reply-to:references:date :x-google-sender-auth:message-id:subject:from:to:cc:content-type :content-transfer-encoding; bh=qXEApSuCWHz3zEeHMXb/NxG79j8+TwFCtTedYz/NTx4=; b=iplVuioPmxE56I/GIm337HM5SWZYtcumA5JIVUtqzpYT+dY8W3Y5gRbHqqOBE6W2t/ UrIL1agjlQtlToTlSrgIsSqjW+aqOKQ+lLXvyb5C+bYycdqlF6RIYpssIN49WytB9fPf v6bsmZkT3kIFz3dxJH2uJcVQhkIpqzBF5h1Zg=
MIME-Version: 1.0
Received: by 10.68.19.230 with SMTP id i6mr2581809pbe.271.1311220109472; Wed, 20 Jul 2011 20:48:29 -0700 (PDT)
Sender: joe.kupke@gmail.com
Received: by 10.68.59.197 with HTTP; Wed, 20 Jul 2011 20:48:29 -0700 (PDT)
In-Reply-To: <CAGKau1HzJAtLwPxjSGJ8rmJy+pVNKOuOHtbR=Ox-93CeG-M_cA@mail.gmail.com>
References: <4E083D3F.6030200@gmx.de> <4E0D3EA5.7010803@gmail.com> <4E0DCFEF.20206@gmx.de> <4E0DEA77.3050007@gmail.com> <4E0E0E76.2080007@gmail.com> <4E0E995A.7060800@gmail.com> <4E0F1058.3050201@gmail.com> <1309613470.2807.17.camel@mackerel> <4E0F1F2F.8020504@gmail.com> <CAGKau1GyaxpgZsZmUcqZp1iUG6wrvSG3LHM3Pq52AjXfZz900Q@mail.gmail.com> <4E10208C.6090209@gmx.de> <CAKACZovTrCEkFRvN94BW4NChko3_J=FzsAmc37jAJ6YnnjeOeg@mail.gmail.com> <4E1818B9.8030804@gmx.de> <CAGKau1HzJAtLwPxjSGJ8rmJy+pVNKOuOHtbR=Ox-93CeG-M_cA@mail.gmail.com>
Date: Wed, 20 Jul 2011 20:48:29 -0700
X-Google-Sender-Auth: KtuXYO3lSZQuJWQuQ3hEC3v8Ock
Message-ID: <CAKACZov5--m7ZbOyomTpRtTps_VvwCD_fLRMA3KeqfzRWV_MNg@mail.gmail.com>
From: Joachim Kupke <joachim@kupke.za.net>
To: Maile Ohye <maileko@gmail.com>
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
X-Mailman-Approved-At: Thu, 21 Jul 2011 09:26:34 -0700
Cc: "link-relations@ietf.org" <link-relations@ietf.org>, IETF Apps Discuss <apps-discuss@ietf.org>, Bjartur Thorlacius <svartman95@gmail.com>
Subject: Re: [apps-discuss] [link-relations] Fwd: I-D Action: draft-ohye-canonical-link-relation-00.txt
X-BeenThere: apps-discuss@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: General discussion of application-layer protocols <apps-discuss.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/apps-discuss>, <mailto:apps-discuss-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/apps-discuss>
List-Post: <mailto:apps-discuss@ietf.org>
List-Help: <mailto:apps-discuss-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/apps-discuss>, <mailto:apps-discuss-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 21 Jul 2011 03:48:30 -0000

Julian Reschke <julian.reschke@gmx.de> wrote:
> Joachim Kupke <joachim@kupke.za.net> wrote:
>> Reading paragraph 5 of RFC 2119, I don't see
>> anything wrong with
>> ‘MAY.’  How is ‘can’ better? It's indeed not a ‘SHOULD’ since a typical
>> implementation, say in a search engine, will want to go to some length to
>> find evidence for misguided usage of the relationship and in the presence of
>> such evidence decide to ignore it.  The point of ‘MAY’ is to make clear that
>> in the absence of such evidence, the responsibility not to advertise an
>> erroneous ‘canonical’ relationship lies with the author of the context URI.”
>
> See RFC 2119, Section 6:
[...]

It's true that the imperatives of RFC 2119 (including
"MAY"/"OPTIONAL") are supposed to pertain to the format of data that
is exchanged (e.g., this and that field MAY be omitted, or this and
that proxy MAY have rewritten something), but interoperability goes
beyond that; hence the desire to give the two points where MAY is used
in the draft some more formal meaning.

Specifically, a <link rel="canonical"> supposedly gives carte blanche
to, say, an application to organize bookmarks, to rewrite a bookmarked
URI to its rel=canonical variant, if any (and non-self-referential).
Or, say, Wikipedia could crawl its external links and automatically
update those that now have a "canonical" relationship with another URI
(or, for that matter, have become an HTTP 301 redirect).  In order not
to break such applications, it is fairly important for implementations
of rel="canonical" not to designate URIs that are meaningless
substitutions for links to the context URI.

Speaking of HTTP redirects, RFC 2616 uses the wording that "[c]lients
with link editing capabilities *ought to* automatically re-link
references" to URIs that have become 301 redirects; and for a 300
"multiple choices" URI, "user agents *MAY* use the Location field
value for automatic redirection" (both emphases added).  It appears
the latter point is no more required for interoperability than what
the occurrences of "MAY" in this draft allude to.

With that said, it's closer in spirit to the provisions of 301
redirects, so I'd be happy to see the draft changed as follows:

diff before after
index 5f01f55..4ed47b8 100644
--- before
+++ after
@@ -66,10 +66,9 @@ Internet-Draft         The Canonical Link Relation
 1.  Introduction

    The canonical link relation specifies the preferred URI from a set of
-   identical or vastly similar content accessible on multiple URIs.
-   This designation MAY be used for future references to this resource,
-   and clients with link editing capabilities MAY automatically re-link
-   references to the context URI to the designated URI.
+   identical or vastly similar content accessible on multiple URIs, to
+   make it possible for references to the context URI to be updated to
+   reference the designated URI.

    The most common application of the canonical link relation includes
    specifying the preferred version of a URI from duplicate content
@@ -86,8 +85,8 @@ Internet-Draft         The Canonical Link Relation

    The canonical (target) URI MUST identify content that duplicates, is
    extremely similar, or is a superset of the content at the context
-   (referring) URI.  Presence of the canonical link relation indicates
-   to applications, such as search engines, that they MAY:
+   (referring) URI.  Authors who declare the canonical link relation
+   ought to anticipate that applications such as search engines can:

    o  Index content only from the canonical target (i.e. content from
       the context URIs will be likely disregarded as duplicative).

--Joachim