Re: [urn] URN:DOI namespace registration request

Paul Jessop <paul@countyanalytics.com> Tue, 19 January 2021 11:29 UTC

Return-Path: <paul@countyanalytics.com>
X-Original-To: urn@ietfa.amsl.com
Delivered-To: urn@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 842D83A144A for <urn@ietfa.amsl.com>; Tue, 19 Jan 2021 03:29:37 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.598
X-Spam-Level:
X-Spam-Status: No, score=-2.598 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H2=-0.001, SPF_HELO_NONE=0.001, SPF_NONE=0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 5GCJ7tfmpphn for <urn@ietfa.amsl.com>; Tue, 19 Jan 2021 03:29:35 -0800 (PST)
Received: from exch-smtp-out.livemail.co.uk (exch-smtp-out.livemail.co.uk [213.171.216.29]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 4A0DF3A1446 for <urn@ietf.org>; Tue, 19 Jan 2021 03:29:35 -0800 (PST)
Received: from localhost (unknown [127.0.0.1]) by exch-smtp-out.livemail.co.uk (Postfix) with ESMTP id D8979CBE4C for <urn@ietf.org>; Tue, 19 Jan 2021 11:29:33 +0000 (UTC)
X-Virus-Scanned: amavisd-new at exch-smtp-out-07.livemail.co.uk
Received: from exch-smtp-out.livemail.co.uk ([127.0.0.1]) by localhost (exch-smtp-out-07.livemail.co.uk [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id PONWXL4o4tq1 for <urn@ietf.org>; Tue, 19 Jan 2021 11:29:32 +0000 (GMT)
Received: from WINHEXFEEU5.win.mail (unknown [217.160.154.164]) by exch-smtp-out.livemail.co.uk (Postfix) with ESMTPS id 85707CBE73 for <urn@ietf.org>; Tue, 19 Jan 2021 11:29:32 +0000 (GMT)
Received: from WINHEXBEEU113.win.mail (10.72.15.46) by WINHEXBEEU105.win.mail (10.72.15.42) with Microsoft SMTP Server (TLS) id 15.0.1497.2; Tue, 19 Jan 2021 12:29:31 +0100
Received: from WINHEXBEEU113.win.mail ([fe80::516d:9387:664a:7563]) by WINHEXBEEU113.win.mail ([fe80::516d:9387:664a:7563%13]) with mapi id 15.00.1497.010; Tue, 19 Jan 2021 12:29:31 +0100
From: Paul Jessop <paul@countyanalytics.com>
To: "Henry S. Thompson" <ht@inf.ed.ac.uk>, =?utf-8?B?TWFydGluIEouIETDvHJzdA==?= <duerst@it.aoyama.ac.jp>
CC: "Dale R. Worley" <worley@ariadne.com>, "Hakala, Juha E" <juha.hakala@helsinki.fi>, "llannom@cnri.reston.va.us" <llannom@cnri.reston.va.us>, "urn@ietf.org" <urn@ietf.org>, "john@jck.com" <john@jck.com>, "jonathanmtclark@gmail.com" <jonathanmtclark@gmail.com>
Thread-Topic: [urn] URN:DOI namespace registration request
Thread-Index: AQHW7k91dBAqygkVVEaauUDP7Cfgeaouv1KA
Date: Tue, 19 Jan 2021 11:29:31 +0000
Message-ID: <4A3C1714-31AD-4CBB-A7DA-E82BF3249F9D@countyanalytics.com>
References: <HE1PR07MB3196DBADE6019EF3794C90ADFA310@HE1PR07MB3196.eurprd07.prod.outlook.com> <87ft2ygofe.fsf@hobgoblin.ariadne.com> <A4354843-F680-44A6-AE49-11FFA28C3462@countyanalytics.com> <eee21c2c-94a9-5aaf-a20d-4528b383c6e5@it.aoyama.ac.jp> <f5br1mhdxbp.fsf@ecclerig.inf.ed.ac.uk>
In-Reply-To: <f5br1mhdxbp.fsf@ecclerig.inf.ed.ac.uk>
Accept-Language: en-GB, de-DE, en-US
Content-Language: en-GB
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
user-agent: Microsoft-MacOutlook/16.45.21011103
x-ms-exchange-messagesentrepresentingtype: 1
x-ms-exchange-transport-fromentityheader: Hosted
x-originating-ip: [90.254.242.57]
x-1and1-spam-score: 1/100
x-1and1-spam-level: None
x-1and1-expurgate-category: clean
x-provags-id: V02::b2huztsGdBQoO1Pi75pfwbBbqfSO5PVVm+qxf1KusN5WF 7TOnzD/2lg1TIvZLUKtX41/2lj8/OGqzLvMvj6V7vzjYoQwGtL KDPBvS486tiSWlglOSO/BftbNclU8DWar2igwGmuxO/fSyKZFo nHbjZY0qlT+frWV7cqjwjFkzEEbJ1oUe4vDijd5PP0djKOA
x-routing-0be3562e-11e2-4fc7-b5a6-c7ea0e0bf210: 1.0.0.0
Content-Type: text/plain; charset="utf-8"
Content-ID: <C1DE62B61108214994A38F7B57D5FDBA@win.mail>
Content-Transfer-Encoding: base64
MIME-Version: 1.0
Archived-At: <https://mailarchive.ietf.org/arch/msg/urn/5IiKyGx9sGYsw76vjHvjwC2oELA>
Subject: Re: [urn] URN:DOI namespace registration request
X-BeenThere: urn@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Revisions to URN RFCs <urn.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/urn>, <mailto:urn-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/urn/>
List-Post: <mailto:urn@ietf.org>
List-Help: <mailto:urn-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/urn>, <mailto:urn-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 19 Jan 2021 11:29:38 -0000

Hi,

Let me give some context:

1. The canonical form of a DOI name as defined in ISO 26324 is the string starting "10dot".

2. The ISO standard says that when " displayed on screen or in print" is should be preceded by "doi:" as a label (rather than as a scheme name). Most of the family of ISO identifiers have similar text in their standards which defines separately the identifier string (to be used in machine to machine activity) and the way to present that string for human consumption. Most use a space as a delimiter and why DOI uses a colon I do not know. The standard notes that the format complies with RFC 3986 but does not claim that the resulting string is a URI - just I suppose that it could become one as we have discovered from the provisional registration.

The DOI Handbook (that I inherited) says the same thing, noting in passing that a browser might in future be able to resolve the doi: form. Of course this has never happened.

3. The ISO standard also notes that the DOI name can be made actionable by prepending the web address of a proxy.
 
4. There was a perceived need for a URI representation of a DOI name and a DOI registration was made in the "info" URI scheme. However this project did not achieve much traction and is now closed to new registrations (though existing instances remain valid).

5. Most of the applications using DOIs recommend the web address approach. For example Crossref says (https://www.crossref.org/education/metadata/persistent-identifiers/doi-display-guidelines/) 

> ... always be displayed as a full URL link in the form https://doi.org/10.xxxx/xxxxx 

So the “doi:” form is a normative format in the ISO standard but it isn’t formally a URI – it just shares the syntax.

There is a limited revision of the DOI standard under way but it won’t address this issue – though it will collate additional requirements. and resolving any ambiguity resulting from the above is on my list for subsequent work - so we can clarify that when Crossref (and others) recommends the HTTP URI form, they are formally compliant. 

I’m hoping we can then also add the urn:doi:10.xxxx/xxxxx format at the same time as a further normative format.

Thanks as ever to all for giving this attention.

Best regards,

Paul Jessop
Technology Adviser, The DOI Foundation

Paul Jessop              county analytics ltd 
--------------------------------------------- 
rights - technology - markets - music - media 
--------------------------------------------- 
paul@countyanalytics.com      +44 7850 685378
 
 
 

On 19/01/2021, 10:40, "Henry S. Thompson" <ht@inf.ed.ac.uk> wrote:

    Martin J. Dürst writes:

    > ...
    > What I think is important is to do some research on how DOIs are
    > currently used in the wild. In a very quick search, I found the
    > following two at least:
    >   IETF RFCs: DOI 10.17487/RFC3986
    >   ACM Digital Library: DOI:https://doi.org/10.1145/2594291.2594299

    Here's another relevant study:

      https://doi.org/10.1145/3184558.3191636

    > I'm very sure I have seen DOIs in URI form (i.e. doi:...), too.

    Indeed.  There are some numbers bearing this out in the above study,
    showing hundreds of thousands of uses of doi: URIs in HTML pages taken
    from Common Crawl, almost entirely in HTML <head> metadata.

    ht
    -- 
           Henry S. Thompson, School of Informatics, University of Edinburgh
          10 Crichton Street, Edinburgh EH8 9AB, SCOTLAND -- (44) 131 650-4440
                    Fax: (44) 131 650-4587, e-mail: ht@inf.ed.ac.uk
                           URL: http://www.ltg.ed.ac.uk/~ht/
     [mail from me _always_ has a .sig like this -- mail without it is forged spam]

    The University of Edinburgh is a charitable body, registered in
    Scotland, with registration number SC005336.