Re: text in atom:category?

"Peter Keane" <pkeane@mail.utexas.edu> Mon, 08 December 2008 17:11 UTC

Return-Path: <owner-atom-syntax@mail.imc.org>
X-Original-To: ietfarch-atompub-archive@core3.amsl.com
Delivered-To: ietfarch-atompub-archive@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id D267D3A6A72 for <ietfarch-atompub-archive@core3.amsl.com>; Mon, 8 Dec 2008 09:11:51 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.577
X-Spam-Level:
X-Spam-Status: No, score=-1.577 tagged_above=-999 required=5 tests=[AWL=-0.201, BAYES_00=-2.599, FM_FORGED_GMAIL=0.622, HTML_MESSAGE=0.001, J_CHICKENPOX_48=0.6]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id Eg4+R0kxo57K for <ietfarch-atompub-archive@core3.amsl.com>; Mon, 8 Dec 2008 09:11:50 -0800 (PST)
Received: from balder-227.proper.com (properopus-pt.tunnel.tserv3.fmt2.ipv6.he.net [IPv6:2001:470:1f04:392::2]) by core3.amsl.com (Postfix) with ESMTP id C017B3A688C for <atompub-archive@ietf.org>; Mon, 8 Dec 2008 09:11:49 -0800 (PST)
Received: from balder-227.proper.com (localhost [127.0.0.1]) by balder-227.proper.com (8.14.2/8.14.2) with ESMTP id mB8H075h081634 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Mon, 8 Dec 2008 10:00:07 -0700 (MST) (envelope-from owner-atom-syntax@mail.imc.org)
Received: (from majordom@localhost) by balder-227.proper.com (8.14.2/8.13.5/Submit) id mB8H07gF081633; Mon, 8 Dec 2008 10:00:07 -0700 (MST) (envelope-from owner-atom-syntax@mail.imc.org)
X-Authentication-Warning: balder-227.proper.com: majordom set sender to owner-atom-syntax@mail.imc.org using -f
Received: from yx-out-1718.google.com (yx-out-1718.google.com [74.125.44.156]) by balder-227.proper.com (8.14.2/8.14.2) with ESMTP id mB8GxuNR081577 for <atom-syntax@imc.org>; Mon, 8 Dec 2008 10:00:06 -0700 (MST) (envelope-from pjkeane@gmail.com)
Received: by yx-out-1718.google.com with SMTP id 36so722064yxh.4 for <atom-syntax@imc.org>; Mon, 08 Dec 2008 08:59:55 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:message-id:date:from:sender :to:subject:cc:in-reply-to:mime-version:content-type:references :x-google-sender-auth; bh=k43dV20UFDuQd4uI3edMjP8Ny+evJfQE7OnCDHZX2nc=; b=c3XsWf/mGs7ckoT/IlR2oYfPuKc0oDwE04YVlySMwaqfcMqhW7WCe5CzbHXEgFyBfB ATHIKOuppaRX5Uj9HKX34xWX87GjoIT8Aa+pEBzkSanf2sle6jM0DG+r13XH0snXWrWY Nj8hpS2bCGYGLogr4gL1+v/hmE3lFFKj+Ew1M=
DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=message-id:date:from:sender:to:subject:cc:in-reply-to:mime-version :content-type:references:x-google-sender-auth; b=BxES7kqy1GZta3vcbIqzOYoQhbOqbdxehG2Ni75QfHM2x4YXWJ8fdzLqBpg6f/61Ta Mup5tIH9vNyIseene0ljXoray7AsZfLYNikGEE5zIjYcJx4acDs46eyVxb6XkEra+s0x qTA+YxbJiNRXhl1vApANSpJ66slgzaXOSzXXs=
Received: by 10.142.173.14 with SMTP id v14mr1567171wfe.115.1228755595048; Mon, 08 Dec 2008 08:59:55 -0800 (PST)
Received: by 10.142.148.8 with HTTP; Mon, 8 Dec 2008 08:59:54 -0800 (PST)
Message-ID: <8158ad750812080859p5ff7bb81qd15ab4ca88d8e1e9@mail.gmail.com>
Date: Mon, 08 Dec 2008 10:59:54 -0600
From: Peter Keane <pkeane@mail.utexas.edu>
To: Eric Scheid <eric.scheid@ironclad.net.au>
Subject: Re: text in atom:category?
Cc: Atom Syntax <atom-syntax@imc.org>
In-Reply-To: <C5638F3C.B475%eric.scheid@ironclad.net.au>
MIME-Version: 1.0
Content-Type: multipart/alternative; boundary="----=_Part_70202_4816625.1228755595020"
References: <8158ad750812072319r762d1d1h37142d15649efb84@mail.gmail.com> <C5638F3C.B475%eric.scheid@ironclad.net.au>
X-Google-Sender-Auth: caa7ff0f6db30f47
Sender: owner-atom-syntax@mail.imc.org
Precedence: bulk
List-Archive: <http://www.imc.org/atom-syntax/mail-archive/>
List-Unsubscribe: <mailto:atom-syntax-request@imc.org?body=unsubscribe>
List-ID: <atom-syntax.imc.org>

On Mon, Dec 8, 2008 at 9:58 AM, Eric Scheid <eric.scheid@ironclad.net.au>wrote:

>
> On 8/12/08 6:19 PM, "Peter Keane" <pkeane@mail.utexas.edu> wrote:
>
> > This would be great in an ideal world, but the key-value pairs are all
> > arbitrary (created by users, serialized to atom for syndication & atompub
> > manipulation -- like in a spreadsheet -- column name/cell value).
>
> Sounds very much like you are trying to shoe horn non-categorical metadata
> into the atom:category construct. Please don't do this ;-)


Well,  I guess it depends what you mean by categorizing.  Is asserting an
RDF triple in a sense "categorizing"?  While I could imagine lots of things
that are categorical metadata, I'd have a tough time figuring out a good set
of criteria to define "non-categorical" metadata.  Now "best practices" is
another matter -- it's a hack, for sure.


> > Since I now use a simple extension element for these key-value pairs, I
> was
> > thinking about using category instead -- the Balisage paper I cited in my
> > original message piqued my interest there. (Oh, BTW my example left out
> the
> > scheme, which would be something like "http://example.com/metadata").
>
> What would using the <atom:category/> syntax buy you that a
> <pkeane:key-value/> extension wouldn't? Surely you're not hoping for
> interoperability of your key-value data with the wider publishing world?
>

Perhaps not much -- I'd refer you to the paper I cited in my original post
[1] for a description of one possible benefit.  I do use the extension that
you describe and it indeed works fine.  There's been discussion in the past
about atom:category as a possible "RDFish extension point," [2], and efforts
to embed triples in Atom [3] -- I am just curious to know if other folks are
looking at things similarly vis-a-vis atom:category (sounds like not).

For me the bottom line is: that I have a large (~200K items) digital
repository used on a large university campus (UT Austin -- daseproject.org)
that uses Atom/AtomPub extensively.  Of the multitude of users from all
discipline that use it in a variety of ways, the central need is that we
have "collections" of items,  each item of which has a digital object
attached (image, mp3, pdf, etc.) plus derivatives, and an arbitrary set of
key-value pairs, and often a large chunk of textual content.  Just meeting
those criteria we've eliminated all sorts of one-off collections of digital
stuff in spreadsheets, Access, Filemaker, php/mysql apps, etc.  with a very
standardized way to deal with them.  New projects take a matter of days
rather than weeks to get off the ground, and archiving/preservation becomes
much simpler.

I don't really feel all that strongly about how we might serialize the
key/vals in Atom.  It's basically a two-line change in my code for
serializing/unserializing.  That said, I have tried as much as possible to
hue closely to Atom/AtomPub and have indeed found that other tools
(MarsEdit, say, or the Picasa Image uploader) can start to provide useful
functionality for this app. Google Spreadsheets is the only obvious example
I have found of a similar need and they use a custom extension.

 I have begun looking at using category in another unusual way, as
essentially the inverse of a "link" [4].  Which would allow me to use
AtomPub to create "links" between resources. But that's another (not
entirely unrelated) kettle of fish....

>>  <category
>>      scheme='http://www.loc.gov/catdir/cpso/lcco/'
>>      term="SF429.S65"
>>      label="siberian husky" />
>

> >>  <!-- note: term may not be entirely accurate, IANAL -->
> >>  <!-- (where L = Librarian, of course ;-) -->
> >>
> > This would certainly not work for me since I am, in fact, a librarian
> ;-)!
>
> (just curious btw - was I close with the LOC term for siberian husky, or
> did
> I clumsily misappropriate the wrong meta-data value from www.loc.gov?)
>

Actually, I don't really know.  Although I have an MLIS, for all intents &
purposes I might as well be "NAL" ;-).

--peter

[1] http://www.balisage.net/Proceedings/html/2008/Milowski01/Balisage2008-Milowski01.html
[2] http://torrez.us/archives/2006/05/25/447/
[3] http://www.ietf.org/internet-drafts/draft-nottingham-atomtriples-00.txt
[4] http://www.imc.org/atom-syntax/mail-archive/msg20770.html



>
>
> e.
>
>