Re: File metadata format draft

Trace Bond <tbond@ctv.ca> Thu, 15 November 2007 00:09 UTC

Return-path: <discuss-bounces@apps.ietf.org>
Received: from [127.0.0.1] (helo=stiedprmman1.va.neustar.com) by megatron.ietf.org with esmtp (Exim 4.43) id 1IsSIb-0007O5-CO; Wed, 14 Nov 2007 19:09:37 -0500
Received: from discuss by megatron.ietf.org with local (Exim 4.43) id 1IsSIa-0007Md-2o for discuss-confirm+ok@megatron.ietf.org; Wed, 14 Nov 2007 19:09:36 -0500
Received: from [10.91.34.44] (helo=ietf-mx.ietf.org) by megatron.ietf.org with esmtp (Exim 4.43) id 1IsSIZ-0007MT-PH for discuss@apps.ietf.org; Wed, 14 Nov 2007 19:09:35 -0500
Received: from mail26-priv.ctv.ca ([199.85.69.70]) by ietf-mx.ietf.org with esmtp (Exim 4.43) id 1IsSIX-00060A-3S for discuss@apps.ietf.org; Wed, 14 Nov 2007 19:09:35 -0500
X-IronPort-AV: E=Sophos;i="4.21,418,1188792000"; d="scan'208";a="55314228"
Received: by agcoms01.corp.ctv.ca with Internet Mail Service (5.5.2658.3) id <WN5WJLWW>; Wed, 14 Nov 2007 19:09:29 -0500
Message-ID: <F8EBAC189452844C808799CDF8B1F7F707C52C4E@vacoms04.corp.ctv.ca>
From: Trace Bond <tbond@ctv.ca>
To: discuss@apps.ietf.org
Subject: Re: File metadata format draft
Date: Wed, 14 Nov 2007 19:09:27 -0500
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2658.3)
Content-Type: text/plain
X-Spam-Score: 0.0 (/)
X-Scan-Signature: 6ffdee8af20de249c24731d8414917d3
Cc: METS@LOC.gov
X-BeenThere: discuss@apps.ietf.org
X-Mailman-Version: 2.1.5
Precedence: list
List-Id: general discussion of application-layer protocols <discuss.apps.ietf.org>
List-Unsubscribe: <https://www1.ietf.org/mailman/listinfo/discuss>, <mailto:discuss-request@apps.ietf.org?subject=unsubscribe>
List-Post: <mailto:discuss@apps.ietf.org>
List-Help: <mailto:discuss-request@apps.ietf.org?subject=help>
List-Subscribe: <https://www1.ietf.org/mailman/listinfo/discuss>, <mailto:discuss-request@apps.ietf.org?subject=subscribe>
Errors-To: discuss-bounces@apps.ietf.org

Hi Miguel,
Thanks for this draft. I think this is a very important topic!

For the past two years, I've been searching for an XML format for describing
computer files that I could use for creating catalogs of files for backup
purposes.
Such a format could be a very useful tool for making, duplicating and
sharing vendor-independent file systems, as well as cataloging.

There seem to be two barriers to the embrace of some such standard:
1) The tendency of the community to focus on "resources" which may be
misguided since most of the "resources" are files.
2) There are already several file metadata XML formats and the
community/communities cannot focus on one.
	Examples:
	a) the WebDAV format mentioned below
	b) FLUTE/RFC3926's file description format
	c) <file><fileGrp> schema from the library community
(http://www.loc.gov/standards/mets/)
	d) the DublinCore/RDF format from the resource-centric SemanticWeb
community

One of my many questions is.. could this draft be the core of a
general-purpose XML format for describing computer files and file systems?
Is such a question or format meaningful or possible? 

Some questions on this draft
http://www.ietf.org/internet-drafts/draft-garcia-app-area-file-data-format-0
0.txt:
1) From page 5:  
"Each <instance> element provides information that is related to a
particular instance of the file, rather than the file itself." 
This is unclear to me.. isn't each instance of the file an actual physical
copy/clone?
2) From page 6:
"The <modification-date> element indicates the date and time at which the
file was last modified."
If this is the case, then this element should be in <identity> (not
<instance>) since it is my understanding that all instances/copies of a file
must be identical.

Please forgive me for posting to two email forums. 
	Best regards,
	Trace Bond
	Vancouver

@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@
At the moment the integration with the WebDAV properties is not done. I
believe this is a topic for discussion. I have here more questions than
answers. For example:

Do we want to propose a single data format for describing file metadata? The
WebDAV properties is already standardized and deployed, whereas the
file-data-format draft is not. However, the requirements of both
applications are not totally the same. For example, the file data format
does not care about the creator, but needs to signal different endpoints
(instances) that host the same file.

Or can we integrate the WebDAV properties into the file data format? This
will effectively create a superset of the WebDAV properties, bt I am not
sure it will have a footprint on the implementations, due to the advance
state of deployment of WebDAV.

Or do we want to continue with two separate paths, perhaps with this mapping
that you mentioned?

BR,
Miguel


Cyrus Daboo wrote: 

	Hi Miguel,

	--On November 12, 2007 9:58:50 PM +0200 Miguel Garcia <Miguel.Garcia
at nsn.com> wrote:
	
	

		I have submitted a draft that defines an XML schema for
describing files
		and associated metadata. I am referring to this draft:

	
http://www.ietf.org/internet-drafts/draft-garcia-app-area-file-data-forma
		t-00.txt

		The background of this draft is as follows: We first
submitted a SIP
		event package for subscribing to changes in files stored in
a remote
		endpoint. The idea is to use SIP to emulate the "shared
folder"
		functionality that is available in common instant messaging
and presence
		clients. A part of the overall function is a format for
describing files
		and associated metadata.

		At the last IETF meeting, I got a few comments indicating
that the data
		format could be reused by other protocols, namely HTTP, and
thus, it
		should be split out from the draft and discussed in the Apss
area. So
		this is what I am doing.

		So, the authors would like to get comments and questions
about the
		document.
		

	
	Had a quick look. In the HTTP world, meta-data about files is
already covered by WebDAV properties. So the question I have is what, if
any, thought has been given to integration with WebDAV. e.g. a lot of
elements in your schema have exact equivalents in WebDAV. It might be
useful, at a minimum, to define a mapping between your schema and what
WebDAV provides.
	
	


--
Miguel A. Garcia           tel:+358-50-4804586
Nokia Siemens Networks     Espoo, Finland