TopNode data elements
percival@bronze.ucs.indiana.edu Sat, 14 November 1992 00:51 UTC
Received: from ietf.nri.reston.va.us by IETF.CNRI.Reston.VA.US id aa15654; 13 Nov 92 19:51 EST
Received: from CNRI.RESTON.VA.US by IETF.CNRI.Reston.VA.US id aa15650; 13 Nov 92 19:51 EST
Received: from kona.CC.McGill.CA by CNRI.Reston.VA.US id aa24164; 13 Nov 92 19:52 EST
Received: by kona.cc.mcgill.ca (5.65a/IDA-1.4.2b/CC-Guru-2b) id AA29864 on Fri, 13 Nov 92 15:30:04 -0500
Received: from PO1.Indiana.Edu by kona.cc.mcgill.ca with SMTP (5.65a/IDA-1.4.2b/CC-Guru-2b) id AA29854 (mail destined for /usr/lib/sendmail -odq -oi -fnir-request nir-out) on Fri, 13 Nov 92 15:29:48 -0500
Message-Id: <9211132029.AA29854@kona.cc.mcgill.ca>
Received: from GUESSWHO (percival-mac.ucs.indiana.edu) by PO1.Indiana.EDU; id AA10168 (5.65c+jsm/2.5.1jsm); Fri, 13 Nov 1992 15:29:04 -0500
Date: Fri, 13 Nov 1992 15:37:08 -0500
To: pacs-l@uhupvm1.bitnet, cni-directories@cni.org, nir@cc.mcgill.ca
Sender: ietf-archive-request@IETF.CNRI.Reston.VA.US
From: percival@bronze.ucs.indiana.edu
Subject: TopNode data elements
TopNode Data Elements The data elements currently defined for the BRS TopNode database are described in what follows. Please send any comments to percival@indiana.edu. The data elements necessary to fully describe an information resource or service differ depending how the resource is accessed. As a result, separate data element templates were created for different resource types or access methods. We chose having separate templates rather than attempting to create a universal (and lengthy) template of data elements that could be applied to any network information resource. By creating separate templates, only data elements that are relevant to a specific resource type are present on any given template. In each template, there is a set of common data elements which can generally be applied to any resource type (e.g. title, contact person, etc), followed by a set of elements that are specific to the particular resource type. Currently, templates (and the corresponding data elements) have been developed for the following resource types: FTP accessible resources, resources accessible via terminal emulation (Telnet & TN3270), e-mail accessible resources (e.g. e-journals and listservers), conventional print resources (those concerned with networks and network information retrieval) and service centers (e.g. NICS, NOCS, supercomputer centers, etc). Additional templates for other resource types such as WAIS and Gopher servers are planned. *Internal fields - certain fields described below do not originate with the authors during a registration process, but will need to be included by the TopNode staff who catalog each information source, or will be fields that are created automatically by the database application itself for management purposes. These "internal" elements are marked with an asterisk in the text below. -Non-displayed fields - Some of the fields below will not be displayed to users. These fields are marked with a dash (-). Note: maximum line length for all fields is 74 characters unless noted otherwise. ---------------------------------------------------------------------------- -- The following data elements are those which generally can be applied to all information resource types. Values in parentheses are the BRS load tags for the element. Note that all fields are not required and some may not be appropriate for a given resource. ---------------------------------------------------------------------------- -- -*DOCN (..Document-Number:) - An internal document number created by BRS -*UNIQUE RECORD IDENTIFIER (..ID:) - TopNode unique record indentifier created internally. -*RECORD COMPLETE/INCOMPLETE (..RC:) - An internal field which describes whether the record has been cataloged and is ready to be released. *RECORD QUALITY (..RQ:) - The quality of records will vary even after cataloging, based on the amount of information that is available for a resource. Some records may result from a self registration process and will not be "cataloged" at all. This field will indicate to users the overall quality of the record (e.g. High quality, Questionable, etc). The cataloging staff will assign a quality rating. The exact categorization scheme has yet to be determined. PROTOCOL TYPE/ACCESS METHOD (..Protocol:) - How a resource is accessed. Currently this field may be one of : Terminal, FTP, E-mail, Print, Service Center, WAIS, or Gopher. Other types will be added as needed. Specific elements for each type are discussed below. LABEL (..Label:) - A field which will be used to label information sources (e.g. short title) in various user applications. The maximum length of this field is 60 characters. TITLE (..Title:) - A complete title for the resource (potentially multiple lines) AUTHOR (..Author:) - The personal author CORPORATE AUTHOR (..Corpauthor:) - The corporate author if appropriate. ORGANIZATION (..Organization:) - Organizational affiliation or organization supplying the service if appropriate. RESOURCE LOCATION (..Location:) - City, State, Country TIME ZONE (..Timezone:) - Time zone relative to GMT ACCESS TIME RESTRICTIONS (..Accesstime:) - Text field describing when the resource is not available KEYWORDS (..Keywords:) - Any keywords which might be applied to the record SUBJECT HEADINGS (..Subjects:) - Author supplied and from an evolving TopNode thesaurus ABSTRACT (..Abstract:) - Description of the information source/resource (not access information) The public contact information fields listed below will be displayed to the users upon request. PUBLIC CONTACT NAME (..Pubname:) PUBLIC CONTACT MAILING ADDRESS (..Pubaddress:) PUBLIC CONTACT PHONE NUMBER (..Pubphone:) PUBLIC CONTACT FAX NUMBER (..Pubfax:) PUBLIC CONTACT E-MAIL ADDRESS (..Pubemail:) Often contacts may not want their names and/or e-mail addresses made public. The private contact information fields listed below will never be given to users but will be used as needed by the database maintenance staff. -PRIVATE CONTACT NAME (..Privname:) -PRIVATE CONTACT MAILING ADDRESS (..Privaddress:) -PRIVATE CONTACT PHONE NUMBER (..Privphone:) -PRIVATE CONTACT FAX NUMBER (..Privfax:) -PRIVATE CONTACT E-MAIL ADDRESS (..Privemail:) Often, the TopNode staff will be responsible for maintaining the database content. In some cases, however, others may choose to take responsibility for maintaining certain entries. The following maintainer fields will indicate who is responsible (and gets the credit) for each record. MAINTAINER NAME (..Mainname:) MAINTAINER MAILING ADDRESS (..Mainaddress:) MAINTAINER PHONE NUMBER (..Mainphone:) MAINTAINER FAX NUMBER (..Mainfax:) MAINTAINER E-MAIL ADDRESS (..Mainemail:) FREQUENCY OF UPDATES/PUBLICATION (..Frequency:) - (irregular, never, #days) Hopefully this field will allow some automation of the maintenance effort (e.g. generate a report of entries that are out of date and need follow up, or mail the author his/her entry when its time to update to verify the currency of the information). Exact values for this field have not been determined. *LAST VERIFIED (..Verified:) - in yymmdd or yyyymmdd format - the last date the entry was verified by the maintainer *LAST UPDATED (..Updated:) - date stamped automatically by the database *CREATION DATE (..Created:) - date stamped automatically by the database What follows are the additional fields which are required for each specific resource type. A complete TopNode record is made up of the general fields listed above and the additonal fields for the specific resource type. ---------------------------------------------------------------------------- -- Additional fields for Type: Terminal resource ---------------------------------------------------------------------------- -- COST/VALIDATION (..Cost:) - text field describing costs or special validation required to gain access TERMINAL TYPE (..Terminal:)- One of: Telnet, Telnet/VT100, or TN3270 DOMAIN (CANONICAL) NAME (..Domainname:) - .e.g. ucs.indiana.edu IP ADDRESS (..IP:) - e.g. 129.79.1.16 SOCKET/PORT (..Port:) - supplied if other than the default LOGIN ID (..LoginID:) - if a username is required at login PASSWORD (..Password:) - any required password CONNECTION INFORMATION (..Connectinfo:) - Text describing any addition information necessary for a connection. Any special logoff instructions should also be included here as a second paragraph. LOGIN SCRIPT (..Script:) - to-be-determined script which could be used to automate the connection. A variety of scripting standards may be included here. ---------------------------------------------------------------------------- -- Additional fields for Type: anonymous FTP file access (a subset could be the anonymous FTP site itself (i.e. no specific file information) site/directory/files flag? ---------------------------------------------------------------------------- -- *UNIQUE DOCUMENT IDENTIFER (..UDI:) - Any given file available via anonymous FTP likely resides on more than one server and may be available in more than one format (e.g. ASCII & Postscript). The current thinking is that each different form of the file (different sites and different formats) should be entered into the database as a separate record. The problem is that users searching the database will receive multiple hits for the same file. Ultimately the development of a standard unique document identifier (i.e. the IETF work in this area) will help alleviate this problem. In the interim, we may try and use this field for this purpose in the TopNode database so that users can view information about the primary source file and then look secondary copies. We may, however, choose to only list a single copy of the file and then have users access Archie to locate alternate copies. DOMAIN (CANONICAL) NAME (..Domainname:) - .e.g. ucs.indiana.edu IP ADDRESS (..IP:) - e.g. 129.79.1.16 LOGIN ID (..LoginID:) - if a username other than ANONYMOUS is required at login PASSWORD (..Password:) - any required password MACHINE/OPERATING SYSTEM (..Opsystem:) - e.g.UNIX, IBM VM, VAX/VMS FILENAME (..Filename:) - file name PATH (..Path:) - full path name to the file FILE SIZE (..Filesize:) - in bytes FILE TYPE (..Filetype:) - document, application, or data Other types will likely need to be added. FILE FORMAT (..Fileformat:) - e.g. postscript, GIF, TIFF, ASCII, Wordperfect, tar, tar.Z, ZIP, hqx, ... SYSTEM REQUIREMENTS (..Requirements:)- text field describing any special requirements. Probably most appropriate for applications but might be used to indicate if a special application is required to process a datafile, etc. ---------------------------------------------------------------------------- -- Additional fields for Type:Service Centers- e.g. NICS, supercomputer centers, etc ---------------------------------------------------------------------------- -- No additional fields beyond the general fields are required. ---------------------------------------------------------------------------- -- Additional fields for Type: Type: e-mail distribution lists ---------------------------------------------------------------------------- -- INTERNET LIST ADDRESS (..Interlist:) BITNET LIST ADDRESS (..Bitlist:) INTERNET SUBSCRIPTION ADDRESS (..Intersubs:) BITNET SUBSCRIPTION ADDRESS (..Bitsubs:) SUBSCRIPTION CONVENTION (..Subscribe:) - bitnet "subscribe", plain english to,... ---------------------------------------------------------------------------- -- Additional fields for Type: Publication/Print ---------------------------------------------------------------------------- -- PUBLISHER (..Publisher:) PLACE OF PUBLICATION (..Pubplace:) - city and state PUBLICATION DATE (..Date:) ISBN (..ISBN:) ISSN (..ISSN:) SUBSCRIPTION ADDRESS (..Address:) - for serials PUBLICATION TYPE (..Pubtype:) - a non-displayed flag which allows books and serials to be differentiated COLLATION INFORMATION (..Pagination:) Below is a sample record complete with the BRS load tags. Note that only those data elements which are necessary or appropriate have been included. .Protocol: Terminal .Label: CARL .Title: Colorado Alliance of Research Libraries (CARL) .Corpauthor: CARL Systems .Location: Denver, Colorado, USA .Keywords: Internet-accessible library catalog; OPAC; Online Public Access Catalog; Online bibliographic database; Journal Index .Abstract: Internet-accessible card catalog to 50 major libraries at Colorado and a database of current article information taken from 10,000 multidisciplinary journals. Articles can be sent directly to you or your fax machine. Costs of articles delivery may be varies. .Pubname: CARL Systems, Inc .Pubaddress: 3801 East Florida Street Suite 300 Denver, CO 80210 .Pubphone: 303-758-3030 .Pubfax: 303-758-0606 .Pubemail: help@carl.org .Privname: Robert Shafer .Privaddress: Colorado Alliance of Research Libraries 777 Grant, Suite 304 Denver, CO 80203 .Privphone: 303-758-3030 .Privemail: bshafer@SERVER.CARL.ORG /~ Additional fields for Type: Terminal .Cost: Free .Terminal: VT100 .Domainname: pac.carl.org .IP: 192.54.81.128 .Connectinfo: To log off type //exit.
- TopNode data elements percival