Harvest
Jill Foster <Jill.Foster@newcastle.ac.uk> Sun, 12 March 1995 22:14 UTC
Received: from ietf.nri.reston.va.us by IETF.CNRI.Reston.VA.US id aa06329; 12 Mar 95 17:14 EST
Received: from CNRI.Reston.VA.US by IETF.CNRI.Reston.VA.US id aa06325; 12 Mar 95 17:14 EST
Received: from norn.ncl.ac.uk by CNRI.Reston.VA.US id aa10530; 12 Mar 95 17:14 EST
Received: by norn.mailbase.ac.uk id <VAA08674@norn.mailbase.ac.uk> (8.6.8.1/ for mailbase.ac.uk); Sun, 12 Mar 1995 21:51:37 GMT
Received: from cheviot.ncl.ac.uk by norn.mailbase.ac.uk id <VAA08664@norn.mailbase.ac.uk> (8.6.8.1/ for mailbase.ac.uk) with ESMTP; Sun, 12 Mar 1995 21:51:35 GMT
Received: from burnmoor.ncl.ac.uk by cheviot.ncl.ac.uk id <VAA04828@cheviot.ncl.ac.uk> (8.6.10/ for ncl.ac.uk) with SMTP; Sun, 12 Mar 1995 21:51:34 GMT
Received: from tuda.ncl.ac.uk (tuda.ncl.ac.uk [128.240.2.1]) by burnmoor.ncl.ac.uk (8.6.10/8.6.10-cf revision 2 for Solaris 2.x) with ESMTP id VAA02197; Sun, 12 Mar 1995 21:51:33 GMT
Received: from [128.240.2.23] (knott3.ncl.ac.uk [128.240.2.23]) by tuda.ncl.ac.uk (8.6.10/8.6.10-cf revision 1 for SunOS 4.1.x) with SMTP id VAA02058; Sun, 12 Mar 1995 21:51:27 GMT
Date: Sun, 12 Mar 1995 21:51:27 +0000
X-Sender: njf@burnmoor.ncl.ac.uk
Message-Id: <v02110106ab88bb03fd54@[128.240.3.154]>
Mime-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
To: nir@mailbase.ac.uk
Sender: ietf-archive-request@IETF.CNRI.Reston.VA.US
From: Jill Foster <Jill.Foster@newcastle.ac.uk>
Subject: Harvest
X-List: nir@mailbase.ac.uk
Reply-To: Jill Foster <Jill.Foster@newcastle.ac.uk>
X-Orig-Sender: nir-request@mailbase.ac.uk
Precedence: list
An update on Harvest from the Internet Monthly report. Jill Internet Monthly Report February 1995 INTERNET RESEARCH REPORTS ------------------------- RESOURCE DISCOVERY AND DIRECTORY SERVICE ---------------------------------------- The Internet Research Task Force research group on Resource Discovery has been developing and experimenting with the Harvest system for the past 1.5 years. Harvest provides an integrated set of tools to gather, extract, organize, search, cache, and replicate relevant information across the Internet. With modest effort users can tailor Harvest to digest information in many different formats, and offer custom search services on the Internet. Moreover, Harvest makes very efficient use of network traffic, remote servers, and disk space. In the past few months we have made significant improvements to the system, allowing well-controlled specifications of the information gathering workload, much better gathering and indexing performance, support for more data formats, more sophisticated caching and replication, ports to popular platforms, and much more easily installed and used binary distributions of the basic system. At present we are working on extending the system to support taxonomies and query routing, more complex data models, interfaces with other popular systems and products (such as Verity's and WAIS Inc.'s search engines, SGML, and SQL search engines), more customizable searching schemes, non-textual index/search engines, and a number of experiments concerning system scalability. We are actively pursuing collaborative efforts with other projects in all sectors - commercial, government, academic, and others. Readers can get information about Harvest (including demos, papers, software, and documentation) from http://harvest.cs.colorado.edu/ - Mike Schwartz (schwartz@cs.colorado.edu) University of Colorado, Boulder IRTF-RD Chair and Harvest Project Principal Investigator Mike Schwartz@latour.cs.colorado.edu.