Harvest

Jill Foster <Jill.Foster@newcastle.ac.uk> Sun, 12 March 1995 22:14 UTC

Received: from ietf.nri.reston.va.us by IETF.CNRI.Reston.VA.US id aa06329; 12 Mar 95 17:14 EST
Received: from CNRI.Reston.VA.US by IETF.CNRI.Reston.VA.US id aa06325; 12 Mar 95 17:14 EST
Received: from norn.ncl.ac.uk by CNRI.Reston.VA.US id aa10530; 12 Mar 95 17:14 EST
Received: by norn.mailbase.ac.uk id <VAA08674@norn.mailbase.ac.uk> (8.6.8.1/ for mailbase.ac.uk); Sun, 12 Mar 1995 21:51:37 GMT
Received: from cheviot.ncl.ac.uk by norn.mailbase.ac.uk id <VAA08664@norn.mailbase.ac.uk> (8.6.8.1/ for mailbase.ac.uk) with ESMTP; Sun, 12 Mar 1995 21:51:35 GMT
Received: from burnmoor.ncl.ac.uk by cheviot.ncl.ac.uk id <VAA04828@cheviot.ncl.ac.uk> (8.6.10/ for ncl.ac.uk) with SMTP; Sun, 12 Mar 1995 21:51:34 GMT
Received: from tuda.ncl.ac.uk (tuda.ncl.ac.uk [128.240.2.1]) by burnmoor.ncl.ac.uk (8.6.10/8.6.10-cf revision 2 for Solaris 2.x) with ESMTP id VAA02197; Sun, 12 Mar 1995 21:51:33 GMT
Received: from [128.240.2.23] (knott3.ncl.ac.uk [128.240.2.23]) by tuda.ncl.ac.uk (8.6.10/8.6.10-cf revision 1 for SunOS 4.1.x) with SMTP id VAA02058; Sun, 12 Mar 1995 21:51:27 GMT
Date: Sun, 12 Mar 1995 21:51:27 GMT
X-Sender: njf@burnmoor.ncl.ac.uk
Message-Id: <v02110106ab88bb03fd54@[128.240.3.154]>
Mime-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
To: nir@mailbase.ac.uk
Sender: ietf-archive-request@IETF.CNRI.Reston.VA.US
From: Jill Foster <Jill.Foster@newcastle.ac.uk>
Subject: Harvest
X-List: nir@mailbase.ac.uk
Reply-To: Jill Foster <Jill.Foster@newcastle.ac.uk>
X-Orig-Sender: nir-request@mailbase.ac.uk
Precedence: list

An update on Harvest from the Internet Monthly report. Jill


Internet Monthly Report                                    February 1995



INTERNET RESEARCH REPORTS
-------------------------

     RESOURCE DISCOVERY AND DIRECTORY SERVICE
     ----------------------------------------

        The Internet Research Task Force research group on Resource
        Discovery has been developing and experimenting with the Harvest
        system for the past 1.5 years.

        Harvest provides an integrated set of tools to gather, extract,
        organize, search, cache, and replicate relevant information
        across the Internet.  With modest effort users can tailor
        Harvest to digest information in many different formats, and
        offer custom search services on the Internet.  Moreover, Harvest
        makes very efficient use of network traffic, remote servers, and
        disk space.

        In the past few months we have made significant improvements to
        the system, allowing well-controlled specifications of the
        information gathering workload, much better gathering and
        indexing performance, support for more data formats, more
        sophisticated caching and replication, ports to popular
        platforms, and much more easily installed and used binary
        distributions of the basic system.  At present we are working on
        extending the system to support taxonomies and query routing,
        more complex data models, interfaces with other popular systems
        and products (such as Verity's and WAIS Inc.'s search engines,
        SGML, and SQL search engines), more customizable searching
        schemes, non-textual index/search engines, and a number of
        experiments concerning system scalability.  We are actively
        pursuing collaborative efforts with other projects in all
        sectors - commercial, government, academic, and others.

        Readers can get information about Harvest (including demos,
        papers, software, and documentation) from
        http://harvest.cs.colorado.edu/

        - Mike Schwartz (schwartz@cs.colorado.edu)
          University of Colorado, Boulder
          IRTF-RD Chair
          and Harvest Project Principal Investigator

        Mike Schwartz@latour.cs.colorado.edu.