Re: Discontinuing the mhonarc archives January 14

Nick Hilliard <nick@foobar.org> Fri, 08 March 2019 14:00 UTC

Return-Path: <nick@foobar.org>
X-Original-To: ietf@ietfa.amsl.com
Delivered-To: ietf@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id CEE58127917; Fri, 8 Mar 2019 06:00:53 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -4.199
X-Spam-Level:
X-Spam-Status: No, score=-4.199 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_MED=-2.3, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id dfrfXhtQJ1sa; Fri, 8 Mar 2019 06:00:51 -0800 (PST)
Received: from mail.netability.ie (mail.netability.ie [IPv6:2a03:8900:0:100::5]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id B7C741240D3; Fri, 8 Mar 2019 06:00:50 -0800 (PST)
X-Envelope-To: irsg@irtf.org
Received: from cupcake.local (089-101-195156.ntlworld.ie [89.101.195.156] (may be forged)) (authenticated bits=0) by mail.netability.ie (8.15.2/8.15.2) with ESMTPSA id x28E0kkZ016207 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-SHA bits=256 verify=NO); Fri, 8 Mar 2019 14:00:47 GMT (envelope-from nick@foobar.org)
X-Authentication-Warning: cheesecake.ibn.ie: Host 089-101-195156.ntlworld.ie [89.101.195.156] (may be forged) claimed to be cupcake.local
Subject: Re: Discontinuing the mhonarc archives January 14
To: Robert Sparks <rjsparks@nostrum.com>
Cc: "ietf@ietf.org" <ietf@ietf.org>, irsg@irtf.org
References: <33d1a5a4-5117-a264-316f-52465fdae372@nostrum.com> <c0887bdb-0c2e-0802-eb9e-04fb16b3ed9e@nostrum.com> <5927e0a6-3717-38e8-ed3c-76cbee1f800f@nostrum.com> <df286e5d-0f68-4634-3879-829b1c33e6eb@foobar.org> <16208a03-0c96-a73c-70d8-85cdd08cdb5e@nostrum.com>
From: Nick Hilliard <nick@foobar.org>
Message-ID: <470e30eb-4a72-c0cd-8ee7-f501ae4d84f4@foobar.org>
Date: Fri, 08 Mar 2019 14:00:45 +0000
User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.14; rv:52.0) Gecko/20100101 PostboxApp/6.1.11
MIME-Version: 1.0
In-Reply-To: <16208a03-0c96-a73c-70d8-85cdd08cdb5e@nostrum.com>
Content-Type: multipart/alternative; boundary="------------EE4802948ABECB6607E9C329"
Content-Language: en-US
Archived-At: <https://mailarchive.ietf.org/arch/msg/ietf/KCYoMIUsYyHUV0k4rvYOlxlQ2PY>
X-BeenThere: ietf@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: IETF-Discussion <ietf.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/ietf>, <mailto:ietf-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/ietf/>
List-Post: <mailto:ietf@ietf.org>
List-Help: <mailto:ietf-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ietf>, <mailto:ietf-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 08 Mar 2019 14:00:54 -0000

Robert,

a search for "site:mailarchive.ietf.org" gives 5 results on both Google 
search, Bing and duckduckgo.  I.e. the entire IETF mail archive is 
unsearchable via the most commonly used public search engines.  Is there 
any way of resolving this?

Nick
> Robert Sparks <mailto:rjsparks@nostrum.com>
> 10 January 2019 at 15:29
> Thanks for the nudge, Nick. Some conversations started before the end 
> of the year had not run to conclusion.
>
> The robots.txt file has been changed, and some things put in place to 
> help the spiders find all the messages.
>
> RjS
>
>
> Nick Hilliard <mailto:nick@foobar.org>
> 8 January 2019 at 19:59
>
>
> Robert,
>
> https://mailarchive.ietf.org/robots.txt includes "Disallow: /", which 
> means that the entire mailarch repository is unavailable on regular 
> search engines.
>
> This was reported in ietfdb ticket 2658.
>
> Are there plans to fix this before mhonarc is discontinued?  It would 
> be hugely regressive for ietf mailing lists to be cut off from regular 
> internet searches.
>
> Nick
>
> Robert Sparks <mailto:rjsparks@nostrum.com>
> 8 January 2019 at 19:49
> We will be discontinuing Mhonarc during the day on Monday Jan 14.
>
> As noted below, all existing Mhonarc URLs will redirect to the same 
> message in mailarch.
>
> The mbox files available through rsync will continue to be available.
>
>
>
> ------------------------------------------------------------------------