Re: Google Scholar, was How to pay $47 for a copy of RFC 793

Harald Alvestrand <harald@alvestrand.no> Tue, 10 May 2011 18:22 UTC

Return-Path: <harald@alvestrand.no>
X-Original-To: ietf@ietfa.amsl.com
Delivered-To: ietf@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 3F935E0698 for <ietf@ietfa.amsl.com>; Tue, 10 May 2011 11:22:10 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -102.599
X-Spam-Level:
X-Spam-Status: No, score=-102.599 tagged_above=-999 required=5 tests=[AWL=0.000, BAYES_00=-2.599, USER_IN_WHITELIST=-100]
Received: from mail.ietf.org ([64.170.98.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id qCWbPQbRtInC for <ietf@ietfa.amsl.com>; Tue, 10 May 2011 11:22:08 -0700 (PDT)
Received: from eikenes.alvestrand.no (eikenes.alvestrand.no [158.38.152.233]) by ietfa.amsl.com (Postfix) with ESMTP id 728E6E06F1 for <ietf@ietf.org>; Tue, 10 May 2011 11:22:07 -0700 (PDT)
Received: from localhost (localhost [127.0.0.1]) by eikenes.alvestrand.no (Postfix) with ESMTP id DE68B39E119; Tue, 10 May 2011 20:21:17 +0200 (CEST)
X-Virus-Scanned: Debian amavisd-new at eikenes.alvestrand.no
Received: from eikenes.alvestrand.no ([127.0.0.1]) by localhost (eikenes.alvestrand.no [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 2bA1b7kxm5wp; Tue, 10 May 2011 20:21:17 +0200 (CEST)
Received: from hta-dell.lul.corp.google.com (62-20-124-50.customer.telia.com [62.20.124.50]) by eikenes.alvestrand.no (Postfix) with ESMTPS id 3D22C39E0BF; Tue, 10 May 2011 20:21:17 +0200 (CEST)
Message-ID: <4DC9824C.2070109@alvestrand.no>
Date: Tue, 10 May 2011 20:22:04 +0200
From: Harald Alvestrand <harald@alvestrand.no>
User-Agent: Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9.2.17) Gecko/20110424 Thunderbird/3.1.10
MIME-Version: 1.0
To: Paul Hoffman <paul.hoffman@vpnc.org>
Subject: Re: Google Scholar, was How to pay $47 for a copy of RFC 793
References: <20110510152851.40727.qmail@joyce.lan> <4DC95CBE.60304@alvestrand.no> <1C26E7D5-1810-4B13-B51B-A1220121531F@vpnc.org>
In-Reply-To: <1C26E7D5-1810-4B13-B51B-A1220121531F@vpnc.org>
Content-Type: text/plain; charset="ISO-8859-1"; format="flowed"
Content-Transfer-Encoding: 7bit
Cc: John Levine <johnl@iecc.com>, ietf@ietf.org
X-BeenThere: ietf@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: IETF-Discussion <ietf.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/ietf>, <mailto:ietf-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/ietf>
List-Post: <mailto:ietf@ietf.org>
List-Help: <mailto:ietf-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ietf>, <mailto:ietf-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 10 May 2011 18:22:10 -0000

On 05/10/11 18:14, Paul Hoffman wrote:
> On May 10, 2011, at 8:41 AM, Harald Alvestrand wrote:
>
>> For some reason, scholar has indexed 151 docs from tools.ietf.org and then stopped.
>
> If only there was someone who worked at Google on this list who could send an internal message to get this rectified.... :-)
 From what I could tell from the instructions, Scholar is using some 
heuristics to figure out that "this is a paper" and "this is not a 
paper". The highest one on the list was a 3-slide presentation that 
really didn't say very much - I think this is one where heuristics had 
failed.

I think someone at the site could help them a lot more.

> --Paul Hoffman
>
>