Re: [Mtgvenue] Exploration of a "posting" metric - draft-elkins-mtgvenue-participation-metrics

Dave Crocker <dcrocker@bbiw.net> Wed, 20 July 2016 18:12 UTC

Return-Path: <dcrocker@bbiw.net>
X-Original-To: mtgvenue@ietfa.amsl.com
Delivered-To: mtgvenue@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id A529012D9EA for <mtgvenue@ietfa.amsl.com>; Wed, 20 Jul 2016 11:12:17 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.107
X-Spam-Level:
X-Spam-Status: No, score=-1.107 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, RDNS_NONE=0.793] autolearn=no autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id DmosoHKF4NEH for <mtgvenue@ietfa.amsl.com>; Wed, 20 Jul 2016 11:12:16 -0700 (PDT)
Received: from simon.songbird.com (unknown [72.52.113.5]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id C318512D9A7 for <mtgvenue@ietf.org>; Wed, 20 Jul 2016 11:12:09 -0700 (PDT)
Received: from [192.168.100.164] (p578ab585.dip0.t-ipconnect.de [87.138.181.133]) (authenticated bits=0) by simon.songbird.com (8.14.4/8.14.4/Debian-4.1ubuntu1) with ESMTP id u6KICiV2028380 (version=TLSv1/SSLv3 cipher=DHE-RSA-CAMELLIA256-SHA bits=256 verify=NOT); Wed, 20 Jul 2016 11:12:45 -0700
To: "Fred Baker (fred)" <fred@cisco.com>
References: <F4EACEAA-0255-4985-AB4B-0247085C1D68@cisco.com> <6ca7e342-87b7-fa19-1ca4-ae8fa9908c12@dcrocker.net> <B6A66A5E-5DBA-43D5-8140-F4248A8E023F@cisco.com>
From: Dave Crocker <dcrocker@bbiw.net>
Organization: Brandenburg InternetWorking
Message-ID: <7c78abb8-d930-db2b-715d-70bb290ebea0@bbiw.net>
Date: Wed, 20 Jul 2016 20:11:49 +0200
User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:45.0) Gecko/20100101 Thunderbird/45.2.0
MIME-Version: 1.0
In-Reply-To: <B6A66A5E-5DBA-43D5-8140-F4248A8E023F@cisco.com>
Content-Type: text/plain; charset="windows-1252"; format="flowed"
Content-Transfer-Encoding: 7bit
Archived-At: <https://mailarchive.ietf.org/arch/msg/mtgvenue/edUgQ6Y0pMfgXiC5o3fl0mqWRNo>
Cc: "mtgvenue@ietf.org" <mtgvenue@ietf.org>, Nalini Elkins <nalini.elkins@insidethestack.com>
Subject: Re: [Mtgvenue] Exploration of a "posting" metric - draft-elkins-mtgvenue-participation-metrics
X-BeenThere: mtgvenue@ietf.org
X-Mailman-Version: 2.1.17
Precedence: list
List-Id: "List for email discussion of the IAOC meeting venue selection process." <mtgvenue.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/mtgvenue>, <mailto:mtgvenue-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/mtgvenue/>
List-Post: <mailto:mtgvenue@ietf.org>
List-Help: <mailto:mtgvenue-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/mtgvenue>, <mailto:mtgvenue-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 20 Jul 2016 18:12:17 -0000

On 7/20/2016 8:08 PM, Fred Baker (fred) wrote:
> If you have some good heuristics, I'd be interested. I don't think "hire an intern" is a long term solution.


Well, it gets easier when the input is an xml source, but for classic 
ascii source, other than obvious header-searching algorithms, no I 
haven't a clue.

A simple algorithm would likely be acceptable at finding the right 
document sections, but would probably parse the contents into many 
possible names that aren't really names,  I suspect that correlating 
these candidate with various signup lists (ietf mailing lists, ietf 
registrations, whatever) could get pretty close.  mumble.

d/

-- 

   Dave Crocker
   Brandenburg InternetWorking
   bbiw.net