Re: [TOOLS-DEVELOPMENT] Narrowing the slowdown down...

Robert Sparks <rjsparks@nostrum.com> Mon, 27 June 2011 14:50 UTC

Return-Path: <rjsparks@nostrum.com>
X-Original-To: tools-development@ietfa.amsl.com
Delivered-To: tools-development@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id E43D911E80F7; Mon, 27 Jun 2011 07:50:01 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -102.599
X-Spam-Level:
X-Spam-Status: No, score=-102.599 tagged_above=-999 required=5 tests=[BAYES_00=-2.599, USER_IN_WHITELIST=-100]
Received: from mail.ietf.org ([64.170.98.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id JNRvh+AWqKYz; Mon, 27 Jun 2011 07:50:01 -0700 (PDT)
Received: from nostrum.com (shaman.nostrum.com [72.232.179.90]) by ietfa.amsl.com (Postfix) with ESMTP id 2DD0211E80E0; Mon, 27 Jun 2011 07:50:00 -0700 (PDT)
Received: from dn3-177.estacado.net (vicuna-alt.estacado.net [75.53.54.121]) (authenticated bits=0) by nostrum.com (8.14.3/8.14.3) with ESMTP id p5REmFsq022271 (version=TLSv1/SSLv3 cipher=AES128-SHA bits=128 verify=NO); Mon, 27 Jun 2011 09:48:16 -0500 (CDT) (envelope-from rjsparks@nostrum.com)
Mime-Version: 1.0 (Apple Message framework v1084)
Content-Type: text/plain; charset="us-ascii"
From: Robert Sparks <rjsparks@nostrum.com>
In-Reply-To: <20110627144404.GA29259@amsl.com>
Date: Mon, 27 Jun 2011 09:48:15 -0500
Content-Transfer-Encoding: quoted-printable
Message-Id: <0A2F9BE9-D22D-4D90-A296-5C2A60BEFE7B@nostrum.com>
References: <20110627144404.GA29259@amsl.com>
To: Glen <glen@amsl.com>
X-Mailer: Apple Mail (2.1084)
Received-SPF: pass (nostrum.com: 75.53.54.121 is authenticated by a trusted mechanism)
Cc: IETF WG chairs <wgchairs@ietf.org>, iab@ietf.org, IETF Tools Development <tools-development@ietf.org>, IAOC <iaoc@ietf.org>, Gonzalo Camarillo <Gonzalo.Camarillo@ericsson.com>, Dan Romascanu <dromasca@avaya.com>, The IESG <iesg@ietf.org>, pete resnick <presnick@qualcomm.com>, Henrik Levkowetz <henrik@levkowetz.com>, Stephen Farrell <stephen.farrell@cs.tcd.ie>
Subject: Re: [TOOLS-DEVELOPMENT] Narrowing the slowdown down...
X-BeenThere: tools-development@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: Tools Development list server <tools-development.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tools-development>, <mailto:tools-development-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/tools-development>
List-Post: <mailto:tools-development@ietf.org>
List-Help: <mailto:tools-development-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tools-development>, <mailto:tools-development-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 27 Jun 2011 14:50:02 -0000

This was not the draft that was cleared.
On Jun 27, 2011, at 9:44 AM, Glen wrote:

> All -
> 
> I have sent detailed data to the tools team and Henrik, but I wanted to alert
> everyone to a pattern I've seen during my analysis:
> 
> This request:
> 
> POST /doc/draft-ietf-payload-rfc3016bis/edit/position/ HTTP/1.1" 302 - 
> https://datatracker.ietf.org/doc/draft-ietf-payload-rfc3016bis/edit/position/
> 
> was seen in the logs at the start of both slowdowns, and I now suspect that
> there may be database corruption and/or some problem with the code related
> either to ballot positions generally, or this draft specifically.
> 
> It comes to my mind that, while I was gone, a request came in to clear the
> ballot positions for a draft, which the secretariat did.  This may have been
> the draft that was cleared - and clearing it may have caused some type of
> problem for the datatracker.
> 
> Of course, the datatracker should not loop or fail even if data is bad, but
> not all possibilities can be forseen.
> 
> It is my hope that we will both be able to correct a potential database
> problem, and find and harden a potential datatracker bug, quickly.
> 
> In the meantime, until we hear from the tools team, it might be best to
> at least refrain from voting on the above draft, if not all drafts.
> 
> If you do vote on a draft, and get a response, don't get too excited either
> way.  The server actually survives for an hour or more once the bug starts
> using resources (I'm actually proud of this - it's a HUGE server with lots
> of resources - the old servers would have died much more quickly. ;-) so
> things can appear okay for a while.
> 
> Now that we know what to look for, we can catch it earlier, but I'm still
> hopeful for a quick fix and repair today.
> 
> Thanks,
> Glen
> _______________________________________________
> TOOLS-DEVELOPMENT mailing list
> TOOLS-DEVELOPMENT@ietf.org
> https://www.ietf.org/mailman/listinfo/tools-development