From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 9759 invoked by alias); 5 Apr 2005 15:17:35 -0000 Mailing-List: contact overseers-help@sources.redhat.com; run by ezmlm Precedence: bulk List-Archive: List-Post: List-Help: , Sender: overseers-owner@sources.redhat.com Received: (qmail 9714 invoked from network); 5 Apr 2005 15:17:29 -0000 Received: from unknown (HELO mx1.redhat.com) (66.187.233.31) by sourceware.org with SMTP; 5 Apr 2005 15:17:29 -0000 Received: from int-mx1.corp.redhat.com (int-mx1.corp.redhat.com [172.16.52.254]) by mx1.redhat.com (8.12.11/8.12.11) with ESMTP id j35FHSJw004490; Tue, 5 Apr 2005 11:17:28 -0400 Received: from vpn26-8.sfbay.redhat.com (vpn26-8.sfbay.redhat.com [172.16.26.8]) by int-mx1.corp.redhat.com (8.11.6/8.11.6) with ESMTP id j35FHQO22052; Tue, 5 Apr 2005 11:17:27 -0400 Subject: Re: src crippled, cvs inaccessible. From: Jeffrey A Law Reply-To: law@redhat.com To: Dave Korn Cc: overseers@sourceware.org In-Reply-To: References: Content-Type: text/plain Organization: Red Hat, Inc Date: Tue, 05 Apr 2005 15:17:00 -0000 Message-Id: <1112714244.13749.120.camel@localhost.localdomain> Mime-Version: 1.0 Content-Transfer-Encoding: 7bit X-SW-Source: 2005-q2/txt/msg00010.txt.bz2 On Tue, 2005-04-05 at 15:59 +0100, Dave Korn wrote: > Hi overseers! > > I've been getting almost nothing but > > cvs [diff aborted]: reading from server: Software caused connection abort > > and > > cvs [diff aborted]: end of file from server (consult above messages if any) > > messages trying to access the cvs repository on src for about four hours > now. I know that this is the sort of thing you expect to see when it's > heavily loaded, but it seems to have been a lot worse and going on for > longer than usual. > > Could someone with login access run a quick 'top' and make sure there > isn't some stuck process hogging all the cpu (or similar)? TIA! The machine seems to be running "OK" -- we're hitting the disks pretty hard, which is causing us to spend a fair amount of time in disk wait. The net result is we have a load average of ~20 due to all the processes sitting in disk wait. If you're using anoncvs, the connection refused messages are probably due to the high load average. jeff