From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 7600 invoked by alias); 28 Sep 2003 22:40:38 -0000 Mailing-List: contact overseers-help@sources.redhat.com; run by ezmlm Precedence: bulk List-Archive: List-Post: List-Help: , Sender: overseers-owner@sources.redhat.com Received: (qmail 7578 invoked from network); 28 Sep 2003 22:40:38 -0000 Received: from unknown (HELO dair.pair.com) (209.68.1.49) by sources.redhat.com with SMTP; 28 Sep 2003 22:40:38 -0000 Received: (qmail 92623 invoked by uid 20157); 28 Sep 2003 22:40:37 -0000 Received: from localhost (sendmail-bs@127.0.0.1) by localhost with SMTP; 28 Sep 2003 22:40:37 -0000 Date: Sun, 28 Sep 2003 22:40:00 -0000 From: Hans-Peter Nilsson X-X-Sender: hp@dair.pair.com To: overseers@sources.redhat.com Subject: Re: last days of htdig In-Reply-To: Message-ID: MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII X-SW-Source: 2003-q3/txt/msg00244.txt.bz2 On Sat, 27 Sep 2003, Hans-Peter Nilsson wrote: > There's been no update since 2003-07-31; that's when some file > passed 2G and it all fell apart, save for the existing DB. I'm > going to try to exclude parts of gcc.gnu.org from indexing, > probably some mailing lists. By excluding /ml/gccadmin, documentation for released versions matching /onlinedocs/gcc-, and adding some words (see the new file gcc_bad_words in the htdig-conf dir) that appear in all or half the messages, like "gcc", "gnu", "org", "patches", "from", abbrev. day of month, day of week -- except "sun" :-) etc. to those not indexed, the gcc htdig setup seems to be up and indexing again. For a few months that is. brgds, H-P