From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 19186 invoked by alias); 11 Nov 2010 18:06:43 -0000 Received: (qmail 19163 invoked by uid 22791); 11 Nov 2010 18:06:42 -0000 X-SWARE-Spam-Status: No, hits=-1.7 required=5.0 tests=AWL,BAYES_00,FSL_RU_URL X-Spam-Check-By: sourceware.org Received: from aibo.runbox.com (HELO aibo.runbox.com) (87.238.52.70) by sourceware.org (qpsmtpd/0.43rc1) with ESMTP; Thu, 11 Nov 2010 18:06:37 +0000 Received: from [10.9.9.161] (helo=patch.runbox.com) by greyhound.runbox.com with esmtp (Exim 4.50) id 1PGbXf-0003DF-8W for overseers@sourceware.org; Thu, 11 Nov 2010 19:06:35 +0100 Received: from 99-197-203-76.cust.wildblue.net ([99.197.203.76] helo=Lenny.Bothner.com) by patch.runbox.com with esmtpsa (uid:757155 ) (TLS-1.0:DHE_RSA_AES_256_CBC_SHA1:32) (Exim 4.63) id 1PGbXd-0000LW-Ji for overseers@sourceware.org; Thu, 11 Nov 2010 19:06:35 +0100 Message-ID: <4CDC308D.3080603@bothner.com> Date: Thu, 11 Nov 2010 18:06:00 -0000 From: Per Bothner User-Agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9.2.12) Gecko/20101027 Fedora/3.1.6-1.fc13 Thunderbird/3.1.6 MIME-Version: 1.0 To: overseers@sourceware.org Subject: Re: sourceware on search engines References: <4CDA46DC.50709@jifvik.org> <20101110142102.GF26790@redhat.com> In-Reply-To: Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Mailing-List: contact overseers-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: List-Archive: List-Post: List-Help: , Sender: overseers-owner@sourceware.org X-SW-Source: 2010-q4/txt/msg00037.txt.bz2 On 11/11/2010 10:03 AM, Tom Tromey wrote: >>>>>> "Frank" == Frank Ch Eigler writes: > > Frank> They generally are indexed (not included in robots.txt). Can you give > Frank> an example of what you see missing? > > I've seen this too. > Almost any search that I would expect to hit on sourceware.org instead > pulls up results from elsewhere, often cygwin.ru. > > E.g., search for "systemtap signedness roland" on Google. > This shows cygwin.ru, nabble.com, but not sourceware. > Now add "site:sourceware.org" -- I see no hits. I Googled for "kawa mailing list" and http://www.cygwin.com/ml/kawa/ came up, rather than sourceware,org, So it *is* being indexed - just under the wrong hostname. -- --Per Bothner per@bothner.com http://per.bothner.com/