From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mout.gmx.net (mout.gmx.net [212.227.17.21]) by sourceware.org (Postfix) with ESMTPS id BFF20394D81A for ; Wed, 6 May 2020 14:55:47 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.3.2 sourceware.org BFF20394D81A X-UI-Sender-Class: 01bb95c1-4bf8-414a-932a-4f6e2808ef9c Received: from [10.2.0.112] ([62.213.40.60]) by mail.gmx.com (mrgmx104 [212.227.17.174]) with ESMTPSA (Nemesis) id 1MWAOQ-1jcMmO18I6-00XZ4s; Wed, 06 May 2020 16:55:38 +0200 From: Arseny Solokha Subject: Re: Stability of pipermail ml archive URLs To: "Frank Ch. Eigler" , Jakub Jelinek , Overseers mailing list Cc: GCC Development References: <20200506141139.GJ2375@tucnak> <20200506144446.GB2466959@elastic.org> Message-ID: Date: Wed, 6 May 2020 21:55:31 +0700 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:68.0) Gecko/20100101 Thunderbird/68.8.0 MIME-Version: 1.0 In-Reply-To: <20200506144446.GB2466959@elastic.org> Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: quoted-printable X-Provags-ID: V03:K1:PSX04KFciMnO6l0minN9+Nc/075a7OXXfpgbat6SolmuJ5l9o47 Q/fEo4iVcFiGLYARYDWCFqyLI1ZC2d3IvhgHIlv2A54WqVkyqXEaSA8NUgiqNvnftO3serK Pkk6Xh0FZ3zFXfGjf2XOmJWQgX2wVKAd8qo7OGa2mhNxRuABnEaqmi6VP88edyk+g0h715f FXZzOGYYrrf94LciR9+Yg== X-UI-Out-Filterresults: notjunk:1;V03:K0:mXfG5wuJODI=:wkGZL2IQ9qzRdbX/UsDark LvpFor7cvZ+qrP/ylX6XDcA0AvIibUJe5Bo4oKj4DjKjCA17Cr9xBEWsoUlkAPCVm4dx6zeKm 3+X7iY7ivzWoEIpw5hR+/2euO2qJFjjFu4ATfj23bZ6gEGbdqDmfOrfP3d8j5f5cz+n4+RstR LyOUHoGCCYMOSu2EFuOdVv/W4Pf5vcgJshahl6hCBkspHleAw64ZU31W62PsLuKi9YbqydzM4 AOkFUItY3LQI7u4VKdp3J5haZUQS4Mz+IpazNmM+YLfk6VyZlD4U7nE3+xN2YXeL63IRYteh4 Ci5JeonB1DVIgukleYUvNiDHoU7MZivYXkyzk/XCqcJ/6p8r8C185kvTQCpvxG3mbuDhpRPhA eogX1vTKZrGVGoqYtPQDw461uNg7/aj7T11rNBTSweLCEbIoVDBCA+72ly8Rs7cq6AJJttUTn 1C5xQJSiYYC71XOFVD1EZGRpOwbWkhAZX18EJNCufAbMZqZ8poxp/BgEz4dMEldx6JGWC2EEX jtfP5f5UIhdvvGSA9UBynzilwlAcD4LJGgdhcXGc0cyEPm5oOqaKAy+Slbhjmwo28yZ8XRvJM SwHkw2rNHDgLmUxdqOOGwRlhmX4wo3jyxXF5z2QVZX2ox9IWJV4v2bQcJXGzMcb1axYWfQ9l7 ON7yyHv8RtbomiaNmrNY7XK4joxjtytZ/yHuMYtuxwLxv+UVVyyH9fOllFfXNes2C33+25nQS FnrITOHhfWi9do5Y59YqrY701asWJCjXcyl8RWIkYS9I4OqK0HvLOuWnKSKpsbrmUbCO3BPJM 2ZMwP/jqfGsSVQWdNZP8EjvGjjGFc9QkR4y7OKbTsoBuqmoiuXERifq3ZH+iGsVDlRrVe0pJZ 5ZRvcQRswXbzpL1BfWlKhQTJyEeOf0IB8+VZBWLERm54l1x5l8ziGTAx+SsugKuRu+2UGUX2s N4AIu7QgdMis4dNb7jJTRb8iahda8QqgQqlYtyeeXHSGsopARps2W1Z1AJur1ExUTdJkWxMP0 PcpwHE9FN6ZBt1TSKkEQ8zDrX0if17IhgjvhS3dSrbKX03eVUORK8t7qsiYFAseMzcABxyLjh OEsT5I0InvtMx0QGAmfmAmvE4w+WNdbGfoAO0KQO5K7F5gwKAQ37OvMGIYeO+8i8Pdcv1DhmL bvk7RIV51geC/FCi0d4PduOn5BJTmAk/BlGNMAnQfC9TshNzZwhh0crviZgnTK1NIkcQE8hVq 1kjjQYnMeWcdmG7V4 X-Spam-Status: No, score=-3.9 required=5.0 tests=BAYES_00, DKIM_SIGNED, DKIM_VALID, FREEMAIL_FROM, KAM_SHORT, RCVD_IN_DNSWL_LOW, RCVD_IN_MSPIKE_H2, SPF_HELO_NONE, SPF_PASS, TXREP autolearn=unavailable autolearn_force=no version=3.4.2 X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on server2.sourceware.org X-BeenThere: overseers@sourceware.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Overseers mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 06 May 2020 14:55:49 -0000 Hi, >> https://gcc.gnu.org/pipermail/gcc/2020-February/232205.html >> Looking around, the last two months of gcc now have very small >> numbers, but e.g. on gcc-patches the mails have very high numbers like >> 545238.html. Can pipermail provide stable URLs at all? We really >> need those, we reference those in commit messages, other mails, bugzill= a >> etc. > > Argh, that is a problem, sorry. We get mailman to regenerate web > archives for example in the case of spam that has gone through. Our > recipe has been to delete the spam from the apropriate .mbox, but this > does renumber things. > > The big vs. little numbers are probably an accidental function of > whether the email .mbox files were processed chronologically or not. > I'll tweak the mrefresh script to make sure it's chronological; that > should avoid gross jumps like that. I believe gcc-patches just wasn't > regenerated for spam removal whereas others have. There should not be > gross jumps in the future, except we'll have to regenerate everything > one more time. :-( > > Small jumps though --- darn, we'd have to do something else with spam > in the mbox, maybe replace it somehow in situ with something else. Or > catch it so quickly that subsequent URLs aren't archived anywhere > important. > > It would be good to have another way of making permanent URLs for > individual messages in mailing list archives. may I also chime in with a related (to some extent), even though a separat= e issue? It seems URL rewriting rules designed to replace old-style https://gcc.gnu.org/ml//current URLs pointing to monthly digests to current ones https://gcc.gnu.org/pipermail///date.html#end broke with onset of May. I mean, if I type https://gcc.gnu.org/ml/gcc/current I still get https://gcc.gnu.org/pipermail/gcc/2020-April/date.html#end (note 2020-April) instead.