From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mout.gmx.net (mout.gmx.net [212.227.17.20]) by sourceware.org (Postfix) with ESMTPS id 19B233951C87 for ; Wed, 6 May 2020 14:55:47 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.3.2 sourceware.org 19B233951C87 X-UI-Sender-Class: 01bb95c1-4bf8-414a-932a-4f6e2808ef9c Received: from [10.2.0.112] ([62.213.40.60]) by mail.gmx.com (mrgmx104 [212.227.17.174]) with ESMTPSA (Nemesis) id 1MTiTt-1jifdV1Y7x-00TyMg; Wed, 06 May 2020 16:55:36 +0200 Subject: Re: Stability of pipermail ml archive URLs To: "Frank Ch. Eigler" , Jakub Jelinek , Overseers mailing list Cc: GCC Development References: <20200506141139.GJ2375@tucnak> <20200506144446.GB2466959@elastic.org> From: Arseny Solokha Message-ID: Date: Wed, 6 May 2020 21:54:06 +0700 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:68.0) Gecko/20100101 Thunderbird/68.8.0 MIME-Version: 1.0 In-Reply-To: <20200506144446.GB2466959@elastic.org> Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: quoted-printable X-Provags-ID: V03:K1:gQXgJbDUVPfwLIfP5yvfEJ5o96VIcfyKycp4mMlreLeCJfWmbHc wzeCK/uA/SjNDe4kGLqzjhMv3BkwAbCLzsNVj1tn4qAGnvpec8S6o0jtAUIpdquFK3/eyIh 5yLXlCI0aFN5+AED73DTn5OqVKd73EHUYzoLjvOkBNoQ7MF2Xf1oe7izNCLsZLkb5nIIQAk XiC8nL5+z3qa96vnjc65A== X-UI-Out-Filterresults: notjunk:1;V03:K0:nZidkdxRNu8=:ky8z94aucEapZ8Gu1voPPA YEErAM6NSk2sk0etvNckomWlvXOf1S22oAyMKYfh9BvrGhcrpJsBO6hi61i0M4VMVBtdUm/qv XLRlsS20KGcNoKVGdfQyFuKVq7o+bXOvRYc0ovkCYvFR/EzlZfu5BB2TLLFkHd76A89Cs/cvf 5FdMFZl7H96ChC7rl9RKfUlVVKy9737S2FU2iXlAOSv5MQRXmkw1vgqqhMtlEn0L/nfomSl9Z mAntRPRRYxhgpnTZ/l4XvxrfhJlv6hJgA0KYmei1UY/54drnJvER3CZ3Lmxe3R8qaXNk6z19N Au1btH6xh5VRAcHfZksDbBCxAjUWet8uG/oyvY4Ulr+cUMp5C2jO15yc3KfeCtQy4LPOOktRQ /mR88I/19ck62Xxa8ihoiHwstHvep53xQdWdnWRQVcMAgU2+rl3AGkgR5CYTn4Qwwi50ZGtn0 A3yQNCLyMcAtcK6wpZVlsQdsMchHw7rqEF9X+vQq12hXnOq9wEdshXyMD6KW/Oob95ZM434gj t/OS3HrUCEmA3B7yBbNwiSAO7LB3Y0iszm/+TguiULuwQA9TsfA1Ws8tyqWb2bv1ePB2A5CHg DOGY2HRA12NQm2RQI+QJBNbJgv6ITOf5e6wJf7Nh3+WhFvW+RIm+LyIT64NDCjVxuZNk/ORGm 9ffljOJ79xRDC49TNi3Dua1X1hYawkP3DgkfgJRzswW6Xnr88qkc5Rrmtmzw3qCmChzhXdhOp xFsyIhW2WJlXIx28/RAyhJ4iVRvXa+UEKmzyZWo9atcm7EM3O8G0eMw1zMcc59+aQmmOOZZ8X CmdP/ngcpnDzEoRuDTDDByuKKWzrCT493nm0wLsQHVFUDQRLC/ndkQAPcRfqV4vLCCkbfSCyC zWsVR+cx0du5CTPyAx0Shduhmtze4wL8q6ZjEcW5LFipQvhwnQAQKM45rUdl9jWtMfunQs278 zPalHgnGR81nnwmyFX4qlJuovH4tduZKKexSPs01sfeKWtdgpe27u1Cnh4rIbfbtXVz0EG5n0 PnNsKW80UCtvV3DzJrYYsbbRATG7UP/yyiguErqu8dkzRhsisCl8o81WeIqmxyPmBKmAm2AeV GwirSTZfUpY0/kN+/d2ro2xMQ0/GUzHWXtylOpjkIIBoB/bdF8Nmblyzfp9veyKmMxAv9pA+S 5wkGgp7BqL5Q6nv2r8SfTpRnc+CUZYur2cvEZ90FFKX/mNLPMeHw3NflwPh9hXIFbdXJG6xhE lF2IkiXqX/njV893N X-Spam-Status: No, score=-3.8 required=5.0 tests=BAYES_00, DKIM_SIGNED, DKIM_VALID, FREEMAIL_FROM, KAM_SHORT, RCVD_IN_DNSWL_LOW, RCVD_IN_MSPIKE_H3, RCVD_IN_MSPIKE_WL, SPF_HELO_NONE, SPF_PASS, TXREP autolearn=unavailable autolearn_force=no version=3.4.2 X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on server2.sourceware.org X-BeenThere: overseers@sourceware.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Overseers mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 06 May 2020 14:55:49 -0000 Hi, >> https://gcc.gnu.org/pipermail/gcc/2020-February/232205.html >> Looking around, the last two months of gcc now have very small >> numbers, but e.g. on gcc-patches the mails have very high numbers like >> 545238.html. Can pipermail provide stable URLs at all? We really >> need those, we reference those in commit messages, other mails, bugzill= a >> etc. > > Argh, that is a problem, sorry. We get mailman to regenerate web > archives for example in the case of spam that has gone through. Our > recipe has been to delete the spam from the apropriate .mbox, but this > does renumber things. > > The big vs. little numbers are probably an accidental function of > whether the email .mbox files were processed chronologically or not. > I'll tweak the mrefresh script to make sure it's chronological; that > should avoid gross jumps like that. I believe gcc-patches just wasn't > regenerated for spam removal whereas others have. There should not be > gross jumps in the future, except we'll have to regenerate everything > one more time. :-( > > Small jumps though --- darn, we'd have to do something else with spam > in the mbox, maybe replace it somehow in situ with something else. Or > catch it so quickly that subsequent URLs aren't archived anywhere > important. > > It would be good to have another way of making permanent URLs for > individual messages in mailing list archives. may I also chime in with a related (to some extent), even though a separat= e issue? It seems URL rewriting rules designed to replace old-style https://gcc.gnu.org/ml//current URLs pointing to monthly digests to current ones https://gcc.gnu.org/pipermail///date.html#end broke with onset of May. I mean, if I type https://gcc.gnu.org/ml/gcc/current I still get https://gcc.gnu.org/pipermail/gcc/2020-April/date.html#end (note 2020-April) instead.