From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from esa2.mentor.iphmx.com (esa2.mentor.iphmx.com [68.232.141.98]) by sourceware.org (Postfix) with ESMTPS id 888E63858420; Fri, 17 Sep 2021 16:38:08 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org 888E63858420 IronPort-SDR: 2kWn+XgSQyOyVUdeVwahHCsoPgQpshnNhe3D0C0jh0abIg/cL2Wz8BOOOXMc7hqLQANnhbTus1 rS+LAUy8UdjosAiuT2SqVhvDdcqmMWn0oENnzkkze+M3VZEpteKh/mVljNjro7mf8c1B9+h+mc 0wuyqdznhFmLOgT/ijmDcCJBte/31ErTpeuhMUMIEpotuWoiY93NJWges/WNiJFqUCiyN3e2ic lKV389bC5cr6KYP3aGSsh9YfTUnwuzMBCIvmUzSYZhw4DnkVToRS14j/V5ha10B+7H6OpB9IHy vqRUJCtyya3RZAr2AoHHMtmI X-IronPort-AV: E=Sophos;i="5.85,301,1624348800"; d="scan'208";a="65993065" Received: from orw-gwy-01-in.mentorg.com ([192.94.38.165]) by esa2.mentor.iphmx.com with ESMTP; 17 Sep 2021 08:38:07 -0800 IronPort-SDR: JXqlcvOfOBG5Hq/M1RTNU8WYmIDXlRu+JBPZJD9nKRliKUkj6+5uaIRh4zKj6M3w7jCN3uz/2A KFcBQMFisPylIOH5fZFnrII2uSBcw8/ziVoBUgKp7EmLqyp5UcKXf7HHI57Wu8Ys1CoJe7Q2sT 1ac1jL8kJ2xdvLbKy75canWT7gj0aRN7J1Vt0Y2uANDNuhOxgpjvnSNfxs0Fb6qDivJzQJ0n5a 5co1VS48iPcX1T1BMT8bBcSZv0GhRNur6eITN2M369lDA3/DcTLx0DdWj7r8E+MgR4HiY+WebV yMA= Date: Fri, 17 Sep 2021 16:38:02 +0000 From: Joseph Myers X-X-Sender: jsm28@digraph.polyomino.org.uk To: , Subject: The email-to-bugzilla bug-number-matching regular expression Message-ID: User-Agent: Alpine 2.22 (DEB 394 2020-01-19) MIME-Version: 1.0 Content-Type: text/plain; charset="US-ASCII" X-Originating-IP: [137.202.0.90] X-ClientProxiedBy: svr-ies-mbx-01.mgc.mentorg.com (139.181.222.1) To svr-ies-mbx-01.mgc.mentorg.com (139.181.222.1) X-Spam-Status: No, score=-3118.4 required=5.0 tests=BAYES_00, HEADER_FROM_DIFFERENT_DOMAINS, KAM_DMARC_STATUS, SPF_HELO_PASS, SPF_PASS, TXREP autolearn=no autolearn_force=no version=3.4.4 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on server2.sourceware.org X-BeenThere: overseers@sourceware.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Overseers mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 17 Sep 2021 16:38:14 -0000 Since Apr 2020, /sourceware/infra/bin/email-to-bugzilla has matched bug numbers with: while ($log_txt =~ m/\s(?:bug|PR|BZ)\s+\#?\s*(?:[a-z0-9+-]+\/)?(?:\/)?(\d+)(.*)$/si) { That is, whitespace is required before "bug", "PR" or "BZ". That's fine for GCC commit conventions (bug numbers mentioned in a ChangeLog entry in the commit message, "PR c/123456" after a TAB). It doesn't work very well for glibc commit messages (no ChangeLog entries used, bug numbers often mentioned only in the form "(bug 12345)" or "[BZ #12345]" in the commit summary line. Could we use \b instead of \s at the start of the regular expression, so we still avoid matching "Apr 1", but do match "(bug" or "[BZ"? -- Joseph S. Myers joseph@codesourcery.com