From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: by sourceware.org (Postfix, from userid 48) id 22D893858D35; Thu, 16 Mar 2023 11:57:48 +0000 (GMT) DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org 22D893858D35 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gcc.gnu.org; s=default; t=1678967868; bh=mDOii9WFwGdOilLvgGYOXh1FJfzQeonnTf3QmtJ1M7k=; h=From:To:Subject:Date:From; b=BGuS7rBpWV6w0VPGiE0pD0fR8NDDbU/AqaQgTb/YcnlYWIpJU6ikDbRhNobSwAR8W /qiDqFZYjFrS8pJgBnSDgmugSoUV2LF1c0GYWMHvmtRklRodMb8HszONTU+PQxKK6u jDbDYxOkUSpsAkYMqMr1XBsPaS4gZZv6Hs5WdLQQ= From: "pgodbole at nvidia dot com" To: gcc-bugs@gcc.gnu.org Subject: [Bug tree-optimization/109154] New: [13 regression] aarch64 -mcpu=neoverse-v1 microbude performance regression Date: Thu, 16 Mar 2023 11:57:47 +0000 X-Bugzilla-Reason: CC X-Bugzilla-Type: new X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: gcc X-Bugzilla-Component: tree-optimization X-Bugzilla-Version: 13.0 X-Bugzilla-Keywords: X-Bugzilla-Severity: normal X-Bugzilla-Who: pgodbole at nvidia dot com X-Bugzilla-Status: UNCONFIRMED X-Bugzilla-Resolution: X-Bugzilla-Priority: P3 X-Bugzilla-Assigned-To: unassigned at gcc dot gnu.org X-Bugzilla-Target-Milestone: --- X-Bugzilla-Flags: X-Bugzilla-Changed-Fields: bug_id short_desc product version bug_status bug_severity priority component assigned_to reporter cc target_milestone attachments.created Message-ID: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://gcc.gnu.org/bugzilla/ Auto-Submitted: auto-generated MIME-Version: 1.0 List-Id: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=3D109154 Bug ID: 109154 Summary: [13 regression] aarch64 -mcpu=3Dneoverse-v1 microbude performance regression Product: gcc Version: 13.0 Status: UNCONFIRMED Severity: normal Priority: P3 Component: tree-optimization Assignee: unassigned at gcc dot gnu.org Reporter: pgodbole at nvidia dot com CC: ramana at gcc dot gnu.org Target Milestone: --- Created attachment 54681 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=3D54681&action=3Dedit Reduced microbude test case We're observing a significant performance drop (~30%) in an application when comparing gcc trunk against gcc 12, observed with -mcpu=3Dneoverse-v1 on an aarch64 Neoverse-V1. With OMP_NUM_THREADS=3D1 we see a regression of nearly= 60% between gcc12 and gcc13. The test case attached is reduced from a test shar= ed here https://github.com/UoB-HPC/microBUDE and has been made more suitable f= or a gcc bug report. $ install-gcc-12/bin/g++ --version g++ (GCC) 12.2.1 20221222 [master r13-4850-g74544bdadc4] Copyright (C) 2022 Free Software Foundation, Inc. This is free software; see the source for copying conditions. There is NO warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. $ install-gcc-trunk/bin/g++ --version g++ (GCC) 13.0.1 20230315 (experimental) Copyright (C) 2023 Free Software Foundation, Inc. This is free software; see the source for copying conditions. There is NO warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. Command line used :=20 $ install-gcc-12/bin/g++ -std=3Dc++17 -Wall -Wno-sign-compare -Wno-unused-variable -Ofast -mcpu=3Dneoverse-v1 -fopenmp -g3 reduced_microbude.cpp -o microbude-12-neoverse-v1 $ ./microbude-12-neoverse-v1 $ install-gcc-trunk/bin/g++ -std=3Dc++17 -Wall -Wno-sign-compare -Wno-unused-variable -Ofast -mcpu=3Dneoverse-v1 -fopenmp -g3 reduced_microbude.cpp -o microbude-trunk-neoverse-v1 Bisecting suggests that commit https://gcc.gnu.org/git/gitweb.cgi?p=3Dgcc.git;h=3D4fbe3e6a could be a poss= ible candidate. Thank you Tom Lin for the help here with the bisection.=20 Thanks=