public inbox for gcc-bugs@sourceware.org
help / color / mirror / Atom feed
* [Bug target/112547] New: 9% exec time regression of 462.libquantum SPEC on AMD zen4 CPU
@ 2023-11-15 14:56 fkastl at suse dot cz
  2023-11-16  7:26 ` [Bug target/112547] " rguenth at gcc dot gnu.org
                   ` (8 more replies)
  0 siblings, 9 replies; 10+ messages in thread
From: fkastl at suse dot cz @ 2023-11-15 14:56 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112547

            Bug ID: 112547
           Summary: 9% exec time regression of 462.libquantum SPEC on AMD
                    zen4 CPU
           Product: gcc
           Version: 14.0
            Status: UNCONFIRMED
          Severity: normal
          Priority: P3
         Component: target
          Assignee: unassigned at gcc dot gnu.org
          Reporter: fkastl at suse dot cz
  Target Milestone: ---
              Host: x86_64-linux
            Target: x86_64-linux

As seen here

https://lnt.opensuse.org/db_default/v4/SPEC/graph?plot.0=956.210.0

between commits

g:53010f6ff6dfbf7b (2023-11-05 02:42)

and

g:1a55724f7870719d (2023-11-06 02:27)

there is about 9% slowdown of execution time of the 2006SPEC 462.libquantum.

The test is run with -O2 and lto on an amd zen4 machine.

^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug target/112547] 9% exec time regression of 462.libquantum SPEC on AMD zen4 CPU
  2023-11-15 14:56 [Bug target/112547] New: 9% exec time regression of 462.libquantum SPEC on AMD zen4 CPU fkastl at suse dot cz
@ 2023-11-16  7:26 ` rguenth at gcc dot gnu.org
  2023-11-16  7:34 ` haochen.jiang at intel dot com
                   ` (7 subsequent siblings)
  8 siblings, 0 replies; 10+ messages in thread
From: rguenth at gcc dot gnu.org @ 2023-11-16  7:26 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112547

Richard Biener <rguenth at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
           Keywords|                            |missed-optimization,
                   |                            |needs-bisection
                 CC|                            |haochen.jiang at intel dot com

--- Comment #1 from Richard Biener <rguenth at gcc dot gnu.org> ---
The x86 changes from Haochen are the only relevant changes in that area I think
(but it's odd they affect -O2 generic tune).

^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug target/112547] 9% exec time regression of 462.libquantum SPEC on AMD zen4 CPU
  2023-11-15 14:56 [Bug target/112547] New: 9% exec time regression of 462.libquantum SPEC on AMD zen4 CPU fkastl at suse dot cz
  2023-11-16  7:26 ` [Bug target/112547] " rguenth at gcc dot gnu.org
@ 2023-11-16  7:34 ` haochen.jiang at intel dot com
  2023-11-16  7:36 ` haochen.jiang at intel dot com
                   ` (6 subsequent siblings)
  8 siblings, 0 replies; 10+ messages in thread
From: haochen.jiang at intel dot com @ 2023-11-16  7:34 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112547

--- Comment #2 from Haochen Jiang <haochen.jiang at intel dot com> ---
It is weird since I did not touch the tune.

Need a bisect to check that but I do not have a zen4 machine.

Could you try with this commit g:459866eaeec151e72aecd670695f014f4ec48588 to
see if the regression still occurs?

If that still occurs, a guess might be zmm vectorization is not enabled
corrected under some scenario.

^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug target/112547] 9% exec time regression of 462.libquantum SPEC on AMD zen4 CPU
  2023-11-15 14:56 [Bug target/112547] New: 9% exec time regression of 462.libquantum SPEC on AMD zen4 CPU fkastl at suse dot cz
  2023-11-16  7:26 ` [Bug target/112547] " rguenth at gcc dot gnu.org
  2023-11-16  7:34 ` haochen.jiang at intel dot com
@ 2023-11-16  7:36 ` haochen.jiang at intel dot com
  2023-11-16  7:48 ` haochen.jiang at intel dot com
                   ` (5 subsequent siblings)
  8 siblings, 0 replies; 10+ messages in thread
From: haochen.jiang at intel dot com @ 2023-11-16  7:36 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112547

--- Comment #3 from Haochen Jiang <haochen.jiang at intel dot com> ---
(In reply to Haochen Jiang from comment #2)
> It is weird since I did not touch the tune.
> 
> Need a bisect to check that but I do not have a zen4 machine.
> 
> Could you try with this commit g:459866eaeec151e72aecd670695f014f4ec48588 to
> see if the regression still occurs?
> 
> If that still occurs, a guess might be zmm vectorization is not enabled
> corrected under some scenario.

Sorry, should be if that regression disappeared, not occured.

^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug target/112547] 9% exec time regression of 462.libquantum SPEC on AMD zen4 CPU
  2023-11-15 14:56 [Bug target/112547] New: 9% exec time regression of 462.libquantum SPEC on AMD zen4 CPU fkastl at suse dot cz
                   ` (2 preceding siblings ...)
  2023-11-16  7:36 ` haochen.jiang at intel dot com
@ 2023-11-16  7:48 ` haochen.jiang at intel dot com
  2023-11-16  9:33 ` fkastl at suse dot cz
                   ` (4 subsequent siblings)
  8 siblings, 0 replies; 10+ messages in thread
From: haochen.jiang at intel dot com @ 2023-11-16  7:48 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112547

--- Comment #4 from Haochen Jiang <haochen.jiang at intel dot com> ---
I checked the znver3 plot on the site, it seems that no regression occurs.

Since znver4 enabled AVX512, that is the reason why I guessed previously.

Could you also provide the option you ran with? I could not find where it hides
in the site. Thx!

^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug target/112547] 9% exec time regression of 462.libquantum SPEC on AMD zen4 CPU
  2023-11-15 14:56 [Bug target/112547] New: 9% exec time regression of 462.libquantum SPEC on AMD zen4 CPU fkastl at suse dot cz
                   ` (3 preceding siblings ...)
  2023-11-16  7:48 ` haochen.jiang at intel dot com
@ 2023-11-16  9:33 ` fkastl at suse dot cz
  2023-11-17  1:31 ` liuhongt at gcc dot gnu.org
                   ` (3 subsequent siblings)
  8 siblings, 0 replies; 10+ messages in thread
From: fkastl at suse dot cz @ 2023-11-16  9:33 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112547

--- Comment #5 from Filip Kastl <fkastl at suse dot cz> ---
Compile options are:
-O2 -g -flto=128

Configure options of GCC are:
--enable-languages=c,c++,fortran,rust,m2 --disable-bootstrap
--disable-libsanitizer --disable-multilib --enable-checking=release

^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug target/112547] 9% exec time regression of 462.libquantum SPEC on AMD zen4 CPU
  2023-11-15 14:56 [Bug target/112547] New: 9% exec time regression of 462.libquantum SPEC on AMD zen4 CPU fkastl at suse dot cz
                   ` (4 preceding siblings ...)
  2023-11-16  9:33 ` fkastl at suse dot cz
@ 2023-11-17  1:31 ` liuhongt at gcc dot gnu.org
  2023-11-17  7:17 ` haochen.jiang at intel dot com
                   ` (2 subsequent siblings)
  8 siblings, 0 replies; 10+ messages in thread
From: liuhongt at gcc dot gnu.org @ 2023-11-17  1:31 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112547

liuhongt at gcc dot gnu.org changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |liuhongt at gcc dot gnu.org

--- Comment #6 from liuhongt at gcc dot gnu.org ---
(In reply to Filip Kastl from comment #5)
> Compile options are:
> -O2 -g -flto=128
> 
> Configure options of GCC are:
> --enable-languages=c,c++,fortran,rust,m2 --disable-bootstrap
> --disable-libsanitizer --disable-multilib --enable-checking=release

hmm, so there's no avx512 in the cmdline options, haochen's commit shouldn't
have any impact in theory(the binary should be exact the same).
haochen, could you double check that?

^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug target/112547] 9% exec time regression of 462.libquantum SPEC on AMD zen4 CPU
  2023-11-15 14:56 [Bug target/112547] New: 9% exec time regression of 462.libquantum SPEC on AMD zen4 CPU fkastl at suse dot cz
                   ` (5 preceding siblings ...)
  2023-11-17  1:31 ` liuhongt at gcc dot gnu.org
@ 2023-11-17  7:17 ` haochen.jiang at intel dot com
  2023-11-18 11:39 ` fkastl at suse dot cz
  2023-11-23  9:41 ` fkastl at suse dot cz
  8 siblings, 0 replies; 10+ messages in thread
From: haochen.jiang at intel dot com @ 2023-11-17  7:17 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112547

--- Comment #7 from Haochen Jiang <haochen.jiang at intel dot com> ---
I have got a same binary w/ and w/o my commit with the options if nothing went
wrong.

Seems we need more investigation.

^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug target/112547] 9% exec time regression of 462.libquantum SPEC on AMD zen4 CPU
  2023-11-15 14:56 [Bug target/112547] New: 9% exec time regression of 462.libquantum SPEC on AMD zen4 CPU fkastl at suse dot cz
                   ` (6 preceding siblings ...)
  2023-11-17  7:17 ` haochen.jiang at intel dot com
@ 2023-11-18 11:39 ` fkastl at suse dot cz
  2023-11-23  9:41 ` fkastl at suse dot cz
  8 siblings, 0 replies; 10+ messages in thread
From: fkastl at suse dot cz @ 2023-11-18 11:39 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112547

--- Comment #8 from Filip Kastl <fkastl at suse dot cz> ---
I've just ran the test on another zen4 machine. Between the originally
mentioned commits g:53010f6ff6dfbf7b and g:1a55724f7870719d there was only 1%
slowdown on this other machine. I guess this means that the 9% slowdown is
specific to the machine where we measure the data I sent.

Since there seems to be no reason why there should be a general zen4 slowdown
between the two commits and if there are no objections, I'll mark this bug as
RESOLVED WORKSFORME.

^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug target/112547] 9% exec time regression of 462.libquantum SPEC on AMD zen4 CPU
  2023-11-15 14:56 [Bug target/112547] New: 9% exec time regression of 462.libquantum SPEC on AMD zen4 CPU fkastl at suse dot cz
                   ` (7 preceding siblings ...)
  2023-11-18 11:39 ` fkastl at suse dot cz
@ 2023-11-23  9:41 ` fkastl at suse dot cz
  8 siblings, 0 replies; 10+ messages in thread
From: fkastl at suse dot cz @ 2023-11-23  9:41 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112547

Filip Kastl <fkastl at suse dot cz> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|UNCONFIRMED                 |RESOLVED
         Resolution|---                         |WORKSFORME

--- Comment #9 from Filip Kastl <fkastl at suse dot cz> ---
Setting status to RESOLVED WORKSFORME

^ permalink raw reply	[flat|nested] 10+ messages in thread

end of thread, other threads:[~2023-11-23  9:41 UTC | newest]

Thread overview: 10+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2023-11-15 14:56 [Bug target/112547] New: 9% exec time regression of 462.libquantum SPEC on AMD zen4 CPU fkastl at suse dot cz
2023-11-16  7:26 ` [Bug target/112547] " rguenth at gcc dot gnu.org
2023-11-16  7:34 ` haochen.jiang at intel dot com
2023-11-16  7:36 ` haochen.jiang at intel dot com
2023-11-16  7:48 ` haochen.jiang at intel dot com
2023-11-16  9:33 ` fkastl at suse dot cz
2023-11-17  1:31 ` liuhongt at gcc dot gnu.org
2023-11-17  7:17 ` haochen.jiang at intel dot com
2023-11-18 11:39 ` fkastl at suse dot cz
2023-11-23  9:41 ` fkastl at suse dot cz

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).