public inbox for gcc-bugs@sourceware.org
help / color / mirror / Atom feed
* [Bug target/112547] New: 9% exec time regression of 462.libquantum SPEC on AMD zen4 CPU
@ 2023-11-15 14:56 fkastl at suse dot cz
2023-11-16 7:26 ` [Bug target/112547] " rguenth at gcc dot gnu.org
` (8 more replies)
0 siblings, 9 replies; 10+ messages in thread
From: fkastl at suse dot cz @ 2023-11-15 14:56 UTC (permalink / raw)
To: gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112547
Bug ID: 112547
Summary: 9% exec time regression of 462.libquantum SPEC on AMD
zen4 CPU
Product: gcc
Version: 14.0
Status: UNCONFIRMED
Severity: normal
Priority: P3
Component: target
Assignee: unassigned at gcc dot gnu.org
Reporter: fkastl at suse dot cz
Target Milestone: ---
Host: x86_64-linux
Target: x86_64-linux
As seen here
https://lnt.opensuse.org/db_default/v4/SPEC/graph?plot.0=956.210.0
between commits
g:53010f6ff6dfbf7b (2023-11-05 02:42)
and
g:1a55724f7870719d (2023-11-06 02:27)
there is about 9% slowdown of execution time of the 2006SPEC 462.libquantum.
The test is run with -O2 and lto on an amd zen4 machine.
^ permalink raw reply [flat|nested] 10+ messages in thread
* [Bug target/112547] 9% exec time regression of 462.libquantum SPEC on AMD zen4 CPU
2023-11-15 14:56 [Bug target/112547] New: 9% exec time regression of 462.libquantum SPEC on AMD zen4 CPU fkastl at suse dot cz
@ 2023-11-16 7:26 ` rguenth at gcc dot gnu.org
2023-11-16 7:34 ` haochen.jiang at intel dot com
` (7 subsequent siblings)
8 siblings, 0 replies; 10+ messages in thread
From: rguenth at gcc dot gnu.org @ 2023-11-16 7:26 UTC (permalink / raw)
To: gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112547
Richard Biener <rguenth at gcc dot gnu.org> changed:
What |Removed |Added
----------------------------------------------------------------------------
Keywords| |missed-optimization,
| |needs-bisection
CC| |haochen.jiang at intel dot com
--- Comment #1 from Richard Biener <rguenth at gcc dot gnu.org> ---
The x86 changes from Haochen are the only relevant changes in that area I think
(but it's odd they affect -O2 generic tune).
^ permalink raw reply [flat|nested] 10+ messages in thread
* [Bug target/112547] 9% exec time regression of 462.libquantum SPEC on AMD zen4 CPU
2023-11-15 14:56 [Bug target/112547] New: 9% exec time regression of 462.libquantum SPEC on AMD zen4 CPU fkastl at suse dot cz
2023-11-16 7:26 ` [Bug target/112547] " rguenth at gcc dot gnu.org
@ 2023-11-16 7:34 ` haochen.jiang at intel dot com
2023-11-16 7:36 ` haochen.jiang at intel dot com
` (6 subsequent siblings)
8 siblings, 0 replies; 10+ messages in thread
From: haochen.jiang at intel dot com @ 2023-11-16 7:34 UTC (permalink / raw)
To: gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112547
--- Comment #2 from Haochen Jiang <haochen.jiang at intel dot com> ---
It is weird since I did not touch the tune.
Need a bisect to check that but I do not have a zen4 machine.
Could you try with this commit g:459866eaeec151e72aecd670695f014f4ec48588 to
see if the regression still occurs?
If that still occurs, a guess might be zmm vectorization is not enabled
corrected under some scenario.
^ permalink raw reply [flat|nested] 10+ messages in thread
* [Bug target/112547] 9% exec time regression of 462.libquantum SPEC on AMD zen4 CPU
2023-11-15 14:56 [Bug target/112547] New: 9% exec time regression of 462.libquantum SPEC on AMD zen4 CPU fkastl at suse dot cz
2023-11-16 7:26 ` [Bug target/112547] " rguenth at gcc dot gnu.org
2023-11-16 7:34 ` haochen.jiang at intel dot com
@ 2023-11-16 7:36 ` haochen.jiang at intel dot com
2023-11-16 7:48 ` haochen.jiang at intel dot com
` (5 subsequent siblings)
8 siblings, 0 replies; 10+ messages in thread
From: haochen.jiang at intel dot com @ 2023-11-16 7:36 UTC (permalink / raw)
To: gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112547
--- Comment #3 from Haochen Jiang <haochen.jiang at intel dot com> ---
(In reply to Haochen Jiang from comment #2)
> It is weird since I did not touch the tune.
>
> Need a bisect to check that but I do not have a zen4 machine.
>
> Could you try with this commit g:459866eaeec151e72aecd670695f014f4ec48588 to
> see if the regression still occurs?
>
> If that still occurs, a guess might be zmm vectorization is not enabled
> corrected under some scenario.
Sorry, should be if that regression disappeared, not occured.
^ permalink raw reply [flat|nested] 10+ messages in thread
* [Bug target/112547] 9% exec time regression of 462.libquantum SPEC on AMD zen4 CPU
2023-11-15 14:56 [Bug target/112547] New: 9% exec time regression of 462.libquantum SPEC on AMD zen4 CPU fkastl at suse dot cz
` (2 preceding siblings ...)
2023-11-16 7:36 ` haochen.jiang at intel dot com
@ 2023-11-16 7:48 ` haochen.jiang at intel dot com
2023-11-16 9:33 ` fkastl at suse dot cz
` (4 subsequent siblings)
8 siblings, 0 replies; 10+ messages in thread
From: haochen.jiang at intel dot com @ 2023-11-16 7:48 UTC (permalink / raw)
To: gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112547
--- Comment #4 from Haochen Jiang <haochen.jiang at intel dot com> ---
I checked the znver3 plot on the site, it seems that no regression occurs.
Since znver4 enabled AVX512, that is the reason why I guessed previously.
Could you also provide the option you ran with? I could not find where it hides
in the site. Thx!
^ permalink raw reply [flat|nested] 10+ messages in thread
* [Bug target/112547] 9% exec time regression of 462.libquantum SPEC on AMD zen4 CPU
2023-11-15 14:56 [Bug target/112547] New: 9% exec time regression of 462.libquantum SPEC on AMD zen4 CPU fkastl at suse dot cz
` (3 preceding siblings ...)
2023-11-16 7:48 ` haochen.jiang at intel dot com
@ 2023-11-16 9:33 ` fkastl at suse dot cz
2023-11-17 1:31 ` liuhongt at gcc dot gnu.org
` (3 subsequent siblings)
8 siblings, 0 replies; 10+ messages in thread
From: fkastl at suse dot cz @ 2023-11-16 9:33 UTC (permalink / raw)
To: gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112547
--- Comment #5 from Filip Kastl <fkastl at suse dot cz> ---
Compile options are:
-O2 -g -flto=128
Configure options of GCC are:
--enable-languages=c,c++,fortran,rust,m2 --disable-bootstrap
--disable-libsanitizer --disable-multilib --enable-checking=release
^ permalink raw reply [flat|nested] 10+ messages in thread
* [Bug target/112547] 9% exec time regression of 462.libquantum SPEC on AMD zen4 CPU
2023-11-15 14:56 [Bug target/112547] New: 9% exec time regression of 462.libquantum SPEC on AMD zen4 CPU fkastl at suse dot cz
` (4 preceding siblings ...)
2023-11-16 9:33 ` fkastl at suse dot cz
@ 2023-11-17 1:31 ` liuhongt at gcc dot gnu.org
2023-11-17 7:17 ` haochen.jiang at intel dot com
` (2 subsequent siblings)
8 siblings, 0 replies; 10+ messages in thread
From: liuhongt at gcc dot gnu.org @ 2023-11-17 1:31 UTC (permalink / raw)
To: gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112547
liuhongt at gcc dot gnu.org changed:
What |Removed |Added
----------------------------------------------------------------------------
CC| |liuhongt at gcc dot gnu.org
--- Comment #6 from liuhongt at gcc dot gnu.org ---
(In reply to Filip Kastl from comment #5)
> Compile options are:
> -O2 -g -flto=128
>
> Configure options of GCC are:
> --enable-languages=c,c++,fortran,rust,m2 --disable-bootstrap
> --disable-libsanitizer --disable-multilib --enable-checking=release
hmm, so there's no avx512 in the cmdline options, haochen's commit shouldn't
have any impact in theory(the binary should be exact the same).
haochen, could you double check that?
^ permalink raw reply [flat|nested] 10+ messages in thread
* [Bug target/112547] 9% exec time regression of 462.libquantum SPEC on AMD zen4 CPU
2023-11-15 14:56 [Bug target/112547] New: 9% exec time regression of 462.libquantum SPEC on AMD zen4 CPU fkastl at suse dot cz
` (5 preceding siblings ...)
2023-11-17 1:31 ` liuhongt at gcc dot gnu.org
@ 2023-11-17 7:17 ` haochen.jiang at intel dot com
2023-11-18 11:39 ` fkastl at suse dot cz
2023-11-23 9:41 ` fkastl at suse dot cz
8 siblings, 0 replies; 10+ messages in thread
From: haochen.jiang at intel dot com @ 2023-11-17 7:17 UTC (permalink / raw)
To: gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112547
--- Comment #7 from Haochen Jiang <haochen.jiang at intel dot com> ---
I have got a same binary w/ and w/o my commit with the options if nothing went
wrong.
Seems we need more investigation.
^ permalink raw reply [flat|nested] 10+ messages in thread
* [Bug target/112547] 9% exec time regression of 462.libquantum SPEC on AMD zen4 CPU
2023-11-15 14:56 [Bug target/112547] New: 9% exec time regression of 462.libquantum SPEC on AMD zen4 CPU fkastl at suse dot cz
` (6 preceding siblings ...)
2023-11-17 7:17 ` haochen.jiang at intel dot com
@ 2023-11-18 11:39 ` fkastl at suse dot cz
2023-11-23 9:41 ` fkastl at suse dot cz
8 siblings, 0 replies; 10+ messages in thread
From: fkastl at suse dot cz @ 2023-11-18 11:39 UTC (permalink / raw)
To: gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112547
--- Comment #8 from Filip Kastl <fkastl at suse dot cz> ---
I've just ran the test on another zen4 machine. Between the originally
mentioned commits g:53010f6ff6dfbf7b and g:1a55724f7870719d there was only 1%
slowdown on this other machine. I guess this means that the 9% slowdown is
specific to the machine where we measure the data I sent.
Since there seems to be no reason why there should be a general zen4 slowdown
between the two commits and if there are no objections, I'll mark this bug as
RESOLVED WORKSFORME.
^ permalink raw reply [flat|nested] 10+ messages in thread
* [Bug target/112547] 9% exec time regression of 462.libquantum SPEC on AMD zen4 CPU
2023-11-15 14:56 [Bug target/112547] New: 9% exec time regression of 462.libquantum SPEC on AMD zen4 CPU fkastl at suse dot cz
` (7 preceding siblings ...)
2023-11-18 11:39 ` fkastl at suse dot cz
@ 2023-11-23 9:41 ` fkastl at suse dot cz
8 siblings, 0 replies; 10+ messages in thread
From: fkastl at suse dot cz @ 2023-11-23 9:41 UTC (permalink / raw)
To: gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112547
Filip Kastl <fkastl at suse dot cz> changed:
What |Removed |Added
----------------------------------------------------------------------------
Status|UNCONFIRMED |RESOLVED
Resolution|--- |WORKSFORME
--- Comment #9 from Filip Kastl <fkastl at suse dot cz> ---
Setting status to RESOLVED WORKSFORME
^ permalink raw reply [flat|nested] 10+ messages in thread
end of thread, other threads:[~2023-11-23 9:41 UTC | newest]
Thread overview: 10+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2023-11-15 14:56 [Bug target/112547] New: 9% exec time regression of 462.libquantum SPEC on AMD zen4 CPU fkastl at suse dot cz
2023-11-16 7:26 ` [Bug target/112547] " rguenth at gcc dot gnu.org
2023-11-16 7:34 ` haochen.jiang at intel dot com
2023-11-16 7:36 ` haochen.jiang at intel dot com
2023-11-16 7:48 ` haochen.jiang at intel dot com
2023-11-16 9:33 ` fkastl at suse dot cz
2023-11-17 1:31 ` liuhongt at gcc dot gnu.org
2023-11-17 7:17 ` haochen.jiang at intel dot com
2023-11-18 11:39 ` fkastl at suse dot cz
2023-11-23 9:41 ` fkastl at suse dot cz
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).