public inbox for gcc-bugs@sourceware.org
help / color / mirror / Atom feed
* [Bug target/115029] New: FFT computation performance regression, x86, between gcc-14 and gcc-13 on skylake platform
@ 2024-05-10 13:30 colin.king at intel dot com
2024-05-10 13:35 ` [Bug target/115029] " colin.king at intel dot com
` (3 more replies)
0 siblings, 4 replies; 5+ messages in thread
From: colin.king at intel dot com @ 2024-05-10 13:30 UTC (permalink / raw)
To: gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=115029
Bug ID: 115029
Summary: FFT computation performance regression, x86, between
gcc-14 and gcc-13 on skylake platform
Product: gcc
Version: 14.0
Status: UNCONFIRMED
Severity: normal
Priority: P3
Component: target
Assignee: unassigned at gcc dot gnu.org
Reporter: colin.king at intel dot com
Target Milestone: ---
Created attachment 58172
--> https://gcc.gnu.org/bugzilla/attachment.cgi?id=58172&action=edit
reproducer source code
I'm seeing a ~0.8-1.4% performance regression in gcc-14 compared to gcc-13,
using gcc on Ubuntu 24.04 computing Fast Fourier Transforms on 4096 values.
Versions:
gcc version 13.2.0 (Ubuntu 13.2.0-23ubuntu4)
gcc version 14.0.1 20240412 (experimental) [master r14-9935-g67e1433a94f]
(Ubuntu 14-20240412-0ubuntu1)
cking@skylake:~$ CFLAGS="" gcc-13 reproducer-fft.c -lm -O2
cking@skylake:~$ ./a.out
1927.23 fft ops per sec
CFLAGS="" gcc-14 reproducer-fft.c -lm -O2
cking@skylake:~$ ./a.out
1906.73 fft ops per sec
I did some analysis on 20 runs of each gcc-13 and gcc-14 runs I noted a ~0.44
percentage std.deviation jitter in my results, but it's clear that there gcc-14
build is always 0.8%-1.4% slower on my i7-6700 test machine, so I think this is
a significant regression in performance to be reported.
Attached is the reproducer.
The original issue appeared when regression testing stress-ng cpu fft number
stressor [1]. I've managed to extract the attached reproducer from the original
code (see attached).
Attached are the reproducer C source and disassembled object code.
References: [1]
https://github.com/ColinIanKing/stress-ng/blob/master/stress-cpu.c
^ permalink raw reply [flat|nested] 5+ messages in thread
* [Bug target/115029] FFT computation performance regression, x86, between gcc-14 and gcc-13 on skylake platform
2024-05-10 13:30 [Bug target/115029] New: FFT computation performance regression, x86, between gcc-14 and gcc-13 on skylake platform colin.king at intel dot com
@ 2024-05-10 13:35 ` colin.king at intel dot com
2024-05-10 13:35 ` colin.king at intel dot com
` (2 subsequent siblings)
3 siblings, 0 replies; 5+ messages in thread
From: colin.king at intel dot com @ 2024-05-10 13:35 UTC (permalink / raw)
To: gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=115029
--- Comment #1 from Colin Ian King <colin.king at intel dot com> ---
Created attachment 58174
--> https://gcc.gnu.org/bugzilla/attachment.cgi?id=58174&action=edit
gcc-13 disassembly
^ permalink raw reply [flat|nested] 5+ messages in thread
* [Bug target/115029] FFT computation performance regression, x86, between gcc-14 and gcc-13 on skylake platform
2024-05-10 13:30 [Bug target/115029] New: FFT computation performance regression, x86, between gcc-14 and gcc-13 on skylake platform colin.king at intel dot com
2024-05-10 13:35 ` [Bug target/115029] " colin.king at intel dot com
@ 2024-05-10 13:35 ` colin.king at intel dot com
2024-05-16 1:49 ` [Bug target/115029] [14/15 regression] " sjames at gcc dot gnu.org
2024-05-23 6:04 ` lin1.hu at intel dot com
3 siblings, 0 replies; 5+ messages in thread
From: colin.king at intel dot com @ 2024-05-10 13:35 UTC (permalink / raw)
To: gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=115029
--- Comment #2 from Colin Ian King <colin.king at intel dot com> ---
Created attachment 58175
--> https://gcc.gnu.org/bugzilla/attachment.cgi?id=58175&action=edit
gcc-14 disassembly
^ permalink raw reply [flat|nested] 5+ messages in thread
* [Bug target/115029] [14/15 regression] FFT computation performance regression, x86, between gcc-14 and gcc-13 on skylake platform
2024-05-10 13:30 [Bug target/115029] New: FFT computation performance regression, x86, between gcc-14 and gcc-13 on skylake platform colin.king at intel dot com
2024-05-10 13:35 ` [Bug target/115029] " colin.king at intel dot com
2024-05-10 13:35 ` colin.king at intel dot com
@ 2024-05-16 1:49 ` sjames at gcc dot gnu.org
2024-05-23 6:04 ` lin1.hu at intel dot com
3 siblings, 0 replies; 5+ messages in thread
From: sjames at gcc dot gnu.org @ 2024-05-16 1:49 UTC (permalink / raw)
To: gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=115029
Sam James <sjames at gcc dot gnu.org> changed:
What |Removed |Added
----------------------------------------------------------------------------
Target Milestone|--- |14.2
^ permalink raw reply [flat|nested] 5+ messages in thread
* [Bug target/115029] [14/15 regression] FFT computation performance regression, x86, between gcc-14 and gcc-13 on skylake platform
2024-05-10 13:30 [Bug target/115029] New: FFT computation performance regression, x86, between gcc-14 and gcc-13 on skylake platform colin.king at intel dot com
` (2 preceding siblings ...)
2024-05-16 1:49 ` [Bug target/115029] [14/15 regression] " sjames at gcc dot gnu.org
@ 2024-05-23 6:04 ` lin1.hu at intel dot com
3 siblings, 0 replies; 5+ messages in thread
From: lin1.hu at intel dot com @ 2024-05-23 6:04 UTC (permalink / raw)
To: gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=115029
Hu Lin <lin1.hu at intel dot com> changed:
What |Removed |Added
----------------------------------------------------------------------------
CC| |lin1.hu at intel dot com
--- Comment #3 from Hu Lin <lin1.hu at intel dot com> ---
According to my investigation, the regression is about 0.9% regression on
cascadelake. And for Sapphire rapids, gcc14 has about a 4% improvement.
^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2024-05-23 6:04 UTC | newest]
Thread overview: 5+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2024-05-10 13:30 [Bug target/115029] New: FFT computation performance regression, x86, between gcc-14 and gcc-13 on skylake platform colin.king at intel dot com
2024-05-10 13:35 ` [Bug target/115029] " colin.king at intel dot com
2024-05-10 13:35 ` colin.king at intel dot com
2024-05-16 1:49 ` [Bug target/115029] [14/15 regression] " sjames at gcc dot gnu.org
2024-05-23 6:04 ` lin1.hu at intel dot com
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).