public inbox for gcc-patches@gcc.gnu.org
 help / color / mirror / Atom feed
* [r14-6559 Regression] FAIL: gcc.dg/guality/pr58791-4.c   -Os -DPREVENT_OPTIMIZATION  line pr58791-4.c:32 i == 486 on Linux/x86_64
@ 2023-12-15 17:36 haochen.jiang
  0 siblings, 0 replies; 3+ messages in thread
From: haochen.jiang @ 2023-12-15 17:36 UTC (permalink / raw)
  To: dizhao, gcc-regression, gcc-patches, haochen.jiang

On Linux/x86_64,

8afdbcdd7abe1e3c7a81e07f34c256e7f2dbc652 is the first bad commit
commit 8afdbcdd7abe1e3c7a81e07f34c256e7f2dbc652
Author: Di Zhao <dizhao@os.amperecomputing.com>
Date:   Fri Dec 15 03:22:32 2023 +0800

    Consider fully pipelined FMA in get_reassociation_width

caused

FAIL: gcc.dg/pr110279-2.c scan-tree-dump-not reassoc2 "was chosen for reassociation"
FAIL: gcc.dg/pr110279-2.c scan-tree-dump-times optimized "\\.FMA " 3

with GCC configured with

../../gcc/configure --prefix=/export/users/haochenj/src/gcc-bisect/master/master/r14-6559/usr --enable-clocale=gnu --with-system-zlib --with-demangler-in-ld --with-fpmath=sse --enable-languages=c,c++,fortran --enable-cet --without-isl --enable-libmpx x86_64-linux --disable-bootstrap

To reproduce:

$ cd {build_dir}/gcc && make check RUNTESTFLAGS="dg.exp=gcc.dg/pr110279-2.c --target_board='unix{-m32}'"
$ cd {build_dir}/gcc && make check RUNTESTFLAGS="dg.exp=gcc.dg/pr110279-2.c --target_board='unix{-m64}'"

(Please do not reply to this email, for question about this report, contact me at haochen dot jiang at intel.com)
(If you met problems with cascadelake related, disabling AVX512F in command line might save that.)
(However, please make sure that there is no potential problems with AVX512.)

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: [r14-6559 Regression] FAIL: gcc.dg/guality/pr58791-4.c -Os -DPREVENT_OPTIMIZATION line pr58791-4.c:32 i == 486 on Linux/x86_64
  2023-12-15  1:23 haochen.jiang
@ 2023-12-15  8:06 ` Richard Biener
  0 siblings, 0 replies; 3+ messages in thread
From: Richard Biener @ 2023-12-15  8:06 UTC (permalink / raw)
  To: haochen.jiang; +Cc: dizhao, gcc-regression, gcc-patches, haochen.jiang

On Fri, Dec 15, 2023 at 2:25 AM haochen.jiang
<haochenj@ecsmtp.sh.intel.com> wrote:
>
> On Linux/x86_64,
>
> 8afdbcdd7abe1e3c7a81e07f34c256e7f2dbc652 is the first bad commit
> commit 8afdbcdd7abe1e3c7a81e07f34c256e7f2dbc652
> Author: Di Zhao <dizhao@os.amperecomputing.com>
> Date:   Fri Dec 15 03:22:32 2023 +0800
>
>     Consider fully pipelined FMA in get_reassociation_width
>
> caused
>
> FAIL: gcc.dg/guality/pr58791-4.c   -O2  -DPREVENT_OPTIMIZATION  line pr58791-4.c:32 i2 == 487
> FAIL: gcc.dg/guality/pr58791-4.c   -O2  -DPREVENT_OPTIMIZATION  line pr58791-4.c:32 i == 486
> FAIL: gcc.dg/guality/pr58791-4.c   -O2 -flto -fno-use-linker-plugin -flto-partition=none  -DPREVENT_OPTIMIZATION line pr58791-4.c:32 i2 == 487
> FAIL: gcc.dg/guality/pr58791-4.c   -O2 -flto -fno-use-linker-plugin -flto-partition=none  -DPREVENT_OPTIMIZATION line pr58791-4.c:32 i == 486
> FAIL: gcc.dg/guality/pr58791-4.c   -O2 -flto -fuse-linker-plugin -fno-fat-lto-objects  -DPREVENT_OPTIMIZATION line pr58791-4.c:32 i2 == 487
> FAIL: gcc.dg/guality/pr58791-4.c   -O2 -flto -fuse-linker-plugin -fno-fat-lto-objects  -DPREVENT_OPTIMIZATION line pr58791-4.c:32 i == 486
> FAIL: gcc.dg/guality/pr58791-4.c   -O3 -g  -DPREVENT_OPTIMIZATION  line pr58791-4.c:32 i2 == 487
> FAIL: gcc.dg/guality/pr58791-4.c   -O3 -g  -DPREVENT_OPTIMIZATION  line pr58791-4.c:32 i == 486
> FAIL: gcc.dg/guality/pr58791-4.c   -Os  -DPREVENT_OPTIMIZATION  line pr58791-4.c:32 i2 == 487
> FAIL: gcc.dg/guality/pr58791-4.c   -Os  -DPREVENT_OPTIMIZATION  line pr58791-4.c:32 i == 486
>
> with GCC configured with
>
> ../../gcc/configure --prefix=/export/users/haochenj/src/gcc-bisect/master/master/r14-6559/usr --enable-clocale=gnu --with-system-zlib --with-demangler-in-ld --with-fpmath=sse --enable-languages=c,c++,fortran --enable-cet --without-isl --enable-libmpx x86_64-linux --disable-bootstrap
>
> To reproduce:
>
> $ cd {build_dir}/gcc && make check RUNTESTFLAGS="guality.exp=gcc.dg/guality/pr58791-4.c --target_board='unix{-m64\ -march=cascadelake}'"

There's an extra intermediate stmt inserted (for much later use, but
reassoc inserts close to defs) that is
then also used for FMA forming.  This disturbs things in some way:

  g_5 = (double) f_4;
  # DEBUG g => g_5
  # DEBUG BEGIN_STMT
  h_7 = (double) b_6(D);
  # DEBUG h => h_7
  # DEBUG BEGIN_STMT
  _39 = h_7 * 3.25e+0;
  # DEBUG D#5 => g_5 * h_7
  # DEBUG i => D#5
  # DEBUG BEGIN_STMT
  # DEBUG i2 => D#5 + 1.0e+0
  # DEBUG BEGIN_STMT
  # DEBUG D#8 => g_5 * _39
  _3 = .FMA (g_5, _39, h_7);

g_5 is dead after the FMA.  Interestingly removing the

  asm volatile (NOP : : : "memory");
  asm volatile (NOP : : : "memory");

lines fixes the regression because then we can TER the FMA, keeping
g_5 live for longer.

Richard.

^ permalink raw reply	[flat|nested] 3+ messages in thread

* [r14-6559 Regression] FAIL: gcc.dg/guality/pr58791-4.c   -Os -DPREVENT_OPTIMIZATION  line pr58791-4.c:32 i == 486 on Linux/x86_64
@ 2023-12-15  1:23 haochen.jiang
  2023-12-15  8:06 ` Richard Biener
  0 siblings, 1 reply; 3+ messages in thread
From: haochen.jiang @ 2023-12-15  1:23 UTC (permalink / raw)
  To: dizhao, gcc-regression, gcc-patches, haochen.jiang

On Linux/x86_64,

8afdbcdd7abe1e3c7a81e07f34c256e7f2dbc652 is the first bad commit
commit 8afdbcdd7abe1e3c7a81e07f34c256e7f2dbc652
Author: Di Zhao <dizhao@os.amperecomputing.com>
Date:   Fri Dec 15 03:22:32 2023 +0800

    Consider fully pipelined FMA in get_reassociation_width

caused

FAIL: gcc.dg/guality/pr58791-4.c   -O2  -DPREVENT_OPTIMIZATION  line pr58791-4.c:32 i2 == 487
FAIL: gcc.dg/guality/pr58791-4.c   -O2  -DPREVENT_OPTIMIZATION  line pr58791-4.c:32 i == 486
FAIL: gcc.dg/guality/pr58791-4.c   -O2 -flto -fno-use-linker-plugin -flto-partition=none  -DPREVENT_OPTIMIZATION line pr58791-4.c:32 i2 == 487
FAIL: gcc.dg/guality/pr58791-4.c   -O2 -flto -fno-use-linker-plugin -flto-partition=none  -DPREVENT_OPTIMIZATION line pr58791-4.c:32 i == 486
FAIL: gcc.dg/guality/pr58791-4.c   -O2 -flto -fuse-linker-plugin -fno-fat-lto-objects  -DPREVENT_OPTIMIZATION line pr58791-4.c:32 i2 == 487
FAIL: gcc.dg/guality/pr58791-4.c   -O2 -flto -fuse-linker-plugin -fno-fat-lto-objects  -DPREVENT_OPTIMIZATION line pr58791-4.c:32 i == 486
FAIL: gcc.dg/guality/pr58791-4.c   -O3 -g  -DPREVENT_OPTIMIZATION  line pr58791-4.c:32 i2 == 487
FAIL: gcc.dg/guality/pr58791-4.c   -O3 -g  -DPREVENT_OPTIMIZATION  line pr58791-4.c:32 i == 486
FAIL: gcc.dg/guality/pr58791-4.c   -Os  -DPREVENT_OPTIMIZATION  line pr58791-4.c:32 i2 == 487
FAIL: gcc.dg/guality/pr58791-4.c   -Os  -DPREVENT_OPTIMIZATION  line pr58791-4.c:32 i == 486

with GCC configured with

../../gcc/configure --prefix=/export/users/haochenj/src/gcc-bisect/master/master/r14-6559/usr --enable-clocale=gnu --with-system-zlib --with-demangler-in-ld --with-fpmath=sse --enable-languages=c,c++,fortran --enable-cet --without-isl --enable-libmpx x86_64-linux --disable-bootstrap

To reproduce:

$ cd {build_dir}/gcc && make check RUNTESTFLAGS="guality.exp=gcc.dg/guality/pr58791-4.c --target_board='unix{-m64\ -march=cascadelake}'"

(Please do not reply to this email, for question about this report, contact me at haochen dot jiang at intel.com.)
(If you met problems with cascadelake related, disabling AVX512F in command line might save that.)
(However, please make sure that there is no potential problems with AVX512.)

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2023-12-15 17:38 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2023-12-15 17:36 [r14-6559 Regression] FAIL: gcc.dg/guality/pr58791-4.c -Os -DPREVENT_OPTIMIZATION line pr58791-4.c:32 i == 486 on Linux/x86_64 haochen.jiang
  -- strict thread matches above, loose matches on Subject: below --
2023-12-15  1:23 haochen.jiang
2023-12-15  8:06 ` Richard Biener

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).