public inbox for gcc-bugs@sourceware.org
help / color / mirror / Atom feed
* [Bug c/102889] New: -funroll-loops generates incorrect codes from inline assembly on aarch64
@ 2021-10-22  8:22 ariel at amazon dot com
  2021-10-22  8:28 ` [Bug c/102889] " pinskia at gcc dot gnu.org
                   ` (2 more replies)
  0 siblings, 3 replies; 4+ messages in thread
From: ariel at amazon dot com @ 2021-10-22  8:22 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=102889

            Bug ID: 102889
           Summary: -funroll-loops generates incorrect codes from inline
                    assembly on aarch64
           Product: gcc
           Version: 11.2.1
            Status: UNCONFIRMED
          Severity: normal
          Priority: P3
         Component: c
          Assignee: unassigned at gcc dot gnu.org
          Reporter: ariel at amazon dot com
  Target Milestone: ---

Created attachment 51651
  --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=51651&action=edit
Source code that reproduces bug.

The attached code works correctly with both -O2 and -O3 on GCC7 and 11.2, all
clang versions on AARCH64.
But generates incorrect code with:

/usr/local/gcc11/bin/gcc -O2 -funroll-loops -Wall -Werror -Wextra aa1.c
-fsanitize=undefined -fno-strict-aliasing -fwrapv
-fno-aggressive-loop-optimizations -o aa11y

(and in -O3 as well, just adding -funroll-loops breaks).

on all versions of gcc (triggers printf("Mismatch %i != %i\n", res1, res2);)

Attaching code to reproduce.

---


Using built-in specs.
COLLECT_GCC=/usr/local/gcc11/bin/gcc
COLLECT_LTO_WRAPPER=/usr/local/gcc11/libexec/gcc/aarch64-unknown-linux-gnu/11.2.0/lto-wrapper
Target: aarch64-unknown-linux-gnu
Configured with: ./configure --prefix=/usr/local/gcc11
Thread model: posix
Supported LTO compression algorithms: zlib
gcc version 11.2.0 (GCC)
COLLECT_GCC_OPTIONS='-v' '-save-temps' '-O3' '-funroll-loops' '-Wall' '-Werror'
'-Wextra' '-march=armv8.2-a+fp16+rcpc+dotprod+crypto' '-mtune=neoverse-n1'
'-fsanitize=undefined' '-fno-strict-aliasing' '-fwrapv'
'-fno-aggressive-loop-optimizations' '-o' 'aa11y' '-mlittle-endian'
'-mabi=lp64' '-dumpdir' 'aa11y-'
 /usr/local/gcc11/libexec/gcc/aarch64-unknown-linux-gnu/11.2.0/cc1 -E -quiet -v
aa1.c -march=armv8.2-a+fp16+rcpc+dotprod+crypto -mtune=neoverse-n1
-mlittle-endian -mabi=lp64 -Wall -Werror -Wextra -funroll-loops
-fsanitize=undefined -fno-strict-aliasing -fwrapv
-fno-aggressive-loop-optimizations -O3 -fpch-preprocess -o aa11y-aa1.i
ignoring nonexistent directory
"/usr/local/gcc11/lib/gcc/aarch64-unknown-linux-gnu/11.2.0/../../../../aarch64-unknown-linux-gnu/include"
#include "..." search starts here:
#include <...> search starts here:
 /usr/local/gcc11/lib/gcc/aarch64-unknown-linux-gnu/11.2.0/include
 /usr/local/include
 /usr/local/gcc11/include
 /usr/local/gcc11/lib/gcc/aarch64-unknown-linux-gnu/11.2.0/include-fixed
 /usr/include
End of search list.
COLLECT_GCC_OPTIONS='-v' '-save-temps' '-O3' '-funroll-loops' '-Wall' '-Werror'
'-Wextra' '-march=armv8.2-a+fp16+rcpc+dotprod+crypto' '-mtune=neoverse-n1'
'-fsanitize=undefined' '-fno-strict-aliasing' '-fwrapv'
'-fno-aggressive-loop-optimizations' '-o' 'aa11y' '-mlittle-endian'
'-mabi=lp64' '-dumpdir' 'aa11y-'
 /usr/local/gcc11/libexec/gcc/aarch64-unknown-linux-gnu/11.2.0/cc1
-fpreprocessed aa11y-aa1.i -quiet -dumpdir aa11y- -dumpbase aa1.c -dumpbase-ext
.c -march=armv8.2-a+fp16+rcpc+dotprod+crypto -mtune=neoverse-n1 -mlittle-endian
-mabi=lp64 -O3 -Wall -Werror -Wextra -version -funroll-loops
-fsanitize=undefined -fno-strict-aliasing -fwrapv
-fno-aggressive-loop-optimizations -o aa11y-aa1.s
GNU C17 (GCC) version 11.2.0 (aarch64-unknown-linux-gnu)
        compiled by GNU C version 11.2.0, GMP version 6.2.1, MPFR version
4.1.0, MPC version 1.2.1, isl version none
GGC heuristics: --param ggc-min-expand=100 --param ggc-min-heapsize=131072
GNU C17 (GCC) version 11.2.0 (aarch64-unknown-linux-gnu)
        compiled by GNU C version 11.2.0, GMP version 6.2.1, MPFR version
4.1.0, MPC version 1.2.1, isl version none
GGC heuristics: --param ggc-min-expand=100 --param ggc-min-heapsize=131072
Compiler executable checksum: 3921e1b032be4ab2b700d43daf3de441
COLLECT_GCC_OPTIONS='-v' '-save-temps' '-O3' '-funroll-loops' '-Wall' '-Werror'
'-Wextra' '-march=armv8.2-a+fp16+rcpc+dotprod+crypto' '-mtune=neoverse-n1'
'-fsanitize=undefined' '-fno-strict-aliasing' '-fwrapv'
'-fno-aggressive-loop-optimizations' '-o' 'aa11y' '-mlittle-endian'
'-mabi=lp64' '-dumpdir' 'aa11y-'
 as -v -EL -march=armv8.2-a+fp16+rcpc+dotprod+crypto -mabi=lp64 -o aa11y-aa1.o
aa11y-aa1.s
GNU assembler version 2.29.1 (aarch64-redhat-linux) using BFD version version
2.29.1-30.amzn2
COMPILER_PATH=/usr/local/gcc11/libexec/gcc/aarch64-unknown-linux-gnu/11.2.0/:/usr/local/gcc11/libexec/gcc/aarch64-unknown-linux-gnu/11.2.0/:/usr/local/gcc11/libexec/gcc/aarch64-unknown-linux-gnu/:/usr/local/gcc11/lib/gcc/aarch64-unknown-linux-gnu/11.2.0/:/usr/local/gcc11/lib/gcc/aarch64-unknown-linux-gnu/
LIBRARY_PATH=/usr/local/gcc11/lib/gcc/aarch64-unknown-linux-gnu/11.2.0/:/usr/local/gcc11/lib/gcc/aarch64-unknown-linux-gnu/11.2.0/../../../../lib64/:/lib/../lib64/:/usr/lib/../lib64/:/usr/local/gcc11/lib/gcc/aarch64-unknown-linux-gnu/11.2.0/../../../:/lib/:/usr/lib/
COLLECT_GCC_OPTIONS='-v' '-save-temps' '-O3' '-funroll-loops' '-Wall' '-Werror'
'-Wextra' '-march=armv8.2-a+fp16+rcpc+dotprod+crypto' '-mtune=neoverse-n1'
'-fsanitize=undefined' '-fno-strict-aliasing' '-fwrapv'
'-fno-aggressive-loop-optimizations' '-o' 'aa11y' '-mlittle-endian'
'-mabi=lp64' '-dumpdir' 'aa11y.'
 /usr/local/gcc11/libexec/gcc/aarch64-unknown-linux-gnu/11.2.0/collect2 -plugin
/usr/local/gcc11/libexec/gcc/aarch64-unknown-linux-gnu/11.2.0/liblto_plugin.so
-plugin-opt=/usr/local/gcc11/libexec/gcc/aarch64-unknown-linux-gnu/11.2.0/lto-wrapper
-plugin-opt=-fresolution=aa11y.res -plugin-opt=-pass-through=-lgcc
-plugin-opt=-pass-through=-lgcc_s -plugin-opt=-pass-through=-lc
-plugin-opt=-pass-through=-lgcc -plugin-opt=-pass-through=-lgcc_s
--eh-frame-hdr -dynamic-linker /lib/ld-linux-aarch64.so.1 -X -EL -maarch64linux
-o aa11y /lib/../lib64/crt1.o /lib/../lib64/crti.o
/usr/local/gcc11/lib/gcc/aarch64-unknown-linux-gnu/11.2.0/crtbegin.o
-L/usr/local/gcc11/lib/gcc/aarch64-unknown-linux-gnu/11.2.0
-L/usr/local/gcc11/lib/gcc/aarch64-unknown-linux-gnu/11.2.0/../../../../lib64
-L/lib/../lib64 -L/usr/lib/../lib64
-L/usr/local/gcc11/lib/gcc/aarch64-unknown-linux-gnu/11.2.0/../../..
aa11y-aa1.o -lubsan -lgcc --push-state --as-needed -lgcc_s --pop-state -lc
-lgcc --push-state --as-needed -lgcc_s --pop-state
/usr/local/gcc11/lib/gcc/aarch64-unknown-linux-gnu/11.2.0/crtend.o
/lib/../lib64/crtn.o
COLLECT_GCC_OPTIONS='-v' '-save-temps' '-O3' '-funroll-loops' '-Wall' '-Werror'
'-Wextra' '-march=armv8.2-a+fp16+rcpc+dotprod+crypto' '-mtune=neoverse-n1'
'-fsanitize=undefined' '-fno-strict-aliasing' '-fwrapv'
'-fno-aggressive-loop-optimizations' '-o' 'aa11y' '-mlittle-endian'
'-mabi=lp64' '-dumpdir' 'aa11y.'

^ permalink raw reply	[flat|nested] 4+ messages in thread

* [Bug c/102889] -funroll-loops generates incorrect codes from inline assembly on aarch64
  2021-10-22  8:22 [Bug c/102889] New: -funroll-loops generates incorrect codes from inline assembly on aarch64 ariel at amazon dot com
@ 2021-10-22  8:28 ` pinskia at gcc dot gnu.org
  2021-10-22  8:30 ` pinskia at gcc dot gnu.org
  2021-10-22  9:20 ` ariel at amazon dot com
  2 siblings, 0 replies; 4+ messages in thread
From: pinskia at gcc dot gnu.org @ 2021-10-22  8:28 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=102889

Andrew Pinski <pinskia at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
           Keywords|                            |inline-asm
         Resolution|---                         |INVALID
             Status|UNCONFIRMED                 |RESOLVED

--- Comment #1 from Andrew Pinski <pinskia at gcc dot gnu.org> ---
This inline asm is totally wrong:
asm volatile("rbit %0,%0\n"
                    "clz %0,%0\n" : "=r" (val) : "r" (val) );
Try this:
asm volatile("rbit %0,%1\n"
                    "clz %0,%0\n" : "=&r" (val) : "r" (val) );

^ permalink raw reply	[flat|nested] 4+ messages in thread

* [Bug c/102889] -funroll-loops generates incorrect codes from inline assembly on aarch64
  2021-10-22  8:22 [Bug c/102889] New: -funroll-loops generates incorrect codes from inline assembly on aarch64 ariel at amazon dot com
  2021-10-22  8:28 ` [Bug c/102889] " pinskia at gcc dot gnu.org
@ 2021-10-22  8:30 ` pinskia at gcc dot gnu.org
  2021-10-22  9:20 ` ariel at amazon dot com
  2 siblings, 0 replies; 4+ messages in thread
From: pinskia at gcc dot gnu.org @ 2021-10-22  8:30 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=102889

--- Comment #2 from Andrew Pinski <pinskia at gcc dot gnu.org> ---
Or this:
asm volatile("rbit %0,%0\n"
                    "clz %0,%0\n" : "=r" (val) : "0" (val) );

^ permalink raw reply	[flat|nested] 4+ messages in thread

* [Bug c/102889] -funroll-loops generates incorrect codes from inline assembly on aarch64
  2021-10-22  8:22 [Bug c/102889] New: -funroll-loops generates incorrect codes from inline assembly on aarch64 ariel at amazon dot com
  2021-10-22  8:28 ` [Bug c/102889] " pinskia at gcc dot gnu.org
  2021-10-22  8:30 ` pinskia at gcc dot gnu.org
@ 2021-10-22  9:20 ` ariel at amazon dot com
  2 siblings, 0 replies; 4+ messages in thread
From: ariel at amazon dot com @ 2021-10-22  9:20 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=102889

--- Comment #3 from ariel at amazon dot com ---
Thanks for pointing out my mistake

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2021-10-22  9:20 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2021-10-22  8:22 [Bug c/102889] New: -funroll-loops generates incorrect codes from inline assembly on aarch64 ariel at amazon dot com
2021-10-22  8:28 ` [Bug c/102889] " pinskia at gcc dot gnu.org
2021-10-22  8:30 ` pinskia at gcc dot gnu.org
2021-10-22  9:20 ` ariel at amazon dot com

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).