* [Bug tree-optimization/113900] [14 regression] Hang and then ICe in vect_transform_loops, at tree-vectorizer.cc:1031 when building slang-2.3.3
2024-02-13 9:03 [Bug tree-optimization/113900] New: [14 regression] Hang and then ICe in vect_transform_loops, at tree-vectorizer.cc:1031 when building slang-2.3.3 sjames at gcc dot gnu.org
@ 2024-02-13 9:06 ` sjames at gcc dot gnu.org
2024-02-13 9:19 ` [Bug tree-optimization/113900] [14 regression] Hang and then ICE " sjames at gcc dot gnu.org
` (8 subsequent siblings)
9 siblings, 0 replies; 11+ messages in thread
From: sjames at gcc dot gnu.org @ 2024-02-13 9:06 UTC (permalink / raw)
To: gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113900
--- Comment #1 from Sam James <sjames at gcc dot gnu.org> ---
Created attachment 57405
--> https://gcc.gnu.org/bugzilla/attachment.cgi?id=57405&action=edit
slarith.i.xz
^ permalink raw reply [flat|nested] 11+ messages in thread
* [Bug tree-optimization/113900] [14 regression] Hang and then ICE in vect_transform_loops, at tree-vectorizer.cc:1031 when building slang-2.3.3
2024-02-13 9:03 [Bug tree-optimization/113900] New: [14 regression] Hang and then ICe in vect_transform_loops, at tree-vectorizer.cc:1031 when building slang-2.3.3 sjames at gcc dot gnu.org
2024-02-13 9:06 ` [Bug tree-optimization/113900] " sjames at gcc dot gnu.org
@ 2024-02-13 9:19 ` sjames at gcc dot gnu.org
2024-02-13 9:20 ` sjames at gcc dot gnu.org
` (7 subsequent siblings)
9 siblings, 0 replies; 11+ messages in thread
From: sjames at gcc dot gnu.org @ 2024-02-13 9:19 UTC (permalink / raw)
To: gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113900
--- Comment #2 from Sam James <sjames at gcc dot gnu.org> ---
This actually takes ages even when it doesn't ICE:
```
$ gcc -c ./slarith.i -O2 -ftime-report
Time variable usr sys wall
GGC
phase setup : 0.00 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
1841k ( 0%)
phase parsing : 0.47 ( 1%) 0.41 ( 37%) 0.88 ( 2%)
24M ( 4%)
phase opt and generate : 33.91 ( 99%) 0.70 ( 63%) 34.61 ( 97%)
633M ( 96%)
garbage collection : 0.53 ( 2%) 0.01 ( 1%) 0.54 ( 2%)
0 ( 0%)
dump files : 0.06 ( 0%) 0.00 ( 0%) 0.06 ( 0%)
0 ( 0%)
callgraph construction : 0.02 ( 0%) 0.00 ( 0%) 0.05 ( 0%)
8742k ( 1%)
callgraph optimization : 0.05 ( 0%) 0.00 ( 0%) 0.06 ( 0%)
2200 ( 0%)
callgraph functions expansion : 28.02 ( 82%) 0.45 ( 41%) 28.47 ( 80%)
500M ( 76%)
callgraph ipa passes : 5.47 ( 16%) 0.21 ( 19%) 5.68 ( 16%)
70M ( 11%)
ipa function summary : 0.34 ( 1%) 0.00 ( 0%) 0.37 ( 1%)
14M ( 2%)
ipa devirtualization : 0.00 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
0 ( 0%)
ipa cp : 0.01 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
151k ( 0%)
ipa inlining heuristics : 0.01 ( 0%) 0.00 ( 0%) 0.00 ( 0%)
48k ( 0%)
ipa function splitting : 0.00 ( 0%) 0.00 ( 0%) 0.02 ( 0%)
9192 ( 0%)
ipa reference : 0.00 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
0 ( 0%)
ipa profile : 0.00 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
0 ( 0%)
ipa pure const : 0.03 ( 0%) 0.00 ( 0%) 0.05 ( 0%)
3792 ( 0%)
ipa icf : 0.03 ( 0%) 0.00 ( 0%) 0.03 ( 0%)
48k ( 0%)
ipa SRA : 0.02 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
37k ( 0%)
ipa free inline summary : 0.01 ( 0%) 0.00 ( 0%) 0.00 ( 0%)
0 ( 0%)
ipa modref : 0.04 ( 0%) 0.00 ( 0%) 0.05 ( 0%)
459k ( 0%)
cfg construction : 0.00 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
2113k ( 0%)
cfg cleanup : 0.19 ( 1%) 0.00 ( 0%) 0.21 ( 1%)
4518k ( 1%)
CFG verifier : 0.90 ( 3%) 0.04 ( 4%) 0.82 ( 2%)
0 ( 0%)
trivially dead code : 0.02 ( 0%) 0.00 ( 0%) 0.03 ( 0%)
0 ( 0%)
df scan insns : 0.05 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
12k ( 0%)
df reaching defs : 0.19 ( 1%) 0.00 ( 0%) 0.20 ( 1%)
0 ( 0%)
df live regs : 0.40 ( 1%) 0.01 ( 1%) 0.39 ( 1%)
0 ( 0%)
df live&initialized regs : 0.17 ( 0%) 0.00 ( 0%) 0.19 ( 1%)
0 ( 0%)
df must-initialized regs : 0.00 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
0 ( 0%)
df use-def / def-use chains : 0.08 ( 0%) 0.00 ( 0%) 0.10 ( 0%)
0 ( 0%)
df reg dead/unused notes : 0.17 ( 0%) 0.01 ( 1%) 0.15 ( 0%)
3314k ( 0%)
register information : 0.04 ( 0%) 0.01 ( 1%) 0.01 ( 0%)
0 ( 0%)
alias analysis : 0.08 ( 0%) 0.00 ( 0%) 0.12 ( 0%)
14M ( 2%)
alias stmt walking : 0.15 ( 0%) 0.02 ( 2%) 0.24 ( 1%)
227k ( 0%)
register scan : 0.03 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
154k ( 0%)
rebuild jump labels : 0.02 ( 0%) 0.00 ( 0%) 0.02 ( 0%)
768 ( 0%)
preprocessing : 0.10 ( 0%) 0.11 ( 10%) 0.19 ( 1%)
500k ( 0%)
lexical analysis : 0.12 ( 0%) 0.14 ( 13%) 0.36 ( 1%)
0 ( 0%)
parser (global) : 0.02 ( 0%) 0.01 ( 1%) 0.03 ( 0%)
3270k ( 0%)
parser function body : 0.19 ( 1%) 0.14 ( 13%) 0.28 ( 1%)
20M ( 3%)
early inlining heuristics : 0.01 ( 0%) 0.00 ( 0%) 0.00 ( 0%)
23k ( 0%)
inline parameters : 0.11 ( 0%) 0.00 ( 0%) 0.12 ( 0%)
801k ( 0%)
tree gimplify : 0.12 ( 0%) 0.01 ( 1%) 0.12 ( 0%)
30M ( 5%)
tree CFG construction : 0.06 ( 0%) 0.01 ( 1%) 0.06 ( 0%)
22M ( 3%)
tree CFG cleanup : 0.36 ( 1%) 0.00 ( 0%) 0.24 ( 1%)
320k ( 0%)
tree tail merge : 0.02 ( 0%) 0.00 ( 0%) 0.04 ( 0%)
3392k ( 1%)
tree VRP : 1.51 ( 4%) 0.01 ( 1%) 1.60 ( 5%)
20M ( 3%)
tree Early VRP : 0.78 ( 2%) 0.02 ( 2%) 0.84 ( 2%)
11M ( 2%)
tree copy propagation : 0.10 ( 0%) 0.00 ( 0%) 0.05 ( 0%)
353k ( 0%)
tree PTA : 0.22 ( 1%) 0.00 ( 0%) 0.18 ( 1%)
2501k ( 0%)
tree SSA other : 0.00 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
0 ( 0%)
tree SSA rewrite : 0.03 ( 0%) 0.01 ( 1%) 0.09 ( 0%)
5749k ( 1%)
tree SSA incremental : 0.10 ( 0%) 0.01 ( 1%) 0.09 ( 0%)
4424k ( 1%)
tree operand scan : 0.08 ( 0%) 0.05 ( 5%) 0.09 ( 0%)
9419k ( 1%)
dominator optimization : 0.99 ( 3%) 0.00 ( 0%) 1.05 ( 3%)
7120k ( 1%)
backwards jump threading : 0.59 ( 2%) 0.00 ( 0%) 0.51 ( 1%)
1062k ( 0%)
tree SRA : 0.01 ( 0%) 0.00 ( 0%) 0.00 ( 0%)
0 ( 0%)
tree CCP : 1.81 ( 5%) 0.00 ( 0%) 1.88 ( 5%)
2790k ( 0%)
tree split crit edges : 0.04 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
3148k ( 0%)
tree reassociation : 0.05 ( 0%) 0.00 ( 0%) 0.02 ( 0%)
11k ( 0%)
tree PRE : 0.58 ( 2%) 0.01 ( 1%) 0.54 ( 2%)
9776k ( 1%)
tree FRE : 0.71 ( 2%) 0.08 ( 7%) 0.74 ( 2%)
2240k ( 0%)
tree RPO VN : 0.03 ( 0%) 0.01 ( 1%) 0.00 ( 0%)
973k ( 0%)
tree code sinking : 0.03 ( 0%) 0.00 ( 0%) 0.03 ( 0%)
5259k ( 1%)
tree linearize phis : 0.07 ( 0%) 0.00 ( 0%) 0.04 ( 0%)
1195k ( 0%)
tree backward propagate : 0.00 ( 0%) 0.00 ( 0%) 0.02 ( 0%)
0 ( 0%)
tree forward propagate : 0.15 ( 0%) 0.02 ( 2%) 0.13 ( 0%)
1359k ( 0%)
tree phiprop : 0.01 ( 0%) 0.00 ( 0%) 0.00 ( 0%)
0 ( 0%)
tree conservative DCE : 0.04 ( 0%) 0.03 ( 3%) 0.03 ( 0%)
104k ( 0%)
tree aggressive DCE : 0.18 ( 1%) 0.00 ( 0%) 0.26 ( 1%)
14M ( 2%)
tree DSE : 0.10 ( 0%) 0.00 ( 0%) 0.02 ( 0%)
900k ( 0%)
PHI merge : 0.00 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
1360k ( 0%)
tree loop optimization : 0.00 ( 0%) 0.01 ( 1%) 0.00 ( 0%)
0 ( 0%)
tree loop invariant motion : 0.06 ( 0%) 0.00 ( 0%) 0.08 ( 0%)
0 ( 0%)
tree canonical iv : 0.16 ( 0%) 0.00 ( 0%) 0.09 ( 0%)
5872k ( 1%)
scev constant prop : 0.07 ( 0%) 0.00 ( 0%) 0.05 ( 0%)
2690k ( 0%)
complete unrolling : 0.56 ( 2%) 0.00 ( 0%) 0.52 ( 1%)
27M ( 4%)
tree vectorization : 0.96 ( 3%) 0.00 ( 0%) 0.92 ( 3%)
99M ( 15%)
tree slp vectorization : 0.35 ( 1%) 0.00 ( 0%) 0.32 ( 1%)
53M ( 8%)
tree loop distribution : 0.10 ( 0%) 0.00 ( 0%) 0.19 ( 1%)
5678k ( 1%)
tree iv optimization : 0.68 ( 2%) 0.02 ( 2%) 0.73 ( 2%)
25M ( 4%)
predictive commoning : 0.17 ( 0%) 0.00 ( 0%) 0.23 ( 1%)
9610k ( 1%)
tree copy headers : 0.36 ( 1%) 0.00 ( 0%) 0.37 ( 1%)
14M ( 2%)
tree SSA uncprop : 0.00 ( 0%) 0.01 ( 1%) 0.00 ( 0%)
0 ( 0%)
tree NRV optimization : 0.01 ( 0%) 0.00 ( 0%) 0.00 ( 0%)
4032 ( 0%)
tree SSA verifier : 2.42 ( 7%) 0.07 ( 6%) 2.31 ( 7%)
0 ( 0%)
tree STMT verifier : 2.45 ( 7%) 0.06 ( 5%) 2.61 ( 7%)
0 ( 0%)
gimple widening/fma detection : 0.01 ( 0%) 0.00 ( 0%) 0.02 ( 0%)
256 ( 0%)
tree strlen optimization : 0.06 ( 0%) 0.00 ( 0%) 0.11 ( 0%)
3430k ( 1%)
tree modref : 0.06 ( 0%) 0.00 ( 0%) 0.05 ( 0%)
497k ( 0%)
callgraph verifier : 0.09 ( 0%) 0.02 ( 2%) 0.08 ( 0%)
0 ( 0%)
dominance frontiers : 0.00 ( 0%) 0.00 ( 0%) 0.04 ( 0%)
0 ( 0%)
dominance computation : 0.28 ( 1%) 0.03 ( 3%) 0.42 ( 1%)
0 ( 0%)
control dependences : 0.00 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
0 ( 0%)
out of ssa : 0.03 ( 0%) 0.00 ( 0%) 0.04 ( 0%)
283k ( 0%)
expand vars : 0.02 ( 0%) 0.00 ( 0%) 0.04 ( 0%)
6005k ( 1%)
expand : 0.11 ( 0%) 0.00 ( 0%) 0.12 ( 0%)
21M ( 3%)
post expand cleanups : 0.02 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
3074k ( 0%)
lower subreg : 0.02 ( 0%) 0.00 ( 0%) 0.02 ( 0%)
0 ( 0%)
forward prop : 0.19 ( 1%) 0.00 ( 0%) 0.15 ( 0%)
740k ( 0%)
CSE : 0.32 ( 1%) 0.00 ( 0%) 0.25 ( 1%)
955k ( 0%)
dead code elimination : 0.11 ( 0%) 0.00 ( 0%) 0.12 ( 0%)
12k ( 0%)
dead store elim1 : 0.09 ( 0%) 0.00 ( 0%) 0.12 ( 0%)
3782k ( 1%)
dead store elim2 : 0.04 ( 0%) 0.00 ( 0%) 0.10 ( 0%)
3907k ( 1%)
loop analysis : 0.00 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
0 ( 0%)
loop init : 0.56 ( 2%) 0.00 ( 0%) 0.43 ( 1%)
41M ( 6%)
loop invariant motion : 2.79 ( 8%) 0.01 ( 1%) 2.83 ( 8%)
203k ( 0%)
loop unrolling : 2.96 ( 9%) 0.03 ( 3%) 3.01 ( 8%)
6601k ( 1%)
loop fini : 0.03 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
4096 ( 0%)
CPROP : 0.36 ( 1%) 0.00 ( 0%) 0.48 ( 1%)
6688k ( 1%)
PRE : 0.30 ( 1%) 0.00 ( 0%) 0.24 ( 1%)
33k ( 0%)
CSE 2 : 0.21 ( 1%) 0.01 ( 1%) 0.17 ( 0%)
460k ( 0%)
branch prediction : 0.23 ( 1%) 0.00 ( 0%) 0.22 ( 1%)
12M ( 2%)
combiner : 0.51 ( 1%) 0.00 ( 0%) 0.47 ( 1%)
17M ( 3%)
if-conversion : 0.16 ( 0%) 0.00 ( 0%) 0.17 ( 0%)
230k ( 0%)
mode switching : 0.01 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
0 ( 0%)
integrated RA : 0.53 ( 2%) 0.01 ( 1%) 0.48 ( 1%)
25M ( 4%)
LRA non-specific : 0.17 ( 0%) 0.00 ( 0%) 0.15 ( 0%)
711k ( 0%)
LRA virtuals elimination : 0.02 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
107k ( 0%)
LRA reload inheritance : 0.00 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
36k ( 0%)
LRA create live ranges : 0.06 ( 0%) 0.00 ( 0%) 0.08 ( 0%)
18k ( 0%)
LRA hard reg assignment : 0.01 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
0 ( 0%)
LRA rematerialization : 0.02 ( 0%) 0.00 ( 0%) 0.03 ( 0%)
0 ( 0%)
reload : 0.00 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
6240 ( 0%)
reload CSE regs : 0.19 ( 1%) 0.00 ( 0%) 0.25 ( 1%)
5298k ( 1%)
ree : 0.09 ( 0%) 0.00 ( 0%) 0.06 ( 0%)
67k ( 0%)
thread pro- & epilogue : 0.09 ( 0%) 0.01 ( 1%) 0.05 ( 0%)
469k ( 0%)
if-conversion 2 : 0.05 ( 0%) 0.00 ( 0%) 0.05 ( 0%)
19k ( 0%)
combine stack adjustments : 0.02 ( 0%) 0.00 ( 0%) 0.04 ( 0%)
6240 ( 0%)
peephole 2 : 0.09 ( 0%) 0.00 ( 0%) 0.09 ( 0%)
667k ( 0%)
hard reg cprop : 0.10 ( 0%) 0.00 ( 0%) 0.07 ( 0%)
33k ( 0%)
scheduling 2 : 0.33 ( 1%) 0.00 ( 0%) 0.39 ( 1%)
996k ( 0%)
machine dep reorg : 0.09 ( 0%) 0.01 ( 1%) 0.12 ( 0%)
12k ( 0%)
reorder blocks : 0.08 ( 0%) 0.00 ( 0%) 0.07 ( 0%)
2289k ( 0%)
shorten branches : 0.03 ( 0%) 0.00 ( 0%) 0.03 ( 0%)
0 ( 0%)
reg stack : 0.02 ( 0%) 0.00 ( 0%) 0.02 ( 0%)
75k ( 0%)
final : 0.05 ( 0%) 0.01 ( 1%) 0.11 ( 0%)
5355k ( 1%)
tree if-combine : 0.00 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
252k ( 0%)
if to switch conversion : 0.00 ( 0%) 0.00 ( 0%) 0.02 ( 0%)
1144 ( 0%)
straight-line strength reduction : 0.04 ( 0%) 0.01 ( 1%) 0.04 ( 0%)
233k ( 0%)
store merging : 0.01 ( 0%) 0.00 ( 0%) 0.00 ( 0%)
480 ( 0%)
tree loop if-conversion : 0.11 ( 0%) 0.00 ( 0%) 0.12 ( 0%)
5909k ( 1%)
access analysis : 0.18 ( 1%) 0.00 ( 0%) 0.16 ( 0%)
4496 ( 0%)
early local passes : 0.00 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
0 ( 0%)
unaccounted optimizations : 0.00 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
0 ( 0%)
rest of compilation : 0.30 ( 1%) 0.01 ( 1%) 0.40 ( 1%)
2089k ( 0%)
unaccounted post reload : 0.01 ( 0%) 0.00 ( 0%) 0.00 ( 0%)
0 ( 0%)
remove unused locals : 0.02 ( 0%) 0.00 ( 0%) 0.03 ( 0%)
0 ( 0%)
address taken : 0.03 ( 0%) 0.00 ( 0%) 0.04 ( 0%)
0 ( 0%)
verify loop closed : 0.05 ( 0%) 0.01 ( 1%) 0.06 ( 0%)
0 ( 0%)
verify RTL sharing : 0.68 ( 2%) 0.00 ( 0%) 0.75 ( 2%)
0 ( 0%)
rebuild frequencies : 0.00 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
0 ( 0%)
repair loop structures : 0.03 ( 0%) 0.00 ( 0%) 0.05 ( 0%)
824 ( 0%)
TOTAL : 34.38 1.11 35.50
660M
Extra diagnostic checks enabled; compiler may run slowly.
Configure with --enable-checking=release to disable checks.
```
Anyway, 'gcc -c ./slarith.i -O3 -march=znver2 -m32 -fno-strict-aliasing -fPIC
makes it ICE at least, I'm not able to test many combinations though. Sorry!
```
$ gcc -v
Using built-in specs.
COLLECT_GCC=gcc
COLLECT_LTO_WRAPPER=/usr/libexec/gcc/x86_64-pc-linux-gnu/14/lto-wrapper
Target: x86_64-pc-linux-gnu
Configured with:
/var/tmp/portage/sys-devel/gcc-14.0.9999/work/gcc-14.0.9999/configure
--host=x86_64-pc-linux-gnu --build=x86_64-pc-linux-gnu --prefix=/usr
--bindir=/usr/x86_64-pc-linux-gnu/gcc-bin/14
--includedir=/usr/lib/gcc/x86_64-pc-linux-gnu/14/include
--datadir=/usr/share/gcc-data/x86_64-pc-linux-gnu/14
--mandir=/usr/share/gcc-data/x86_64-pc-linux-gnu/14/man
--infodir=/usr/share/gcc-data/x86_64-pc-linux-gnu/14/info
--with-gxx-include-dir=/usr/lib/gcc/x86_64-pc-linux-gnu/14/include/g++-v14
--disable-silent-rules --disable-dependency-tracking
--with-python-dir=/share/gcc-data/x86_64-pc-linux-gnu/14/python
--enable-languages=c,c++,fortran --enable-obsolete --enable-secureplt
--disable-werror --with-system-zlib --enable-nls --without-included-gettext
--disable-libunwind-exceptions --enable-checking=yes,extra,rtl,df
--with-bugurl=https://bugs.gentoo.org/ --with-pkgversion='Gentoo 14.0.9999 p,
commit e6628b8411533cc14470a9404cca9047df81cb0f' --with-gcc-major-version-only
--enable-libstdcxx-time --enable-lto --disable-libstdcxx-pch --enable-shared
--enable-threads=posix --enable-__cxa_atexit --enable-clocale=gnu
--enable-multilib --with-multilib-list=m32,m64 --disable-fixed-point
--enable-targets=all --enable-libgomp --disable-libssp --disable-libada
--disable-cet --disable-systemtap --disable-valgrind-annotations
--disable-vtable-verify --disable-libvtv --with-zstd --without-isl
--enable-default-pie --enable-host-pie --disable-host-bind-now
--enable-default-ssp --with-build-config='bootstrap-O3 bootstrap-lto'
Thread model: posix
Supported LTO compression algorithms: zlib zstd
gcc version 14.0.1 20240212 (experimental)
f3306a9455f4d2556ba941e5783c94d4bede3b58 (Gentoo 14.0.9999 p, commit
e6628b8411533cc14470a9404cca9047df81cb0f)
```
^ permalink raw reply [flat|nested] 11+ messages in thread
* [Bug tree-optimization/113900] [14 regression] Hang and then ICE in vect_transform_loops, at tree-vectorizer.cc:1031 when building slang-2.3.3
2024-02-13 9:03 [Bug tree-optimization/113900] New: [14 regression] Hang and then ICe in vect_transform_loops, at tree-vectorizer.cc:1031 when building slang-2.3.3 sjames at gcc dot gnu.org
2024-02-13 9:06 ` [Bug tree-optimization/113900] " sjames at gcc dot gnu.org
2024-02-13 9:19 ` [Bug tree-optimization/113900] [14 regression] Hang and then ICE " sjames at gcc dot gnu.org
@ 2024-02-13 9:20 ` sjames at gcc dot gnu.org
2024-02-13 9:23 ` rguenth at gcc dot gnu.org
` (6 subsequent siblings)
9 siblings, 0 replies; 11+ messages in thread
From: sjames at gcc dot gnu.org @ 2024-02-13 9:20 UTC (permalink / raw)
To: gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113900
--- Comment #3 from Sam James <sjames at gcc dot gnu.org> ---
OK, it takes ages (at least with checking, can't check without right now) with
10/11/12/13/14, so that part isn't a regression. Weird how I didn't notice
before.
^ permalink raw reply [flat|nested] 11+ messages in thread
* [Bug tree-optimization/113900] [14 regression] Hang and then ICE in vect_transform_loops, at tree-vectorizer.cc:1031 when building slang-2.3.3
2024-02-13 9:03 [Bug tree-optimization/113900] New: [14 regression] Hang and then ICe in vect_transform_loops, at tree-vectorizer.cc:1031 when building slang-2.3.3 sjames at gcc dot gnu.org
` (2 preceding siblings ...)
2024-02-13 9:20 ` sjames at gcc dot gnu.org
@ 2024-02-13 9:23 ` rguenth at gcc dot gnu.org
2024-02-13 10:18 ` sjames at gcc dot gnu.org
` (5 subsequent siblings)
9 siblings, 0 replies; 11+ messages in thread
From: rguenth at gcc dot gnu.org @ 2024-02-13 9:23 UTC (permalink / raw)
To: gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113900
Richard Biener <rguenth at gcc dot gnu.org> changed:
What |Removed |Added
----------------------------------------------------------------------------
Target Milestone|--- |14.0
Keywords| |needs-bisection,
| |needs-reduction
^ permalink raw reply [flat|nested] 11+ messages in thread
* [Bug tree-optimization/113900] [14 regression] Hang and then ICE in vect_transform_loops, at tree-vectorizer.cc:1031 when building slang-2.3.3
2024-02-13 9:03 [Bug tree-optimization/113900] New: [14 regression] Hang and then ICe in vect_transform_loops, at tree-vectorizer.cc:1031 when building slang-2.3.3 sjames at gcc dot gnu.org
` (3 preceding siblings ...)
2024-02-13 9:23 ` rguenth at gcc dot gnu.org
@ 2024-02-13 10:18 ` sjames at gcc dot gnu.org
2024-02-13 10:29 ` rguenth at gcc dot gnu.org
` (4 subsequent siblings)
9 siblings, 0 replies; 11+ messages in thread
From: sjames at gcc dot gnu.org @ 2024-02-13 10:18 UTC (permalink / raw)
To: gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113900
--- Comment #4 from Sam James <sjames at gcc dot gnu.org> ---
I'll bisect and then also try reduce if it has tolerable speed without
checking.
^ permalink raw reply [flat|nested] 11+ messages in thread
* [Bug tree-optimization/113900] [14 regression] Hang and then ICE in vect_transform_loops, at tree-vectorizer.cc:1031 when building slang-2.3.3
2024-02-13 9:03 [Bug tree-optimization/113900] New: [14 regression] Hang and then ICe in vect_transform_loops, at tree-vectorizer.cc:1031 when building slang-2.3.3 sjames at gcc dot gnu.org
` (4 preceding siblings ...)
2024-02-13 10:18 ` sjames at gcc dot gnu.org
@ 2024-02-13 10:29 ` rguenth at gcc dot gnu.org
2024-02-13 10:57 ` jakub at gcc dot gnu.org
` (3 subsequent siblings)
9 siblings, 0 replies; 11+ messages in thread
From: rguenth at gcc dot gnu.org @ 2024-02-13 10:29 UTC (permalink / raw)
To: gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113900
Richard Biener <rguenth at gcc dot gnu.org> changed:
What |Removed |Added
----------------------------------------------------------------------------
Last reconfirmed| |2024-02-13
Keywords|compile-time-hog |
Ever confirmed|0 |1
Status|UNCONFIRMED |NEW
--- Comment #5 from Richard Biener <rguenth at gcc dot gnu.org> ---
What does -march=native resolve to? I suppose znver2? I can confirm the
compile-time-hog even with a release checking GCC 13 compiler, but nothing
really stands out here besides maybe RTL combine and load CSE after reload
(that's a usual suspect).
> gcc-13 slarith.i -S -m32 -mfpmath=sse -O3 -fPIC -march=znver2 -fno-strict-aliasing -Waddress -Warray-bounds -Wfree-nonheap-object -Wint-to-pointer-cast -Wmain -Wnonnull -Wodr -Wreturn-type -Wsizeof-pointer-memaccess -Wstrict-aliasing -Wstring-compare -Wuninitialized -Wvarargs -ftime-report
Time variable usr sys wall
GGC
phase setup : 0.00 ( 0%) 0.00 ( 0%) 0.00 ( 0%)
2042k ( 0%)
phase parsing : 0.13 ( 0%) 0.40 ( 20%) 0.53 ( 1%)
25M ( 1%)
phase lang. deferred : 0.00 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
96 ( 0%)
phase opt and generate : 46.65 (100%) 1.61 ( 80%) 48.27 ( 99%)
2563M ( 99%)
garbage collection : 0.12 ( 0%) 0.01 ( 0%) 0.12 ( 0%)
0 ( 0%)
dump files : 0.03 ( 0%) 0.00 ( 0%) 0.05 ( 0%)
0 ( 0%)
callgraph construction : 0.05 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
552k ( 0%)
callgraph optimization : 0.01 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
2952 ( 0%)
callgraph functions expansion : 45.66 ( 98%) 1.46 ( 73%) 47.13 ( 97%)
2459M ( 95%)
callgraph ipa passes : 0.90 ( 2%) 0.15 ( 7%) 1.06 ( 2%)
60M ( 2%)
ipa function summary : 0.09 ( 0%) 0.00 ( 0%) 0.09 ( 0%)
9208k ( 0%)
ipa cp : 0.01 ( 0%) 0.00 ( 0%) 0.00 ( 0%)
175k ( 0%)
ipa inlining heuristics : 0.00 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
68k ( 0%)
ipa function splitting : 0.01 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
8528 ( 0%)
ipa pure const : 0.02 ( 0%) 0.00 ( 0%) 0.00 ( 0%)
3504 ( 0%)
ipa icf : 0.01 ( 0%) 0.00 ( 0%) 0.02 ( 0%)
30k ( 0%)
ipa SRA : 0.00 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
37k ( 0%)
ipa modref : 0.01 ( 0%) 0.00 ( 0%) 0.00 ( 0%)
325k ( 0%)
cfg construction : 0.00 ( 0%) 0.00 ( 0%) 0.00 ( 0%)
3443k ( 0%)
cfg cleanup : 0.52 ( 1%) 0.01 ( 0%) 0.44 ( 1%)
37M ( 1%)
trivially dead code : 0.11 ( 0%) 0.00 ( 0%) 0.15 ( 0%)
0 ( 0%)
df scan insns : 0.07 ( 0%) 0.00 ( 0%) 0.10 ( 0%)
12k ( 0%)
df reaching defs : 0.37 ( 1%) 0.01 ( 0%) 0.29 ( 1%)
0 ( 0%)
df live regs : 1.22 ( 3%) 0.01 ( 0%) 1.15 ( 2%)
0 ( 0%)
df live&initialized regs : 0.53 ( 1%) 0.00 ( 0%) 0.65 ( 1%)
0 ( 0%)
df must-initialized regs : 0.00 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
0 ( 0%)
df use-def / def-use chains : 0.07 ( 0%) 0.00 ( 0%) 0.09 ( 0%)
0 ( 0%)
df live reg subwords : 0.01 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
0 ( 0%)
df reg dead/unused notes : 0.55 ( 1%) 0.00 ( 0%) 0.51 ( 1%)
24M ( 1%)
register information : 0.09 ( 0%) 0.00 ( 0%) 0.09 ( 0%)
0 ( 0%)
alias analysis : 0.51 ( 1%) 0.00 ( 0%) 0.48 ( 1%)
125M ( 5%)
alias stmt walking : 0.91 ( 2%) 0.22 ( 11%) 0.95 ( 2%)
45M ( 2%)
register scan : 0.06 ( 0%) 0.00 ( 0%) 0.04 ( 0%)
1524k ( 0%)
rebuild jump labels : 0.09 ( 0%) 0.00 ( 0%) 0.04 ( 0%)
264 ( 0%)
preprocessing : 0.03 ( 0%) 0.10 ( 5%) 0.12 ( 0%)
500k ( 0%)
lexical analysis : 0.06 ( 0%) 0.19 ( 9%) 0.20 ( 0%)
0 ( 0%)
parser (global) : 0.00 ( 0%) 0.01 ( 0%) 0.01 ( 0%)
3313k ( 0%)
parser struct body : 0.00 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
165k ( 0%)
parser function body : 0.04 ( 0%) 0.10 ( 5%) 0.18 ( 0%)
20M ( 1%)
parser inl. func. body : 0.00 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
374k ( 0%)
inline parameters : 0.04 ( 0%) 0.02 ( 1%) 0.09 ( 0%)
779k ( 0%)
integration : 0.01 ( 0%) 0.00 ( 0%) 0.00 ( 0%)
382k ( 0%)
tree gimplify : 0.03 ( 0%) 0.00 ( 0%) 0.06 ( 0%)
26M ( 1%)
tree CFG construction : 0.02 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
14M ( 1%)
tree CFG cleanup : 0.32 ( 1%) 0.03 ( 1%) 0.28 ( 1%)
1884k ( 0%)
tree tail merge : 0.01 ( 0%) 0.00 ( 0%) 0.00 ( 0%)
3359k ( 0%)
tree VRP : 1.50 ( 3%) 0.09 ( 4%) 1.55 ( 3%)
26M ( 1%)
tree Early VRP : 0.22 ( 0%) 0.00 ( 0%) 0.20 ( 0%)
13M ( 1%)
tree copy propagation : 0.03 ( 0%) 0.00 ( 0%) 0.03 ( 0%)
365k ( 0%)
tree PTA : 0.08 ( 0%) 0.02 ( 1%) 0.10 ( 0%)
2480k ( 0%)
tree SSA rewrite : 0.01 ( 0%) 0.03 ( 1%) 0.03 ( 0%)
6370k ( 0%)
tree SSA incremental : 0.24 ( 1%) 0.02 ( 1%) 0.43 ( 1%)
28M ( 1%)
tree operand scan : 0.44 ( 1%) 0.16 ( 8%) 0.64 ( 1%)
57M ( 2%)
dominator optimization : 1.32 ( 3%) 0.03 ( 1%) 1.34 ( 3%)
25M ( 1%)
backwards jump threading : 0.81 ( 2%) 0.04 ( 2%) 0.84 ( 2%)
923k ( 0%)
tree CCP : 0.62 ( 1%) 0.01 ( 0%) 0.54 ( 1%)
1512k ( 0%)
tree split crit edges : 0.00 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
8949k ( 0%)
tree reassociation : 0.07 ( 0%) 0.00 ( 0%) 0.04 ( 0%)
107k ( 0%)
tree PRE : 0.22 ( 0%) 0.00 ( 0%) 0.29 ( 1%)
10M ( 0%)
tree FRE : 0.74 ( 2%) 0.15 ( 7%) 0.89 ( 2%)
5140k ( 0%)
tree RPO VN : 0.23 ( 0%) 0.06 ( 3%) 0.33 ( 1%)
3884k ( 0%)
tree code sinking : 0.09 ( 0%) 0.02 ( 1%) 0.03 ( 0%)
11M ( 0%)
tree linearize phis : 0.00 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
1003k ( 0%)
tree backward propagate : 0.00 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
0 ( 0%)
tree forward propagate : 0.10 ( 0%) 0.02 ( 1%) 0.19 ( 0%)
7716k ( 0%)
tree conservative DCE : 0.05 ( 0%) 0.03 ( 1%) 0.11 ( 0%)
4177k ( 0%)
tree aggressive DCE : 0.05 ( 0%) 0.01 ( 0%) 0.07 ( 0%)
1184k ( 0%)
tree DSE : 0.07 ( 0%) 0.00 ( 0%) 0.09 ( 0%)
1168k ( 0%)
PHI merge : 0.00 ( 0%) 0.00 ( 0%) 0.02 ( 0%)
1057k ( 0%)
tree loop invariant motion : 0.06 ( 0%) 0.00 ( 0%) 0.07 ( 0%)
0 ( 0%)
tree loop interchange : 0.01 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
1476k ( 0%)
tree canonical iv : 0.05 ( 0%) 0.00 ( 0%) 0.10 ( 0%)
10M ( 0%)
scev constant prop : 0.04 ( 0%) 0.00 ( 0%) 0.03 ( 0%)
3211k ( 0%)
tree loop unswitching : 0.05 ( 0%) 0.00 ( 0%) 0.08 ( 0%)
11M ( 0%)
loop splitting : 0.01 ( 0%) 0.00 ( 0%) 0.02 ( 0%)
886k ( 0%)
complete unrolling : 0.70 ( 1%) 0.09 ( 4%) 0.86 ( 2%)
108M ( 4%)
tree vectorization : 0.81 ( 2%) 0.22 ( 11%) 0.95 ( 2%)
261M ( 10%)
tree slp vectorization : 0.58 ( 1%) 0.01 ( 0%) 0.56 ( 1%)
242M ( 9%)
tree loop distribution : 0.09 ( 0%) 0.00 ( 0%) 0.07 ( 0%)
9740k ( 0%)
tree iv optimization : 0.46 ( 1%) 0.05 ( 2%) 0.55 ( 1%)
69M ( 3%)
predictive commoning : 0.52 ( 1%) 0.01 ( 0%) 0.52 ( 1%)
25M ( 1%)
tree copy headers : 0.06 ( 0%) 0.00 ( 0%) 0.06 ( 0%)
4543k ( 0%)
tree SSA uncprop : 0.02 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
0 ( 0%)
tree switch lowering : 0.01 ( 0%) 0.00 ( 0%) 0.00 ( 0%)
188k ( 0%)
gimple CSE sin/cos : 0.01 ( 0%) 0.00 ( 0%) 0.00 ( 0%)
0 ( 0%)
gimple widening/fma detection : 0.01 ( 0%) 0.00 ( 0%) 0.00 ( 0%)
0 ( 0%)
tree strlen optimization : 0.15 ( 0%) 0.01 ( 0%) 0.18 ( 0%)
1363k ( 0%)
tree modref : 0.04 ( 0%) 0.00 ( 0%) 0.06 ( 0%)
367k ( 0%)
dominance frontiers : 0.00 ( 0%) 0.00 ( 0%) 0.03 ( 0%)
0 ( 0%)
dominance computation : 0.24 ( 1%) 0.00 ( 0%) 0.25 ( 1%)
0 ( 0%)
out of ssa : 0.10 ( 0%) 0.01 ( 0%) 0.13 ( 0%)
514k ( 0%)
expand vars : 0.07 ( 0%) 0.00 ( 0%) 0.07 ( 0%)
29M ( 1%)
expand : 0.34 ( 1%) 0.01 ( 0%) 0.32 ( 1%)
149M ( 6%)
post expand cleanups : 0.01 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
6798k ( 0%)
varconst : 0.00 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
1944 ( 0%)
lower subreg : 0.03 ( 0%) 0.00 ( 0%) 0.11 ( 0%)
134k ( 0%)
jump : 0.01 ( 0%) 0.00 ( 0%) 0.00 ( 0%)
0 ( 0%)
forward prop : 0.40 ( 1%) 0.01 ( 0%) 0.36 ( 1%)
8490k ( 0%)
CSE : 0.99 ( 2%) 0.02 ( 1%) 1.06 ( 2%)
5692k ( 0%)
dead code elimination : 0.15 ( 0%) 0.00 ( 0%) 0.17 ( 0%)
0 ( 0%)
dead store elim1 : 0.21 ( 0%) 0.00 ( 0%) 0.23 ( 0%)
27M ( 1%)
dead store elim2 : 0.24 ( 1%) 0.00 ( 0%) 0.30 ( 1%)
32M ( 1%)
loop analysis : 0.01 ( 0%) 0.00 ( 0%) 0.00 ( 0%)
0 ( 0%)
loop init : 0.15 ( 0%) 0.02 ( 1%) 0.21 ( 0%)
54M ( 2%)
loop invariant motion : 0.05 ( 0%) 0.00 ( 0%) 0.05 ( 0%)
71k ( 0%)
loop unrolling : 0.03 ( 0%) 0.01 ( 0%) 0.08 ( 0%)
6644k ( 0%)
loop fini : 0.02 ( 0%) 0.01 ( 0%) 0.01 ( 0%)
104k ( 0%)
CPROP : 0.51 ( 1%) 0.00 ( 0%) 0.47 ( 1%)
49M ( 2%)
PRE : 3.47 ( 7%) 0.00 ( 0%) 3.41 ( 7%)
2326k ( 0%)
CSE 2 : 0.51 ( 1%) 0.00 ( 0%) 0.47 ( 1%)
2598k ( 0%)
branch prediction : 0.08 ( 0%) 0.00 ( 0%) 0.08 ( 0%)
13M ( 1%)
combiner : 7.83 ( 17%) 0.00 ( 0%) 7.81 ( 16%)
475M ( 18%)
if-conversion : 0.03 ( 0%) 0.00 ( 0%) 0.03 ( 0%)
1297k ( 0%)
integrated RA : 1.92 ( 4%) 0.02 ( 1%) 1.95 ( 4%)
232M ( 9%)
LRA non-specific : 1.86 ( 4%) 0.00 ( 0%) 1.84 ( 4%)
54M ( 2%)
LRA virtuals elimination : 0.08 ( 0%) 0.00 ( 0%) 0.12 ( 0%)
7192k ( 0%)
LRA reload inheritance : 0.39 ( 1%) 0.00 ( 0%) 0.30 ( 1%)
22M ( 1%)
LRA create live ranges : 1.58 ( 3%) 0.00 ( 0%) 1.68 ( 3%)
11M ( 0%)
LRA hard reg assignment : 1.51 ( 3%) 0.01 ( 0%) 1.58 ( 3%)
0 ( 0%)
LRA coalesce pseudo regs : 0.07 ( 0%) 0.00 ( 0%) 0.06 ( 0%)
0 ( 0%)
LRA rematerialization : 0.34 ( 1%) 0.00 ( 0%) 0.31 ( 1%)
200k ( 0%)
reload : 0.04 ( 0%) 0.00 ( 0%) 0.04 ( 0%)
6240 ( 0%)
reload CSE regs : 1.17 ( 3%) 0.01 ( 0%) 1.16 ( 2%)
45M ( 2%)
load CSE after reload : 3.04 ( 6%) 0.00 ( 0%) 3.13 ( 6%)
3435k ( 0%)
ree : 0.03 ( 0%) 0.00 ( 0%) 0.05 ( 0%)
319k ( 0%)
thread pro- & epilogue : 0.05 ( 0%) 0.00 ( 0%) 0.04 ( 0%)
739k ( 0%)
if-conversion 2 : 0.02 ( 0%) 0.00 ( 0%) 0.03 ( 0%)
159k ( 0%)
split paths : 0.02 ( 0%) 0.00 ( 0%) 0.02 ( 0%)
336k ( 0%)
combine stack adjustments : 0.01 ( 0%) 0.00 ( 0%) 0.02 ( 0%)
5912 ( 0%)
peephole 2 : 0.08 ( 0%) 0.00 ( 0%) 0.09 ( 0%)
2472k ( 0%)
hard reg cprop : 0.14 ( 0%) 0.00 ( 0%) 0.15 ( 0%)
501k ( 0%)
scheduling 2 : 1.15 ( 2%) 0.00 ( 0%) 1.13 ( 2%)
6414k ( 0%)
machine dep reorg : 0.11 ( 0%) 0.00 ( 0%) 0.17 ( 0%)
6520k ( 0%)
reorder blocks : 0.07 ( 0%) 0.00 ( 0%) 0.03 ( 0%)
5676k ( 0%)
shorten branches : 0.08 ( 0%) 0.01 ( 0%) 0.16 ( 0%)
113k ( 0%)
reg stack : 0.11 ( 0%) 0.00 ( 0%) 0.08 ( 0%)
200k ( 0%)
final : 0.27 ( 1%) 0.02 ( 1%) 0.20 ( 0%)
9063k ( 0%)
tree if-combine : 0.01 ( 0%) 0.00 ( 0%) 0.01 ( 0%)
267k ( 0%)
straight-line strength reduction : 0.06 ( 0%) 0.01 ( 0%) 0.07 ( 0%)
502k ( 0%)
tree loop if-conversion : 0.04 ( 0%) 0.00 ( 0%) 0.05 ( 0%)
10M ( 0%)
access analysis : 0.08 ( 0%) 0.03 ( 1%) 0.05 ( 0%)
2856 ( 0%)
rest of compilation : 0.32 ( 1%) 0.02 ( 1%) 0.32 ( 1%)
12M ( 0%)
remove unused locals : 0.03 ( 0%) 0.00 ( 0%) 0.03 ( 0%)
0 ( 0%)
address taken : 0.02 ( 0%) 0.00 ( 0%) 0.03 ( 0%)
0 ( 0%)
repair loop structures : 0.00 ( 0%) 0.00 ( 0%) 0.02 ( 0%)
936 ( 0%)
TOTAL : 46.78 2.01 48.81
2591M
I can confirm the ICE with trunk. That's
/* We should not have to update virtual SSA form here but some
transforms involve creating new virtual definitions which makes
updating difficult.
We delay the actual update to the end of the pass but avoid
confusing ourselves by forcing need_ssa_update_p () to false. */
unsigned todo = 0;
if (need_ssa_update_p (cfun))
{
gcc_assert (loop_vinfo->any_known_not_updated_vssa);
and this is a new "feature", doing less SSA updates. It will eventually
get bisected to the commit introducing this.
Testcase reduction should be priority here.
^ permalink raw reply [flat|nested] 11+ messages in thread
* [Bug tree-optimization/113900] [14 regression] Hang and then ICE in vect_transform_loops, at tree-vectorizer.cc:1031 when building slang-2.3.3
2024-02-13 9:03 [Bug tree-optimization/113900] New: [14 regression] Hang and then ICe in vect_transform_loops, at tree-vectorizer.cc:1031 when building slang-2.3.3 sjames at gcc dot gnu.org
` (5 preceding siblings ...)
2024-02-13 10:29 ` rguenth at gcc dot gnu.org
@ 2024-02-13 10:57 ` jakub at gcc dot gnu.org
2024-02-13 11:09 ` jakub at gcc dot gnu.org
` (2 subsequent siblings)
9 siblings, 0 replies; 11+ messages in thread
From: jakub at gcc dot gnu.org @ 2024-02-13 10:57 UTC (permalink / raw)
To: gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113900
Jakub Jelinek <jakub at gcc dot gnu.org> changed:
What |Removed |Added
----------------------------------------------------------------------------
CC| |jakub at gcc dot gnu.org
--- Comment #6 from Jakub Jelinek <jakub at gcc dot gnu.org> ---
Created attachment 57408
--> https://gcc.gnu.org/bugzilla/attachment.cgi?id=57408&action=edit
slarith.i.xz
The testcase is insane.
I've hand reduced it a little bit, so that it ICEs within 2 seconds and will
reduce/bisect from there.
^ permalink raw reply [flat|nested] 11+ messages in thread
* [Bug tree-optimization/113900] [14 regression] Hang and then ICE in vect_transform_loops, at tree-vectorizer.cc:1031 when building slang-2.3.3
2024-02-13 9:03 [Bug tree-optimization/113900] New: [14 regression] Hang and then ICe in vect_transform_loops, at tree-vectorizer.cc:1031 when building slang-2.3.3 sjames at gcc dot gnu.org
` (6 preceding siblings ...)
2024-02-13 10:57 ` jakub at gcc dot gnu.org
@ 2024-02-13 11:09 ` jakub at gcc dot gnu.org
2024-02-13 11:12 ` [Bug tree-optimization/113900] [14 regression] Hang and then ICE in vect_transform_loops, at tree-vectorizer.cc:1031 when building slang-2.3.3 since r14-8925 jakub at gcc dot gnu.org
2024-02-13 11:52 ` rguenth at gcc dot gnu.org
9 siblings, 0 replies; 11+ messages in thread
From: jakub at gcc dot gnu.org @ 2024-02-13 11:09 UTC (permalink / raw)
To: gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113900
Jakub Jelinek <jakub at gcc dot gnu.org> changed:
What |Removed |Added
----------------------------------------------------------------------------
Keywords|needs-reduction |
--- Comment #7 from Jakub Jelinek <jakub at gcc dot gnu.org> ---
Reduced testcase -O3 -msse4:
int a, b;
void foo (void);
int
bar (int j)
{
int i = b;
for (; j; j++)
{
if (a)
continue;
if (i)
return 1;
foo ();
}
return 42;
}
^ permalink raw reply [flat|nested] 11+ messages in thread
* [Bug tree-optimization/113900] [14 regression] Hang and then ICE in vect_transform_loops, at tree-vectorizer.cc:1031 when building slang-2.3.3 since r14-8925
2024-02-13 9:03 [Bug tree-optimization/113900] New: [14 regression] Hang and then ICe in vect_transform_loops, at tree-vectorizer.cc:1031 when building slang-2.3.3 sjames at gcc dot gnu.org
` (7 preceding siblings ...)
2024-02-13 11:09 ` jakub at gcc dot gnu.org
@ 2024-02-13 11:12 ` jakub at gcc dot gnu.org
2024-02-13 11:52 ` rguenth at gcc dot gnu.org
9 siblings, 0 replies; 11+ messages in thread
From: jakub at gcc dot gnu.org @ 2024-02-13 11:12 UTC (permalink / raw)
To: gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113900
Jakub Jelinek <jakub at gcc dot gnu.org> changed:
What |Removed |Added
----------------------------------------------------------------------------
Keywords|needs-bisection |
Priority|P3 |P1
Summary|[14 regression] Hang and |[14 regression] Hang and
|then ICE in |then ICE in
|vect_transform_loops, at |vect_transform_loops, at
|tree-vectorizer.cc:1031 |tree-vectorizer.cc:1031
|when building slang-2.3.3 |when building slang-2.3.3
| |since r14-8925
--- Comment #8 from Jakub Jelinek <jakub at gcc dot gnu.org> ---
Started with r14-8925-g1e3f78dbb328a2f2db8def241372cb947d9cb7eb
^ permalink raw reply [flat|nested] 11+ messages in thread
* [Bug tree-optimization/113900] [14 regression] Hang and then ICE in vect_transform_loops, at tree-vectorizer.cc:1031 when building slang-2.3.3 since r14-8925
2024-02-13 9:03 [Bug tree-optimization/113900] New: [14 regression] Hang and then ICe in vect_transform_loops, at tree-vectorizer.cc:1031 when building slang-2.3.3 sjames at gcc dot gnu.org
` (8 preceding siblings ...)
2024-02-13 11:12 ` [Bug tree-optimization/113900] [14 regression] Hang and then ICE in vect_transform_loops, at tree-vectorizer.cc:1031 when building slang-2.3.3 since r14-8925 jakub at gcc dot gnu.org
@ 2024-02-13 11:52 ` rguenth at gcc dot gnu.org
9 siblings, 0 replies; 11+ messages in thread
From: rguenth at gcc dot gnu.org @ 2024-02-13 11:52 UTC (permalink / raw)
To: gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113900
Richard Biener <rguenth at gcc dot gnu.org> changed:
What |Removed |Added
----------------------------------------------------------------------------
Status|NEW |RESOLVED
Resolution|--- |DUPLICATE
--- Comment #9 from Richard Biener <rguenth at gcc dot gnu.org> ---
It's an odd duplicate. I confirm the fix for PR113902 fixes both the original
and the reduced testcase.
*** This bug has been marked as a duplicate of bug 113902 ***
^ permalink raw reply [flat|nested] 11+ messages in thread