public inbox for gcc-bugs@sourceware.org
help / color / mirror / Atom feed
* [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x.
@ 2010-08-26 18:33 jv244 at cam dot ac dot uk
2010-08-26 18:34 ` [Bug middle-end/45422] " jv244 at cam dot ac dot uk
` (24 more replies)
0 siblings, 25 replies; 26+ messages in thread
From: jv244 at cam dot ac dot uk @ 2010-08-26 18:33 UTC (permalink / raw)
To: gcc-bugs
compiling the to-be-attached testcase as:
gfortran -c -ftime-report -cpp -fbounds-check -g -O3 -ffast-math -funroll-loops
-ftree-vectorize -march=native -ffree-form test.f90
massively slowed down in 4.6
4.4: 51s
4.5: 50s
4.6: 434s
the timing report look like:
Execution times (seconds)
garbage collection : 15.35 ( 4%) usr 0.04 ( 2%) sys 15.35 ( 4%) wall
0 kB ( 0%) ggc
callgraph construction: 0.07 ( 0%) usr 0.01 ( 1%) sys 0.08 ( 0%) wall
5736 kB ( 0%) ggc
callgraph optimization: 0.41 ( 0%) usr 0.01 ( 0%) sys 0.34 ( 0%) wall
725 kB ( 0%) ggc
ipa cp : 0.11 ( 0%) usr 0.00 ( 0%) sys 0.11 ( 0%) wall
1347 kB ( 0%) ggc
ipa function splitting: 0.03 ( 0%) usr 0.00 ( 0%) sys 0.02 ( 0%) wall
0 kB ( 0%) ggc
ipa reference : 0.02 ( 0%) usr 0.00 ( 0%) sys 0.02 ( 0%) wall
0 kB ( 0%) ggc
ipa profile : 0.00 ( 0%) usr 0.00 ( 0%) sys 0.01 ( 0%) wall
0 kB ( 0%) ggc
ipa pure const : 0.14 ( 0%) usr 0.01 ( 0%) sys 0.19 ( 0%) wall
0 kB ( 0%) ggc
cfg cleanup : 2.47 ( 1%) usr 0.00 ( 0%) sys 3.10 ( 1%) wall
4726 kB ( 0%) ggc
CFG verifier : 6.48 ( 1%) usr 0.02 ( 1%) sys 6.76 ( 2%) wall
0 kB ( 0%) ggc
trivially dead code : 0.94 ( 0%) usr 0.00 ( 0%) sys 0.76 ( 0%) wall
0 kB ( 0%) ggc
df multiple defs : 0.44 ( 0%) usr 0.01 ( 1%) sys 0.37 ( 0%) wall
0 kB ( 0%) ggc
df reaching defs : 2.38 ( 1%) usr 0.02 ( 1%) sys 2.41 ( 1%) wall
0 kB ( 0%) ggc
df live regs : 12.32 ( 3%) usr 0.03 ( 2%) sys 12.19 ( 3%) wall
0 kB ( 0%) ggc
df live&initialized regs: 4.24 ( 1%) usr 0.01 ( 1%) sys 4.31 ( 1%) wall
0 kB ( 0%) ggc
df use-def / def-use chains: 1.72 ( 0%) usr 0.01 ( 1%) sys 1.54 ( 0%)
wall 0 kB ( 0%) ggc
df live reg subwords : 0.35 ( 0%) usr 0.00 ( 0%) sys 0.32 ( 0%) wall
0 kB ( 0%) ggc
df reg dead/unused notes: 6.24 ( 1%) usr 0.01 ( 1%) sys 6.17 ( 1%) wall
8317 kB ( 0%) ggc
register information : 2.34 ( 1%) usr 0.00 ( 0%) sys 2.39 ( 1%) wall
0 kB ( 0%) ggc
alias analysis : 2.25 ( 1%) usr 0.02 ( 1%) sys 2.22 ( 1%) wall
47018 kB ( 3%) ggc
alias stmt walking : 0.63 ( 0%) usr 0.03 ( 2%) sys 0.83 ( 0%) wall
6938 kB ( 0%) ggc
register scan : 0.37 ( 0%) usr 0.00 ( 0%) sys 0.33 ( 0%) wall
298 kB ( 0%) ggc
rebuild jump labels : 0.70 ( 0%) usr 0.00 ( 0%) sys 0.67 ( 0%) wall
0 kB ( 0%) ggc
parser : 0.94 ( 0%) usr 0.11 ( 6%) sys 1.06 ( 0%) wall
55368 kB ( 3%) ggc
inline heuristics : 0.23 ( 0%) usr 0.00 ( 0%) sys 0.26 ( 0%) wall
0 kB ( 0%) ggc
tree gimplify : 0.64 ( 0%) usr 0.01 ( 0%) sys 0.53 ( 0%) wall
48405 kB ( 3%) ggc
tree eh : 0.02 ( 0%) usr 0.01 ( 1%) sys 0.02 ( 0%) wall
0 kB ( 0%) ggc
tree CFG construction : 0.05 ( 0%) usr 0.00 ( 0%) sys 0.08 ( 0%) wall
11974 kB ( 1%) ggc
tree CFG cleanup : 1.10 ( 0%) usr 0.00 ( 0%) sys 1.34 ( 0%) wall
3531 kB ( 0%) ggc
tree VRP : 2.91 ( 1%) usr 0.04 ( 2%) sys 2.76 ( 1%) wall
67348 kB ( 4%) ggc
tree copy propagation : 0.21 ( 0%) usr 0.03 ( 2%) sys 0.25 ( 0%) wall
1384 kB ( 0%) ggc
tree find ref. vars : 0.11 ( 0%) usr 0.01 ( 1%) sys 0.06 ( 0%) wall
3806 kB ( 0%) ggc
tree PTA : 0.36 ( 0%) usr 0.01 ( 1%) sys 0.42 ( 0%) wall
5199 kB ( 0%) ggc
tree PHI insertion : 0.02 ( 0%) usr 0.00 ( 0%) sys 0.02 ( 0%) wall
3194 kB ( 0%) ggc
tree SSA rewrite : 0.57 ( 0%) usr 0.00 ( 0%) sys 0.55 ( 0%) wall
14011 kB ( 1%) ggc
tree SSA other : 0.10 ( 0%) usr 0.02 ( 1%) sys 0.13 ( 0%) wall
432 kB ( 0%) ggc
tree SSA incremental : 1.38 ( 0%) usr 0.15 ( 8%) sys 1.62 ( 0%) wall
7441 kB ( 0%) ggc
tree operand scan : 0.57 ( 0%) usr 0.36 (18%) sys 0.84 ( 0%) wall
58289 kB ( 3%) ggc
dominator optimization: 0.56 ( 0%) usr 0.02 ( 1%) sys 0.65 ( 0%) wall
8527 kB ( 0%) ggc
tree SRA : 0.03 ( 0%) usr 0.00 ( 0%) sys 0.00 ( 0%) wall
0 kB ( 0%) ggc
tree CCP : 1.14 ( 0%) usr 0.02 ( 1%) sys 1.15 ( 0%) wall
4845 kB ( 0%) ggc
tree PHI const/copy prop: 0.03 ( 0%) usr 0.00 ( 0%) sys 0.00 ( 0%) wall
106 kB ( 0%) ggc
tree split crit edges : 0.03 ( 0%) usr 0.00 ( 0%) sys 0.00 ( 0%) wall
2014 kB ( 0%) ggc
tree reassociation : 0.24 ( 0%) usr 0.02 ( 1%) sys 0.29 ( 0%) wall
6030 kB ( 0%) ggc
tree PRE : 0.91 ( 0%) usr 0.02 ( 1%) sys 1.08 ( 0%) wall
7164 kB ( 0%) ggc
tree FRE : 0.53 ( 0%) usr 0.01 ( 1%) sys 0.52 ( 0%) wall
5411 kB ( 0%) ggc
tree code sinking : 0.05 ( 0%) usr 0.00 ( 0%) sys 0.04 ( 0%) wall
1311 kB ( 0%) ggc
tree linearize phis : 0.01 ( 0%) usr 0.00 ( 0%) sys 0.01 ( 0%) wall
0 kB ( 0%) ggc
tree forward propagate: 0.27 ( 0%) usr 0.00 ( 0%) sys 0.32 ( 0%) wall
11820 kB ( 1%) ggc
tree phiprop : 0.01 ( 0%) usr 0.00 ( 0%) sys 0.02 ( 0%) wall
0 kB ( 0%) ggc
tree conservative DCE : 0.17 ( 0%) usr 0.01 ( 1%) sys 0.18 ( 0%) wall
576 kB ( 0%) ggc
tree aggressive DCE : 1.04 ( 0%) usr 0.03 ( 2%) sys 1.10 ( 0%) wall
25495 kB ( 1%) ggc
tree buildin call DCE : 0.00 ( 0%) usr 0.00 ( 0%) sys 0.01 ( 0%) wall
0 kB ( 0%) ggc
tree DSE : 0.28 ( 0%) usr 0.00 ( 0%) sys 0.36 ( 0%) wall
260 kB ( 0%) ggc
PHI merge : 0.00 ( 0%) usr 0.00 ( 0%) sys 0.02 ( 0%) wall
0 kB ( 0%) ggc
tree loop bounds : 0.23 ( 0%) usr 0.00 ( 0%) sys 0.22 ( 0%) wall
6686 kB ( 0%) ggc
tree loop invariant motion: 0.06 ( 0%) usr 0.00 ( 0%) sys 0.06 ( 0%)
wall 76 kB ( 0%) ggc
tree canonical iv : 0.09 ( 0%) usr 0.00 ( 0%) sys 0.03 ( 0%) wall
3421 kB ( 0%) ggc
scev constant prop : 0.06 ( 0%) usr 0.00 ( 0%) sys 0.07 ( 0%) wall
2302 kB ( 0%) ggc
tree loop unswitching : 0.01 ( 0%) usr 0.01 ( 1%) sys 0.03 ( 0%) wall
739 kB ( 0%) ggc
complete unrolling : 1.59 ( 0%) usr 0.12 ( 6%) sys 1.48 ( 0%) wall
101520 kB ( 6%) ggc
tree vectorization : 0.38 ( 0%) usr 0.00 ( 0%) sys 0.39 ( 0%) wall
20119 kB ( 1%) ggc
tree slp vectorization: 0.99 ( 0%) usr 0.00 ( 0%) sys 0.80 ( 0%) wall
52747 kB ( 3%) ggc
tree loop distribution: 0.01 ( 0%) usr 0.00 ( 0%) sys 0.02 ( 0%) wall
0 kB ( 0%) ggc
tree prefetching : 3.37 ( 1%) usr 0.07 ( 3%) sys 3.36 ( 1%) wall
90902 kB ( 5%) ggc
tree iv optimization : 193.28 (44%) usr 0.05 ( 2%) sys 193.22 (44%) wall
322316 kB (18%) ggc
predictive commoning : 0.06 ( 0%) usr 0.00 ( 0%) sys 0.04 ( 0%) wall
1752 kB ( 0%) ggc
tree loop init : 0.07 ( 0%) usr 0.01 ( 1%) sys 0.04 ( 0%) wall
1307 kB ( 0%) ggc
tree loop fini : 0.00 ( 0%) usr 0.00 ( 0%) sys 0.01 ( 0%) wall
0 kB ( 0%) ggc
tree copy headers : 0.01 ( 0%) usr 0.00 ( 0%) sys 0.04 ( 0%) wall
1658 kB ( 0%) ggc
tree SSA uncprop : 0.05 ( 0%) usr 0.00 ( 0%) sys 0.02 ( 0%) wall
0 kB ( 0%) ggc
tree rename SSA copies: 0.05 ( 0%) usr 0.00 ( 0%) sys 0.06 ( 0%) wall
0 kB ( 0%) ggc
tree SSA verifier : 7.77 ( 2%) usr 0.07 ( 3%) sys 8.45 ( 2%) wall
0 kB ( 0%) ggc
tree STMT verifier : 15.39 ( 4%) usr 0.00 ( 0%) sys 15.16 ( 3%) wall
0 kB ( 0%) ggc
tree switch initialization conversion: 0.00 ( 0%) usr 0.00 ( 0%) sys
0.01 ( 0%) wall 0 kB ( 0%) ggc
callgraph verifier : 1.12 ( 0%) usr 0.01 ( 1%) sys 1.12 ( 0%) wall
0 kB ( 0%) ggc
dominance frontiers : 0.35 ( 0%) usr 0.00 ( 0%) sys 0.36 ( 0%) wall
0 kB ( 0%) ggc
dominance computation : 0.90 ( 0%) usr 0.01 ( 1%) sys 0.82 ( 0%) wall
0 kB ( 0%) ggc
control dependences : 0.04 ( 0%) usr 0.00 ( 0%) sys 0.00 ( 0%) wall
0 kB ( 0%) ggc
out of ssa : 0.38 ( 0%) usr 0.00 ( 0%) sys 0.32 ( 0%) wall
225 kB ( 0%) ggc
expand vars : 0.12 ( 0%) usr 0.00 ( 0%) sys 0.11 ( 0%) wall
11174 kB ( 1%) ggc
expand : 16.44 ( 4%) usr 0.05 ( 2%) sys 16.36 ( 4%) wall
111440 kB ( 6%) ggc
post expand cleanups : 0.11 ( 0%) usr 0.00 ( 0%) sys 0.14 ( 0%) wall
5818 kB ( 0%) ggc
lower subreg : 0.18 ( 0%) usr 0.00 ( 0%) sys 0.24 ( 0%) wall
0 kB ( 0%) ggc
forward prop : 1.79 ( 0%) usr 0.00 ( 0%) sys 1.87 ( 0%) wall
15809 kB ( 1%) ggc
CSE : 1.59 ( 0%) usr 0.00 ( 0%) sys 1.66 ( 0%) wall
662 kB ( 0%) ggc
dead code elimination : 2.31 ( 1%) usr 0.00 ( 0%) sys 2.27 ( 1%) wall
0 kB ( 0%) ggc
dead store elim1 : 1.54 ( 0%) usr 0.01 ( 1%) sys 1.67 ( 0%) wall
23524 kB ( 1%) ggc
dead store elim2 : 2.41 ( 1%) usr 0.00 ( 0%) sys 2.49 ( 1%) wall
22835 kB ( 1%) ggc
loop analysis : 0.20 ( 0%) usr 0.00 ( 0%) sys 0.17 ( 0%) wall
2220 kB ( 0%) ggc
loop invariant motion : 0.28 ( 0%) usr 0.01 ( 0%) sys 0.22 ( 0%) wall
448 kB ( 0%) ggc
loop unswitching : 5.71 ( 1%) usr 0.01 ( 1%) sys 5.68 ( 1%) wall
218 kB ( 0%) ggc
loop unrolling : 30.36 ( 7%) usr 0.04 ( 2%) sys 30.75 ( 7%) wall
184992 kB (10%) ggc
CPROP : 2.56 ( 1%) usr 0.00 ( 0%) sys 2.62 ( 1%) wall
25399 kB ( 1%) ggc
PRE : 1.53 ( 0%) usr 0.00 ( 0%) sys 1.43 ( 0%) wall
1798 kB ( 0%) ggc
web : 2.76 ( 1%) usr 0.01 ( 1%) sys 2.57 ( 1%) wall
8429 kB ( 0%) ggc
CSE 2 : 2.39 ( 1%) usr 0.00 ( 0%) sys 2.41 ( 1%) wall
2123 kB ( 0%) ggc
branch prediction : 0.22 ( 0%) usr 0.01 ( 1%) sys 0.17 ( 0%) wall
6857 kB ( 0%) ggc
combiner : 4.33 ( 1%) usr 0.00 ( 0%) sys 4.57 ( 1%) wall
60529 kB ( 3%) ggc
if-conversion : 0.22 ( 0%) usr 0.00 ( 0%) sys 0.24 ( 0%) wall
2520 kB ( 0%) ggc
regmove : 0.88 ( 0%) usr 0.00 ( 0%) sys 0.90 ( 0%) wall
0 kB ( 0%) ggc
mode switching : 0.01 ( 0%) usr 0.01 ( 1%) sys 0.00 ( 0%) wall
0 kB ( 0%) ggc
integrated RA : 16.20 ( 4%) usr 0.04 ( 2%) sys 16.28 ( 4%) wall
44292 kB ( 2%) ggc
reload : 7.72 ( 2%) usr 0.02 ( 1%) sys 7.47 ( 2%) wall
10065 kB ( 1%) ggc
reload CSE regs : 4.83 ( 1%) usr 0.00 ( 0%) sys 4.84 ( 1%) wall
36964 kB ( 2%) ggc
load CSE after reload : 0.50 ( 0%) usr 0.00 ( 0%) sys 0.52 ( 0%) wall
449 kB ( 0%) ggc
zee : 0.36 ( 0%) usr 0.00 ( 0%) sys 0.45 ( 0%) wall
45 kB ( 0%) ggc
thread pro- & epilogue: 0.15 ( 0%) usr 0.00 ( 0%) sys 0.09 ( 0%) wall
3988 kB ( 0%) ggc
if-conversion 2 : 0.12 ( 0%) usr 0.00 ( 0%) sys 0.19 ( 0%) wall
1056 kB ( 0%) ggc
combine stack adjustments: 0.08 ( 0%) usr 0.00 ( 0%) sys 0.18 ( 0%) wall
0 kB ( 0%) ggc
peephole 2 : 0.58 ( 0%) usr 0.00 ( 0%) sys 0.55 ( 0%) wall
2995 kB ( 0%) ggc
rename registers : 1.45 ( 0%) usr 0.00 ( 0%) sys 1.22 ( 0%) wall
2741 kB ( 0%) ggc
hard reg cprop : 1.27 ( 0%) usr 0.00 ( 0%) sys 1.26 ( 0%) wall
15 kB ( 0%) ggc
scheduling 2 : 6.64 ( 2%) usr 0.06 ( 3%) sys 6.71 ( 2%) wall
1284 kB ( 0%) ggc
machine dep reorg : 0.89 ( 0%) usr 0.00 ( 0%) sys 0.75 ( 0%) wall
77 kB ( 0%) ggc
reorder blocks : 0.90 ( 0%) usr 0.00 ( 0%) sys 0.76 ( 0%) wall
4788 kB ( 0%) ggc
final : 2.13 ( 0%) usr 0.12 ( 6%) sys 2.29 ( 1%) wall
9656 kB ( 1%) ggc
symout : 0.71 ( 0%) usr 0.11 ( 5%) sys 0.82 ( 0%) wall
58849 kB ( 3%) ggc
variable tracking : 3.12 ( 1%) usr 0.00 ( 0%) sys 3.06 ( 1%) wall
62059 kB ( 3%) ggc
var-tracking dataflow : 4.46 ( 1%) usr 0.01 ( 1%) sys 4.47 ( 1%) wall
0 kB ( 0%) ggc
var-tracking emit : 3.78 ( 1%) usr 0.01 ( 1%) sys 3.84 ( 1%) wall
19142 kB ( 1%) ggc
TOTAL : 434.63 2.00 436.64
1780330 kB
gfortran -v reports:
/data03/vondele/gcc_trunk/build/libexec/gcc/x86_64-unknown-linux-gnu/4.6.0/f951
test.f90 -cpp=/tmp/cceEHMzK.f90 -quiet -v test.f90 -march=k8-sse3 -mcx16 -msahf
--param l1-cache-size=64 --param l1-cache-line-size=64 --param
l2-cache-size=1024 -mtune=k8 -quiet -dumpbase test.f90 -auxbase test -g -O3
-version -ftime-report -fbounds-check -ffast-math -funroll-loops
-ftree-vectorize -ffree-form -fintrinsic-modules-path
/data03/vondele/gcc_trunk/build/lib/gcc/x86_64-unknown-linux-gnu/4.6.0/finclude
-o /tmp/ccQ56z0k.s
--
Summary: [4.6 Regression] compile time increases 8x.
Product: gcc
Version: 4.6.0
Status: UNCONFIRMED
Keywords: compile-time-hog
Severity: normal
Priority: P3
Component: middle-end
AssignedTo: unassigned at gcc dot gnu dot org
ReportedBy: jv244 at cam dot ac dot uk
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422
^ permalink raw reply [flat|nested] 26+ messages in thread
* [Bug middle-end/45422] [4.6 Regression] compile time increases 8x.
2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
@ 2010-08-26 18:34 ` jv244 at cam dot ac dot uk
2010-08-27 10:23 ` rguenth at gcc dot gnu dot org
` (23 subsequent siblings)
24 siblings, 0 replies; 26+ messages in thread
From: jv244 at cam dot ac dot uk @ 2010-08-26 18:34 UTC (permalink / raw)
To: gcc-bugs
------- Comment #1 from jv244 at cam dot ac dot uk 2010-08-26 18:34 -------
Created an attachment (id=21573)
--> (http://gcc.gnu.org/bugzilla/attachment.cgi?id=21573&action=view)
testcase
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422
^ permalink raw reply [flat|nested] 26+ messages in thread
* [Bug middle-end/45422] [4.6 Regression] compile time increases 8x.
2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
2010-08-26 18:34 ` [Bug middle-end/45422] " jv244 at cam dot ac dot uk
@ 2010-08-27 10:23 ` rguenth at gcc dot gnu dot org
2010-08-27 11:41 ` rguenth at gcc dot gnu dot org
` (22 subsequent siblings)
24 siblings, 0 replies; 26+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2010-08-27 10:23 UTC (permalink / raw)
To: gcc-bugs
------- Comment #2 from rguenth at gcc dot gnu dot org 2010-08-27 10:23 -------
Can also be seen here:
http://gcc.opensuse.org/SPEC/CFP/sb-barbella.suse.de-head-64-2006/times.html
and here:
http://gcc.opensuse.org/c++bench-frescobaldi/polyhedron/polyhedron-summary.txt-1-0.html
I'm looking at a profile.
--
rguenth at gcc dot gnu dot org changed:
What |Removed |Added
----------------------------------------------------------------------------
Status|UNCONFIRMED |NEW
Ever Confirmed|0 |1
Last reconfirmed|0000-00-00 00:00:00 |2010-08-27 10:23:21
date| |
Target Milestone|--- |4.6.0
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422
^ permalink raw reply [flat|nested] 26+ messages in thread
* [Bug middle-end/45422] [4.6 Regression] compile time increases 8x.
2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
2010-08-26 18:34 ` [Bug middle-end/45422] " jv244 at cam dot ac dot uk
2010-08-27 10:23 ` rguenth at gcc dot gnu dot org
@ 2010-08-27 11:41 ` rguenth at gcc dot gnu dot org
2010-08-27 11:46 ` jv244 at cam dot ac dot uk
` (21 subsequent siblings)
24 siblings, 0 replies; 26+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2010-08-27 11:41 UTC (permalink / raw)
To: gcc-bugs
------- Comment #3 from rguenth at gcc dot gnu dot org 2010-08-27 11:41 -------
Reproducable with -O3 alone, -fbounds-check makes it more visible. With plain
-O3:
tree iv optimization : 22.64 (33%) usr 0.18 (11%) sys 22.97 (32%) wall
113858 kB (16%) ggc
and with added -fbounds-check:
tree iv optimization : 184.70 (78%) usr 0.67 (31%) sys 186.67 (78%) wall
166066 kB (18%) ggc
this connection with bounds-checking makes it sound familiar.
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422
^ permalink raw reply [flat|nested] 26+ messages in thread
* [Bug middle-end/45422] [4.6 Regression] compile time increases 8x.
2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
` (2 preceding siblings ...)
2010-08-27 11:41 ` rguenth at gcc dot gnu dot org
@ 2010-08-27 11:46 ` jv244 at cam dot ac dot uk
2010-08-27 12:07 ` rguenth at gcc dot gnu dot org
` (20 subsequent siblings)
24 siblings, 0 replies; 26+ messages in thread
From: jv244 at cam dot ac dot uk @ 2010-08-27 11:46 UTC (permalink / raw)
To: gcc-bugs
------- Comment #4 from jv244 at cam dot ac dot uk 2010-08-27 11:45 -------
(In reply to comment #3)
>
> this connection with bounds-checking makes it sound familiar.
>
I had a similar bug open (and fixed) as PR43627
with a comment from you
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=43627#c11
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422
^ permalink raw reply [flat|nested] 26+ messages in thread
* [Bug middle-end/45422] [4.6 Regression] compile time increases 8x.
2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
` (3 preceding siblings ...)
2010-08-27 11:46 ` jv244 at cam dot ac dot uk
@ 2010-08-27 12:07 ` rguenth at gcc dot gnu dot org
2010-08-27 12:16 ` rguenth at gcc dot gnu dot org
` (19 subsequent siblings)
24 siblings, 0 replies; 26+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2010-08-27 12:07 UTC (permalink / raw)
To: gcc-bugs
------- Comment #5 from rguenth at gcc dot gnu dot org 2010-08-27 12:06 -------
MODULE hfx_contract_block
INTEGER, PARAMETER :: dp=KIND(0.0D0)
CONTAINS
SUBROUTINE
block_9_2(mc_max,md_max,kbd,kbc,kad,kac,pbd,pbc,pad,pac,prim,scale)
INTEGER :: mc_max, md_max
REAL(KIND=dp) :: kbd(2*md_max), kbc(2*mc_max), kad(9*md_max), &
kac(9*mc_max), pbd(2*md_max), pbc(2*mc_max), pad(9*md_max), &
pac(9*mc_max), prim(9*2*mc_max*md_max), scale
INTEGER :: ma, mb, mc, md, p_index
REAL(KIND=dp) :: ks_bc, ks_bd, p_bc, p_bd, tmp
kbd(1:2*md_max) = 0.0_dp
kbc(1:2*mc_max) = 0.0_dp
kad(1:9*md_max) = 0.0_dp
kac(1:9*mc_max) = 0.0_dp
p_index = 0
DO md = 1,md_max
DO mc = 1,mc_max
DO mb = 1,2
ks_bd = 0.0_dp
ks_bc = 0.0_dp
p_bd = pbd((md-1)*2+mb)
p_bc = pbc((mc-1)*2+mb)
DO ma = 1,9
p_index=p_index+1
tmp = scale*prim(p_index)
ks_bc = ks_bc + tmp* pad((md-1)*9+ma)
ks_bd = ks_bd + tmp* pac((mc-1)*9+ma)
kad((md-1)*9+ma) = kad((md-1)*9+ma)-tmp*p_bc
kac((mc-1)*9+ma) = kac((mc-1)*9+ma)-tmp*p_bd
END DO
kbd((md-1)*2+mb) = kbd((md-1)*2+mb) - ks_bd
kbc((mc-1)*2+mb) = kbc((mc-1)*2+mb) - ks_bc
END DO
END DO
END DO
END SUBROUTINE block_9_2
END MODULE
gfortran -S -O3 -ffree-form -fbounds-check t.f90 -ftime-report
tree iv optimization : 3.49 (94%) usr 0.00 ( 0%) sys 3.52 (94%) wall
2272 kB (36%) ggc
can someone try to track down revisions that slowed that testcase down?
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422
^ permalink raw reply [flat|nested] 26+ messages in thread
* [Bug middle-end/45422] [4.6 Regression] compile time increases 8x.
2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
` (4 preceding siblings ...)
2010-08-27 12:07 ` rguenth at gcc dot gnu dot org
@ 2010-08-27 12:16 ` rguenth at gcc dot gnu dot org
2010-08-27 13:01 ` rguenth at gcc dot gnu dot org
` (18 subsequent siblings)
24 siblings, 0 replies; 26+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2010-08-27 12:16 UTC (permalink / raw)
To: gcc-bugs
------- Comment #6 from rguenth at gcc dot gnu dot org 2010-08-27 12:16 -------
Without -fbounds-check we have around 650 million calls to
iv_ca_get_num_inv_exprs ... called from iv_ca_recount_cost. With
-fbounds-check
this balloons to nearly 5 billion calls to that function.
David - this sounds like your changes, can you have a look to possibly reduce
this number of calls?
--
rguenth at gcc dot gnu dot org changed:
What |Removed |Added
----------------------------------------------------------------------------
CC| |xinliangli at gmail dot com
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422
^ permalink raw reply [flat|nested] 26+ messages in thread
* [Bug middle-end/45422] [4.6 Regression] compile time increases 8x.
2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
` (5 preceding siblings ...)
2010-08-27 12:16 ` rguenth at gcc dot gnu dot org
@ 2010-08-27 13:01 ` rguenth at gcc dot gnu dot org
2010-08-27 15:52 ` hjl dot tools at gmail dot com
` (17 subsequent siblings)
24 siblings, 0 replies; 26+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2010-08-27 13:01 UTC (permalink / raw)
To: gcc-bugs
------- Comment #7 from rguenth at gcc dot gnu dot org 2010-08-27 13:01 -------
The few remaining bound checks do not look simple to remove (if at all).
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422
^ permalink raw reply [flat|nested] 26+ messages in thread
* [Bug middle-end/45422] [4.6 Regression] compile time increases 8x.
2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
` (6 preceding siblings ...)
2010-08-27 13:01 ` rguenth at gcc dot gnu dot org
@ 2010-08-27 15:52 ` hjl dot tools at gmail dot com
2010-08-27 17:01 ` davidxl at gcc dot gnu dot org
` (16 subsequent siblings)
24 siblings, 0 replies; 26+ messages in thread
From: hjl dot tools at gmail dot com @ 2010-08-27 15:52 UTC (permalink / raw)
To: gcc-bugs
------- Comment #8 from hjl dot tools at gmail dot com 2010-08-27 15:52 -------
It is caused by revision 162653:
http://gcc.gnu.org/ml/gcc-cvs/2010-07/msg01007.html
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422
^ permalink raw reply [flat|nested] 26+ messages in thread
* [Bug middle-end/45422] [4.6 Regression] compile time increases 8x.
2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
` (7 preceding siblings ...)
2010-08-27 15:52 ` hjl dot tools at gmail dot com
@ 2010-08-27 17:01 ` davidxl at gcc dot gnu dot org
2010-08-28 6:00 ` davidxl at gcc dot gnu dot org
` (15 subsequent siblings)
24 siblings, 0 replies; 26+ messages in thread
From: davidxl at gcc dot gnu dot org @ 2010-08-27 17:01 UTC (permalink / raw)
To: gcc-bugs
------- Comment #9 from davidxl at gcc dot gnu dot org 2010-08-27 17:01 -------
Will take a look
--
davidxl at gcc dot gnu dot org changed:
What |Removed |Added
----------------------------------------------------------------------------
AssignedTo|unassigned at gcc dot gnu |davidxl at gcc dot gnu dot
|dot org |org
Status|NEW |ASSIGNED
Last reconfirmed|2010-08-27 10:23:21 |2010-08-27 17:01:01
date| |
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422
^ permalink raw reply [flat|nested] 26+ messages in thread
* [Bug middle-end/45422] [4.6 Regression] compile time increases 8x.
2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
` (8 preceding siblings ...)
2010-08-27 17:01 ` davidxl at gcc dot gnu dot org
@ 2010-08-28 6:00 ` davidxl at gcc dot gnu dot org
2010-08-29 5:09 ` [Bug middle-end/45422] [4.6 Regression] compile time increases 5x jv244 at cam dot ac dot uk
` (14 subsequent siblings)
24 siblings, 0 replies; 26+ messages in thread
From: davidxl at gcc dot gnu dot org @ 2010-08-28 6:00 UTC (permalink / raw)
To: gcc-bugs
------- Comment #10 from davidxl at gcc dot gnu dot org 2010-08-28 06:00 -------
fixed in r163610.
--
davidxl at gcc dot gnu dot org changed:
What |Removed |Added
----------------------------------------------------------------------------
Status|ASSIGNED |RESOLVED
Resolution| |FIXED
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422
^ permalink raw reply [flat|nested] 26+ messages in thread
* [Bug middle-end/45422] [4.6 Regression] compile time increases 5x.
2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
` (9 preceding siblings ...)
2010-08-28 6:00 ` davidxl at gcc dot gnu dot org
@ 2010-08-29 5:09 ` jv244 at cam dot ac dot uk
2010-08-29 5:13 ` pinskia at gcc dot gnu dot org
` (13 subsequent siblings)
24 siblings, 0 replies; 26+ messages in thread
From: jv244 at cam dot ac dot uk @ 2010-08-29 5:09 UTC (permalink / raw)
To: gcc-bugs
------- Comment #11 from jv244 at cam dot ac dot uk 2010-08-29 05:09 -------
After David's patch (thanks!), the testcase requires 240s, that's still a 5x
slowdown. I paste the new timing profile below, and reopen the bug. There is no
obvious candidate for the slowdown.
> gfortran -c -ftime-report -cpp -fbounds-check -g -O3 -ffast-math -funroll-loops -ftree-vectorize -march=native -ffree-form test.f90
Execution times (seconds)
garbage collection : 12.55 ( 5%) usr 0.03 ( 2%) sys 12.57 ( 5%) wall
0 kB ( 0%) ggc
callgraph construction: 0.08 ( 0%) usr 0.00 ( 0%) sys 0.06 ( 0%) wall
5736 kB ( 0%) ggc
callgraph optimization: 0.40 ( 0%) usr 0.02 ( 1%) sys 0.41 ( 0%) wall
725 kB ( 0%) ggc
ipa cp : 0.07 ( 0%) usr 0.00 ( 0%) sys 0.07 ( 0%) wall
1347 kB ( 0%) ggc
ipa function splitting: 0.01 ( 0%) usr 0.00 ( 0%) sys 0.02 ( 0%) wall
0 kB ( 0%) ggc
ipa reference : 0.01 ( 0%) usr 0.00 ( 0%) sys 0.01 ( 0%) wall
0 kB ( 0%) ggc
ipa profile : 0.00 ( 0%) usr 0.00 ( 0%) sys 0.01 ( 0%) wall
0 kB ( 0%) ggc
ipa pure const : 0.07 ( 0%) usr 0.01 ( 1%) sys 0.15 ( 0%) wall
0 kB ( 0%) ggc
cfg cleanup : 2.28 ( 1%) usr 0.00 ( 0%) sys 2.35 ( 1%) wall
4726 kB ( 0%) ggc
CFG verifier : 5.54 ( 2%) usr 0.03 ( 2%) sys 5.73 ( 2%) wall
0 kB ( 0%) ggc
trivially dead code : 0.67 ( 0%) usr 0.00 ( 0%) sys 0.65 ( 0%) wall
0 kB ( 0%) ggc
df multiple defs : 0.23 ( 0%) usr 0.00 ( 0%) sys 0.28 ( 0%) wall
0 kB ( 0%) ggc
df reaching defs : 2.00 ( 1%) usr 0.00 ( 0%) sys 2.12 ( 1%) wall
0 kB ( 0%) ggc
df live regs : 9.80 ( 4%) usr 0.01 ( 1%) sys 10.18 ( 4%) wall
0 kB ( 0%) ggc
df live&initialized regs: 3.62 ( 1%) usr 0.00 ( 0%) sys 3.08 ( 1%) wall
0 kB ( 0%) ggc
df use-def / def-use chains: 1.22 ( 0%) usr 0.00 ( 0%) sys 1.26 ( 1%)
wall 0 kB ( 0%) ggc
df live reg subwords : 0.32 ( 0%) usr 0.00 ( 0%) sys 0.27 ( 0%) wall
0 kB ( 0%) ggc
df reg dead/unused notes: 4.67 ( 2%) usr 0.00 ( 0%) sys 4.44 ( 2%) wall
8317 kB ( 0%) ggc
register information : 2.10 ( 1%) usr 0.00 ( 0%) sys 1.97 ( 1%) wall
0 kB ( 0%) ggc
alias analysis : 1.73 ( 1%) usr 0.00 ( 0%) sys 1.87 ( 1%) wall
47018 kB ( 3%) ggc
alias stmt walking : 0.61 ( 0%) usr 0.07 ( 4%) sys 0.61 ( 0%) wall
6938 kB ( 0%) ggc
register scan : 0.32 ( 0%) usr 0.00 ( 0%) sys 0.32 ( 0%) wall
202 kB ( 0%) ggc
rebuild jump labels : 0.72 ( 0%) usr 0.00 ( 0%) sys 0.67 ( 0%) wall
0 kB ( 0%) ggc
parser : 0.90 ( 0%) usr 0.09 ( 5%) sys 0.99 ( 0%) wall
55368 kB ( 3%) ggc
inline heuristics : 0.17 ( 0%) usr 0.01 ( 1%) sys 0.26 ( 0%) wall
0 kB ( 0%) ggc
tree gimplify : 0.51 ( 0%) usr 0.01 ( 1%) sys 0.57 ( 0%) wall
48405 kB ( 3%) ggc
tree eh : 0.03 ( 0%) usr 0.00 ( 0%) sys 0.02 ( 0%) wall
0 kB ( 0%) ggc
tree CFG construction : 0.02 ( 0%) usr 0.01 ( 1%) sys 0.03 ( 0%) wall
11974 kB ( 1%) ggc
tree CFG cleanup : 1.30 ( 1%) usr 0.02 ( 1%) sys 1.21 ( 0%) wall
3530 kB ( 0%) ggc
tree VRP : 2.50 ( 1%) usr 0.03 ( 2%) sys 2.44 ( 1%) wall
67364 kB ( 4%) ggc
tree copy propagation : 0.16 ( 0%) usr 0.05 ( 3%) sys 0.15 ( 0%) wall
1384 kB ( 0%) ggc
tree find ref. vars : 0.05 ( 0%) usr 0.01 ( 1%) sys 0.05 ( 0%) wall
3806 kB ( 0%) ggc
tree PTA : 0.34 ( 0%) usr 0.00 ( 0%) sys 0.33 ( 0%) wall
5198 kB ( 0%) ggc
tree PHI insertion : 0.03 ( 0%) usr 0.00 ( 0%) sys 0.02 ( 0%) wall
3194 kB ( 0%) ggc
tree SSA rewrite : 0.39 ( 0%) usr 0.00 ( 0%) sys 0.35 ( 0%) wall
14011 kB ( 1%) ggc
tree SSA other : 0.10 ( 0%) usr 0.04 ( 2%) sys 0.10 ( 0%) wall
432 kB ( 0%) ggc
tree SSA incremental : 1.18 ( 0%) usr 0.14 ( 8%) sys 1.44 ( 1%) wall
7441 kB ( 0%) ggc
tree operand scan : 0.47 ( 0%) usr 0.33 (19%) sys 0.78 ( 0%) wall
58289 kB ( 3%) ggc
dominator optimization: 0.52 ( 0%) usr 0.00 ( 0%) sys 0.61 ( 0%) wall
8527 kB ( 0%) ggc
tree SRA : 0.00 ( 0%) usr 0.00 ( 0%) sys 0.01 ( 0%) wall
0 kB ( 0%) ggc
tree CCP : 1.05 ( 0%) usr 0.05 ( 3%) sys 1.28 ( 1%) wall
4845 kB ( 0%) ggc
tree PHI const/copy prop: 0.01 ( 0%) usr 0.00 ( 0%) sys 0.05 ( 0%) wall
106 kB ( 0%) ggc
tree split crit edges : 0.00 ( 0%) usr 0.00 ( 0%) sys 0.00 ( 0%) wall
2014 kB ( 0%) ggc
tree reassociation : 0.27 ( 0%) usr 0.03 ( 2%) sys 0.27 ( 0%) wall
6030 kB ( 0%) ggc
tree PRE : 0.85 ( 0%) usr 0.00 ( 0%) sys 0.89 ( 0%) wall
7164 kB ( 0%) ggc
tree FRE : 0.47 ( 0%) usr 0.02 ( 1%) sys 0.56 ( 0%) wall
5411 kB ( 0%) ggc
tree code sinking : 0.11 ( 0%) usr 0.02 ( 1%) sys 0.03 ( 0%) wall
1311 kB ( 0%) ggc
tree linearize phis : 0.02 ( 0%) usr 0.00 ( 0%) sys 0.00 ( 0%) wall
0 kB ( 0%) ggc
tree forward propagate: 0.22 ( 0%) usr 0.02 ( 1%) sys 0.26 ( 0%) wall
11820 kB ( 1%) ggc
tree phiprop : 0.00 ( 0%) usr 0.00 ( 0%) sys 0.01 ( 0%) wall
0 kB ( 0%) ggc
tree conservative DCE : 0.11 ( 0%) usr 0.01 ( 1%) sys 0.04 ( 0%) wall
576 kB ( 0%) ggc
tree aggressive DCE : 0.84 ( 0%) usr 0.01 ( 1%) sys 0.92 ( 0%) wall
25495 kB ( 1%) ggc
tree buildin call DCE : 0.00 ( 0%) usr 0.00 ( 0%) sys 0.01 ( 0%) wall
0 kB ( 0%) ggc
tree DSE : 0.26 ( 0%) usr 0.00 ( 0%) sys 0.19 ( 0%) wall
260 kB ( 0%) ggc
tree loop bounds : 0.20 ( 0%) usr 0.00 ( 0%) sys 0.27 ( 0%) wall
6686 kB ( 0%) ggc
tree loop invariant motion: 0.07 ( 0%) usr 0.00 ( 0%) sys 0.06 ( 0%)
wall 76 kB ( 0%) ggc
tree canonical iv : 0.11 ( 0%) usr 0.00 ( 0%) sys 0.09 ( 0%) wall
3421 kB ( 0%) ggc
scev constant prop : 0.04 ( 0%) usr 0.00 ( 0%) sys 0.11 ( 0%) wall
2302 kB ( 0%) ggc
tree loop unswitching : 0.03 ( 0%) usr 0.00 ( 0%) sys 0.04 ( 0%) wall
739 kB ( 0%) ggc
complete unrolling : 1.60 ( 1%) usr 0.12 ( 7%) sys 1.40 ( 1%) wall
101520 kB ( 6%) ggc
tree vectorization : 0.31 ( 0%) usr 0.02 ( 1%) sys 0.27 ( 0%) wall
20116 kB ( 1%) ggc
tree slp vectorization: 0.92 ( 0%) usr 0.00 ( 0%) sys 0.90 ( 0%) wall
52747 kB ( 3%) ggc
tree loop distribution: 0.07 ( 0%) usr 0.00 ( 0%) sys 0.07 ( 0%) wall
0 kB ( 0%) ggc
tree prefetching : 3.09 ( 1%) usr 0.06 ( 3%) sys 3.07 ( 1%) wall
90905 kB ( 5%) ggc
tree iv optimization : 32.77 (13%) usr 0.03 ( 2%) sys 32.96 (13%) wall
322284 kB (18%) ggc
predictive commoning : 0.08 ( 0%) usr 0.00 ( 0%) sys 0.09 ( 0%) wall
1752 kB ( 0%) ggc
tree loop init : 0.05 ( 0%) usr 0.00 ( 0%) sys 0.06 ( 0%) wall
1307 kB ( 0%) ggc
tree loop fini : 0.01 ( 0%) usr 0.00 ( 0%) sys 0.00 ( 0%) wall
0 kB ( 0%) ggc
tree copy headers : 0.10 ( 0%) usr 0.00 ( 0%) sys 0.14 ( 0%) wall
1658 kB ( 0%) ggc
tree SSA uncprop : 0.04 ( 0%) usr 0.01 ( 1%) sys 0.02 ( 0%) wall
0 kB ( 0%) ggc
tree rename SSA copies: 0.03 ( 0%) usr 0.00 ( 0%) sys 0.09 ( 0%) wall
0 kB ( 0%) ggc
tree SSA verifier : 7.34 ( 3%) usr 0.02 ( 1%) sys 7.29 ( 3%) wall
0 kB ( 0%) ggc
tree STMT verifier : 15.08 ( 6%) usr 0.00 ( 0%) sys 15.11 ( 6%) wall
0 kB ( 0%) ggc
callgraph verifier : 0.85 ( 0%) usr 0.00 ( 0%) sys 0.87 ( 0%) wall
0 kB ( 0%) ggc
dominance frontiers : 0.26 ( 0%) usr 0.00 ( 0%) sys 0.32 ( 0%) wall
0 kB ( 0%) ggc
dominance computation : 0.86 ( 0%) usr 0.00 ( 0%) sys 0.89 ( 0%) wall
0 kB ( 0%) ggc
control dependences : 0.02 ( 0%) usr 0.00 ( 0%) sys 0.01 ( 0%) wall
0 kB ( 0%) ggc
out of ssa : 0.33 ( 0%) usr 0.00 ( 0%) sys 0.35 ( 0%) wall
225 kB ( 0%) ggc
expand vars : 0.16 ( 0%) usr 0.00 ( 0%) sys 0.09 ( 0%) wall
11294 kB ( 1%) ggc
expand : 14.67 ( 6%) usr 0.04 ( 2%) sys 13.89 ( 6%) wall
111424 kB ( 6%) ggc
post expand cleanups : 0.06 ( 0%) usr 0.00 ( 0%) sys 0.05 ( 0%) wall
5818 kB ( 0%) ggc
lower subreg : 0.19 ( 0%) usr 0.00 ( 0%) sys 0.13 ( 0%) wall
0 kB ( 0%) ggc
forward prop : 1.58 ( 1%) usr 0.00 ( 0%) sys 1.45 ( 1%) wall
15809 kB ( 1%) ggc
CSE : 1.60 ( 1%) usr 0.00 ( 0%) sys 1.73 ( 1%) wall
662 kB ( 0%) ggc
dead code elimination : 1.72 ( 1%) usr 0.00 ( 0%) sys 1.77 ( 1%) wall
0 kB ( 0%) ggc
dead store elim1 : 1.36 ( 1%) usr 0.01 ( 1%) sys 1.29 ( 1%) wall
23524 kB ( 1%) ggc
dead store elim2 : 2.01 ( 1%) usr 0.00 ( 0%) sys 2.10 ( 1%) wall
22835 kB ( 1%) ggc
loop analysis : 0.14 ( 0%) usr 0.00 ( 0%) sys 0.14 ( 0%) wall
2220 kB ( 0%) ggc
loop invariant motion : 0.24 ( 0%) usr 0.00 ( 0%) sys 0.25 ( 0%) wall
448 kB ( 0%) ggc
loop unswitching : 5.19 ( 2%) usr 0.01 ( 1%) sys 5.40 ( 2%) wall
218 kB ( 0%) ggc
loop unrolling : 26.07 (11%) usr 0.02 ( 1%) sys 25.98 (11%) wall
184992 kB (10%) ggc
CPROP : 2.20 ( 1%) usr 0.00 ( 0%) sys 2.48 ( 1%) wall
25399 kB ( 1%) ggc
PRE : 1.33 ( 1%) usr 0.00 ( 0%) sys 1.25 ( 1%) wall
1798 kB ( 0%) ggc
web : 2.26 ( 1%) usr 0.00 ( 0%) sys 2.29 ( 1%) wall
8429 kB ( 0%) ggc
CSE 2 : 2.07 ( 1%) usr 0.01 ( 1%) sys 2.30 ( 1%) wall
2123 kB ( 0%) ggc
branch prediction : 0.21 ( 0%) usr 0.00 ( 0%) sys 0.19 ( 0%) wall
6857 kB ( 0%) ggc
combiner : 4.11 ( 2%) usr 0.00 ( 0%) sys 4.21 ( 2%) wall
60529 kB ( 3%) ggc
if-conversion : 0.20 ( 0%) usr 0.00 ( 0%) sys 0.28 ( 0%) wall
2520 kB ( 0%) ggc
regmove : 0.63 ( 0%) usr 0.00 ( 0%) sys 0.80 ( 0%) wall
0 kB ( 0%) ggc
integrated RA : 14.33 ( 6%) usr 0.05 ( 3%) sys 14.22 ( 6%) wall
44292 kB ( 2%) ggc
reload : 6.75 ( 3%) usr 0.00 ( 0%) sys 6.74 ( 3%) wall
10065 kB ( 1%) ggc
reload CSE regs : 4.55 ( 2%) usr 0.01 ( 1%) sys 4.67 ( 2%) wall
36964 kB ( 2%) ggc
load CSE after reload : 0.36 ( 0%) usr 0.01 ( 1%) sys 0.45 ( 0%) wall
449 kB ( 0%) ggc
zee : 0.35 ( 0%) usr 0.00 ( 0%) sys 0.35 ( 0%) wall
45 kB ( 0%) ggc
thread pro- & epilogue: 0.11 ( 0%) usr 0.00 ( 0%) sys 0.11 ( 0%) wall
3988 kB ( 0%) ggc
if-conversion 2 : 0.11 ( 0%) usr 0.00 ( 0%) sys 0.10 ( 0%) wall
1056 kB ( 0%) ggc
combine stack adjustments: 0.15 ( 0%) usr 0.00 ( 0%) sys 0.17 ( 0%) wall
0 kB ( 0%) ggc
peephole 2 : 0.41 ( 0%) usr 0.00 ( 0%) sys 0.40 ( 0%) wall
2995 kB ( 0%) ggc
rename registers : 1.23 ( 1%) usr 0.00 ( 0%) sys 1.31 ( 1%) wall
2741 kB ( 0%) ggc
hard reg cprop : 1.23 ( 1%) usr 0.02 ( 1%) sys 1.11 ( 0%) wall
15 kB ( 0%) ggc
scheduling 2 : 6.25 ( 3%) usr 0.04 ( 2%) sys 6.24 ( 3%) wall
1284 kB ( 0%) ggc
machine dep reorg : 0.82 ( 0%) usr 0.00 ( 0%) sys 0.89 ( 0%) wall
77 kB ( 0%) ggc
reorder blocks : 0.68 ( 0%) usr 0.00 ( 0%) sys 0.73 ( 0%) wall
4788 kB ( 0%) ggc
final : 1.86 ( 1%) usr 0.08 ( 5%) sys 2.10 ( 1%) wall
9656 kB ( 1%) ggc
symout : 0.65 ( 0%) usr 0.06 ( 3%) sys 0.69 ( 0%) wall
58849 kB ( 3%) ggc
variable tracking : 2.74 ( 1%) usr 0.00 ( 0%) sys 2.83 ( 1%) wall
62059 kB ( 3%) ggc
var-tracking dataflow : 4.21 ( 2%) usr 0.01 ( 1%) sys 4.24 ( 2%) wall
0 kB ( 0%) ggc
var-tracking emit : 3.79 ( 2%) usr 0.01 ( 1%) sys 3.58 ( 1%) wall
19142 kB ( 1%) ggc
TOTAL : 244.77 1.72 246.49
1780321 kB
Extra diagnostic checks enabled; compiler may run slowly.
--
jv244 at cam dot ac dot uk changed:
What |Removed |Added
----------------------------------------------------------------------------
Status|RESOLVED |REOPENED
Resolution|FIXED |
Summary|[4.6 Regression] compile |[4.6 Regression] compile
|time increases 8x. |time increases 5x.
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422
^ permalink raw reply [flat|nested] 26+ messages in thread
* [Bug middle-end/45422] [4.6 Regression] compile time increases 5x.
2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
` (10 preceding siblings ...)
2010-08-29 5:09 ` [Bug middle-end/45422] [4.6 Regression] compile time increases 5x jv244 at cam dot ac dot uk
@ 2010-08-29 5:13 ` pinskia at gcc dot gnu dot org
2010-08-29 5:21 ` jv244 at cam dot ac dot uk
` (12 subsequent siblings)
24 siblings, 0 replies; 26+ messages in thread
From: pinskia at gcc dot gnu dot org @ 2010-08-29 5:13 UTC (permalink / raw)
To: gcc-bugs
------- Comment #12 from pinskia at gcc dot gnu dot org 2010-08-29 05:13 -------
>Extra diagnostic checks enabled; compiler may run slowly.
Make sure you configure the trunk with --enable-checking=release to get the
same timing results as what a release would be.
--
pinskia at gcc dot gnu dot org changed:
What |Removed |Added
----------------------------------------------------------------------------
Status|REOPENED |WAITING
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422
^ permalink raw reply [flat|nested] 26+ messages in thread
* [Bug middle-end/45422] [4.6 Regression] compile time increases 5x.
2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
` (11 preceding siblings ...)
2010-08-29 5:13 ` pinskia at gcc dot gnu dot org
@ 2010-08-29 5:21 ` jv244 at cam dot ac dot uk
2010-08-29 5:24 ` pinskia at gcc dot gnu dot org
` (11 subsequent siblings)
24 siblings, 0 replies; 26+ messages in thread
From: jv244 at cam dot ac dot uk @ 2010-08-29 5:21 UTC (permalink / raw)
To: gcc-bugs
------- Comment #13 from jv244 at cam dot ac dot uk 2010-08-29 05:20 -------
(In reply to comment #12)
> >Extra diagnostic checks enabled; compiler may run slowly.
>
> Make sure you configure the trunk with --enable-checking=release to get the
> same timing results as what a release would be.
>
The comparison is actually against the branches, not releases. However, I'm
rebuilding gcc and will report back.
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422
^ permalink raw reply [flat|nested] 26+ messages in thread
* [Bug middle-end/45422] [4.6 Regression] compile time increases 5x.
2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
` (12 preceding siblings ...)
2010-08-29 5:21 ` jv244 at cam dot ac dot uk
@ 2010-08-29 5:24 ` pinskia at gcc dot gnu dot org
2010-08-29 5:31 ` jv244 at cam dot ac dot uk
` (10 subsequent siblings)
24 siblings, 0 replies; 26+ messages in thread
From: pinskia at gcc dot gnu dot org @ 2010-08-29 5:24 UTC (permalink / raw)
To: gcc-bugs
------- Comment #14 from pinskia at gcc dot gnu dot org 2010-08-29 05:23 -------
(In reply to comment #12)
> >Extra diagnostic checks enabled; compiler may run slowly.
>
> Make sure you configure the trunk with --enable-checking=release to get the
> same timing results as what a release would be.
s/release/release branch/ :).
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422
^ permalink raw reply [flat|nested] 26+ messages in thread
* [Bug middle-end/45422] [4.6 Regression] compile time increases 5x.
2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
` (13 preceding siblings ...)
2010-08-29 5:24 ` pinskia at gcc dot gnu dot org
@ 2010-08-29 5:31 ` jv244 at cam dot ac dot uk
2010-08-29 6:38 ` [Bug middle-end/45422] [4.6 Regression] compile time increases 3x jv244 at cam dot ac dot uk
` (9 subsequent siblings)
24 siblings, 0 replies; 26+ messages in thread
From: jv244 at cam dot ac dot uk @ 2010-08-29 5:31 UTC (permalink / raw)
To: gcc-bugs
------- Comment #15 from jv244 at cam dot ac dot uk 2010-08-29 05:31 -------
Similar times (a bit faster) with release checking:
Execution times (seconds)
garbage collection : 1.17 ( 1%) usr 0.00 ( 0%) sys 1.18 ( 1%) wall
0 kB ( 0%) ggc
callgraph construction: 0.04 ( 0%) usr 0.01 ( 1%) sys 0.04 ( 0%) wall
5670 kB ( 0%) ggc
callgraph optimization: 0.32 ( 0%) usr 0.00 ( 0%) sys 0.25 ( 0%) wall
599 kB ( 0%) ggc
ipa cp : 0.07 ( 0%) usr 0.00 ( 0%) sys 0.09 ( 0%) wall
1345 kB ( 0%) ggc
ipa function splitting: 0.02 ( 0%) usr 0.00 ( 0%) sys 0.03 ( 0%) wall
0 kB ( 0%) ggc
ipa reference : 0.02 ( 0%) usr 0.00 ( 0%) sys 0.02 ( 0%) wall
0 kB ( 0%) ggc
ipa pure const : 0.11 ( 0%) usr 0.02 ( 1%) sys 0.14 ( 0%) wall
0 kB ( 0%) ggc
cfg cleanup : 2.67 ( 2%) usr 0.02 ( 1%) sys 2.59 ( 2%) wall
4726 kB ( 0%) ggc
trivially dead code : 0.74 ( 0%) usr 0.00 ( 0%) sys 0.72 ( 0%) wall
0 kB ( 0%) ggc
df multiple defs : 0.48 ( 0%) usr 0.01 ( 1%) sys 0.35 ( 0%) wall
0 kB ( 0%) ggc
df reaching defs : 1.73 ( 1%) usr 0.00 ( 0%) sys 2.12 ( 1%) wall
0 kB ( 0%) ggc
df live regs : 10.78 ( 7%) usr 0.01 ( 1%) sys 11.16 ( 7%) wall
0 kB ( 0%) ggc
df live&initialized regs: 3.60 ( 2%) usr 0.00 ( 0%) sys 3.87 ( 2%) wall
0 kB ( 0%) ggc
df use-def / def-use chains: 1.52 ( 1%) usr 0.00 ( 0%) sys 1.18 ( 1%)
wall 0 kB ( 0%) ggc
df live reg subwords : 0.33 ( 0%) usr 0.00 ( 0%) sys 0.34 ( 0%) wall
0 kB ( 0%) ggc
df reg dead/unused notes: 5.27 ( 3%) usr 0.00 ( 0%) sys 5.42 ( 3%) wall
7568 kB ( 0%) ggc
register information : 2.24 ( 1%) usr 0.00 ( 0%) sys 2.19 ( 1%) wall
0 kB ( 0%) ggc
alias analysis : 2.33 ( 1%) usr 0.00 ( 0%) sys 2.30 ( 1%) wall
47018 kB ( 3%) ggc
alias stmt walking : 0.48 ( 0%) usr 0.05 ( 3%) sys 0.44 ( 0%) wall
6938 kB ( 0%) ggc
register scan : 0.22 ( 0%) usr 0.00 ( 0%) sys 0.37 ( 0%) wall
394 kB ( 0%) ggc
rebuild jump labels : 0.73 ( 0%) usr 0.00 ( 0%) sys 0.61 ( 0%) wall
0 kB ( 0%) ggc
parser : 0.85 ( 1%) usr 0.13 ( 7%) sys 0.98 ( 1%) wall
55365 kB ( 3%) ggc
inline heuristics : 0.24 ( 0%) usr 0.00 ( 0%) sys 0.19 ( 0%) wall
0 kB ( 0%) ggc
tree gimplify : 0.40 ( 0%) usr 0.06 ( 3%) sys 0.47 ( 0%) wall
48405 kB ( 3%) ggc
tree eh : 0.02 ( 0%) usr 0.00 ( 0%) sys 0.00 ( 0%) wall
0 kB ( 0%) ggc
tree CFG construction : 0.03 ( 0%) usr 0.02 ( 1%) sys 0.08 ( 0%) wall
11971 kB ( 1%) ggc
tree CFG cleanup : 1.02 ( 1%) usr 0.03 ( 2%) sys 1.14 ( 1%) wall
3522 kB ( 0%) ggc
tree VRP : 2.25 ( 1%) usr 0.05 ( 3%) sys 2.18 ( 1%) wall
67051 kB ( 4%) ggc
tree copy propagation : 0.24 ( 0%) usr 0.00 ( 0%) sys 0.16 ( 0%) wall
1384 kB ( 0%) ggc
tree find ref. vars : 0.09 ( 0%) usr 0.00 ( 0%) sys 0.07 ( 0%) wall
3806 kB ( 0%) ggc
tree PTA : 0.36 ( 0%) usr 0.00 ( 0%) sys 0.26 ( 0%) wall
5193 kB ( 0%) ggc
tree PHI insertion : 0.03 ( 0%) usr 0.00 ( 0%) sys 0.04 ( 0%) wall
3194 kB ( 0%) ggc
tree SSA rewrite : 0.40 ( 0%) usr 0.02 ( 1%) sys 0.53 ( 0%) wall
14011 kB ( 1%) ggc
tree SSA other : 0.09 ( 0%) usr 0.01 ( 1%) sys 0.13 ( 0%) wall
428 kB ( 0%) ggc
tree SSA incremental : 1.40 ( 1%) usr 0.09 ( 5%) sys 1.50 ( 1%) wall
7431 kB ( 0%) ggc
tree operand scan : 0.45 ( 0%) usr 0.33 (18%) sys 0.82 ( 0%) wall
58289 kB ( 3%) ggc
dominator optimization: 0.41 ( 0%) usr 0.04 ( 2%) sys 0.60 ( 0%) wall
8526 kB ( 0%) ggc
tree SRA : 0.01 ( 0%) usr 0.00 ( 0%) sys 0.03 ( 0%) wall
0 kB ( 0%) ggc
tree CCP : 1.05 ( 1%) usr 0.02 ( 1%) sys 1.16 ( 1%) wall
4845 kB ( 0%) ggc
tree PHI const/copy prop: 0.03 ( 0%) usr 0.00 ( 0%) sys 0.00 ( 0%) wall
88 kB ( 0%) ggc
tree split crit edges : 0.01 ( 0%) usr 0.00 ( 0%) sys 0.02 ( 0%) wall
2014 kB ( 0%) ggc
tree reassociation : 0.25 ( 0%) usr 0.05 ( 3%) sys 0.23 ( 0%) wall
6023 kB ( 0%) ggc
tree PRE : 0.81 ( 0%) usr 0.00 ( 0%) sys 0.82 ( 0%) wall
7164 kB ( 0%) ggc
tree FRE : 0.43 ( 0%) usr 0.03 ( 2%) sys 0.51 ( 0%) wall
5410 kB ( 0%) ggc
tree code sinking : 0.07 ( 0%) usr 0.00 ( 0%) sys 0.03 ( 0%) wall
1311 kB ( 0%) ggc
tree linearize phis : 0.01 ( 0%) usr 0.00 ( 0%) sys 0.02 ( 0%) wall
0 kB ( 0%) ggc
tree forward propagate: 0.33 ( 0%) usr 0.00 ( 0%) sys 0.30 ( 0%) wall
11812 kB ( 1%) ggc
tree phiprop : 0.05 ( 0%) usr 0.00 ( 0%) sys 0.01 ( 0%) wall
0 kB ( 0%) ggc
tree conservative DCE : 0.09 ( 0%) usr 0.01 ( 1%) sys 0.09 ( 0%) wall
575 kB ( 0%) ggc
tree aggressive DCE : 0.72 ( 0%) usr 0.00 ( 0%) sys 0.82 ( 0%) wall
25487 kB ( 1%) ggc
tree DSE : 0.23 ( 0%) usr 0.00 ( 0%) sys 0.18 ( 0%) wall
259 kB ( 0%) ggc
tree loop bounds : 0.14 ( 0%) usr 0.00 ( 0%) sys 0.22 ( 0%) wall
6651 kB ( 0%) ggc
tree loop invariant motion: 0.04 ( 0%) usr 0.01 ( 1%) sys 0.07 ( 0%)
wall 76 kB ( 0%) ggc
tree canonical iv : 0.07 ( 0%) usr 0.00 ( 0%) sys 0.13 ( 0%) wall
3418 kB ( 0%) ggc
scev constant prop : 0.03 ( 0%) usr 0.00 ( 0%) sys 0.02 ( 0%) wall
2301 kB ( 0%) ggc
tree loop unswitching : 0.00 ( 0%) usr 0.01 ( 1%) sys 0.02 ( 0%) wall
739 kB ( 0%) ggc
complete unrolling : 1.25 ( 1%) usr 0.15 ( 8%) sys 1.56 ( 1%) wall
101449 kB ( 6%) ggc
tree vectorization : 0.35 ( 0%) usr 0.01 ( 1%) sys 0.26 ( 0%) wall
20110 kB ( 1%) ggc
tree slp vectorization: 0.76 ( 0%) usr 0.00 ( 0%) sys 0.72 ( 0%) wall
52730 kB ( 3%) ggc
tree loop distribution: 0.10 ( 0%) usr 0.00 ( 0%) sys 0.06 ( 0%) wall
0 kB ( 0%) ggc
tree prefetching : 1.64 ( 1%) usr 0.08 ( 4%) sys 1.39 ( 1%) wall
90832 kB ( 5%) ggc
tree iv optimization : 32.57 (20%) usr 0.10 ( 5%) sys 32.73 (20%) wall
322095 kB (18%) ggc
predictive commoning : 0.03 ( 0%) usr 0.00 ( 0%) sys 0.06 ( 0%) wall
1747 kB ( 0%) ggc
tree loop init : 0.00 ( 0%) usr 0.00 ( 0%) sys 0.02 ( 0%) wall
1306 kB ( 0%) ggc
tree loop fini : 0.03 ( 0%) usr 0.00 ( 0%) sys 0.01 ( 0%) wall
0 kB ( 0%) ggc
tree copy headers : 0.03 ( 0%) usr 0.00 ( 0%) sys 0.05 ( 0%) wall
1652 kB ( 0%) ggc
tree SSA uncprop : 0.03 ( 0%) usr 0.00 ( 0%) sys 0.02 ( 0%) wall
0 kB ( 0%) ggc
tree rename SSA copies: 0.05 ( 0%) usr 0.00 ( 0%) sys 0.04 ( 0%) wall
0 kB ( 0%) ggc
dominance frontiers : 0.19 ( 0%) usr 0.01 ( 1%) sys 0.29 ( 0%) wall
0 kB ( 0%) ggc
dominance computation : 0.50 ( 0%) usr 0.01 ( 1%) sys 0.64 ( 0%) wall
0 kB ( 0%) ggc
control dependences : 0.01 ( 0%) usr 0.00 ( 0%) sys 0.01 ( 0%) wall
0 kB ( 0%) ggc
out of ssa : 0.28 ( 0%) usr 0.00 ( 0%) sys 0.28 ( 0%) wall
215 kB ( 0%) ggc
expand vars : 0.25 ( 0%) usr 0.00 ( 0%) sys 0.26 ( 0%) wall
11358 kB ( 1%) ggc
expand : 6.48 ( 4%) usr 0.04 ( 2%) sys 6.30 ( 4%) wall
111660 kB ( 6%) ggc
post expand cleanups : 0.14 ( 0%) usr 0.00 ( 0%) sys 0.12 ( 0%) wall
5818 kB ( 0%) ggc
lower subreg : 0.14 ( 0%) usr 0.00 ( 0%) sys 0.20 ( 0%) wall
0 kB ( 0%) ggc
jump : 0.00 ( 0%) usr 0.01 ( 1%) sys 0.00 ( 0%) wall
0 kB ( 0%) ggc
forward prop : 1.67 ( 1%) usr 0.00 ( 0%) sys 1.68 ( 1%) wall
15809 kB ( 1%) ggc
CSE : 1.77 ( 1%) usr 0.01 ( 1%) sys 1.59 ( 1%) wall
661 kB ( 0%) ggc
dead code elimination : 2.19 ( 1%) usr 0.00 ( 0%) sys 1.99 ( 1%) wall
0 kB ( 0%) ggc
dead store elim1 : 1.42 ( 1%) usr 0.01 ( 1%) sys 1.34 ( 1%) wall
23523 kB ( 1%) ggc
dead store elim2 : 1.14 ( 1%) usr 0.00 ( 0%) sys 1.37 ( 1%) wall
22811 kB ( 1%) ggc
loop analysis : 0.12 ( 0%) usr 0.00 ( 0%) sys 0.04 ( 0%) wall
2220 kB ( 0%) ggc
loop invariant motion : 0.31 ( 0%) usr 0.00 ( 0%) sys 0.22 ( 0%) wall
448 kB ( 0%) ggc
loop unswitching : 0.12 ( 0%) usr 0.00 ( 0%) sys 0.15 ( 0%) wall
0 kB ( 0%) ggc
loop unrolling : 4.31 ( 3%) usr 0.03 ( 2%) sys 4.35 ( 3%) wall
184514 kB (10%) ggc
CPROP : 2.35 ( 1%) usr 0.00 ( 0%) sys 2.36 ( 1%) wall
25112 kB ( 1%) ggc
PRE : 1.42 ( 1%) usr 0.00 ( 0%) sys 1.34 ( 1%) wall
1627 kB ( 0%) ggc
web : 1.12 ( 1%) usr 0.00 ( 0%) sys 1.33 ( 1%) wall
8405 kB ( 0%) ggc
CSE 2 : 2.29 ( 1%) usr 0.00 ( 0%) sys 2.11 ( 1%) wall
2121 kB ( 0%) ggc
branch prediction : 0.20 ( 0%) usr 0.00 ( 0%) sys 0.17 ( 0%) wall
6854 kB ( 0%) ggc
combiner : 4.28 ( 3%) usr 0.01 ( 1%) sys 4.22 ( 3%) wall
60382 kB ( 3%) ggc
if-conversion : 0.13 ( 0%) usr 0.00 ( 0%) sys 0.20 ( 0%) wall
2520 kB ( 0%) ggc
regmove : 0.74 ( 0%) usr 0.00 ( 0%) sys 0.63 ( 0%) wall
0 kB ( 0%) ggc
integrated RA : 11.28 ( 7%) usr 0.03 ( 2%) sys 11.20 ( 7%) wall
44198 kB ( 2%) ggc
reload : 7.00 ( 4%) usr 0.03 ( 2%) sys 7.05 ( 4%) wall
10026 kB ( 1%) ggc
reload CSE regs : 4.45 ( 3%) usr 0.00 ( 0%) sys 4.48 ( 3%) wall
36964 kB ( 2%) ggc
load CSE after reload : 0.42 ( 0%) usr 0.01 ( 1%) sys 0.55 ( 0%) wall
449 kB ( 0%) ggc
zee : 0.41 ( 0%) usr 0.01 ( 1%) sys 0.36 ( 0%) wall
45 kB ( 0%) ggc
thread pro- & epilogue: 0.13 ( 0%) usr 0.00 ( 0%) sys 0.18 ( 0%) wall
3984 kB ( 0%) ggc
if-conversion 2 : 0.11 ( 0%) usr 0.00 ( 0%) sys 0.06 ( 0%) wall
1056 kB ( 0%) ggc
combine stack adjustments: 0.12 ( 0%) usr 0.00 ( 0%) sys 0.10 ( 0%) wall
0 kB ( 0%) ggc
peephole 2 : 0.54 ( 0%) usr 0.00 ( 0%) sys 0.46 ( 0%) wall
2995 kB ( 0%) ggc
rename registers : 1.21 ( 1%) usr 0.00 ( 0%) sys 1.19 ( 1%) wall
2741 kB ( 0%) ggc
hard reg cprop : 0.93 ( 1%) usr 0.00 ( 0%) sys 1.13 ( 1%) wall
15 kB ( 0%) ggc
scheduling 2 : 6.06 ( 4%) usr 0.02 ( 1%) sys 6.13 ( 4%) wall
1138 kB ( 0%) ggc
machine dep reorg : 0.87 ( 1%) usr 0.00 ( 0%) sys 0.99 ( 1%) wall
77 kB ( 0%) ggc
reorder blocks : 0.65 ( 0%) usr 0.01 ( 1%) sys 0.58 ( 0%) wall
4789 kB ( 0%) ggc
final : 1.90 ( 1%) usr 0.15 ( 8%) sys 1.83 ( 1%) wall
9656 kB ( 1%) ggc
symout : 0.62 ( 0%) usr 0.07 ( 4%) sys 0.72 ( 0%) wall
58777 kB ( 3%) ggc
variable tracking : 2.65 ( 2%) usr 0.00 ( 0%) sys 2.79 ( 2%) wall
62056 kB ( 3%) ggc
var-tracking dataflow : 3.74 ( 2%) usr 0.01 ( 1%) sys 3.73 ( 2%) wall
0 kB ( 0%) ggc
var-tracking emit : 3.47 ( 2%) usr 0.00 ( 0%) sys 3.44 ( 2%) wall
19142 kB ( 1%) ggc
TOTAL : 164.53 1.84 166.37
1777349 kB
--
jv244 at cam dot ac dot uk changed:
What |Removed |Added
----------------------------------------------------------------------------
Last reconfirmed|2010-08-27 17:01:01 |2010-08-29 05:31:37
date| |
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422
^ permalink raw reply [flat|nested] 26+ messages in thread
* [Bug middle-end/45422] [4.6 Regression] compile time increases 3x.
2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
` (14 preceding siblings ...)
2010-08-29 5:31 ` jv244 at cam dot ac dot uk
@ 2010-08-29 6:38 ` jv244 at cam dot ac dot uk
2010-08-29 9:26 ` rguenth at gcc dot gnu dot org
` (8 subsequent siblings)
24 siblings, 0 replies; 26+ messages in thread
From: jv244 at cam dot ac dot uk @ 2010-08-29 6:38 UTC (permalink / raw)
To: gcc-bugs
------- Comment #16 from jv244 at cam dot ac dot uk 2010-08-29 06:38 -------
adjust summary according to the last timings
--
jv244 at cam dot ac dot uk changed:
What |Removed |Added
----------------------------------------------------------------------------
Last reconfirmed|2010-08-29 05:31:37 |2010-08-29 06:38:26
date| |
Summary|[4.6 Regression] compile |[4.6 Regression] compile
|time increases 5x. |time increases 3x.
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422
^ permalink raw reply [flat|nested] 26+ messages in thread
* [Bug middle-end/45422] [4.6 Regression] compile time increases 3x.
2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
` (15 preceding siblings ...)
2010-08-29 6:38 ` [Bug middle-end/45422] [4.6 Regression] compile time increases 3x jv244 at cam dot ac dot uk
@ 2010-08-29 9:26 ` rguenth at gcc dot gnu dot org
2010-08-29 15:07 ` jv244 at cam dot ac dot uk
` (7 subsequent siblings)
24 siblings, 0 replies; 26+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2010-08-29 9:26 UTC (permalink / raw)
To: gcc-bugs
------- Comment #17 from rguenth at gcc dot gnu dot org 2010-08-29 09:25 -------
tree iv optimization : 32.57 (20%) usr 0.10 ( 5%) sys 32.73 (20%) wall
322095 kB (18%) ggc
20% is still completely unreasonable for IV optimization.
--
rguenth at gcc dot gnu dot org changed:
What |Removed |Added
----------------------------------------------------------------------------
Status|WAITING |NEW
Last reconfirmed|2010-08-29 06:38:26 |2010-08-29 09:25:52
date| |
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422
^ permalink raw reply [flat|nested] 26+ messages in thread
* [Bug middle-end/45422] [4.6 Regression] compile time increases 3x.
2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
` (16 preceding siblings ...)
2010-08-29 9:26 ` rguenth at gcc dot gnu dot org
@ 2010-08-29 15:07 ` jv244 at cam dot ac dot uk
2010-08-30 3:11 ` davidxl at gcc dot gnu dot org
` (6 subsequent siblings)
24 siblings, 0 replies; 26+ messages in thread
From: jv244 at cam dot ac dot uk @ 2010-08-29 15:07 UTC (permalink / raw)
To: gcc-bugs
------- Comment #18 from jv244 at cam dot ac dot uk 2010-08-29 15:07 -------
FYI, these are the 4.5 branch timings:
Execution times (seconds)
garbage collection : 0.47 ( 1%) usr 0.00 ( 0%) sys 0.47 ( 1%) wall
0 kB ( 0%) ggc
callgraph construction: 0.05 ( 0%) usr 0.01 ( 1%) sys 0.09 ( 0%) wall
5996 kB ( 1%) ggc
callgraph optimization: 0.21 ( 0%) usr 0.02 ( 1%) sys 0.26 ( 0%) wall
606 kB ( 0%) ggc
ipa cp : 0.09 ( 0%) usr 0.00 ( 0%) sys 0.08 ( 0%) wall
1381 kB ( 0%) ggc
ipa reference : 0.06 ( 0%) usr 0.00 ( 0%) sys 0.06 ( 0%) wall
0 kB ( 0%) ggc
ipa pure const : 0.06 ( 0%) usr 0.01 ( 1%) sys 0.09 ( 0%) wall
0 kB ( 0%) ggc
cfg cleanup : 0.39 ( 1%) usr 0.00 ( 0%) sys 0.51 ( 1%) wall
2459 kB ( 0%) ggc
trivially dead code : 0.34 ( 1%) usr 0.00 ( 0%) sys 0.30 ( 1%) wall
0 kB ( 0%) ggc
df multiple defs : 0.08 ( 0%) usr 0.00 ( 0%) sys 0.13 ( 0%) wall
0 kB ( 0%) ggc
df reaching defs : 0.33 ( 1%) usr 0.00 ( 0%) sys 0.27 ( 1%) wall
0 kB ( 0%) ggc
df live regs : 2.08 ( 4%) usr 0.01 ( 1%) sys 2.19 ( 4%) wall
0 kB ( 0%) ggc
df live&initialized regs: 0.98 ( 2%) usr 0.00 ( 0%) sys 0.92 ( 2%) wall
0 kB ( 0%) ggc
df use-def / def-use chains: 0.24 ( 0%) usr 0.00 ( 0%) sys 0.19 ( 0%)
wall 0 kB ( 0%) ggc
df reg dead/unused notes: 0.93 ( 2%) usr 0.00 ( 0%) sys 1.04 ( 2%) wall
5756 kB ( 1%) ggc
register information : 0.51 ( 1%) usr 0.01 ( 1%) sys 0.39 ( 1%) wall
0 kB ( 0%) ggc
alias analysis : 0.78 ( 1%) usr 0.01 ( 1%) sys 0.91 ( 2%) wall
22384 kB ( 3%) ggc
alias stmt walking : 0.50 ( 1%) usr 0.03 ( 2%) sys 0.38 ( 1%) wall
5563 kB ( 1%) ggc
register scan : 0.13 ( 0%) usr 0.00 ( 0%) sys 0.09 ( 0%) wall
0 kB ( 0%) ggc
rebuild jump labels : 0.19 ( 0%) usr 0.00 ( 0%) sys 0.19 ( 0%) wall
0 kB ( 0%) ggc
parser : 0.82 ( 2%) usr 0.13 ( 9%) sys 0.94 ( 2%) wall
55603 kB ( 6%) ggc
inline heuristics : 0.20 ( 0%) usr 0.01 ( 1%) sys 0.16 ( 0%) wall
0 kB ( 0%) ggc
tree gimplify : 0.38 ( 1%) usr 0.03 ( 2%) sys 0.40 ( 1%) wall
46588 kB ( 5%) ggc
tree eh : 0.02 ( 0%) usr 0.00 ( 0%) sys 0.02 ( 0%) wall
0 kB ( 0%) ggc
tree CFG construction : 0.04 ( 0%) usr 0.02 ( 1%) sys 0.05 ( 0%) wall
11964 kB ( 1%) ggc
tree CFG cleanup : 0.47 ( 1%) usr 0.00 ( 0%) sys 0.79 ( 1%) wall
1829 kB ( 0%) ggc
tree VRP : 1.46 ( 3%) usr 0.05 ( 4%) sys 1.27 ( 2%) wall
56376 kB ( 6%) ggc
tree copy propagation : 0.09 ( 0%) usr 0.02 ( 1%) sys 0.22 ( 0%) wall
746 kB ( 0%) ggc
tree find ref. vars : 0.09 ( 0%) usr 0.01 ( 1%) sys 0.07 ( 0%) wall
3806 kB ( 0%) ggc
tree PTA : 0.30 ( 1%) usr 0.00 ( 0%) sys 0.33 ( 1%) wall
3836 kB ( 0%) ggc
tree PHI insertion : 0.00 ( 0%) usr 0.00 ( 0%) sys 0.04 ( 0%) wall
3194 kB ( 0%) ggc
tree SSA rewrite : 0.24 ( 0%) usr 0.01 ( 1%) sys 0.29 ( 1%) wall
13860 kB ( 2%) ggc
tree SSA other : 0.13 ( 0%) usr 0.02 ( 1%) sys 0.11 ( 0%) wall
418 kB ( 0%) ggc
tree SSA incremental : 0.89 ( 2%) usr 0.06 ( 4%) sys 0.97 ( 2%) wall
6811 kB ( 1%) ggc
tree operand scan : 0.34 ( 1%) usr 0.23 (17%) sys 0.59 ( 1%) wall
44776 kB ( 5%) ggc
dominator optimization: 0.29 ( 1%) usr 0.01 ( 1%) sys 0.35 ( 1%) wall
5152 kB ( 1%) ggc
tree CCP : 0.51 ( 1%) usr 0.02 ( 1%) sys 0.43 ( 1%) wall
4620 kB ( 1%) ggc
tree PHI const/copy prop: 0.01 ( 0%) usr 0.00 ( 0%) sys 0.01 ( 0%) wall
106 kB ( 0%) ggc
tree split crit edges : 0.00 ( 0%) usr 0.00 ( 0%) sys 0.00 ( 0%) wall
2019 kB ( 0%) ggc
tree reassociation : 0.12 ( 0%) usr 0.01 ( 1%) sys 0.12 ( 0%) wall
2946 kB ( 0%) ggc
tree PRE : 0.92 ( 2%) usr 0.00 ( 0%) sys 0.95 ( 2%) wall
7315 kB ( 1%) ggc
tree FRE : 0.45 ( 1%) usr 0.04 ( 3%) sys 0.35 ( 1%) wall
5518 kB ( 1%) ggc
tree code sinking : 0.03 ( 0%) usr 0.00 ( 0%) sys 0.03 ( 0%) wall
1400 kB ( 0%) ggc
tree linearize phis : 0.02 ( 0%) usr 0.01 ( 1%) sys 0.01 ( 0%) wall
0 kB ( 0%) ggc
tree forward propagate: 0.18 ( 0%) usr 0.02 ( 1%) sys 0.16 ( 0%) wall
10006 kB ( 1%) ggc
tree conservative DCE : 0.05 ( 0%) usr 0.01 ( 1%) sys 0.13 ( 0%) wall
576 kB ( 0%) ggc
tree aggressive DCE : 0.28 ( 1%) usr 0.01 ( 1%) sys 0.37 ( 1%) wall
8853 kB ( 1%) ggc
tree buildin call DCE : 0.01 ( 0%) usr 0.00 ( 0%) sys 0.00 ( 0%) wall
0 kB ( 0%) ggc
tree DSE : 0.20 ( 0%) usr 0.00 ( 0%) sys 0.11 ( 0%) wall
132 kB ( 0%) ggc
PHI merge : 0.01 ( 0%) usr 0.00 ( 0%) sys 0.00 ( 0%) wall
37 kB ( 0%) ggc
tree loop bounds : 0.22 ( 0%) usr 0.00 ( 0%) sys 0.18 ( 0%) wall
8266 kB ( 1%) ggc
tree loop invariant motion: 0.06 ( 0%) usr 0.00 ( 0%) sys 0.08 ( 0%)
wall 67 kB ( 0%) ggc
tree canonical iv : 0.07 ( 0%) usr 0.00 ( 0%) sys 0.09 ( 0%) wall
4779 kB ( 1%) ggc
scev constant prop : 0.08 ( 0%) usr 0.00 ( 0%) sys 0.07 ( 0%) wall
2345 kB ( 0%) ggc
tree loop unswitching : 0.00 ( 0%) usr 0.00 ( 0%) sys 0.02 ( 0%) wall
573 kB ( 0%) ggc
complete unrolling : 1.05 ( 2%) usr 0.11 ( 8%) sys 1.39 ( 3%) wall
98553 kB (11%) ggc
tree vectorization : 0.01 ( 0%) usr 0.00 ( 0%) sys 0.01 ( 0%) wall
883 kB ( 0%) ggc
tree slp vectorization: 0.61 ( 1%) usr 0.00 ( 0%) sys 0.60 ( 1%) wall
53236 kB ( 6%) ggc
tree iv optimization : 5.80 (11%) usr 0.06 ( 4%) sys 5.94 (11%) wall
95356 kB (11%) ggc
predictive commoning : 0.03 ( 0%) usr 0.00 ( 0%) sys 0.03 ( 0%) wall
1054 kB ( 0%) ggc
tree loop init : 0.05 ( 0%) usr 0.00 ( 0%) sys 0.02 ( 0%) wall
1339 kB ( 0%) ggc
tree copy headers : 0.01 ( 0%) usr 0.00 ( 0%) sys 0.03 ( 0%) wall
1613 kB ( 0%) ggc
tree SSA uncprop : 0.03 ( 0%) usr 0.00 ( 0%) sys 0.01 ( 0%) wall
0 kB ( 0%) ggc
tree rename SSA copies: 0.06 ( 0%) usr 0.00 ( 0%) sys 0.03 ( 0%) wall
0 kB ( 0%) ggc
dominance frontiers : 0.04 ( 0%) usr 0.00 ( 0%) sys 0.05 ( 0%) wall
0 kB ( 0%) ggc
dominance computation : 0.23 ( 0%) usr 0.00 ( 0%) sys 0.26 ( 0%) wall
0 kB ( 0%) ggc
expand : 3.24 ( 6%) usr 0.07 ( 5%) sys 3.34 ( 6%) wall
69633 kB ( 8%) ggc
lower subreg : 0.02 ( 0%) usr 0.00 ( 0%) sys 0.00 ( 0%) wall
0 kB ( 0%) ggc
forward prop : 0.48 ( 1%) usr 0.01 ( 1%) sys 0.48 ( 1%) wall
9984 kB ( 1%) ggc
CSE : 0.73 ( 1%) usr 0.00 ( 0%) sys 0.92 ( 2%) wall
248 kB ( 0%) ggc
dead code elimination : 0.24 ( 0%) usr 0.00 ( 0%) sys 0.28 ( 1%) wall
0 kB ( 0%) ggc
dead store elim1 : 0.33 ( 1%) usr 0.01 ( 1%) sys 0.32 ( 1%) wall
5987 kB ( 1%) ggc
dead store elim2 : 0.44 ( 1%) usr 0.02 ( 1%) sys 0.39 ( 1%) wall
7831 kB ( 1%) ggc
loop analysis : 0.01 ( 0%) usr 0.00 ( 0%) sys 0.02 ( 0%) wall
718 kB ( 0%) ggc
loop invariant motion : 0.05 ( 0%) usr 0.00 ( 0%) sys 0.07 ( 0%) wall
305 kB ( 0%) ggc
loop unswitching : 0.01 ( 0%) usr 0.00 ( 0%) sys 0.02 ( 0%) wall
0 kB ( 0%) ggc
loop unrolling : 0.65 ( 1%) usr 0.00 ( 0%) sys 0.62 ( 1%) wall
32780 kB ( 4%) ggc
CPROP : 0.70 ( 1%) usr 0.00 ( 0%) sys 0.60 ( 1%) wall
7825 kB ( 1%) ggc
PRE : 0.32 ( 1%) usr 0.00 ( 0%) sys 0.33 ( 1%) wall
719 kB ( 0%) ggc
web : 0.13 ( 0%) usr 0.00 ( 0%) sys 0.07 ( 0%) wall
594 kB ( 0%) ggc
CSE 2 : 0.75 ( 1%) usr 0.01 ( 1%) sys 0.60 ( 1%) wall
470 kB ( 0%) ggc
branch prediction : 0.19 ( 0%) usr 0.01 ( 1%) sys 0.14 ( 0%) wall
7344 kB ( 1%) ggc
combiner : 1.19 ( 2%) usr 0.01 ( 1%) sys 1.33 ( 2%) wall
19980 kB ( 2%) ggc
if-conversion : 0.04 ( 0%) usr 0.00 ( 0%) sys 0.03 ( 0%) wall
746 kB ( 0%) ggc
regmove : 0.37 ( 1%) usr 0.01 ( 1%) sys 0.33 ( 1%) wall
0 kB ( 0%) ggc
integrated RA : 3.51 ( 7%) usr 0.01 ( 1%) sys 3.74 ( 7%) wall
12746 kB ( 1%) ggc
reload : 2.16 ( 4%) usr 0.02 ( 1%) sys 2.01 ( 4%) wall
7755 kB ( 1%) ggc
reload CSE regs : 1.38 ( 3%) usr 0.00 ( 0%) sys 1.26 ( 2%) wall
12331 kB ( 1%) ggc
load CSE after reload : 0.18 ( 0%) usr 0.00 ( 0%) sys 0.14 ( 0%) wall
162 kB ( 0%) ggc
thread pro- & epilogue: 0.11 ( 0%) usr 0.00 ( 0%) sys 0.12 ( 0%) wall
4370 kB ( 0%) ggc
if-conversion 2 : 0.03 ( 0%) usr 0.00 ( 0%) sys 0.02 ( 0%) wall
357 kB ( 0%) ggc
combine stack adjustments: 0.03 ( 0%) usr 0.00 ( 0%) sys 0.02 ( 0%) wall
0 kB ( 0%) ggc
peephole 2 : 0.12 ( 0%) usr 0.00 ( 0%) sys 0.16 ( 0%) wall
1899 kB ( 0%) ggc
rename registers : 0.46 ( 1%) usr 0.00 ( 0%) sys 0.55 ( 1%) wall
2237 kB ( 0%) ggc
hard reg cprop : 0.37 ( 1%) usr 0.00 ( 0%) sys 0.48 ( 1%) wall
13 kB ( 0%) ggc
scheduling 2 : 3.30 ( 6%) usr 0.04 ( 3%) sys 3.10 ( 6%) wall
1216 kB ( 0%) ggc
machine dep reorg : 0.38 ( 1%) usr 0.00 ( 0%) sys 0.36 ( 1%) wall
11 kB ( 0%) ggc
reorder blocks : 0.15 ( 0%) usr 0.00 ( 0%) sys 0.13 ( 0%) wall
1283 kB ( 0%) ggc
final : 0.93 ( 2%) usr 0.07 ( 5%) sys 0.84 ( 2%) wall
6610 kB ( 1%) ggc
symout : 0.30 ( 1%) usr 0.03 ( 2%) sys 0.34 ( 1%) wall
27006 kB ( 3%) ggc
variable tracking : 3.86 ( 7%) usr 0.03 ( 2%) sys 3.99 ( 7%) wall
39804 kB ( 4%) ggc
plugin execution : 0.00 ( 0%) usr 0.01 ( 1%) sys 0.05 ( 0%) wall
0 kB ( 0%) ggc
rest of compilation : 0.00 ( 0%) usr 0.01 ( 1%) sys 0.00 ( 0%) wall
0 kB ( 0%) ggc
TOTAL : 52.50 1.37 53.88
893901 kB
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422
^ permalink raw reply [flat|nested] 26+ messages in thread
* [Bug middle-end/45422] [4.6 Regression] compile time increases 3x.
2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
` (17 preceding siblings ...)
2010-08-29 15:07 ` jv244 at cam dot ac dot uk
@ 2010-08-30 3:11 ` davidxl at gcc dot gnu dot org
2010-08-30 3:19 ` davidxl at gcc dot gnu dot org
` (5 subsequent siblings)
24 siblings, 0 replies; 26+ messages in thread
From: davidxl at gcc dot gnu dot org @ 2010-08-30 3:11 UTC (permalink / raw)
To: gcc-bugs
------- Comment #20 from davidxl at gcc dot gnu dot org 2010-08-30 03:10 -------
(In reply to comment #16)
> adjust summary according to the last timings
>
I am surprised to see such big differences between trunk and previous releases.
Compiling this test case with the those options on my core2 box (2.4GHz ) took
only 56seconds which is comparable with the timing with a 4.4.3 compiler (with
google local patches including ivopt improvements).
David
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422
^ permalink raw reply [flat|nested] 26+ messages in thread
* [Bug middle-end/45422] [4.6 Regression] compile time increases 3x.
2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
` (18 preceding siblings ...)
2010-08-30 3:11 ` davidxl at gcc dot gnu dot org
@ 2010-08-30 3:19 ` davidxl at gcc dot gnu dot org
2010-08-30 7:12 ` rguenth at gcc dot gnu dot org
` (4 subsequent siblings)
24 siblings, 0 replies; 26+ messages in thread
From: davidxl at gcc dot gnu dot org @ 2010-08-30 3:19 UTC (permalink / raw)
To: gcc-bugs
------- Comment #21 from davidxl at gcc dot gnu dot org 2010-08-30 03:19 -------
(In reply to comment #17)
> tree iv optimization : 32.57 (20%) usr 0.10 ( 5%) sys 32.73 (20%) wall
> 322095 kB (18%) ggc
>
>
> 20% is still completely unreasonable for IV optimization.
>
There was a patch in trunk that may double the time in ivopt -- i.e.
find_optimal_iv_set_1 is done twice, one with the original iv set while the
other with full set. This probably needs to be revisited.
David
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422
^ permalink raw reply [flat|nested] 26+ messages in thread
* [Bug middle-end/45422] [4.6 Regression] compile time increases 3x.
2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
` (20 preceding siblings ...)
2010-08-30 7:12 ` rguenth at gcc dot gnu dot org
@ 2010-08-30 7:12 ` rguenth at gcc dot gnu dot org
2010-08-30 16:41 ` davidxl at gcc dot gnu dot org
` (2 subsequent siblings)
24 siblings, 0 replies; 26+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2010-08-30 7:12 UTC (permalink / raw)
To: gcc-bugs
------- Comment #23 from rguenth at gcc dot gnu dot org 2010-08-30 07:11 -------
(In reply to comment #22)
> Given the fact that the solution space is really large -- M^N where M is the
> number of candidates and M is the number of uses (here M == 70 and N == 48),
> and the cost function is complicated, it will be challenging to come up with
> algorithm that converges really fast, and most importantly -- 'guarantees' an
> optimal solution..
Well - we can't guarantee an optimal solution. We have to take compile-time
into account which means that O(M^N) is not acceptable but we need to come
up with something that can complete in O((M+N) log (M+N)) time at most.
I btw doubt that the solution found is anywhere near optimal for 32bit
x86 - using 15 IVs instead of 2 can't be cheaper.
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422
^ permalink raw reply [flat|nested] 26+ messages in thread
* [Bug middle-end/45422] [4.6 Regression] compile time increases 3x.
2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
` (19 preceding siblings ...)
2010-08-30 3:19 ` davidxl at gcc dot gnu dot org
@ 2010-08-30 7:12 ` rguenth at gcc dot gnu dot org
2010-08-30 7:12 ` rguenth at gcc dot gnu dot org
` (3 subsequent siblings)
24 siblings, 0 replies; 26+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2010-08-30 7:12 UTC (permalink / raw)
To: gcc-bugs
------- Comment #24 from rguenth at gcc dot gnu dot org 2010-08-30 07:12 -------
(In reply to comment #20)
> (In reply to comment #16)
> > adjust summary according to the last timings
> >
>
> I am surprised to see such big differences between trunk and previous releases.
> Compiling this test case with the those options on my core2 box (2.4GHz ) took
> only 56seconds which is comparable with the timing with a 4.4.3 compiler (with
> google local patches including ivopt improvements).
Of course - because the ivopt improvement patches are the problem.
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422
^ permalink raw reply [flat|nested] 26+ messages in thread
* [Bug middle-end/45422] [4.6 Regression] compile time increases 3x.
2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
` (21 preceding siblings ...)
2010-08-30 7:12 ` rguenth at gcc dot gnu dot org
@ 2010-08-30 16:41 ` davidxl at gcc dot gnu dot org
2010-08-31 17:45 ` davidxl at gcc dot gnu dot org
2010-09-02 11:25 ` rguenth at gcc dot gnu dot org
24 siblings, 0 replies; 26+ messages in thread
From: davidxl at gcc dot gnu dot org @ 2010-08-30 16:41 UTC (permalink / raw)
To: gcc-bugs
------- Comment #25 from davidxl at gcc dot gnu dot org 2010-08-30 16:41 -------
(In reply to comment #24)
> (In reply to comment #20)
> > (In reply to comment #16)
> > > adjust summary according to the last timings
> > >
> >
> > I am surprised to see such big differences between trunk and previous releases.
> > Compiling this test case with the those options on my core2 box (2.4GHz ) took
> > only 56seconds which is comparable with the timing with a 4.4.3 compiler (with
> > google local patches including ivopt improvements).
>
> Of course - because the ivopt improvement patches are the problem.
>
It is just the total time diff from Joost's measure can be just explained by
ivopt component.
David
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422
^ permalink raw reply [flat|nested] 26+ messages in thread
* [Bug middle-end/45422] [4.6 Regression] compile time increases 3x.
2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
` (22 preceding siblings ...)
2010-08-30 16:41 ` davidxl at gcc dot gnu dot org
@ 2010-08-31 17:45 ` davidxl at gcc dot gnu dot org
2010-09-02 11:25 ` rguenth at gcc dot gnu dot org
24 siblings, 0 replies; 26+ messages in thread
From: davidxl at gcc dot gnu dot org @ 2010-08-31 17:45 UTC (permalink / raw)
To: gcc-bugs
------- Comment #26 from davidxl at gcc dot gnu dot org 2010-08-31 17:45 -------
Good observation re. the number of IVs in the final set. This usually points to
some problem/bug in the cost function. I briefly looked at this case -- it
indeed exposes two more bugs in the cost model:
1) the computation cost of the all the cost pairs in an assignment can actually
not simply be added together, because many rewrite expressions can be commoned.
We now have the mechanism to compute with common loop invariants for register
pressure estimation, and this mechnasim needs to be extended for computation
cost.
2) the offset is not stripped when computing loop invariant expression ids --
this can cause problem in overestimating reg pressure. (The case arises more
often with loop unrolling).
David
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422
^ permalink raw reply [flat|nested] 26+ messages in thread
* [Bug middle-end/45422] [4.6 Regression] compile time increases 3x.
2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
` (23 preceding siblings ...)
2010-08-31 17:45 ` davidxl at gcc dot gnu dot org
@ 2010-09-02 11:25 ` rguenth at gcc dot gnu dot org
24 siblings, 0 replies; 26+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2010-09-02 11:25 UTC (permalink / raw)
To: gcc-bugs
--
rguenth at gcc dot gnu dot org changed:
What |Removed |Added
----------------------------------------------------------------------------
Priority|P3 |P1
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422
^ permalink raw reply [flat|nested] 26+ messages in thread
end of thread, other threads:[~2010-09-02 11:25 UTC | newest]
Thread overview: 26+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
2010-08-26 18:34 ` [Bug middle-end/45422] " jv244 at cam dot ac dot uk
2010-08-27 10:23 ` rguenth at gcc dot gnu dot org
2010-08-27 11:41 ` rguenth at gcc dot gnu dot org
2010-08-27 11:46 ` jv244 at cam dot ac dot uk
2010-08-27 12:07 ` rguenth at gcc dot gnu dot org
2010-08-27 12:16 ` rguenth at gcc dot gnu dot org
2010-08-27 13:01 ` rguenth at gcc dot gnu dot org
2010-08-27 15:52 ` hjl dot tools at gmail dot com
2010-08-27 17:01 ` davidxl at gcc dot gnu dot org
2010-08-28 6:00 ` davidxl at gcc dot gnu dot org
2010-08-29 5:09 ` [Bug middle-end/45422] [4.6 Regression] compile time increases 5x jv244 at cam dot ac dot uk
2010-08-29 5:13 ` pinskia at gcc dot gnu dot org
2010-08-29 5:21 ` jv244 at cam dot ac dot uk
2010-08-29 5:24 ` pinskia at gcc dot gnu dot org
2010-08-29 5:31 ` jv244 at cam dot ac dot uk
2010-08-29 6:38 ` [Bug middle-end/45422] [4.6 Regression] compile time increases 3x jv244 at cam dot ac dot uk
2010-08-29 9:26 ` rguenth at gcc dot gnu dot org
2010-08-29 15:07 ` jv244 at cam dot ac dot uk
2010-08-30 3:11 ` davidxl at gcc dot gnu dot org
2010-08-30 3:19 ` davidxl at gcc dot gnu dot org
2010-08-30 7:12 ` rguenth at gcc dot gnu dot org
2010-08-30 7:12 ` rguenth at gcc dot gnu dot org
2010-08-30 16:41 ` davidxl at gcc dot gnu dot org
2010-08-31 17:45 ` davidxl at gcc dot gnu dot org
2010-09-02 11:25 ` rguenth at gcc dot gnu dot org
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).