public inbox for gcc-bugs@sourceware.org
help / color / mirror / Atom feed
* [Bug middle-end/45422]  New: [4.6 Regression] compile time increases 8x.
@ 2010-08-26 18:33 jv244 at cam dot ac dot uk
  2010-08-26 18:34 ` [Bug middle-end/45422] " jv244 at cam dot ac dot uk
                   ` (24 more replies)
  0 siblings, 25 replies; 26+ messages in thread
From: jv244 at cam dot ac dot uk @ 2010-08-26 18:33 UTC (permalink / raw)
  To: gcc-bugs

compiling the to-be-attached testcase as:

gfortran -c -ftime-report -cpp -fbounds-check -g -O3 -ffast-math -funroll-loops
-ftree-vectorize -march=native -ffree-form test.f90

massively slowed down in 4.6

4.4:  51s
4.5:  50s
4.6: 434s

the timing report look like:

Execution times (seconds)
 garbage collection    :  15.35 ( 4%) usr   0.04 ( 2%) sys  15.35 ( 4%) wall   
   0 kB ( 0%) ggc
 callgraph construction:   0.07 ( 0%) usr   0.01 ( 1%) sys   0.08 ( 0%) wall   
5736 kB ( 0%) ggc
 callgraph optimization:   0.41 ( 0%) usr   0.01 ( 0%) sys   0.34 ( 0%) wall   
 725 kB ( 0%) ggc
 ipa cp                :   0.11 ( 0%) usr   0.00 ( 0%) sys   0.11 ( 0%) wall   
1347 kB ( 0%) ggc
 ipa function splitting:   0.03 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall   
   0 kB ( 0%) ggc
 ipa reference         :   0.02 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall   
   0 kB ( 0%) ggc
 ipa profile           :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall   
   0 kB ( 0%) ggc
 ipa pure const        :   0.14 ( 0%) usr   0.01 ( 0%) sys   0.19 ( 0%) wall   
   0 kB ( 0%) ggc
 cfg cleanup           :   2.47 ( 1%) usr   0.00 ( 0%) sys   3.10 ( 1%) wall   
4726 kB ( 0%) ggc
 CFG verifier          :   6.48 ( 1%) usr   0.02 ( 1%) sys   6.76 ( 2%) wall   
   0 kB ( 0%) ggc
 trivially dead code   :   0.94 ( 0%) usr   0.00 ( 0%) sys   0.76 ( 0%) wall   
   0 kB ( 0%) ggc
 df multiple defs      :   0.44 ( 0%) usr   0.01 ( 1%) sys   0.37 ( 0%) wall   
   0 kB ( 0%) ggc
 df reaching defs      :   2.38 ( 1%) usr   0.02 ( 1%) sys   2.41 ( 1%) wall   
   0 kB ( 0%) ggc
 df live regs          :  12.32 ( 3%) usr   0.03 ( 2%) sys  12.19 ( 3%) wall   
   0 kB ( 0%) ggc
 df live&initialized regs:   4.24 ( 1%) usr   0.01 ( 1%) sys   4.31 ( 1%) wall 
     0 kB ( 0%) ggc
 df use-def / def-use chains:   1.72 ( 0%) usr   0.01 ( 1%) sys   1.54 ( 0%)
wall       0 kB ( 0%) ggc
 df live reg subwords  :   0.35 ( 0%) usr   0.00 ( 0%) sys   0.32 ( 0%) wall   
   0 kB ( 0%) ggc
 df reg dead/unused notes:   6.24 ( 1%) usr   0.01 ( 1%) sys   6.17 ( 1%) wall 
  8317 kB ( 0%) ggc
 register information  :   2.34 ( 1%) usr   0.00 ( 0%) sys   2.39 ( 1%) wall   
   0 kB ( 0%) ggc
 alias analysis        :   2.25 ( 1%) usr   0.02 ( 1%) sys   2.22 ( 1%) wall  
47018 kB ( 3%) ggc
 alias stmt walking    :   0.63 ( 0%) usr   0.03 ( 2%) sys   0.83 ( 0%) wall   
6938 kB ( 0%) ggc
 register scan         :   0.37 ( 0%) usr   0.00 ( 0%) sys   0.33 ( 0%) wall   
 298 kB ( 0%) ggc
 rebuild jump labels   :   0.70 ( 0%) usr   0.00 ( 0%) sys   0.67 ( 0%) wall   
   0 kB ( 0%) ggc
 parser                :   0.94 ( 0%) usr   0.11 ( 6%) sys   1.06 ( 0%) wall  
55368 kB ( 3%) ggc
 inline heuristics     :   0.23 ( 0%) usr   0.00 ( 0%) sys   0.26 ( 0%) wall   
   0 kB ( 0%) ggc
 tree gimplify         :   0.64 ( 0%) usr   0.01 ( 0%) sys   0.53 ( 0%) wall  
48405 kB ( 3%) ggc
 tree eh               :   0.02 ( 0%) usr   0.01 ( 1%) sys   0.02 ( 0%) wall   
   0 kB ( 0%) ggc
 tree CFG construction :   0.05 ( 0%) usr   0.00 ( 0%) sys   0.08 ( 0%) wall  
11974 kB ( 1%) ggc
 tree CFG cleanup      :   1.10 ( 0%) usr   0.00 ( 0%) sys   1.34 ( 0%) wall   
3531 kB ( 0%) ggc
 tree VRP              :   2.91 ( 1%) usr   0.04 ( 2%) sys   2.76 ( 1%) wall  
67348 kB ( 4%) ggc
 tree copy propagation :   0.21 ( 0%) usr   0.03 ( 2%) sys   0.25 ( 0%) wall   
1384 kB ( 0%) ggc
 tree find ref. vars   :   0.11 ( 0%) usr   0.01 ( 1%) sys   0.06 ( 0%) wall   
3806 kB ( 0%) ggc
 tree PTA              :   0.36 ( 0%) usr   0.01 ( 1%) sys   0.42 ( 0%) wall   
5199 kB ( 0%) ggc
 tree PHI insertion    :   0.02 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall   
3194 kB ( 0%) ggc
 tree SSA rewrite      :   0.57 ( 0%) usr   0.00 ( 0%) sys   0.55 ( 0%) wall  
14011 kB ( 1%) ggc
 tree SSA other        :   0.10 ( 0%) usr   0.02 ( 1%) sys   0.13 ( 0%) wall   
 432 kB ( 0%) ggc
 tree SSA incremental  :   1.38 ( 0%) usr   0.15 ( 8%) sys   1.62 ( 0%) wall   
7441 kB ( 0%) ggc
 tree operand scan     :   0.57 ( 0%) usr   0.36 (18%) sys   0.84 ( 0%) wall  
58289 kB ( 3%) ggc
 dominator optimization:   0.56 ( 0%) usr   0.02 ( 1%) sys   0.65 ( 0%) wall   
8527 kB ( 0%) ggc
 tree SRA              :   0.03 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall   
   0 kB ( 0%) ggc
 tree CCP              :   1.14 ( 0%) usr   0.02 ( 1%) sys   1.15 ( 0%) wall   
4845 kB ( 0%) ggc
 tree PHI const/copy prop:   0.03 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall 
   106 kB ( 0%) ggc
 tree split crit edges :   0.03 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall   
2014 kB ( 0%) ggc
 tree reassociation    :   0.24 ( 0%) usr   0.02 ( 1%) sys   0.29 ( 0%) wall   
6030 kB ( 0%) ggc
 tree PRE              :   0.91 ( 0%) usr   0.02 ( 1%) sys   1.08 ( 0%) wall   
7164 kB ( 0%) ggc
 tree FRE              :   0.53 ( 0%) usr   0.01 ( 1%) sys   0.52 ( 0%) wall   
5411 kB ( 0%) ggc
 tree code sinking     :   0.05 ( 0%) usr   0.00 ( 0%) sys   0.04 ( 0%) wall   
1311 kB ( 0%) ggc
 tree linearize phis   :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall   
   0 kB ( 0%) ggc
 tree forward propagate:   0.27 ( 0%) usr   0.00 ( 0%) sys   0.32 ( 0%) wall  
11820 kB ( 1%) ggc
 tree phiprop          :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall   
   0 kB ( 0%) ggc
 tree conservative DCE :   0.17 ( 0%) usr   0.01 ( 1%) sys   0.18 ( 0%) wall   
 576 kB ( 0%) ggc
 tree aggressive DCE   :   1.04 ( 0%) usr   0.03 ( 2%) sys   1.10 ( 0%) wall  
25495 kB ( 1%) ggc
 tree buildin call DCE :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall   
   0 kB ( 0%) ggc
 tree DSE              :   0.28 ( 0%) usr   0.00 ( 0%) sys   0.36 ( 0%) wall   
 260 kB ( 0%) ggc
 PHI merge             :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall   
   0 kB ( 0%) ggc
 tree loop bounds      :   0.23 ( 0%) usr   0.00 ( 0%) sys   0.22 ( 0%) wall   
6686 kB ( 0%) ggc
 tree loop invariant motion:   0.06 ( 0%) usr   0.00 ( 0%) sys   0.06 ( 0%)
wall      76 kB ( 0%) ggc
 tree canonical iv     :   0.09 ( 0%) usr   0.00 ( 0%) sys   0.03 ( 0%) wall   
3421 kB ( 0%) ggc
 scev constant prop    :   0.06 ( 0%) usr   0.00 ( 0%) sys   0.07 ( 0%) wall   
2302 kB ( 0%) ggc
 tree loop unswitching :   0.01 ( 0%) usr   0.01 ( 1%) sys   0.03 ( 0%) wall   
 739 kB ( 0%) ggc
 complete unrolling    :   1.59 ( 0%) usr   0.12 ( 6%) sys   1.48 ( 0%) wall 
101520 kB ( 6%) ggc
 tree vectorization    :   0.38 ( 0%) usr   0.00 ( 0%) sys   0.39 ( 0%) wall  
20119 kB ( 1%) ggc
 tree slp vectorization:   0.99 ( 0%) usr   0.00 ( 0%) sys   0.80 ( 0%) wall  
52747 kB ( 3%) ggc
 tree loop distribution:   0.01 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall   
   0 kB ( 0%) ggc
 tree prefetching      :   3.37 ( 1%) usr   0.07 ( 3%) sys   3.36 ( 1%) wall  
90902 kB ( 5%) ggc
 tree iv optimization  : 193.28 (44%) usr   0.05 ( 2%) sys 193.22 (44%) wall 
322316 kB (18%) ggc
 predictive commoning  :   0.06 ( 0%) usr   0.00 ( 0%) sys   0.04 ( 0%) wall   
1752 kB ( 0%) ggc
 tree loop init        :   0.07 ( 0%) usr   0.01 ( 1%) sys   0.04 ( 0%) wall   
1307 kB ( 0%) ggc
 tree loop fini        :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall   
   0 kB ( 0%) ggc
 tree copy headers     :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.04 ( 0%) wall   
1658 kB ( 0%) ggc
 tree SSA uncprop      :   0.05 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall   
   0 kB ( 0%) ggc
 tree rename SSA copies:   0.05 ( 0%) usr   0.00 ( 0%) sys   0.06 ( 0%) wall   
   0 kB ( 0%) ggc
 tree SSA verifier     :   7.77 ( 2%) usr   0.07 ( 3%) sys   8.45 ( 2%) wall   
   0 kB ( 0%) ggc
 tree STMT verifier    :  15.39 ( 4%) usr   0.00 ( 0%) sys  15.16 ( 3%) wall   
   0 kB ( 0%) ggc
 tree switch initialization conversion:   0.00 ( 0%) usr   0.00 ( 0%) sys  
0.01 ( 0%) wall       0 kB ( 0%) ggc
 callgraph verifier    :   1.12 ( 0%) usr   0.01 ( 1%) sys   1.12 ( 0%) wall   
   0 kB ( 0%) ggc
 dominance frontiers   :   0.35 ( 0%) usr   0.00 ( 0%) sys   0.36 ( 0%) wall   
   0 kB ( 0%) ggc
 dominance computation :   0.90 ( 0%) usr   0.01 ( 1%) sys   0.82 ( 0%) wall   
   0 kB ( 0%) ggc
 control dependences   :   0.04 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall   
   0 kB ( 0%) ggc
 out of ssa            :   0.38 ( 0%) usr   0.00 ( 0%) sys   0.32 ( 0%) wall   
 225 kB ( 0%) ggc
 expand vars           :   0.12 ( 0%) usr   0.00 ( 0%) sys   0.11 ( 0%) wall  
11174 kB ( 1%) ggc
 expand                :  16.44 ( 4%) usr   0.05 ( 2%) sys  16.36 ( 4%) wall 
111440 kB ( 6%) ggc
 post expand cleanups  :   0.11 ( 0%) usr   0.00 ( 0%) sys   0.14 ( 0%) wall   
5818 kB ( 0%) ggc
 lower subreg          :   0.18 ( 0%) usr   0.00 ( 0%) sys   0.24 ( 0%) wall   
   0 kB ( 0%) ggc
 forward prop          :   1.79 ( 0%) usr   0.00 ( 0%) sys   1.87 ( 0%) wall  
15809 kB ( 1%) ggc
 CSE                   :   1.59 ( 0%) usr   0.00 ( 0%) sys   1.66 ( 0%) wall   
 662 kB ( 0%) ggc
 dead code elimination :   2.31 ( 1%) usr   0.00 ( 0%) sys   2.27 ( 1%) wall   
   0 kB ( 0%) ggc
 dead store elim1      :   1.54 ( 0%) usr   0.01 ( 1%) sys   1.67 ( 0%) wall  
23524 kB ( 1%) ggc
 dead store elim2      :   2.41 ( 1%) usr   0.00 ( 0%) sys   2.49 ( 1%) wall  
22835 kB ( 1%) ggc
 loop analysis         :   0.20 ( 0%) usr   0.00 ( 0%) sys   0.17 ( 0%) wall   
2220 kB ( 0%) ggc
 loop invariant motion :   0.28 ( 0%) usr   0.01 ( 0%) sys   0.22 ( 0%) wall   
 448 kB ( 0%) ggc
 loop unswitching      :   5.71 ( 1%) usr   0.01 ( 1%) sys   5.68 ( 1%) wall   
 218 kB ( 0%) ggc
 loop unrolling        :  30.36 ( 7%) usr   0.04 ( 2%) sys  30.75 ( 7%) wall 
184992 kB (10%) ggc
 CPROP                 :   2.56 ( 1%) usr   0.00 ( 0%) sys   2.62 ( 1%) wall  
25399 kB ( 1%) ggc
 PRE                   :   1.53 ( 0%) usr   0.00 ( 0%) sys   1.43 ( 0%) wall   
1798 kB ( 0%) ggc
 web                   :   2.76 ( 1%) usr   0.01 ( 1%) sys   2.57 ( 1%) wall   
8429 kB ( 0%) ggc
 CSE 2                 :   2.39 ( 1%) usr   0.00 ( 0%) sys   2.41 ( 1%) wall   
2123 kB ( 0%) ggc
 branch prediction     :   0.22 ( 0%) usr   0.01 ( 1%) sys   0.17 ( 0%) wall   
6857 kB ( 0%) ggc
 combiner              :   4.33 ( 1%) usr   0.00 ( 0%) sys   4.57 ( 1%) wall  
60529 kB ( 3%) ggc
 if-conversion         :   0.22 ( 0%) usr   0.00 ( 0%) sys   0.24 ( 0%) wall   
2520 kB ( 0%) ggc
 regmove               :   0.88 ( 0%) usr   0.00 ( 0%) sys   0.90 ( 0%) wall   
   0 kB ( 0%) ggc
 mode switching        :   0.01 ( 0%) usr   0.01 ( 1%) sys   0.00 ( 0%) wall   
   0 kB ( 0%) ggc
 integrated RA         :  16.20 ( 4%) usr   0.04 ( 2%) sys  16.28 ( 4%) wall  
44292 kB ( 2%) ggc
 reload                :   7.72 ( 2%) usr   0.02 ( 1%) sys   7.47 ( 2%) wall  
10065 kB ( 1%) ggc
 reload CSE regs       :   4.83 ( 1%) usr   0.00 ( 0%) sys   4.84 ( 1%) wall  
36964 kB ( 2%) ggc
 load CSE after reload :   0.50 ( 0%) usr   0.00 ( 0%) sys   0.52 ( 0%) wall   
 449 kB ( 0%) ggc
 zee                   :   0.36 ( 0%) usr   0.00 ( 0%) sys   0.45 ( 0%) wall   
  45 kB ( 0%) ggc
 thread pro- & epilogue:   0.15 ( 0%) usr   0.00 ( 0%) sys   0.09 ( 0%) wall   
3988 kB ( 0%) ggc
 if-conversion 2       :   0.12 ( 0%) usr   0.00 ( 0%) sys   0.19 ( 0%) wall   
1056 kB ( 0%) ggc
 combine stack adjustments:   0.08 ( 0%) usr   0.00 ( 0%) sys   0.18 ( 0%) wall
      0 kB ( 0%) ggc
 peephole 2            :   0.58 ( 0%) usr   0.00 ( 0%) sys   0.55 ( 0%) wall   
2995 kB ( 0%) ggc
 rename registers      :   1.45 ( 0%) usr   0.00 ( 0%) sys   1.22 ( 0%) wall   
2741 kB ( 0%) ggc
 hard reg cprop        :   1.27 ( 0%) usr   0.00 ( 0%) sys   1.26 ( 0%) wall   
  15 kB ( 0%) ggc
 scheduling 2          :   6.64 ( 2%) usr   0.06 ( 3%) sys   6.71 ( 2%) wall   
1284 kB ( 0%) ggc
 machine dep reorg     :   0.89 ( 0%) usr   0.00 ( 0%) sys   0.75 ( 0%) wall   
  77 kB ( 0%) ggc
 reorder blocks        :   0.90 ( 0%) usr   0.00 ( 0%) sys   0.76 ( 0%) wall   
4788 kB ( 0%) ggc
 final                 :   2.13 ( 0%) usr   0.12 ( 6%) sys   2.29 ( 1%) wall   
9656 kB ( 1%) ggc
 symout                :   0.71 ( 0%) usr   0.11 ( 5%) sys   0.82 ( 0%) wall  
58849 kB ( 3%) ggc
 variable tracking     :   3.12 ( 1%) usr   0.00 ( 0%) sys   3.06 ( 1%) wall  
62059 kB ( 3%) ggc
 var-tracking dataflow :   4.46 ( 1%) usr   0.01 ( 1%) sys   4.47 ( 1%) wall   
   0 kB ( 0%) ggc
 var-tracking emit     :   3.78 ( 1%) usr   0.01 ( 1%) sys   3.84 ( 1%) wall  
19142 kB ( 1%) ggc
 TOTAL                 : 434.63             2.00           436.64           
1780330 kB

gfortran -v reports:


/data03/vondele/gcc_trunk/build/libexec/gcc/x86_64-unknown-linux-gnu/4.6.0/f951
test.f90 -cpp=/tmp/cceEHMzK.f90 -quiet -v test.f90 -march=k8-sse3 -mcx16 -msahf
--param l1-cache-size=64 --param l1-cache-line-size=64 --param
l2-cache-size=1024 -mtune=k8 -quiet -dumpbase test.f90 -auxbase test -g -O3
-version -ftime-report -fbounds-check -ffast-math -funroll-loops
-ftree-vectorize -ffree-form -fintrinsic-modules-path
/data03/vondele/gcc_trunk/build/lib/gcc/x86_64-unknown-linux-gnu/4.6.0/finclude
-o /tmp/ccQ56z0k.s


-- 
           Summary: [4.6 Regression] compile time increases 8x.
           Product: gcc
           Version: 4.6.0
            Status: UNCONFIRMED
          Keywords: compile-time-hog
          Severity: normal
          Priority: P3
         Component: middle-end
        AssignedTo: unassigned at gcc dot gnu dot org
        ReportedBy: jv244 at cam dot ac dot uk


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422


^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug middle-end/45422] [4.6 Regression] compile time increases 8x.
  2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
@ 2010-08-26 18:34 ` jv244 at cam dot ac dot uk
  2010-08-27 10:23 ` rguenth at gcc dot gnu dot org
                   ` (23 subsequent siblings)
  24 siblings, 0 replies; 26+ messages in thread
From: jv244 at cam dot ac dot uk @ 2010-08-26 18:34 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #1 from jv244 at cam dot ac dot uk  2010-08-26 18:34 -------
Created an attachment (id=21573)
 --> (http://gcc.gnu.org/bugzilla/attachment.cgi?id=21573&action=view)
testcase


-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422


^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug middle-end/45422] [4.6 Regression] compile time increases 8x.
  2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
  2010-08-26 18:34 ` [Bug middle-end/45422] " jv244 at cam dot ac dot uk
@ 2010-08-27 10:23 ` rguenth at gcc dot gnu dot org
  2010-08-27 11:41 ` rguenth at gcc dot gnu dot org
                   ` (22 subsequent siblings)
  24 siblings, 0 replies; 26+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2010-08-27 10:23 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #2 from rguenth at gcc dot gnu dot org  2010-08-27 10:23 -------
Can also be seen here:

http://gcc.opensuse.org/SPEC/CFP/sb-barbella.suse.de-head-64-2006/times.html

and here:

http://gcc.opensuse.org/c++bench-frescobaldi/polyhedron/polyhedron-summary.txt-1-0.html

I'm looking at a profile.


-- 

rguenth at gcc dot gnu dot org changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|UNCONFIRMED                 |NEW
     Ever Confirmed|0                           |1
   Last reconfirmed|0000-00-00 00:00:00         |2010-08-27 10:23:21
               date|                            |
   Target Milestone|---                         |4.6.0


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422


^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug middle-end/45422] [4.6 Regression] compile time increases 8x.
  2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
  2010-08-26 18:34 ` [Bug middle-end/45422] " jv244 at cam dot ac dot uk
  2010-08-27 10:23 ` rguenth at gcc dot gnu dot org
@ 2010-08-27 11:41 ` rguenth at gcc dot gnu dot org
  2010-08-27 11:46 ` jv244 at cam dot ac dot uk
                   ` (21 subsequent siblings)
  24 siblings, 0 replies; 26+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2010-08-27 11:41 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #3 from rguenth at gcc dot gnu dot org  2010-08-27 11:41 -------
Reproducable with -O3 alone, -fbounds-check makes it more visible.  With plain
-O3:

 tree iv optimization  :  22.64 (33%) usr   0.18 (11%) sys  22.97 (32%) wall 
113858 kB (16%) ggc

and with added -fbounds-check:

 tree iv optimization  : 184.70 (78%) usr   0.67 (31%) sys 186.67 (78%) wall 
166066 kB (18%) ggc

this connection with bounds-checking makes it sound familiar.


-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422


^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug middle-end/45422] [4.6 Regression] compile time increases 8x.
  2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
                   ` (2 preceding siblings ...)
  2010-08-27 11:41 ` rguenth at gcc dot gnu dot org
@ 2010-08-27 11:46 ` jv244 at cam dot ac dot uk
  2010-08-27 12:07 ` rguenth at gcc dot gnu dot org
                   ` (20 subsequent siblings)
  24 siblings, 0 replies; 26+ messages in thread
From: jv244 at cam dot ac dot uk @ 2010-08-27 11:46 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #4 from jv244 at cam dot ac dot uk  2010-08-27 11:45 -------
(In reply to comment #3)
> 
> this connection with bounds-checking makes it sound familiar.
> 
I had a similar bug open (and fixed) as PR43627
with a comment from you
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=43627#c11


-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422


^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug middle-end/45422] [4.6 Regression] compile time increases 8x.
  2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
                   ` (3 preceding siblings ...)
  2010-08-27 11:46 ` jv244 at cam dot ac dot uk
@ 2010-08-27 12:07 ` rguenth at gcc dot gnu dot org
  2010-08-27 12:16 ` rguenth at gcc dot gnu dot org
                   ` (19 subsequent siblings)
  24 siblings, 0 replies; 26+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2010-08-27 12:07 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #5 from rguenth at gcc dot gnu dot org  2010-08-27 12:06 -------
MODULE hfx_contract_block
  INTEGER, PARAMETER :: dp=KIND(0.0D0)
CONTAINS
  SUBROUTINE
block_9_2(mc_max,md_max,kbd,kbc,kad,kac,pbd,pbc,pad,pac,prim,scale)
    INTEGER                                  :: mc_max, md_max
    REAL(KIND=dp) :: kbd(2*md_max), kbc(2*mc_max), kad(9*md_max), &
      kac(9*mc_max), pbd(2*md_max), pbc(2*mc_max), pad(9*md_max), &
      pac(9*mc_max), prim(9*2*mc_max*md_max), scale

    INTEGER                                  :: ma, mb, mc, md, p_index
    REAL(KIND=dp)                            :: ks_bc, ks_bd, p_bc, p_bd, tmp

      kbd(1:2*md_max) = 0.0_dp
      kbc(1:2*mc_max) = 0.0_dp
      kad(1:9*md_max) = 0.0_dp
      kac(1:9*mc_max) = 0.0_dp
      p_index = 0
      DO md = 1,md_max
        DO mc = 1,mc_max
          DO mb = 1,2
            ks_bd = 0.0_dp
            ks_bc = 0.0_dp
            p_bd = pbd((md-1)*2+mb)
            p_bc = pbc((mc-1)*2+mb)
            DO ma = 1,9
              p_index=p_index+1
              tmp = scale*prim(p_index)
              ks_bc = ks_bc + tmp* pad((md-1)*9+ma)
              ks_bd = ks_bd + tmp* pac((mc-1)*9+ma)
              kad((md-1)*9+ma) =  kad((md-1)*9+ma)-tmp*p_bc
              kac((mc-1)*9+ma) = kac((mc-1)*9+ma)-tmp*p_bd
            END DO
            kbd((md-1)*2+mb) = kbd((md-1)*2+mb) - ks_bd
            kbc((mc-1)*2+mb) = kbc((mc-1)*2+mb) - ks_bc
          END DO
        END DO
      END DO
  END SUBROUTINE block_9_2
END MODULE

gfortran -S -O3 -ffree-form -fbounds-check t.f90 -ftime-report

 tree iv optimization  :   3.49 (94%) usr   0.00 ( 0%) sys   3.52 (94%) wall   
2272 kB (36%) ggc

can someone try to track down revisions that slowed that testcase down?


-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422


^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug middle-end/45422] [4.6 Regression] compile time increases 8x.
  2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
                   ` (4 preceding siblings ...)
  2010-08-27 12:07 ` rguenth at gcc dot gnu dot org
@ 2010-08-27 12:16 ` rguenth at gcc dot gnu dot org
  2010-08-27 13:01 ` rguenth at gcc dot gnu dot org
                   ` (18 subsequent siblings)
  24 siblings, 0 replies; 26+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2010-08-27 12:16 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #6 from rguenth at gcc dot gnu dot org  2010-08-27 12:16 -------
Without -fbounds-check we have around 650 million calls to
iv_ca_get_num_inv_exprs ... called from iv_ca_recount_cost.  With
-fbounds-check
this balloons to nearly 5 billion calls to that function.

David - this sounds like your changes, can you have a look to possibly reduce
this number of calls?


-- 

rguenth at gcc dot gnu dot org changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |xinliangli at gmail dot com


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422


^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug middle-end/45422] [4.6 Regression] compile time increases 8x.
  2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
                   ` (5 preceding siblings ...)
  2010-08-27 12:16 ` rguenth at gcc dot gnu dot org
@ 2010-08-27 13:01 ` rguenth at gcc dot gnu dot org
  2010-08-27 15:52 ` hjl dot tools at gmail dot com
                   ` (17 subsequent siblings)
  24 siblings, 0 replies; 26+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2010-08-27 13:01 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #7 from rguenth at gcc dot gnu dot org  2010-08-27 13:01 -------
The few remaining bound checks do not look simple to remove (if at all).


-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422


^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug middle-end/45422] [4.6 Regression] compile time increases 8x.
  2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
                   ` (6 preceding siblings ...)
  2010-08-27 13:01 ` rguenth at gcc dot gnu dot org
@ 2010-08-27 15:52 ` hjl dot tools at gmail dot com
  2010-08-27 17:01 ` davidxl at gcc dot gnu dot org
                   ` (16 subsequent siblings)
  24 siblings, 0 replies; 26+ messages in thread
From: hjl dot tools at gmail dot com @ 2010-08-27 15:52 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #8 from hjl dot tools at gmail dot com  2010-08-27 15:52 -------
It is caused by revision 162653:

http://gcc.gnu.org/ml/gcc-cvs/2010-07/msg01007.html


-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422


^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug middle-end/45422] [4.6 Regression] compile time increases 8x.
  2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
                   ` (7 preceding siblings ...)
  2010-08-27 15:52 ` hjl dot tools at gmail dot com
@ 2010-08-27 17:01 ` davidxl at gcc dot gnu dot org
  2010-08-28  6:00 ` davidxl at gcc dot gnu dot org
                   ` (15 subsequent siblings)
  24 siblings, 0 replies; 26+ messages in thread
From: davidxl at gcc dot gnu dot org @ 2010-08-27 17:01 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #9 from davidxl at gcc dot gnu dot org  2010-08-27 17:01 -------
Will take a look


-- 

davidxl at gcc dot gnu dot org changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
         AssignedTo|unassigned at gcc dot gnu   |davidxl at gcc dot gnu dot
                   |dot org                     |org
             Status|NEW                         |ASSIGNED
   Last reconfirmed|2010-08-27 10:23:21         |2010-08-27 17:01:01
               date|                            |


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422


^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug middle-end/45422] [4.6 Regression] compile time increases 8x.
  2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
                   ` (8 preceding siblings ...)
  2010-08-27 17:01 ` davidxl at gcc dot gnu dot org
@ 2010-08-28  6:00 ` davidxl at gcc dot gnu dot org
  2010-08-29  5:09 ` [Bug middle-end/45422] [4.6 Regression] compile time increases 5x jv244 at cam dot ac dot uk
                   ` (14 subsequent siblings)
  24 siblings, 0 replies; 26+ messages in thread
From: davidxl at gcc dot gnu dot org @ 2010-08-28  6:00 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #10 from davidxl at gcc dot gnu dot org  2010-08-28 06:00 -------
fixed in r163610.


-- 

davidxl at gcc dot gnu dot org changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|ASSIGNED                    |RESOLVED
         Resolution|                            |FIXED


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422


^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug middle-end/45422] [4.6 Regression] compile time increases 5x.
  2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
                   ` (9 preceding siblings ...)
  2010-08-28  6:00 ` davidxl at gcc dot gnu dot org
@ 2010-08-29  5:09 ` jv244 at cam dot ac dot uk
  2010-08-29  5:13 ` pinskia at gcc dot gnu dot org
                   ` (13 subsequent siblings)
  24 siblings, 0 replies; 26+ messages in thread
From: jv244 at cam dot ac dot uk @ 2010-08-29  5:09 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #11 from jv244 at cam dot ac dot uk  2010-08-29 05:09 -------
After David's patch (thanks!), the testcase requires 240s, that's still a 5x
slowdown. I paste the new timing profile below, and reopen the bug. There is no
obvious candidate for the slowdown.

> gfortran -c -ftime-report -cpp -fbounds-check -g -O3 -ffast-math -funroll-loops -ftree-vectorize -march=native -ffree-form test.f90

Execution times (seconds)
 garbage collection    :  12.55 ( 5%) usr   0.03 ( 2%) sys  12.57 ( 5%) wall   
   0 kB ( 0%) ggc
 callgraph construction:   0.08 ( 0%) usr   0.00 ( 0%) sys   0.06 ( 0%) wall   
5736 kB ( 0%) ggc
 callgraph optimization:   0.40 ( 0%) usr   0.02 ( 1%) sys   0.41 ( 0%) wall   
 725 kB ( 0%) ggc
 ipa cp                :   0.07 ( 0%) usr   0.00 ( 0%) sys   0.07 ( 0%) wall   
1347 kB ( 0%) ggc
 ipa function splitting:   0.01 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall   
   0 kB ( 0%) ggc
 ipa reference         :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall   
   0 kB ( 0%) ggc
 ipa profile           :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall   
   0 kB ( 0%) ggc
 ipa pure const        :   0.07 ( 0%) usr   0.01 ( 1%) sys   0.15 ( 0%) wall   
   0 kB ( 0%) ggc
 cfg cleanup           :   2.28 ( 1%) usr   0.00 ( 0%) sys   2.35 ( 1%) wall   
4726 kB ( 0%) ggc
 CFG verifier          :   5.54 ( 2%) usr   0.03 ( 2%) sys   5.73 ( 2%) wall   
   0 kB ( 0%) ggc
 trivially dead code   :   0.67 ( 0%) usr   0.00 ( 0%) sys   0.65 ( 0%) wall   
   0 kB ( 0%) ggc
 df multiple defs      :   0.23 ( 0%) usr   0.00 ( 0%) sys   0.28 ( 0%) wall   
   0 kB ( 0%) ggc
 df reaching defs      :   2.00 ( 1%) usr   0.00 ( 0%) sys   2.12 ( 1%) wall   
   0 kB ( 0%) ggc
 df live regs          :   9.80 ( 4%) usr   0.01 ( 1%) sys  10.18 ( 4%) wall   
   0 kB ( 0%) ggc
 df live&initialized regs:   3.62 ( 1%) usr   0.00 ( 0%) sys   3.08 ( 1%) wall 
     0 kB ( 0%) ggc
 df use-def / def-use chains:   1.22 ( 0%) usr   0.00 ( 0%) sys   1.26 ( 1%)
wall       0 kB ( 0%) ggc
 df live reg subwords  :   0.32 ( 0%) usr   0.00 ( 0%) sys   0.27 ( 0%) wall   
   0 kB ( 0%) ggc
 df reg dead/unused notes:   4.67 ( 2%) usr   0.00 ( 0%) sys   4.44 ( 2%) wall 
  8317 kB ( 0%) ggc
 register information  :   2.10 ( 1%) usr   0.00 ( 0%) sys   1.97 ( 1%) wall   
   0 kB ( 0%) ggc
 alias analysis        :   1.73 ( 1%) usr   0.00 ( 0%) sys   1.87 ( 1%) wall  
47018 kB ( 3%) ggc
 alias stmt walking    :   0.61 ( 0%) usr   0.07 ( 4%) sys   0.61 ( 0%) wall   
6938 kB ( 0%) ggc
 register scan         :   0.32 ( 0%) usr   0.00 ( 0%) sys   0.32 ( 0%) wall   
 202 kB ( 0%) ggc
 rebuild jump labels   :   0.72 ( 0%) usr   0.00 ( 0%) sys   0.67 ( 0%) wall   
   0 kB ( 0%) ggc
 parser                :   0.90 ( 0%) usr   0.09 ( 5%) sys   0.99 ( 0%) wall  
55368 kB ( 3%) ggc
 inline heuristics     :   0.17 ( 0%) usr   0.01 ( 1%) sys   0.26 ( 0%) wall   
   0 kB ( 0%) ggc
 tree gimplify         :   0.51 ( 0%) usr   0.01 ( 1%) sys   0.57 ( 0%) wall  
48405 kB ( 3%) ggc
 tree eh               :   0.03 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall   
   0 kB ( 0%) ggc
 tree CFG construction :   0.02 ( 0%) usr   0.01 ( 1%) sys   0.03 ( 0%) wall  
11974 kB ( 1%) ggc
 tree CFG cleanup      :   1.30 ( 1%) usr   0.02 ( 1%) sys   1.21 ( 0%) wall   
3530 kB ( 0%) ggc
 tree VRP              :   2.50 ( 1%) usr   0.03 ( 2%) sys   2.44 ( 1%) wall  
67364 kB ( 4%) ggc
 tree copy propagation :   0.16 ( 0%) usr   0.05 ( 3%) sys   0.15 ( 0%) wall   
1384 kB ( 0%) ggc
 tree find ref. vars   :   0.05 ( 0%) usr   0.01 ( 1%) sys   0.05 ( 0%) wall   
3806 kB ( 0%) ggc
 tree PTA              :   0.34 ( 0%) usr   0.00 ( 0%) sys   0.33 ( 0%) wall   
5198 kB ( 0%) ggc
 tree PHI insertion    :   0.03 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall   
3194 kB ( 0%) ggc
 tree SSA rewrite      :   0.39 ( 0%) usr   0.00 ( 0%) sys   0.35 ( 0%) wall  
14011 kB ( 1%) ggc
 tree SSA other        :   0.10 ( 0%) usr   0.04 ( 2%) sys   0.10 ( 0%) wall   
 432 kB ( 0%) ggc
 tree SSA incremental  :   1.18 ( 0%) usr   0.14 ( 8%) sys   1.44 ( 1%) wall   
7441 kB ( 0%) ggc
 tree operand scan     :   0.47 ( 0%) usr   0.33 (19%) sys   0.78 ( 0%) wall  
58289 kB ( 3%) ggc
 dominator optimization:   0.52 ( 0%) usr   0.00 ( 0%) sys   0.61 ( 0%) wall   
8527 kB ( 0%) ggc
 tree SRA              :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall   
   0 kB ( 0%) ggc
 tree CCP              :   1.05 ( 0%) usr   0.05 ( 3%) sys   1.28 ( 1%) wall   
4845 kB ( 0%) ggc
 tree PHI const/copy prop:   0.01 ( 0%) usr   0.00 ( 0%) sys   0.05 ( 0%) wall 
   106 kB ( 0%) ggc
 tree split crit edges :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall   
2014 kB ( 0%) ggc
 tree reassociation    :   0.27 ( 0%) usr   0.03 ( 2%) sys   0.27 ( 0%) wall   
6030 kB ( 0%) ggc
 tree PRE              :   0.85 ( 0%) usr   0.00 ( 0%) sys   0.89 ( 0%) wall   
7164 kB ( 0%) ggc
 tree FRE              :   0.47 ( 0%) usr   0.02 ( 1%) sys   0.56 ( 0%) wall   
5411 kB ( 0%) ggc
 tree code sinking     :   0.11 ( 0%) usr   0.02 ( 1%) sys   0.03 ( 0%) wall   
1311 kB ( 0%) ggc
 tree linearize phis   :   0.02 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall   
   0 kB ( 0%) ggc
 tree forward propagate:   0.22 ( 0%) usr   0.02 ( 1%) sys   0.26 ( 0%) wall  
11820 kB ( 1%) ggc
 tree phiprop          :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall   
   0 kB ( 0%) ggc
 tree conservative DCE :   0.11 ( 0%) usr   0.01 ( 1%) sys   0.04 ( 0%) wall   
 576 kB ( 0%) ggc
 tree aggressive DCE   :   0.84 ( 0%) usr   0.01 ( 1%) sys   0.92 ( 0%) wall  
25495 kB ( 1%) ggc
 tree buildin call DCE :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall   
   0 kB ( 0%) ggc
 tree DSE              :   0.26 ( 0%) usr   0.00 ( 0%) sys   0.19 ( 0%) wall   
 260 kB ( 0%) ggc
 tree loop bounds      :   0.20 ( 0%) usr   0.00 ( 0%) sys   0.27 ( 0%) wall   
6686 kB ( 0%) ggc
 tree loop invariant motion:   0.07 ( 0%) usr   0.00 ( 0%) sys   0.06 ( 0%)
wall      76 kB ( 0%) ggc
 tree canonical iv     :   0.11 ( 0%) usr   0.00 ( 0%) sys   0.09 ( 0%) wall   
3421 kB ( 0%) ggc
 scev constant prop    :   0.04 ( 0%) usr   0.00 ( 0%) sys   0.11 ( 0%) wall   
2302 kB ( 0%) ggc
 tree loop unswitching :   0.03 ( 0%) usr   0.00 ( 0%) sys   0.04 ( 0%) wall   
 739 kB ( 0%) ggc
 complete unrolling    :   1.60 ( 1%) usr   0.12 ( 7%) sys   1.40 ( 1%) wall 
101520 kB ( 6%) ggc
 tree vectorization    :   0.31 ( 0%) usr   0.02 ( 1%) sys   0.27 ( 0%) wall  
20116 kB ( 1%) ggc
 tree slp vectorization:   0.92 ( 0%) usr   0.00 ( 0%) sys   0.90 ( 0%) wall  
52747 kB ( 3%) ggc
 tree loop distribution:   0.07 ( 0%) usr   0.00 ( 0%) sys   0.07 ( 0%) wall   
   0 kB ( 0%) ggc
 tree prefetching      :   3.09 ( 1%) usr   0.06 ( 3%) sys   3.07 ( 1%) wall  
90905 kB ( 5%) ggc
 tree iv optimization  :  32.77 (13%) usr   0.03 ( 2%) sys  32.96 (13%) wall 
322284 kB (18%) ggc
 predictive commoning  :   0.08 ( 0%) usr   0.00 ( 0%) sys   0.09 ( 0%) wall   
1752 kB ( 0%) ggc
 tree loop init        :   0.05 ( 0%) usr   0.00 ( 0%) sys   0.06 ( 0%) wall   
1307 kB ( 0%) ggc
 tree loop fini        :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall   
   0 kB ( 0%) ggc
 tree copy headers     :   0.10 ( 0%) usr   0.00 ( 0%) sys   0.14 ( 0%) wall   
1658 kB ( 0%) ggc
 tree SSA uncprop      :   0.04 ( 0%) usr   0.01 ( 1%) sys   0.02 ( 0%) wall   
   0 kB ( 0%) ggc
 tree rename SSA copies:   0.03 ( 0%) usr   0.00 ( 0%) sys   0.09 ( 0%) wall   
   0 kB ( 0%) ggc
 tree SSA verifier     :   7.34 ( 3%) usr   0.02 ( 1%) sys   7.29 ( 3%) wall   
   0 kB ( 0%) ggc
 tree STMT verifier    :  15.08 ( 6%) usr   0.00 ( 0%) sys  15.11 ( 6%) wall   
   0 kB ( 0%) ggc
 callgraph verifier    :   0.85 ( 0%) usr   0.00 ( 0%) sys   0.87 ( 0%) wall   
   0 kB ( 0%) ggc
 dominance frontiers   :   0.26 ( 0%) usr   0.00 ( 0%) sys   0.32 ( 0%) wall   
   0 kB ( 0%) ggc
 dominance computation :   0.86 ( 0%) usr   0.00 ( 0%) sys   0.89 ( 0%) wall   
   0 kB ( 0%) ggc
 control dependences   :   0.02 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall   
   0 kB ( 0%) ggc
 out of ssa            :   0.33 ( 0%) usr   0.00 ( 0%) sys   0.35 ( 0%) wall   
 225 kB ( 0%) ggc
 expand vars           :   0.16 ( 0%) usr   0.00 ( 0%) sys   0.09 ( 0%) wall  
11294 kB ( 1%) ggc
 expand                :  14.67 ( 6%) usr   0.04 ( 2%) sys  13.89 ( 6%) wall 
111424 kB ( 6%) ggc
 post expand cleanups  :   0.06 ( 0%) usr   0.00 ( 0%) sys   0.05 ( 0%) wall   
5818 kB ( 0%) ggc
 lower subreg          :   0.19 ( 0%) usr   0.00 ( 0%) sys   0.13 ( 0%) wall   
   0 kB ( 0%) ggc
 forward prop          :   1.58 ( 1%) usr   0.00 ( 0%) sys   1.45 ( 1%) wall  
15809 kB ( 1%) ggc
 CSE                   :   1.60 ( 1%) usr   0.00 ( 0%) sys   1.73 ( 1%) wall   
 662 kB ( 0%) ggc
 dead code elimination :   1.72 ( 1%) usr   0.00 ( 0%) sys   1.77 ( 1%) wall   
   0 kB ( 0%) ggc
 dead store elim1      :   1.36 ( 1%) usr   0.01 ( 1%) sys   1.29 ( 1%) wall  
23524 kB ( 1%) ggc
 dead store elim2      :   2.01 ( 1%) usr   0.00 ( 0%) sys   2.10 ( 1%) wall  
22835 kB ( 1%) ggc
 loop analysis         :   0.14 ( 0%) usr   0.00 ( 0%) sys   0.14 ( 0%) wall   
2220 kB ( 0%) ggc
 loop invariant motion :   0.24 ( 0%) usr   0.00 ( 0%) sys   0.25 ( 0%) wall   
 448 kB ( 0%) ggc
 loop unswitching      :   5.19 ( 2%) usr   0.01 ( 1%) sys   5.40 ( 2%) wall   
 218 kB ( 0%) ggc
 loop unrolling        :  26.07 (11%) usr   0.02 ( 1%) sys  25.98 (11%) wall 
184992 kB (10%) ggc
 CPROP                 :   2.20 ( 1%) usr   0.00 ( 0%) sys   2.48 ( 1%) wall  
25399 kB ( 1%) ggc
 PRE                   :   1.33 ( 1%) usr   0.00 ( 0%) sys   1.25 ( 1%) wall   
1798 kB ( 0%) ggc
 web                   :   2.26 ( 1%) usr   0.00 ( 0%) sys   2.29 ( 1%) wall   
8429 kB ( 0%) ggc
 CSE 2                 :   2.07 ( 1%) usr   0.01 ( 1%) sys   2.30 ( 1%) wall   
2123 kB ( 0%) ggc
 branch prediction     :   0.21 ( 0%) usr   0.00 ( 0%) sys   0.19 ( 0%) wall   
6857 kB ( 0%) ggc
 combiner              :   4.11 ( 2%) usr   0.00 ( 0%) sys   4.21 ( 2%) wall  
60529 kB ( 3%) ggc
 if-conversion         :   0.20 ( 0%) usr   0.00 ( 0%) sys   0.28 ( 0%) wall   
2520 kB ( 0%) ggc
 regmove               :   0.63 ( 0%) usr   0.00 ( 0%) sys   0.80 ( 0%) wall   
   0 kB ( 0%) ggc
 integrated RA         :  14.33 ( 6%) usr   0.05 ( 3%) sys  14.22 ( 6%) wall  
44292 kB ( 2%) ggc
 reload                :   6.75 ( 3%) usr   0.00 ( 0%) sys   6.74 ( 3%) wall  
10065 kB ( 1%) ggc
 reload CSE regs       :   4.55 ( 2%) usr   0.01 ( 1%) sys   4.67 ( 2%) wall  
36964 kB ( 2%) ggc
 load CSE after reload :   0.36 ( 0%) usr   0.01 ( 1%) sys   0.45 ( 0%) wall   
 449 kB ( 0%) ggc
 zee                   :   0.35 ( 0%) usr   0.00 ( 0%) sys   0.35 ( 0%) wall   
  45 kB ( 0%) ggc
 thread pro- & epilogue:   0.11 ( 0%) usr   0.00 ( 0%) sys   0.11 ( 0%) wall   
3988 kB ( 0%) ggc
 if-conversion 2       :   0.11 ( 0%) usr   0.00 ( 0%) sys   0.10 ( 0%) wall   
1056 kB ( 0%) ggc
 combine stack adjustments:   0.15 ( 0%) usr   0.00 ( 0%) sys   0.17 ( 0%) wall
      0 kB ( 0%) ggc
 peephole 2            :   0.41 ( 0%) usr   0.00 ( 0%) sys   0.40 ( 0%) wall   
2995 kB ( 0%) ggc
 rename registers      :   1.23 ( 1%) usr   0.00 ( 0%) sys   1.31 ( 1%) wall   
2741 kB ( 0%) ggc
 hard reg cprop        :   1.23 ( 1%) usr   0.02 ( 1%) sys   1.11 ( 0%) wall   
  15 kB ( 0%) ggc
 scheduling 2          :   6.25 ( 3%) usr   0.04 ( 2%) sys   6.24 ( 3%) wall   
1284 kB ( 0%) ggc
 machine dep reorg     :   0.82 ( 0%) usr   0.00 ( 0%) sys   0.89 ( 0%) wall   
  77 kB ( 0%) ggc
 reorder blocks        :   0.68 ( 0%) usr   0.00 ( 0%) sys   0.73 ( 0%) wall   
4788 kB ( 0%) ggc
 final                 :   1.86 ( 1%) usr   0.08 ( 5%) sys   2.10 ( 1%) wall   
9656 kB ( 1%) ggc
 symout                :   0.65 ( 0%) usr   0.06 ( 3%) sys   0.69 ( 0%) wall  
58849 kB ( 3%) ggc
 variable tracking     :   2.74 ( 1%) usr   0.00 ( 0%) sys   2.83 ( 1%) wall  
62059 kB ( 3%) ggc
 var-tracking dataflow :   4.21 ( 2%) usr   0.01 ( 1%) sys   4.24 ( 2%) wall   
   0 kB ( 0%) ggc
 var-tracking emit     :   3.79 ( 2%) usr   0.01 ( 1%) sys   3.58 ( 1%) wall  
19142 kB ( 1%) ggc
 TOTAL                 : 244.77             1.72           246.49           
1780321 kB
Extra diagnostic checks enabled; compiler may run slowly.


-- 

jv244 at cam dot ac dot uk changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|RESOLVED                    |REOPENED
         Resolution|FIXED                       |
            Summary|[4.6 Regression] compile    |[4.6 Regression] compile
                   |time increases 8x.          |time increases 5x.


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422


^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug middle-end/45422] [4.6 Regression] compile time increases 5x.
  2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
                   ` (10 preceding siblings ...)
  2010-08-29  5:09 ` [Bug middle-end/45422] [4.6 Regression] compile time increases 5x jv244 at cam dot ac dot uk
@ 2010-08-29  5:13 ` pinskia at gcc dot gnu dot org
  2010-08-29  5:21 ` jv244 at cam dot ac dot uk
                   ` (12 subsequent siblings)
  24 siblings, 0 replies; 26+ messages in thread
From: pinskia at gcc dot gnu dot org @ 2010-08-29  5:13 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #12 from pinskia at gcc dot gnu dot org  2010-08-29 05:13 -------
>Extra diagnostic checks enabled; compiler may run slowly.

Make sure you configure the trunk with --enable-checking=release to get the
same timing results as what a release would be.


-- 

pinskia at gcc dot gnu dot org changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|REOPENED                    |WAITING


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422


^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug middle-end/45422] [4.6 Regression] compile time increases 5x.
  2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
                   ` (11 preceding siblings ...)
  2010-08-29  5:13 ` pinskia at gcc dot gnu dot org
@ 2010-08-29  5:21 ` jv244 at cam dot ac dot uk
  2010-08-29  5:24 ` pinskia at gcc dot gnu dot org
                   ` (11 subsequent siblings)
  24 siblings, 0 replies; 26+ messages in thread
From: jv244 at cam dot ac dot uk @ 2010-08-29  5:21 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #13 from jv244 at cam dot ac dot uk  2010-08-29 05:20 -------
(In reply to comment #12)
> >Extra diagnostic checks enabled; compiler may run slowly.
> 
> Make sure you configure the trunk with --enable-checking=release to get the
> same timing results as what a release would be.
> 
The comparison is actually against the branches, not releases. However, I'm
rebuilding gcc and will report back.


-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422


^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug middle-end/45422] [4.6 Regression] compile time increases 5x.
  2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
                   ` (12 preceding siblings ...)
  2010-08-29  5:21 ` jv244 at cam dot ac dot uk
@ 2010-08-29  5:24 ` pinskia at gcc dot gnu dot org
  2010-08-29  5:31 ` jv244 at cam dot ac dot uk
                   ` (10 subsequent siblings)
  24 siblings, 0 replies; 26+ messages in thread
From: pinskia at gcc dot gnu dot org @ 2010-08-29  5:24 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #14 from pinskia at gcc dot gnu dot org  2010-08-29 05:23 -------
(In reply to comment #12)
> >Extra diagnostic checks enabled; compiler may run slowly.
> 
> Make sure you configure the trunk with --enable-checking=release to get the
> same timing results as what a release would be.

s/release/release branch/ :).


-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422


^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug middle-end/45422] [4.6 Regression] compile time increases 5x.
  2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
                   ` (13 preceding siblings ...)
  2010-08-29  5:24 ` pinskia at gcc dot gnu dot org
@ 2010-08-29  5:31 ` jv244 at cam dot ac dot uk
  2010-08-29  6:38 ` [Bug middle-end/45422] [4.6 Regression] compile time increases 3x jv244 at cam dot ac dot uk
                   ` (9 subsequent siblings)
  24 siblings, 0 replies; 26+ messages in thread
From: jv244 at cam dot ac dot uk @ 2010-08-29  5:31 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #15 from jv244 at cam dot ac dot uk  2010-08-29 05:31 -------
Similar times (a bit faster) with release checking:

Execution times (seconds)
 garbage collection    :   1.17 ( 1%) usr   0.00 ( 0%) sys   1.18 ( 1%) wall   
   0 kB ( 0%) ggc
 callgraph construction:   0.04 ( 0%) usr   0.01 ( 1%) sys   0.04 ( 0%) wall   
5670 kB ( 0%) ggc
 callgraph optimization:   0.32 ( 0%) usr   0.00 ( 0%) sys   0.25 ( 0%) wall   
 599 kB ( 0%) ggc
 ipa cp                :   0.07 ( 0%) usr   0.00 ( 0%) sys   0.09 ( 0%) wall   
1345 kB ( 0%) ggc
 ipa function splitting:   0.02 ( 0%) usr   0.00 ( 0%) sys   0.03 ( 0%) wall   
   0 kB ( 0%) ggc
 ipa reference         :   0.02 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall   
   0 kB ( 0%) ggc
 ipa pure const        :   0.11 ( 0%) usr   0.02 ( 1%) sys   0.14 ( 0%) wall   
   0 kB ( 0%) ggc
 cfg cleanup           :   2.67 ( 2%) usr   0.02 ( 1%) sys   2.59 ( 2%) wall   
4726 kB ( 0%) ggc
 trivially dead code   :   0.74 ( 0%) usr   0.00 ( 0%) sys   0.72 ( 0%) wall   
   0 kB ( 0%) ggc
 df multiple defs      :   0.48 ( 0%) usr   0.01 ( 1%) sys   0.35 ( 0%) wall   
   0 kB ( 0%) ggc
 df reaching defs      :   1.73 ( 1%) usr   0.00 ( 0%) sys   2.12 ( 1%) wall   
   0 kB ( 0%) ggc
 df live regs          :  10.78 ( 7%) usr   0.01 ( 1%) sys  11.16 ( 7%) wall   
   0 kB ( 0%) ggc
 df live&initialized regs:   3.60 ( 2%) usr   0.00 ( 0%) sys   3.87 ( 2%) wall 
     0 kB ( 0%) ggc
 df use-def / def-use chains:   1.52 ( 1%) usr   0.00 ( 0%) sys   1.18 ( 1%)
wall       0 kB ( 0%) ggc
 df live reg subwords  :   0.33 ( 0%) usr   0.00 ( 0%) sys   0.34 ( 0%) wall   
   0 kB ( 0%) ggc
 df reg dead/unused notes:   5.27 ( 3%) usr   0.00 ( 0%) sys   5.42 ( 3%) wall 
  7568 kB ( 0%) ggc
 register information  :   2.24 ( 1%) usr   0.00 ( 0%) sys   2.19 ( 1%) wall   
   0 kB ( 0%) ggc
 alias analysis        :   2.33 ( 1%) usr   0.00 ( 0%) sys   2.30 ( 1%) wall  
47018 kB ( 3%) ggc
 alias stmt walking    :   0.48 ( 0%) usr   0.05 ( 3%) sys   0.44 ( 0%) wall   
6938 kB ( 0%) ggc
 register scan         :   0.22 ( 0%) usr   0.00 ( 0%) sys   0.37 ( 0%) wall   
 394 kB ( 0%) ggc
 rebuild jump labels   :   0.73 ( 0%) usr   0.00 ( 0%) sys   0.61 ( 0%) wall   
   0 kB ( 0%) ggc
 parser                :   0.85 ( 1%) usr   0.13 ( 7%) sys   0.98 ( 1%) wall  
55365 kB ( 3%) ggc
 inline heuristics     :   0.24 ( 0%) usr   0.00 ( 0%) sys   0.19 ( 0%) wall   
   0 kB ( 0%) ggc
 tree gimplify         :   0.40 ( 0%) usr   0.06 ( 3%) sys   0.47 ( 0%) wall  
48405 kB ( 3%) ggc
 tree eh               :   0.02 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall   
   0 kB ( 0%) ggc
 tree CFG construction :   0.03 ( 0%) usr   0.02 ( 1%) sys   0.08 ( 0%) wall  
11971 kB ( 1%) ggc
 tree CFG cleanup      :   1.02 ( 1%) usr   0.03 ( 2%) sys   1.14 ( 1%) wall   
3522 kB ( 0%) ggc
 tree VRP              :   2.25 ( 1%) usr   0.05 ( 3%) sys   2.18 ( 1%) wall  
67051 kB ( 4%) ggc
 tree copy propagation :   0.24 ( 0%) usr   0.00 ( 0%) sys   0.16 ( 0%) wall   
1384 kB ( 0%) ggc
 tree find ref. vars   :   0.09 ( 0%) usr   0.00 ( 0%) sys   0.07 ( 0%) wall   
3806 kB ( 0%) ggc
 tree PTA              :   0.36 ( 0%) usr   0.00 ( 0%) sys   0.26 ( 0%) wall   
5193 kB ( 0%) ggc
 tree PHI insertion    :   0.03 ( 0%) usr   0.00 ( 0%) sys   0.04 ( 0%) wall   
3194 kB ( 0%) ggc
 tree SSA rewrite      :   0.40 ( 0%) usr   0.02 ( 1%) sys   0.53 ( 0%) wall  
14011 kB ( 1%) ggc
 tree SSA other        :   0.09 ( 0%) usr   0.01 ( 1%) sys   0.13 ( 0%) wall   
 428 kB ( 0%) ggc
 tree SSA incremental  :   1.40 ( 1%) usr   0.09 ( 5%) sys   1.50 ( 1%) wall   
7431 kB ( 0%) ggc
 tree operand scan     :   0.45 ( 0%) usr   0.33 (18%) sys   0.82 ( 0%) wall  
58289 kB ( 3%) ggc
 dominator optimization:   0.41 ( 0%) usr   0.04 ( 2%) sys   0.60 ( 0%) wall   
8526 kB ( 0%) ggc
 tree SRA              :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.03 ( 0%) wall   
   0 kB ( 0%) ggc
 tree CCP              :   1.05 ( 1%) usr   0.02 ( 1%) sys   1.16 ( 1%) wall   
4845 kB ( 0%) ggc
 tree PHI const/copy prop:   0.03 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall 
    88 kB ( 0%) ggc
 tree split crit edges :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall   
2014 kB ( 0%) ggc
 tree reassociation    :   0.25 ( 0%) usr   0.05 ( 3%) sys   0.23 ( 0%) wall   
6023 kB ( 0%) ggc
 tree PRE              :   0.81 ( 0%) usr   0.00 ( 0%) sys   0.82 ( 0%) wall   
7164 kB ( 0%) ggc
 tree FRE              :   0.43 ( 0%) usr   0.03 ( 2%) sys   0.51 ( 0%) wall   
5410 kB ( 0%) ggc
 tree code sinking     :   0.07 ( 0%) usr   0.00 ( 0%) sys   0.03 ( 0%) wall   
1311 kB ( 0%) ggc
 tree linearize phis   :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall   
   0 kB ( 0%) ggc
 tree forward propagate:   0.33 ( 0%) usr   0.00 ( 0%) sys   0.30 ( 0%) wall  
11812 kB ( 1%) ggc
 tree phiprop          :   0.05 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall   
   0 kB ( 0%) ggc
 tree conservative DCE :   0.09 ( 0%) usr   0.01 ( 1%) sys   0.09 ( 0%) wall   
 575 kB ( 0%) ggc
 tree aggressive DCE   :   0.72 ( 0%) usr   0.00 ( 0%) sys   0.82 ( 0%) wall  
25487 kB ( 1%) ggc
 tree DSE              :   0.23 ( 0%) usr   0.00 ( 0%) sys   0.18 ( 0%) wall   
 259 kB ( 0%) ggc
 tree loop bounds      :   0.14 ( 0%) usr   0.00 ( 0%) sys   0.22 ( 0%) wall   
6651 kB ( 0%) ggc
 tree loop invariant motion:   0.04 ( 0%) usr   0.01 ( 1%) sys   0.07 ( 0%)
wall      76 kB ( 0%) ggc
 tree canonical iv     :   0.07 ( 0%) usr   0.00 ( 0%) sys   0.13 ( 0%) wall   
3418 kB ( 0%) ggc
 scev constant prop    :   0.03 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall   
2301 kB ( 0%) ggc
 tree loop unswitching :   0.00 ( 0%) usr   0.01 ( 1%) sys   0.02 ( 0%) wall   
 739 kB ( 0%) ggc
 complete unrolling    :   1.25 ( 1%) usr   0.15 ( 8%) sys   1.56 ( 1%) wall 
101449 kB ( 6%) ggc
 tree vectorization    :   0.35 ( 0%) usr   0.01 ( 1%) sys   0.26 ( 0%) wall  
20110 kB ( 1%) ggc
 tree slp vectorization:   0.76 ( 0%) usr   0.00 ( 0%) sys   0.72 ( 0%) wall  
52730 kB ( 3%) ggc
 tree loop distribution:   0.10 ( 0%) usr   0.00 ( 0%) sys   0.06 ( 0%) wall   
   0 kB ( 0%) ggc
 tree prefetching      :   1.64 ( 1%) usr   0.08 ( 4%) sys   1.39 ( 1%) wall  
90832 kB ( 5%) ggc
 tree iv optimization  :  32.57 (20%) usr   0.10 ( 5%) sys  32.73 (20%) wall 
322095 kB (18%) ggc
 predictive commoning  :   0.03 ( 0%) usr   0.00 ( 0%) sys   0.06 ( 0%) wall   
1747 kB ( 0%) ggc
 tree loop init        :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall   
1306 kB ( 0%) ggc
 tree loop fini        :   0.03 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall   
   0 kB ( 0%) ggc
 tree copy headers     :   0.03 ( 0%) usr   0.00 ( 0%) sys   0.05 ( 0%) wall   
1652 kB ( 0%) ggc
 tree SSA uncprop      :   0.03 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall   
   0 kB ( 0%) ggc
 tree rename SSA copies:   0.05 ( 0%) usr   0.00 ( 0%) sys   0.04 ( 0%) wall   
   0 kB ( 0%) ggc
 dominance frontiers   :   0.19 ( 0%) usr   0.01 ( 1%) sys   0.29 ( 0%) wall   
   0 kB ( 0%) ggc
 dominance computation :   0.50 ( 0%) usr   0.01 ( 1%) sys   0.64 ( 0%) wall   
   0 kB ( 0%) ggc
 control dependences   :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall   
   0 kB ( 0%) ggc
 out of ssa            :   0.28 ( 0%) usr   0.00 ( 0%) sys   0.28 ( 0%) wall   
 215 kB ( 0%) ggc
 expand vars           :   0.25 ( 0%) usr   0.00 ( 0%) sys   0.26 ( 0%) wall  
11358 kB ( 1%) ggc
 expand                :   6.48 ( 4%) usr   0.04 ( 2%) sys   6.30 ( 4%) wall 
111660 kB ( 6%) ggc
 post expand cleanups  :   0.14 ( 0%) usr   0.00 ( 0%) sys   0.12 ( 0%) wall   
5818 kB ( 0%) ggc
 lower subreg          :   0.14 ( 0%) usr   0.00 ( 0%) sys   0.20 ( 0%) wall   
   0 kB ( 0%) ggc
 jump                  :   0.00 ( 0%) usr   0.01 ( 1%) sys   0.00 ( 0%) wall   
   0 kB ( 0%) ggc
 forward prop          :   1.67 ( 1%) usr   0.00 ( 0%) sys   1.68 ( 1%) wall  
15809 kB ( 1%) ggc
 CSE                   :   1.77 ( 1%) usr   0.01 ( 1%) sys   1.59 ( 1%) wall   
 661 kB ( 0%) ggc
 dead code elimination :   2.19 ( 1%) usr   0.00 ( 0%) sys   1.99 ( 1%) wall   
   0 kB ( 0%) ggc
 dead store elim1      :   1.42 ( 1%) usr   0.01 ( 1%) sys   1.34 ( 1%) wall  
23523 kB ( 1%) ggc
 dead store elim2      :   1.14 ( 1%) usr   0.00 ( 0%) sys   1.37 ( 1%) wall  
22811 kB ( 1%) ggc
 loop analysis         :   0.12 ( 0%) usr   0.00 ( 0%) sys   0.04 ( 0%) wall   
2220 kB ( 0%) ggc
 loop invariant motion :   0.31 ( 0%) usr   0.00 ( 0%) sys   0.22 ( 0%) wall   
 448 kB ( 0%) ggc
 loop unswitching      :   0.12 ( 0%) usr   0.00 ( 0%) sys   0.15 ( 0%) wall   
   0 kB ( 0%) ggc
 loop unrolling        :   4.31 ( 3%) usr   0.03 ( 2%) sys   4.35 ( 3%) wall 
184514 kB (10%) ggc
 CPROP                 :   2.35 ( 1%) usr   0.00 ( 0%) sys   2.36 ( 1%) wall  
25112 kB ( 1%) ggc
 PRE                   :   1.42 ( 1%) usr   0.00 ( 0%) sys   1.34 ( 1%) wall   
1627 kB ( 0%) ggc
 web                   :   1.12 ( 1%) usr   0.00 ( 0%) sys   1.33 ( 1%) wall   
8405 kB ( 0%) ggc
 CSE 2                 :   2.29 ( 1%) usr   0.00 ( 0%) sys   2.11 ( 1%) wall   
2121 kB ( 0%) ggc
 branch prediction     :   0.20 ( 0%) usr   0.00 ( 0%) sys   0.17 ( 0%) wall   
6854 kB ( 0%) ggc
 combiner              :   4.28 ( 3%) usr   0.01 ( 1%) sys   4.22 ( 3%) wall  
60382 kB ( 3%) ggc
 if-conversion         :   0.13 ( 0%) usr   0.00 ( 0%) sys   0.20 ( 0%) wall   
2520 kB ( 0%) ggc
 regmove               :   0.74 ( 0%) usr   0.00 ( 0%) sys   0.63 ( 0%) wall   
   0 kB ( 0%) ggc
 integrated RA         :  11.28 ( 7%) usr   0.03 ( 2%) sys  11.20 ( 7%) wall  
44198 kB ( 2%) ggc
 reload                :   7.00 ( 4%) usr   0.03 ( 2%) sys   7.05 ( 4%) wall  
10026 kB ( 1%) ggc
 reload CSE regs       :   4.45 ( 3%) usr   0.00 ( 0%) sys   4.48 ( 3%) wall  
36964 kB ( 2%) ggc
 load CSE after reload :   0.42 ( 0%) usr   0.01 ( 1%) sys   0.55 ( 0%) wall   
 449 kB ( 0%) ggc
 zee                   :   0.41 ( 0%) usr   0.01 ( 1%) sys   0.36 ( 0%) wall   
  45 kB ( 0%) ggc
 thread pro- & epilogue:   0.13 ( 0%) usr   0.00 ( 0%) sys   0.18 ( 0%) wall   
3984 kB ( 0%) ggc
 if-conversion 2       :   0.11 ( 0%) usr   0.00 ( 0%) sys   0.06 ( 0%) wall   
1056 kB ( 0%) ggc
 combine stack adjustments:   0.12 ( 0%) usr   0.00 ( 0%) sys   0.10 ( 0%) wall
      0 kB ( 0%) ggc
 peephole 2            :   0.54 ( 0%) usr   0.00 ( 0%) sys   0.46 ( 0%) wall   
2995 kB ( 0%) ggc
 rename registers      :   1.21 ( 1%) usr   0.00 ( 0%) sys   1.19 ( 1%) wall   
2741 kB ( 0%) ggc
 hard reg cprop        :   0.93 ( 1%) usr   0.00 ( 0%) sys   1.13 ( 1%) wall   
  15 kB ( 0%) ggc
 scheduling 2          :   6.06 ( 4%) usr   0.02 ( 1%) sys   6.13 ( 4%) wall   
1138 kB ( 0%) ggc
 machine dep reorg     :   0.87 ( 1%) usr   0.00 ( 0%) sys   0.99 ( 1%) wall   
  77 kB ( 0%) ggc
 reorder blocks        :   0.65 ( 0%) usr   0.01 ( 1%) sys   0.58 ( 0%) wall   
4789 kB ( 0%) ggc
 final                 :   1.90 ( 1%) usr   0.15 ( 8%) sys   1.83 ( 1%) wall   
9656 kB ( 1%) ggc
 symout                :   0.62 ( 0%) usr   0.07 ( 4%) sys   0.72 ( 0%) wall  
58777 kB ( 3%) ggc
 variable tracking     :   2.65 ( 2%) usr   0.00 ( 0%) sys   2.79 ( 2%) wall  
62056 kB ( 3%) ggc
 var-tracking dataflow :   3.74 ( 2%) usr   0.01 ( 1%) sys   3.73 ( 2%) wall   
   0 kB ( 0%) ggc
 var-tracking emit     :   3.47 ( 2%) usr   0.00 ( 0%) sys   3.44 ( 2%) wall  
19142 kB ( 1%) ggc
 TOTAL                 : 164.53             1.84           166.37           
1777349 kB


-- 

jv244 at cam dot ac dot uk changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
   Last reconfirmed|2010-08-27 17:01:01         |2010-08-29 05:31:37
               date|                            |


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422


^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug middle-end/45422] [4.6 Regression] compile time increases 3x.
  2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
                   ` (14 preceding siblings ...)
  2010-08-29  5:31 ` jv244 at cam dot ac dot uk
@ 2010-08-29  6:38 ` jv244 at cam dot ac dot uk
  2010-08-29  9:26 ` rguenth at gcc dot gnu dot org
                   ` (8 subsequent siblings)
  24 siblings, 0 replies; 26+ messages in thread
From: jv244 at cam dot ac dot uk @ 2010-08-29  6:38 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #16 from jv244 at cam dot ac dot uk  2010-08-29 06:38 -------
adjust summary according to the last timings


-- 

jv244 at cam dot ac dot uk changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
   Last reconfirmed|2010-08-29 05:31:37         |2010-08-29 06:38:26
               date|                            |
            Summary|[4.6 Regression] compile    |[4.6 Regression] compile
                   |time increases 5x.          |time increases 3x.


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422


^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug middle-end/45422] [4.6 Regression] compile time increases 3x.
  2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
                   ` (15 preceding siblings ...)
  2010-08-29  6:38 ` [Bug middle-end/45422] [4.6 Regression] compile time increases 3x jv244 at cam dot ac dot uk
@ 2010-08-29  9:26 ` rguenth at gcc dot gnu dot org
  2010-08-29 15:07 ` jv244 at cam dot ac dot uk
                   ` (7 subsequent siblings)
  24 siblings, 0 replies; 26+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2010-08-29  9:26 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #17 from rguenth at gcc dot gnu dot org  2010-08-29 09:25 -------
 tree iv optimization  :  32.57 (20%) usr   0.10 ( 5%) sys  32.73 (20%) wall 
322095 kB (18%) ggc


20% is still completely unreasonable for IV optimization.


-- 

rguenth at gcc dot gnu dot org changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|WAITING                     |NEW
   Last reconfirmed|2010-08-29 06:38:26         |2010-08-29 09:25:52
               date|                            |


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422


^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug middle-end/45422] [4.6 Regression] compile time increases 3x.
  2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
                   ` (16 preceding siblings ...)
  2010-08-29  9:26 ` rguenth at gcc dot gnu dot org
@ 2010-08-29 15:07 ` jv244 at cam dot ac dot uk
  2010-08-30  3:11 ` davidxl at gcc dot gnu dot org
                   ` (6 subsequent siblings)
  24 siblings, 0 replies; 26+ messages in thread
From: jv244 at cam dot ac dot uk @ 2010-08-29 15:07 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #18 from jv244 at cam dot ac dot uk  2010-08-29 15:07 -------
FYI, these are the 4.5 branch timings:

Execution times (seconds)
 garbage collection    :   0.47 ( 1%) usr   0.00 ( 0%) sys   0.47 ( 1%) wall   
   0 kB ( 0%) ggc
 callgraph construction:   0.05 ( 0%) usr   0.01 ( 1%) sys   0.09 ( 0%) wall   
5996 kB ( 1%) ggc
 callgraph optimization:   0.21 ( 0%) usr   0.02 ( 1%) sys   0.26 ( 0%) wall   
 606 kB ( 0%) ggc
 ipa cp                :   0.09 ( 0%) usr   0.00 ( 0%) sys   0.08 ( 0%) wall   
1381 kB ( 0%) ggc
 ipa reference         :   0.06 ( 0%) usr   0.00 ( 0%) sys   0.06 ( 0%) wall   
   0 kB ( 0%) ggc
 ipa pure const        :   0.06 ( 0%) usr   0.01 ( 1%) sys   0.09 ( 0%) wall   
   0 kB ( 0%) ggc
 cfg cleanup           :   0.39 ( 1%) usr   0.00 ( 0%) sys   0.51 ( 1%) wall   
2459 kB ( 0%) ggc
 trivially dead code   :   0.34 ( 1%) usr   0.00 ( 0%) sys   0.30 ( 1%) wall   
   0 kB ( 0%) ggc
 df multiple defs      :   0.08 ( 0%) usr   0.00 ( 0%) sys   0.13 ( 0%) wall   
   0 kB ( 0%) ggc
 df reaching defs      :   0.33 ( 1%) usr   0.00 ( 0%) sys   0.27 ( 1%) wall   
   0 kB ( 0%) ggc
 df live regs          :   2.08 ( 4%) usr   0.01 ( 1%) sys   2.19 ( 4%) wall   
   0 kB ( 0%) ggc
 df live&initialized regs:   0.98 ( 2%) usr   0.00 ( 0%) sys   0.92 ( 2%) wall 
     0 kB ( 0%) ggc
 df use-def / def-use chains:   0.24 ( 0%) usr   0.00 ( 0%) sys   0.19 ( 0%)
wall       0 kB ( 0%) ggc
 df reg dead/unused notes:   0.93 ( 2%) usr   0.00 ( 0%) sys   1.04 ( 2%) wall 
  5756 kB ( 1%) ggc
 register information  :   0.51 ( 1%) usr   0.01 ( 1%) sys   0.39 ( 1%) wall   
   0 kB ( 0%) ggc
 alias analysis        :   0.78 ( 1%) usr   0.01 ( 1%) sys   0.91 ( 2%) wall  
22384 kB ( 3%) ggc
 alias stmt walking    :   0.50 ( 1%) usr   0.03 ( 2%) sys   0.38 ( 1%) wall   
5563 kB ( 1%) ggc
 register scan         :   0.13 ( 0%) usr   0.00 ( 0%) sys   0.09 ( 0%) wall   
   0 kB ( 0%) ggc
 rebuild jump labels   :   0.19 ( 0%) usr   0.00 ( 0%) sys   0.19 ( 0%) wall   
   0 kB ( 0%) ggc
 parser                :   0.82 ( 2%) usr   0.13 ( 9%) sys   0.94 ( 2%) wall  
55603 kB ( 6%) ggc
 inline heuristics     :   0.20 ( 0%) usr   0.01 ( 1%) sys   0.16 ( 0%) wall   
   0 kB ( 0%) ggc
 tree gimplify         :   0.38 ( 1%) usr   0.03 ( 2%) sys   0.40 ( 1%) wall  
46588 kB ( 5%) ggc
 tree eh               :   0.02 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall   
   0 kB ( 0%) ggc
 tree CFG construction :   0.04 ( 0%) usr   0.02 ( 1%) sys   0.05 ( 0%) wall  
11964 kB ( 1%) ggc
 tree CFG cleanup      :   0.47 ( 1%) usr   0.00 ( 0%) sys   0.79 ( 1%) wall   
1829 kB ( 0%) ggc
 tree VRP              :   1.46 ( 3%) usr   0.05 ( 4%) sys   1.27 ( 2%) wall  
56376 kB ( 6%) ggc
 tree copy propagation :   0.09 ( 0%) usr   0.02 ( 1%) sys   0.22 ( 0%) wall   
 746 kB ( 0%) ggc
 tree find ref. vars   :   0.09 ( 0%) usr   0.01 ( 1%) sys   0.07 ( 0%) wall   
3806 kB ( 0%) ggc
 tree PTA              :   0.30 ( 1%) usr   0.00 ( 0%) sys   0.33 ( 1%) wall   
3836 kB ( 0%) ggc
 tree PHI insertion    :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.04 ( 0%) wall   
3194 kB ( 0%) ggc
 tree SSA rewrite      :   0.24 ( 0%) usr   0.01 ( 1%) sys   0.29 ( 1%) wall  
13860 kB ( 2%) ggc
 tree SSA other        :   0.13 ( 0%) usr   0.02 ( 1%) sys   0.11 ( 0%) wall   
 418 kB ( 0%) ggc
 tree SSA incremental  :   0.89 ( 2%) usr   0.06 ( 4%) sys   0.97 ( 2%) wall   
6811 kB ( 1%) ggc
 tree operand scan     :   0.34 ( 1%) usr   0.23 (17%) sys   0.59 ( 1%) wall  
44776 kB ( 5%) ggc
 dominator optimization:   0.29 ( 1%) usr   0.01 ( 1%) sys   0.35 ( 1%) wall   
5152 kB ( 1%) ggc
 tree CCP              :   0.51 ( 1%) usr   0.02 ( 1%) sys   0.43 ( 1%) wall   
4620 kB ( 1%) ggc
 tree PHI const/copy prop:   0.01 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall 
   106 kB ( 0%) ggc
 tree split crit edges :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall   
2019 kB ( 0%) ggc
 tree reassociation    :   0.12 ( 0%) usr   0.01 ( 1%) sys   0.12 ( 0%) wall   
2946 kB ( 0%) ggc
 tree PRE              :   0.92 ( 2%) usr   0.00 ( 0%) sys   0.95 ( 2%) wall   
7315 kB ( 1%) ggc
 tree FRE              :   0.45 ( 1%) usr   0.04 ( 3%) sys   0.35 ( 1%) wall   
5518 kB ( 1%) ggc
 tree code sinking     :   0.03 ( 0%) usr   0.00 ( 0%) sys   0.03 ( 0%) wall   
1400 kB ( 0%) ggc
 tree linearize phis   :   0.02 ( 0%) usr   0.01 ( 1%) sys   0.01 ( 0%) wall   
   0 kB ( 0%) ggc
 tree forward propagate:   0.18 ( 0%) usr   0.02 ( 1%) sys   0.16 ( 0%) wall  
10006 kB ( 1%) ggc
 tree conservative DCE :   0.05 ( 0%) usr   0.01 ( 1%) sys   0.13 ( 0%) wall   
 576 kB ( 0%) ggc
 tree aggressive DCE   :   0.28 ( 1%) usr   0.01 ( 1%) sys   0.37 ( 1%) wall   
8853 kB ( 1%) ggc
 tree buildin call DCE :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall   
   0 kB ( 0%) ggc
 tree DSE              :   0.20 ( 0%) usr   0.00 ( 0%) sys   0.11 ( 0%) wall   
 132 kB ( 0%) ggc
 PHI merge             :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall   
  37 kB ( 0%) ggc
 tree loop bounds      :   0.22 ( 0%) usr   0.00 ( 0%) sys   0.18 ( 0%) wall   
8266 kB ( 1%) ggc
 tree loop invariant motion:   0.06 ( 0%) usr   0.00 ( 0%) sys   0.08 ( 0%)
wall      67 kB ( 0%) ggc
 tree canonical iv     :   0.07 ( 0%) usr   0.00 ( 0%) sys   0.09 ( 0%) wall   
4779 kB ( 1%) ggc
 scev constant prop    :   0.08 ( 0%) usr   0.00 ( 0%) sys   0.07 ( 0%) wall   
2345 kB ( 0%) ggc
 tree loop unswitching :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall   
 573 kB ( 0%) ggc
 complete unrolling    :   1.05 ( 2%) usr   0.11 ( 8%) sys   1.39 ( 3%) wall  
98553 kB (11%) ggc
 tree vectorization    :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall   
 883 kB ( 0%) ggc
 tree slp vectorization:   0.61 ( 1%) usr   0.00 ( 0%) sys   0.60 ( 1%) wall  
53236 kB ( 6%) ggc
 tree iv optimization  :   5.80 (11%) usr   0.06 ( 4%) sys   5.94 (11%) wall  
95356 kB (11%) ggc
 predictive commoning  :   0.03 ( 0%) usr   0.00 ( 0%) sys   0.03 ( 0%) wall   
1054 kB ( 0%) ggc
 tree loop init        :   0.05 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall   
1339 kB ( 0%) ggc
 tree copy headers     :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.03 ( 0%) wall   
1613 kB ( 0%) ggc
 tree SSA uncprop      :   0.03 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall   
   0 kB ( 0%) ggc
 tree rename SSA copies:   0.06 ( 0%) usr   0.00 ( 0%) sys   0.03 ( 0%) wall   
   0 kB ( 0%) ggc
 dominance frontiers   :   0.04 ( 0%) usr   0.00 ( 0%) sys   0.05 ( 0%) wall   
   0 kB ( 0%) ggc
 dominance computation :   0.23 ( 0%) usr   0.00 ( 0%) sys   0.26 ( 0%) wall   
   0 kB ( 0%) ggc
 expand                :   3.24 ( 6%) usr   0.07 ( 5%) sys   3.34 ( 6%) wall  
69633 kB ( 8%) ggc
 lower subreg          :   0.02 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall   
   0 kB ( 0%) ggc
 forward prop          :   0.48 ( 1%) usr   0.01 ( 1%) sys   0.48 ( 1%) wall   
9984 kB ( 1%) ggc
 CSE                   :   0.73 ( 1%) usr   0.00 ( 0%) sys   0.92 ( 2%) wall   
 248 kB ( 0%) ggc
 dead code elimination :   0.24 ( 0%) usr   0.00 ( 0%) sys   0.28 ( 1%) wall   
   0 kB ( 0%) ggc
 dead store elim1      :   0.33 ( 1%) usr   0.01 ( 1%) sys   0.32 ( 1%) wall   
5987 kB ( 1%) ggc
 dead store elim2      :   0.44 ( 1%) usr   0.02 ( 1%) sys   0.39 ( 1%) wall   
7831 kB ( 1%) ggc
 loop analysis         :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall   
 718 kB ( 0%) ggc
 loop invariant motion :   0.05 ( 0%) usr   0.00 ( 0%) sys   0.07 ( 0%) wall   
 305 kB ( 0%) ggc
 loop unswitching      :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall   
   0 kB ( 0%) ggc
 loop unrolling        :   0.65 ( 1%) usr   0.00 ( 0%) sys   0.62 ( 1%) wall  
32780 kB ( 4%) ggc
 CPROP                 :   0.70 ( 1%) usr   0.00 ( 0%) sys   0.60 ( 1%) wall   
7825 kB ( 1%) ggc
 PRE                   :   0.32 ( 1%) usr   0.00 ( 0%) sys   0.33 ( 1%) wall   
 719 kB ( 0%) ggc
 web                   :   0.13 ( 0%) usr   0.00 ( 0%) sys   0.07 ( 0%) wall   
 594 kB ( 0%) ggc
 CSE 2                 :   0.75 ( 1%) usr   0.01 ( 1%) sys   0.60 ( 1%) wall   
 470 kB ( 0%) ggc
 branch prediction     :   0.19 ( 0%) usr   0.01 ( 1%) sys   0.14 ( 0%) wall   
7344 kB ( 1%) ggc
 combiner              :   1.19 ( 2%) usr   0.01 ( 1%) sys   1.33 ( 2%) wall  
19980 kB ( 2%) ggc
 if-conversion         :   0.04 ( 0%) usr   0.00 ( 0%) sys   0.03 ( 0%) wall   
 746 kB ( 0%) ggc
 regmove               :   0.37 ( 1%) usr   0.01 ( 1%) sys   0.33 ( 1%) wall   
   0 kB ( 0%) ggc
 integrated RA         :   3.51 ( 7%) usr   0.01 ( 1%) sys   3.74 ( 7%) wall  
12746 kB ( 1%) ggc
 reload                :   2.16 ( 4%) usr   0.02 ( 1%) sys   2.01 ( 4%) wall   
7755 kB ( 1%) ggc
 reload CSE regs       :   1.38 ( 3%) usr   0.00 ( 0%) sys   1.26 ( 2%) wall  
12331 kB ( 1%) ggc
 load CSE after reload :   0.18 ( 0%) usr   0.00 ( 0%) sys   0.14 ( 0%) wall   
 162 kB ( 0%) ggc
 thread pro- & epilogue:   0.11 ( 0%) usr   0.00 ( 0%) sys   0.12 ( 0%) wall   
4370 kB ( 0%) ggc
 if-conversion 2       :   0.03 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall   
 357 kB ( 0%) ggc
 combine stack adjustments:   0.03 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall
      0 kB ( 0%) ggc
 peephole 2            :   0.12 ( 0%) usr   0.00 ( 0%) sys   0.16 ( 0%) wall   
1899 kB ( 0%) ggc
 rename registers      :   0.46 ( 1%) usr   0.00 ( 0%) sys   0.55 ( 1%) wall   
2237 kB ( 0%) ggc
 hard reg cprop        :   0.37 ( 1%) usr   0.00 ( 0%) sys   0.48 ( 1%) wall   
  13 kB ( 0%) ggc
 scheduling 2          :   3.30 ( 6%) usr   0.04 ( 3%) sys   3.10 ( 6%) wall   
1216 kB ( 0%) ggc
 machine dep reorg     :   0.38 ( 1%) usr   0.00 ( 0%) sys   0.36 ( 1%) wall   
  11 kB ( 0%) ggc
 reorder blocks        :   0.15 ( 0%) usr   0.00 ( 0%) sys   0.13 ( 0%) wall   
1283 kB ( 0%) ggc
 final                 :   0.93 ( 2%) usr   0.07 ( 5%) sys   0.84 ( 2%) wall   
6610 kB ( 1%) ggc
 symout                :   0.30 ( 1%) usr   0.03 ( 2%) sys   0.34 ( 1%) wall  
27006 kB ( 3%) ggc
 variable tracking     :   3.86 ( 7%) usr   0.03 ( 2%) sys   3.99 ( 7%) wall  
39804 kB ( 4%) ggc
 plugin execution      :   0.00 ( 0%) usr   0.01 ( 1%) sys   0.05 ( 0%) wall   
   0 kB ( 0%) ggc
 rest of compilation   :   0.00 ( 0%) usr   0.01 ( 1%) sys   0.00 ( 0%) wall   
   0 kB ( 0%) ggc
 TOTAL                 :  52.50             1.37            53.88            
893901 kB


-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422


^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug middle-end/45422] [4.6 Regression] compile time increases 3x.
  2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
                   ` (17 preceding siblings ...)
  2010-08-29 15:07 ` jv244 at cam dot ac dot uk
@ 2010-08-30  3:11 ` davidxl at gcc dot gnu dot org
  2010-08-30  3:19 ` davidxl at gcc dot gnu dot org
                   ` (5 subsequent siblings)
  24 siblings, 0 replies; 26+ messages in thread
From: davidxl at gcc dot gnu dot org @ 2010-08-30  3:11 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #20 from davidxl at gcc dot gnu dot org  2010-08-30 03:10 -------
(In reply to comment #16)
> adjust summary according to the last timings
> 

I am surprised to see such big differences between trunk and previous releases.
Compiling this test case with the those options on my core2 box (2.4GHz ) took
only 56seconds which is comparable with the timing with a 4.4.3 compiler (with
google local patches including ivopt improvements).

David


-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422


^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug middle-end/45422] [4.6 Regression] compile time increases 3x.
  2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
                   ` (18 preceding siblings ...)
  2010-08-30  3:11 ` davidxl at gcc dot gnu dot org
@ 2010-08-30  3:19 ` davidxl at gcc dot gnu dot org
  2010-08-30  7:12 ` rguenth at gcc dot gnu dot org
                   ` (4 subsequent siblings)
  24 siblings, 0 replies; 26+ messages in thread
From: davidxl at gcc dot gnu dot org @ 2010-08-30  3:19 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #21 from davidxl at gcc dot gnu dot org  2010-08-30 03:19 -------
(In reply to comment #17)
>  tree iv optimization  :  32.57 (20%) usr   0.10 ( 5%) sys  32.73 (20%) wall 
> 322095 kB (18%) ggc
> 
> 
> 20% is still completely unreasonable for IV optimization.
> 

There was a patch in trunk that may double the time in ivopt -- i.e.
find_optimal_iv_set_1 is done twice, one with the original iv set while the
other with full set. This probably needs to be revisited. 

David


-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422


^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug middle-end/45422] [4.6 Regression] compile time increases 3x.
  2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
                   ` (20 preceding siblings ...)
  2010-08-30  7:12 ` rguenth at gcc dot gnu dot org
@ 2010-08-30  7:12 ` rguenth at gcc dot gnu dot org
  2010-08-30 16:41 ` davidxl at gcc dot gnu dot org
                   ` (2 subsequent siblings)
  24 siblings, 0 replies; 26+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2010-08-30  7:12 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #23 from rguenth at gcc dot gnu dot org  2010-08-30 07:11 -------
(In reply to comment #22)
> Given the fact that the solution space is really large -- M^N where M is the
> number of candidates and M is the number of uses (here M == 70 and N == 48), 
> and the cost function is complicated, it will be challenging to come up with
> algorithm that converges really fast, and most importantly -- 'guarantees' an
> optimal solution..

Well - we can't guarantee an optimal solution.  We have to take compile-time
into account which means that O(M^N) is not acceptable but we need to come
up with something that can complete in O((M+N) log (M+N)) time at most.

I btw doubt that the solution found is anywhere near optimal for 32bit
x86 - using 15 IVs instead of 2 can't be cheaper.


-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422


^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug middle-end/45422] [4.6 Regression] compile time increases 3x.
  2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
                   ` (19 preceding siblings ...)
  2010-08-30  3:19 ` davidxl at gcc dot gnu dot org
@ 2010-08-30  7:12 ` rguenth at gcc dot gnu dot org
  2010-08-30  7:12 ` rguenth at gcc dot gnu dot org
                   ` (3 subsequent siblings)
  24 siblings, 0 replies; 26+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2010-08-30  7:12 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #24 from rguenth at gcc dot gnu dot org  2010-08-30 07:12 -------
(In reply to comment #20)
> (In reply to comment #16)
> > adjust summary according to the last timings
> > 
> 
> I am surprised to see such big differences between trunk and previous releases.
> Compiling this test case with the those options on my core2 box (2.4GHz ) took
> only 56seconds which is comparable with the timing with a 4.4.3 compiler (with
> google local patches including ivopt improvements).

Of course - because the ivopt improvement patches are the problem.


-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422


^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug middle-end/45422] [4.6 Regression] compile time increases 3x.
  2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
                   ` (21 preceding siblings ...)
  2010-08-30  7:12 ` rguenth at gcc dot gnu dot org
@ 2010-08-30 16:41 ` davidxl at gcc dot gnu dot org
  2010-08-31 17:45 ` davidxl at gcc dot gnu dot org
  2010-09-02 11:25 ` rguenth at gcc dot gnu dot org
  24 siblings, 0 replies; 26+ messages in thread
From: davidxl at gcc dot gnu dot org @ 2010-08-30 16:41 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #25 from davidxl at gcc dot gnu dot org  2010-08-30 16:41 -------
(In reply to comment #24)
> (In reply to comment #20)
> > (In reply to comment #16)
> > > adjust summary according to the last timings
> > > 
> > 
> > I am surprised to see such big differences between trunk and previous releases.
> > Compiling this test case with the those options on my core2 box (2.4GHz ) took
> > only 56seconds which is comparable with the timing with a 4.4.3 compiler (with
> > google local patches including ivopt improvements).
> 
> Of course - because the ivopt improvement patches are the problem.
> 

It is just the total time diff from Joost's measure can be just explained by
ivopt component.

David


-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422


^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug middle-end/45422] [4.6 Regression] compile time increases 3x.
  2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
                   ` (22 preceding siblings ...)
  2010-08-30 16:41 ` davidxl at gcc dot gnu dot org
@ 2010-08-31 17:45 ` davidxl at gcc dot gnu dot org
  2010-09-02 11:25 ` rguenth at gcc dot gnu dot org
  24 siblings, 0 replies; 26+ messages in thread
From: davidxl at gcc dot gnu dot org @ 2010-08-31 17:45 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #26 from davidxl at gcc dot gnu dot org  2010-08-31 17:45 -------
Good observation re. the number of IVs in the final set. This usually points to
some problem/bug in the cost function. I briefly looked at this case -- it
indeed exposes two more bugs in the cost model:

1) the computation cost of the all the cost pairs in an assignment can actually
not simply be added together, because many rewrite expressions can be commoned.
We now have the mechanism to compute with common loop invariants for register
pressure estimation, and this mechnasim needs to be extended for computation
cost.

2) the offset is not stripped when computing loop invariant expression ids --
this can cause problem in overestimating reg pressure. (The case arises more
often with loop unrolling).

David


-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422


^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug middle-end/45422] [4.6 Regression] compile time increases 3x.
  2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
                   ` (23 preceding siblings ...)
  2010-08-31 17:45 ` davidxl at gcc dot gnu dot org
@ 2010-09-02 11:25 ` rguenth at gcc dot gnu dot org
  24 siblings, 0 replies; 26+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2010-09-02 11:25 UTC (permalink / raw)
  To: gcc-bugs



-- 

rguenth at gcc dot gnu dot org changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
           Priority|P3                          |P1


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45422


^ permalink raw reply	[flat|nested] 26+ messages in thread

end of thread, other threads:[~2010-09-02 11:25 UTC | newest]

Thread overview: 26+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2010-08-26 18:33 [Bug middle-end/45422] New: [4.6 Regression] compile time increases 8x jv244 at cam dot ac dot uk
2010-08-26 18:34 ` [Bug middle-end/45422] " jv244 at cam dot ac dot uk
2010-08-27 10:23 ` rguenth at gcc dot gnu dot org
2010-08-27 11:41 ` rguenth at gcc dot gnu dot org
2010-08-27 11:46 ` jv244 at cam dot ac dot uk
2010-08-27 12:07 ` rguenth at gcc dot gnu dot org
2010-08-27 12:16 ` rguenth at gcc dot gnu dot org
2010-08-27 13:01 ` rguenth at gcc dot gnu dot org
2010-08-27 15:52 ` hjl dot tools at gmail dot com
2010-08-27 17:01 ` davidxl at gcc dot gnu dot org
2010-08-28  6:00 ` davidxl at gcc dot gnu dot org
2010-08-29  5:09 ` [Bug middle-end/45422] [4.6 Regression] compile time increases 5x jv244 at cam dot ac dot uk
2010-08-29  5:13 ` pinskia at gcc dot gnu dot org
2010-08-29  5:21 ` jv244 at cam dot ac dot uk
2010-08-29  5:24 ` pinskia at gcc dot gnu dot org
2010-08-29  5:31 ` jv244 at cam dot ac dot uk
2010-08-29  6:38 ` [Bug middle-end/45422] [4.6 Regression] compile time increases 3x jv244 at cam dot ac dot uk
2010-08-29  9:26 ` rguenth at gcc dot gnu dot org
2010-08-29 15:07 ` jv244 at cam dot ac dot uk
2010-08-30  3:11 ` davidxl at gcc dot gnu dot org
2010-08-30  3:19 ` davidxl at gcc dot gnu dot org
2010-08-30  7:12 ` rguenth at gcc dot gnu dot org
2010-08-30  7:12 ` rguenth at gcc dot gnu dot org
2010-08-30 16:41 ` davidxl at gcc dot gnu dot org
2010-08-31 17:45 ` davidxl at gcc dot gnu dot org
2010-09-02 11:25 ` rguenth at gcc dot gnu dot org

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).