public inbox for gcc-bugs@sourceware.org
help / color / mirror / Atom feed
* [Bug middle-end/29374]  New: Inordinate space required for modulo scheduling
@ 2006-10-06 23:05 lucier at math dot purdue dot edu
  2006-10-06 23:06 ` [Bug middle-end/29374] " lucier at math dot purdue dot edu
  2006-10-06 23:31 ` lucier at math dot purdue dot edu
  0 siblings, 2 replies; 3+ messages in thread
From: lucier at math dot purdue dot edu @ 2006-10-06 23:05 UTC (permalink / raw)
  To: gcc-bugs

Compiling this file took about 2.9GB of ram with -fmodulo-sched
-freschedule-modulo-scheduled-loops and 1.8 gigs without (visual inspection of
"top").  I guess even the "without" space requirements are somewhat outside my
expectations.

all.i.gz will be attached to the next message.

euler-7% gcc -v
Using built-in specs.
Target: x86_64-unknown-linux-gnu
Configured with: ../configure --prefix=/pkgs/gcc-mainline --disable-checking
--enable-languages=c
Thread model: posix
gcc version 4.2.0 20061006 (experimental)

With modulo-scheduling:

euler-6% gcc -I../include -I. -Wall -W -Wno-unused -O1 -fno-math-errno
-fschedule-insns2 -fno-trapping-math -fno-strict-aliasing -fwrapv
-fexpensive-optimizations -fforce-addr -fpeephole2 -falign-jumps
-falign-functions -fno-function-cse -ftree-copyrename -ftree-fre -ftree-dce
-fregmove -fgcse-las -freorder-functions -fcaller-saves -fno-if-conversion2
-foptimize-sibling-calls -fcse-skip-blocks -funit-at-a-time -finline-functions
-fmodulo-sched -freschedule-modulo-scheduled-loops -fomit-frame-pointer -fPIC
-fno-common -mieee-fp -DHAVE_CONFIG_H -D___PRIMAL -D___LIBRARY
-D___GAMBCDIR=\"/pkgs/Gambit-C/4.0b20\" -c _io.c -save-temps -ftime-report
-fmem-report
Memory still allocated at the end of the compilation process
Size   Allocated        Used    Overhead
8             16k         15k        480 
16            16k         12k        352 
64          4096         640          64 
256           12k       9216         168 
512           56k         53k        784 
1024         136k        136k       1904 
2048         116k        114k       1624 
4096         100k        100k       1400 
8192          56k         56k        392 
16384         16k         16k         56 
112         4096         672          56 
208           12k       8112         168 
192           12k       8256         168 
160           88k         82k       1232 
176          160k        157k       2240 
96          1500k       1475k         20k
416          188k        171k       2632 
128           52k         51k        728 
48           228k        225k       3648 
224          368k        359k       5152 
32           224k        222k       4032 
80            12k         11k        168 
Total       3376k       3286k         47k

String pool
entries         15401
identifiers     15401 (100.00%)
slots           32768
bytes           426k (18k overhead)
table size      256k
coll/search     0.3413
ins/search      0.0729
avg. entry      28.37 bytes (+/- 15.54)
longest entry   92

??? tree nodes created

(No per-node statistics)
Type hash: size 1021, 372 elements, 0.247253 collisions
DECL_DEBUG_EXPR  hash: size 1021, 0 elements, 0.000000 collisions
DECL_VALUE_EXPR  hash: size 1021, 0 elements, 0.000000 collisions

Execution times (seconds)
 TOTAL                 :   0.46             0.02             0.64              
3231 kB

Memory still allocated at the end of the compilation process
Size   Allocated        Used    Overhead
8             16k         13k        480 
16            92k         39k       2024 
64           820k        729k         12k
256         4096        1024          56 
512         4096         512          56 
1024         124k        120k       1736 
2048          12k         10k        168 
4096          64k         64k        896 
8192          40k         40k        280 
16384         16k         16k         56 
32768         96k         96k        168 
65536        704k        704k        616 
131072        512k        512k        224 
524288       1024k       1024k        112 
112          216k        200k       3024 
208           20k         17k        280 
192         1572k       1539k         21k
160           40k         18k        560 
176          976k        723k         13k
96          5536k       4706k         75k
416           16k       8320         224 
48          1744k        826k         27k
224          440k        390k       6160 
32          1616k        257k         28k
80          9624k       1018k        131k
Total         24M         12M        327k

String pool
entries         49980
identifiers     49980 (100.00%)
slots           131072
bytes           736k (54k overhead)
table size      1024k
coll/search     0.4881
ins/search      0.1914
avg. entry      15.09 bytes (+/- 11.94)
longest entry   92

??? tree nodes created

(No per-node statistics)
Type hash: size 1021, 515 elements, 0.801737 collisions
DECL_DEBUG_EXPR  hash: size 4093, 0 elements, 0.732265 collisions
DECL_VALUE_EXPR  hash: size 1021, 0 elements, 0.000000 collisions

Execution times (seconds)
 garbage collection    :   1.69 ( 2%) usr   0.08 ( 1%) sys   1.81 ( 2%) wall   
   0 kB ( 0%) ggc
 callgraph construction:   0.21 ( 0%) usr   0.03 ( 1%) sys   0.26 ( 0%) wall   
4932 kB ( 0%) ggc
 callgraph optimization:   0.01 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall   
   0 kB ( 0%) ggc
 ipa reference         :   0.06 ( 0%) usr   0.02 ( 0%) sys   0.08 ( 0%) wall   
   8 kB ( 0%) ggc
 cfg cleanup           :   1.30 ( 2%) usr   0.00 ( 0%) sys   1.38 ( 2%) wall   
 432 kB ( 0%) ggc
 trivially dead code   :   0.32 ( 0%) usr   0.00 ( 0%) sys   0.33 ( 0%) wall   
   0 kB ( 0%) ggc
 life analysis         :   4.40 ( 6%) usr   0.00 ( 0%) sys   4.58 ( 5%) wall   
3687 kB ( 0%) ggc
 life info update      :   0.30 ( 0%) usr   0.00 ( 0%) sys   0.27 ( 0%) wall   
   0 kB ( 0%) ggc
 alias analysis        :   0.44 ( 1%) usr   0.00 ( 0%) sys   0.50 ( 1%) wall   
5208 kB ( 0%) ggc
 register scan         :   0.17 ( 0%) usr   0.00 ( 0%) sys   0.20 ( 0%) wall   
   1 kB ( 0%) ggc
 rebuild jump labels   :   0.12 ( 0%) usr   0.00 ( 0%) sys   0.12 ( 0%) wall   
   0 kB ( 0%) ggc
 preprocessing         :   0.34 ( 0%) usr   0.39 ( 7%) sys   0.62 ( 1%) wall   
1187 kB ( 0%) ggc
 lexical analysis      :   0.26 ( 0%) usr   0.67 (12%) sys   1.19 ( 1%) wall   
   0 kB ( 0%) ggc
 parser                :   0.46 ( 1%) usr   0.46 ( 8%) sys   0.88 ( 1%) wall  
26574 kB ( 2%) ggc
 integration           :   0.08 ( 0%) usr   0.02 ( 0%) sys   0.10 ( 0%) wall   
   0 kB ( 0%) ggc
 tree gimplify         :   0.27 ( 0%) usr   0.01 ( 0%) sys   0.29 ( 0%) wall   
8869 kB ( 1%) ggc
 tree eh               :   0.02 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall   
   0 kB ( 0%) ggc
 tree CFG construction :   0.10 ( 0%) usr   0.01 ( 0%) sys   0.12 ( 0%) wall  
21804 kB ( 2%) ggc
 tree CFG cleanup      :   1.07 ( 1%) usr   0.00 ( 0%) sys   1.08 ( 1%) wall   
3029 kB ( 0%) ggc
 tree copy propagation :   0.40 ( 1%) usr   0.01 ( 0%) sys   0.40 ( 0%) wall   
 931 kB ( 0%) ggc
 tree store copy prop  :   0.14 ( 0%) usr   0.00 ( 0%) sys   0.16 ( 0%) wall   
 288 kB ( 0%) ggc
 tree find ref. vars   :   0.03 ( 0%) usr   0.00 ( 0%) sys   0.05 ( 0%) wall   
1300 kB ( 0%) ggc
 tree PTA              :   1.19 ( 2%) usr   0.02 ( 0%) sys   1.28 ( 1%) wall   
1176 kB ( 0%) ggc
 tree alias analysis   :   0.62 ( 1%) usr   0.13 ( 2%) sys   0.70 ( 1%) wall   
 106 kB ( 0%) ggc
 tree PHI insertion    :   0.14 ( 0%) usr   0.02 ( 0%) sys   0.15 ( 0%) wall   
8889 kB ( 1%) ggc
 tree SSA rewrite      :   0.62 ( 1%) usr   0.01 ( 0%) sys   0.67 ( 1%) wall  
10323 kB ( 1%) ggc
 tree SSA other        :   0.03 ( 0%) usr   0.02 ( 0%) sys   0.11 ( 0%) wall   
   0 kB ( 0%) ggc
 tree SSA incremental  :   1.53 ( 2%) usr   0.01 ( 0%) sys   1.62 ( 2%) wall   
1494 kB ( 0%) ggc
 tree operand scan     :   0.53 ( 1%) usr   0.28 ( 5%) sys   0.87 ( 1%) wall  
10327 kB ( 1%) ggc
 dominator optimization:   0.74 ( 1%) usr   0.00 ( 0%) sys   0.81 ( 1%) wall  
17425 kB ( 2%) ggc
 tree SRA              :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall   
   0 kB ( 0%) ggc
 tree STORE-CCP        :   0.14 ( 0%) usr   0.00 ( 0%) sys   0.14 ( 0%) wall   
   2 kB ( 0%) ggc
 tree CCP              :   0.14 ( 0%) usr   0.00 ( 0%) sys   0.15 ( 0%) wall   
   9 kB ( 0%) ggc
 tree PHI const/copy prop:   0.04 ( 0%) usr   0.00 ( 0%) sys   0.06 ( 0%) wall 
     0 kB ( 0%) ggc
 tree split crit edges :   0.03 ( 0%) usr   0.01 ( 0%) sys   0.04 ( 0%) wall   
8831 kB ( 1%) ggc
 tree reassociation    :   0.14 ( 0%) usr   0.00 ( 0%) sys   0.16 ( 0%) wall   
  15 kB ( 0%) ggc
 tree FRE              :   3.54 ( 4%) usr   0.04 ( 1%) sys   3.76 ( 4%) wall   
6874 kB ( 1%) ggc
 tree code sinking     :   0.14 ( 0%) usr   0.00 ( 0%) sys   0.15 ( 0%) wall   
   1 kB ( 0%) ggc
 tree linearize phis   :   0.04 ( 0%) usr   0.00 ( 0%) sys   0.03 ( 0%) wall   
   0 kB ( 0%) ggc
 tree forward propagate:   0.03 ( 0%) usr   0.00 ( 0%) sys   0.04 ( 0%) wall   
   1 kB ( 0%) ggc
 tree conservative DCE :   0.42 ( 1%) usr   0.00 ( 0%) sys   0.44 ( 0%) wall   
   0 kB ( 0%) ggc
 tree aggressive DCE   :   0.13 ( 0%) usr   0.00 ( 0%) sys   0.14 ( 0%) wall   
   0 kB ( 0%) ggc
 tree DSE              :   0.10 ( 0%) usr   0.00 ( 0%) sys   0.11 ( 0%) wall   
   1 kB ( 0%) ggc
 PHI merge             :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall   
  15 kB ( 0%) ggc
 tree loop bounds      :   0.04 ( 0%) usr   0.00 ( 0%) sys   0.04 ( 0%) wall   
   3 kB ( 0%) ggc
 loop invariant motion :   0.10 ( 0%) usr   0.00 ( 0%) sys   0.11 ( 0%) wall   
   0 kB ( 0%) ggc
 tree canonical iv     :   0.04 ( 0%) usr   0.00 ( 0%) sys   0.04 ( 0%) wall   
   3 kB ( 0%) ggc
 scev constant prop    :   0.15 ( 0%) usr   0.01 ( 0%) sys   0.16 ( 0%) wall   
 586 kB ( 0%) ggc
 complete unrolling    :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall   
   7 kB ( 0%) ggc
 tree iv optimization  :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall   
  46 kB ( 0%) ggc
 tree loop init        :   0.43 ( 1%) usr   0.02 ( 0%) sys   0.45 ( 1%) wall  
12011 kB ( 1%) ggc
 tree copy headers     :   0.03 ( 0%) usr   0.00 ( 0%) sys   0.04 ( 0%) wall   
   8 kB ( 0%) ggc
 tree SSA uncprop      :   0.04 ( 0%) usr   0.00 ( 0%) sys   0.06 ( 0%) wall   
   0 kB ( 0%) ggc
 tree SSA to normal    :   4.47 ( 6%) usr   0.02 ( 0%) sys   4.69 ( 5%) wall  
23550 kB ( 2%) ggc
 tree rename SSA copies:   0.13 ( 0%) usr   0.00 ( 0%) sys   0.13 ( 0%) wall   
   0 kB ( 0%) ggc
 dominance frontiers   :   0.13 ( 0%) usr   0.00 ( 0%) sys   0.14 ( 0%) wall   
   0 kB ( 0%) ggc
 dominance computation :   0.56 ( 1%) usr   0.00 ( 0%) sys   0.59 ( 1%) wall   
   0 kB ( 0%) ggc
 expand                :   2.06 ( 3%) usr   0.03 ( 1%) sys   2.21 ( 2%) wall  
39692 kB ( 4%) ggc
 jump                  :   0.03 ( 0%) usr   0.00 ( 0%) sys   0.04 ( 0%) wall   
   0 kB ( 0%) ggc
 CSE                   :   0.51 ( 1%) usr   0.01 ( 0%) sys   0.55 ( 1%) wall   
1357 kB ( 0%) ggc
 loop analysis         :   3.37 ( 4%) usr   0.89 (16%) sys   4.46 ( 5%) wall   
1712 kB ( 0%) ggc
 branch prediction     :   0.22 ( 0%) usr   0.00 ( 0%) sys   0.23 ( 0%) wall   
 534 kB ( 0%) ggc
 flow analysis         :   0.05 ( 0%) usr   0.00 ( 0%) sys   0.05 ( 0%) wall   
   0 kB ( 0%) ggc
 combiner              :   0.51 ( 1%) usr   0.02 ( 0%) sys   0.54 ( 1%) wall   
6020 kB ( 1%) ggc
 if-conversion         :   0.18 ( 0%) usr   0.00 ( 0%) sys   0.19 ( 0%) wall   
  93 kB ( 0%) ggc
 regmove               :   0.28 ( 0%) usr   0.00 ( 0%) sys   0.31 ( 0%) wall   
   0 kB ( 0%) ggc
 mode switching        :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall   
   0 kB ( 0%) ggc
 sms modulo scheduling :  34.03 (43%) usr   1.44 (26%) sys  37.23 (42%) wall   
1692 kB ( 0%) ggc
 local alloc           :   0.63 ( 1%) usr   0.00 ( 0%) sys   0.67 ( 1%) wall   
1722 kB ( 0%) ggc
 global alloc          :   2.13 ( 3%) usr   0.04 ( 1%) sys   2.28 ( 3%) wall   
2239 kB ( 0%) ggc
 reload CSE regs       :   0.63 ( 1%) usr   0.01 ( 0%) sys   0.65 ( 1%) wall   
6964 kB ( 1%) ggc
 flow 2                :   0.13 ( 0%) usr   0.00 ( 0%) sys   0.15 ( 0%) wall   
 889 kB ( 0%) ggc
 peephole 2            :   0.09 ( 0%) usr   0.00 ( 0%) sys   0.11 ( 0%) wall   
 289 kB ( 0%) ggc
 rename registers      :   0.18 ( 0%) usr   0.02 ( 0%) sys   0.19 ( 0%) wall   
  22 kB ( 0%) ggc
 scheduling 2          :   0.93 ( 1%) usr   0.01 ( 0%) sys   0.96 ( 1%) wall   
5634 kB ( 0%) ggc
 machine dep reorg     :   0.14 ( 0%) usr   0.00 ( 0%) sys   0.16 ( 0%) wall   
  50 kB ( 0%) ggc
 reorder blocks        :   3.74 ( 5%) usr   0.62 (11%) sys   4.60 ( 5%) wall 
879054 kB (78%) ggc
 final                 :   0.53 ( 1%) usr   0.02 ( 0%) sys   0.57 ( 1%) wall   
1073 kB ( 0%) ggc
 TOTAL                 :  79.08             5.58            88.85           
1131330 kB

Without modulo-scheduling:

euler-12% gcc -I../include -I. -Wall -W -Wno-unused -O1 -fno-math-errno
-fschedule-insns2 -fno-trapping-math -fno-strict-aliasing -fwrapv
-fexpensive-optimizations -fforce-addr -fpeephole2 -falign-jumps
-falign-functions -fno-function-cse -ftree-copyrename -ftree-fre -ftree-dce
-fregmove -fgcse-las -freorder-functions -fcaller-saves -fno-if-conversion2
-foptimize-sibling-calls -fcse-skip-blocks -funit-at-a-time -finline-functions
-fomit-frame-pointer -fPIC -fno-common -mieee-fp -DHAVE_CONFIG_H -D___PRIMAL
-D___LIBRARY -D___GAMBCDIR=\"/pkgs/Gambit-C/4.0b20\" -c _io.c -save-temps
-ftime-report -fmem-report
Memory still allocated at the end of the compilation process
Size   Allocated        Used    Overhead
8             16k         15k        480 
16            16k         12k        352 
64          4096         640          64 
256           12k       9216         168 
512           56k         53k        784 
1024         136k        136k       1904 
2048         116k        114k       1624 
4096         100k        100k       1400 
8192          56k         56k        392 
16384         16k         16k         56 
112         4096         672          56 
208           12k       8112         168 
192           12k       8256         168 
160           88k         82k       1232 
176          160k        157k       2240 
96          1500k       1475k         20k
416          188k        171k       2632 
128           52k         51k        728 
48           228k        225k       3648 
224          368k        359k       5152 
32           224k        222k       4032 
80            12k         11k        168 
Total       3376k       3286k         47k

String pool
entries         15401
identifiers     15401 (100.00%)
slots           32768
bytes           426k (18k overhead)
table size      256k
coll/search     0.3413
ins/search      0.0729
avg. entry      28.37 bytes (+/- 15.54)
longest entry   92

??? tree nodes created

(No per-node statistics)
Type hash: size 1021, 372 elements, 0.247253 collisions
DECL_DEBUG_EXPR  hash: size 1021, 0 elements, 0.000000 collisions
DECL_VALUE_EXPR  hash: size 1021, 0 elements, 0.000000 collisions

Execution times (seconds)
 TOTAL                 :   0.47             0.02             0.59              
3231 kB
Memory still allocated at the end of the compilation process
Size   Allocated        Used    Overhead
8             16k         13k        480 
16            92k         39k       2024 
64           820k        729k         12k
256         4096        1024          56 
512         4096         512          56 
1024         124k        120k       1736 
2048          12k         10k        168 
4096          64k         64k        896 
8192          40k         40k        280 
16384         16k         16k         56 
32768         96k         96k        168 
65536        704k        704k        616 
131072        512k        512k        224 
524288       1024k       1024k        112 
112          216k        200k       3024 
208           20k         17k        280 
192         1572k       1539k         21k
160           40k         18k        560 
176          976k        723k         13k
96          5536k       4706k         75k
416           16k       8320         224 
48          1744k        826k         27k
224          440k        390k       6160 
32          1620k        257k         28k
80          9636k       1018k        131k
Total         24M         12M        327k

String pool
entries         49980
identifiers     49980 (100.00%)
slots           131072
bytes           736k (54k overhead)
table size      1024k
coll/search     0.4881
ins/search      0.1914
avg. entry      15.09 bytes (+/- 11.94)
longest entry   92

??? tree nodes created

(No per-node statistics)
Type hash: size 1021, 515 elements, 0.801737 collisions
DECL_DEBUG_EXPR  hash: size 4093, 0 elements, 0.758810 collisions
DECL_VALUE_EXPR  hash: size 1021, 0 elements, 0.000000 collisions

Execution times (seconds)
 garbage collection    :   1.62 ( 4%) usr   0.07 ( 2%) sys   1.70 ( 3%) wall   
   0 kB ( 0%) ggc
 callgraph construction:   0.20 ( 0%) usr   0.03 ( 1%) sys   0.24 ( 0%) wall   
4932 kB ( 0%) ggc
 callgraph optimization:   0.01 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall   
   0 kB ( 0%) ggc
 ipa reference         :   0.07 ( 0%) usr   0.02 ( 1%) sys   0.09 ( 0%) wall   
   8 kB ( 0%) ggc
 cfg cleanup           :   1.22 ( 3%) usr   0.00 ( 0%) sys   1.23 ( 3%) wall   
 344 kB ( 0%) ggc
 trivially dead code   :   0.30 ( 1%) usr   0.00 ( 0%) sys   0.32 ( 1%) wall   
   0 kB ( 0%) ggc
 life analysis         :   3.98 ( 9%) usr   0.00 ( 0%) sys   4.02 ( 8%) wall   
3773 kB ( 0%) ggc
 life info update      :   0.19 ( 0%) usr   0.00 ( 0%) sys   0.19 ( 0%) wall   
   0 kB ( 0%) ggc
 alias analysis        :   0.37 ( 1%) usr   0.00 ( 0%) sys   0.37 ( 1%) wall   
4696 kB ( 0%) ggc
 register scan         :   0.19 ( 0%) usr   0.00 ( 0%) sys   0.20 ( 0%) wall   
   1 kB ( 0%) ggc
 rebuild jump labels   :   0.10 ( 0%) usr   0.00 ( 0%) sys   0.10 ( 0%) wall   
   0 kB ( 0%) ggc
 preprocessing         :   0.38 ( 1%) usr   0.41 (10%) sys   0.66 ( 1%) wall   
1187 kB ( 0%) ggc
 lexical analysis      :   0.21 ( 0%) usr   0.66 (17%) sys   1.05 ( 2%) wall   
   0 kB ( 0%) ggc
 parser                :   0.54 ( 1%) usr   0.38 (10%) sys   1.00 ( 2%) wall  
26574 kB ( 2%) ggc
 integration           :   0.08 ( 0%) usr   0.02 ( 1%) sys   0.10 ( 0%) wall   
   0 kB ( 0%) ggc
 tree gimplify         :   0.27 ( 1%) usr   0.01 ( 0%) sys   0.29 ( 1%) wall   
8869 kB ( 1%) ggc
 tree eh               :   0.02 ( 0%) usr   0.01 ( 0%) sys   0.03 ( 0%) wall   
   0 kB ( 0%) ggc
 tree CFG construction :   0.10 ( 0%) usr   0.01 ( 0%) sys   0.13 ( 0%) wall  
21804 kB ( 2%) ggc
 tree CFG cleanup      :   1.03 ( 2%) usr   0.00 ( 0%) sys   1.08 ( 2%) wall   
3029 kB ( 0%) ggc
 tree copy propagation :   0.40 ( 1%) usr   0.01 ( 0%) sys   0.40 ( 1%) wall   
 931 kB ( 0%) ggc
 tree store copy prop  :   0.15 ( 0%) usr   0.00 ( 0%) sys   0.15 ( 0%) wall   
 288 kB ( 0%) ggc
 tree find ref. vars   :   0.03 ( 0%) usr   0.00 ( 0%) sys   0.04 ( 0%) wall   
1300 kB ( 0%) ggc
 tree PTA              :   1.16 ( 3%) usr   0.04 ( 1%) sys   1.26 ( 3%) wall   
1176 kB ( 0%) ggc
 tree alias analysis   :   0.61 ( 1%) usr   0.16 ( 4%) sys   0.78 ( 2%) wall   
 106 kB ( 0%) ggc
 tree PHI insertion    :   0.13 ( 0%) usr   0.01 ( 0%) sys   0.15 ( 0%) wall   
8889 kB ( 1%) ggc
 tree SSA rewrite      :   0.65 ( 1%) usr   0.00 ( 0%) sys   0.65 ( 1%) wall  
10323 kB ( 1%) ggc
 tree SSA other        :   0.03 ( 0%) usr   0.04 ( 1%) sys   0.08 ( 0%) wall   
   0 kB ( 0%) ggc
 tree SSA incremental  :   1.56 ( 4%) usr   0.00 ( 0%) sys   1.57 ( 3%) wall   
1494 kB ( 0%) ggc
 tree operand scan     :   0.54 ( 1%) usr   0.22 ( 6%) sys   0.75 ( 2%) wall  
10327 kB ( 1%) ggc
 dominator optimization:   0.76 ( 2%) usr   0.02 ( 1%) sys   0.82 ( 2%) wall  
17425 kB ( 2%) ggc
 tree STORE-CCP        :   0.13 ( 0%) usr   0.00 ( 0%) sys   0.14 ( 0%) wall   
   2 kB ( 0%) ggc
 tree CCP              :   0.14 ( 0%) usr   0.00 ( 0%) sys   0.15 ( 0%) wall   
   9 kB ( 0%) ggc
 tree PHI const/copy prop:   0.06 ( 0%) usr   0.00 ( 0%) sys   0.06 ( 0%) wall 
     0 kB ( 0%) ggc
 tree split crit edges :   0.05 ( 0%) usr   0.00 ( 0%) sys   0.05 ( 0%) wall   
8831 kB ( 1%) ggc
 tree reassociation    :   0.13 ( 0%) usr   0.00 ( 0%) sys   0.16 ( 0%) wall   
  15 kB ( 0%) ggc
 tree FRE              :   3.59 ( 8%) usr   0.04 ( 1%) sys   3.83 ( 8%) wall   
6874 kB ( 1%) ggc
 tree code sinking     :   0.14 ( 0%) usr   0.00 ( 0%) sys   0.14 ( 0%) wall   
   1 kB ( 0%) ggc
 tree linearize phis   :   0.03 ( 0%) usr   0.00 ( 0%) sys   0.03 ( 0%) wall   
   0 kB ( 0%) ggc
 tree forward propagate:   0.07 ( 0%) usr   0.00 ( 0%) sys   0.05 ( 0%) wall   
   1 kB ( 0%) ggc
 tree conservative DCE :   0.42 ( 1%) usr   0.00 ( 0%) sys   0.42 ( 1%) wall   
   0 kB ( 0%) ggc
 tree aggressive DCE   :   0.14 ( 0%) usr   0.00 ( 0%) sys   0.14 ( 0%) wall   
   0 kB ( 0%) ggc
 tree DSE              :   0.09 ( 0%) usr   0.00 ( 0%) sys   0.12 ( 0%) wall   
   1 kB ( 0%) ggc
 PHI merge             :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall   
  15 kB ( 0%) ggc
 tree loop bounds      :   0.04 ( 0%) usr   0.00 ( 0%) sys   0.04 ( 0%) wall   
   3 kB ( 0%) ggc
 loop invariant motion :   0.11 ( 0%) usr   0.00 ( 0%) sys   0.10 ( 0%) wall   
   0 kB ( 0%) ggc
 tree canonical iv     :   0.04 ( 0%) usr   0.00 ( 0%) sys   0.04 ( 0%) wall   
   3 kB ( 0%) ggc
 scev constant prop    :   0.15 ( 0%) usr   0.00 ( 0%) sys   0.16 ( 0%) wall   
 586 kB ( 0%) ggc
 complete unrolling    :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall   
   7 kB ( 0%) ggc
 tree iv optimization  :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall   
  46 kB ( 0%) ggc
 tree loop init        :   0.39 ( 1%) usr   0.02 ( 1%) sys   0.44 ( 1%) wall  
12011 kB ( 1%) ggc
 tree copy headers     :   0.03 ( 0%) usr   0.00 ( 0%) sys   0.04 ( 0%) wall   
   8 kB ( 0%) ggc
 tree SSA uncprop      :   0.06 ( 0%) usr   0.00 ( 0%) sys   0.05 ( 0%) wall   
   0 kB ( 0%) ggc
 tree SSA to normal    :   4.65 (10%) usr   0.02 ( 1%) sys   4.69 (10%) wall  
23550 kB ( 2%) ggc
 tree rename SSA copies:   0.13 ( 0%) usr   0.00 ( 0%) sys   0.13 ( 0%) wall   
   0 kB ( 0%) ggc
 dominance frontiers   :   0.13 ( 0%) usr   0.00 ( 0%) sys   0.16 ( 0%) wall   
   0 kB ( 0%) ggc
 dominance computation :   0.57 ( 1%) usr   0.00 ( 0%) sys   0.57 ( 1%) wall   
   0 kB ( 0%) ggc
 expand                :   2.04 ( 5%) usr   0.03 ( 1%) sys   2.07 ( 4%) wall  
39692 kB ( 3%) ggc
 varconst              :   0.00 ( 0%) usr   0.01 ( 0%) sys   0.00 ( 0%) wall   
   8 kB ( 0%) ggc
 jump                  :   0.04 ( 0%) usr   0.00 ( 0%) sys   0.04 ( 0%) wall   
   0 kB ( 0%) ggc
 CSE                   :   0.53 ( 1%) usr   0.01 ( 0%) sys   0.54 ( 1%) wall   
1357 kB ( 0%) ggc
 loop analysis         :   3.17 ( 7%) usr   0.76 (19%) sys   3.93 ( 8%) wall   
1712 kB ( 0%) ggc
 branch prediction     :   0.22 ( 0%) usr   0.00 ( 0%) sys   0.23 ( 0%) wall   
 534 kB ( 0%) ggc
 flow analysis         :   0.05 ( 0%) usr   0.00 ( 0%) sys   0.05 ( 0%) wall   
   0 kB ( 0%) ggc
 combiner              :   0.51 ( 1%) usr   0.00 ( 0%) sys   0.52 ( 1%) wall   
6020 kB ( 1%) ggc
 if-conversion         :   0.17 ( 0%) usr   0.00 ( 0%) sys   0.18 ( 0%) wall   
  93 kB ( 0%) ggc
 regmove               :   0.30 ( 1%) usr   0.00 ( 0%) sys   0.30 ( 1%) wall   
   0 kB ( 0%) ggc
 local alloc           :   0.65 ( 1%) usr   0.01 ( 0%) sys   0.66 ( 1%) wall   
2420 kB ( 0%) ggc
 global alloc          :   2.03 ( 5%) usr   0.04 ( 1%) sys   2.07 ( 4%) wall   
2856 kB ( 0%) ggc
 reload CSE regs       :   0.61 ( 1%) usr   0.01 ( 0%) sys   0.61 ( 1%) wall   
6964 kB ( 1%) ggc
 flow 2                :   0.11 ( 0%) usr   0.00 ( 0%) sys   0.11 ( 0%) wall   
 888 kB ( 0%) ggc
 peephole 2            :   0.08 ( 0%) usr   0.00 ( 0%) sys   0.08 ( 0%) wall   
 287 kB ( 0%) ggc
 rename registers      :   0.17 ( 0%) usr   0.00 ( 0%) sys   0.17 ( 0%) wall   
   4 kB ( 0%) ggc
 scheduling 2          :   0.91 ( 2%) usr   0.02 ( 1%) sys   0.94 ( 2%) wall   
5694 kB ( 0%) ggc
 machine dep reorg     :   0.16 ( 0%) usr   0.00 ( 0%) sys   0.15 ( 0%) wall   
  50 kB ( 0%) ggc
 reorder blocks        :   3.74 ( 8%) usr   0.70 (18%) sys   4.44 ( 9%) wall 
892891 kB (78%) ggc
 final                 :   0.53 ( 1%) usr   0.00 ( 0%) sys   0.53 ( 1%) wall   
1073 kB ( 0%) ggc
 TOTAL                 :  44.31             3.97            49.15           
1144316 kB


-- 
           Summary: Inordinate space required for modulo scheduling
           Product: gcc
           Version: 4.2.0
            Status: UNCONFIRMED
          Severity: normal
          Priority: P3
         Component: middle-end
        AssignedTo: unassigned at gcc dot gnu dot org
        ReportedBy: lucier at math dot purdue dot edu
 GCC build triplet: x86_64-unknown-linux-gnu
  GCC host triplet: x86_64-unknown-linux-gnu
GCC target triplet: x86_64-unknown-linux-gnu


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=29374


^ permalink raw reply	[flat|nested] 3+ messages in thread

* [Bug middle-end/29374] Inordinate space required for modulo scheduling
  2006-10-06 23:05 [Bug middle-end/29374] New: Inordinate space required for modulo scheduling lucier at math dot purdue dot edu
@ 2006-10-06 23:06 ` lucier at math dot purdue dot edu
  2006-10-06 23:31 ` lucier at math dot purdue dot edu
  1 sibling, 0 replies; 3+ messages in thread
From: lucier at math dot purdue dot edu @ 2006-10-06 23:06 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #1 from lucier at math dot purdue dot edu  2006-10-06 23:06 -------
Created an attachment (id=12394)
 --> (http://gcc.gnu.org/bugzilla/attachment.cgi?id=12394&action=view)
macro-expanded test file


-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=29374


^ permalink raw reply	[flat|nested] 3+ messages in thread

* [Bug middle-end/29374] Inordinate space required for modulo scheduling
  2006-10-06 23:05 [Bug middle-end/29374] New: Inordinate space required for modulo scheduling lucier at math dot purdue dot edu
  2006-10-06 23:06 ` [Bug middle-end/29374] " lucier at math dot purdue dot edu
@ 2006-10-06 23:31 ` lucier at math dot purdue dot edu
  1 sibling, 0 replies; 3+ messages in thread
From: lucier at math dot purdue dot edu @ 2006-10-06 23:31 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #2 from lucier at math dot purdue dot edu  2006-10-06 23:31 -------
On Darwin you can't compile the PPC64 version of _num.c, an even smaller file,
with Apple's gcc 4.0.1, and I can't build a 64-bit version of 4.2 to test it.

Blah.

gcc -mcpu=970 -m64 -I../include -I. -no-cpp-precomp -Wall -W -Wno-unused -O1
-fno-math-errno -fschedule-insns2 -fno-trapping-math -fno-strict-aliasing
-fwrapv -fexpensive-optimizations -fforce-addr -fpeephole2 -falign-jumps
-falign-functions -fno-function-cse -ftree-copyrename -ftree-fre -ftree-dce
-fregmove -fgcse-las -freorder-functions -fcaller-saves -fno-if-conversion2
-foptimize-sibling-calls -fcse-skip-blocks -funit-at-a-time -finline-functions
-fmodulo-sched -freschedule-modulo-scheduled-loops -fomit-frame-pointer -fPIC
-fno-common -DHAVE_CONFIG_H -D___PRIMAL -D___LIBRARY
-D___GAMBCDIR=\"/usr/local/Gambit-C/4.0b20\" -c _num.c
cc1(10820) malloc: *** vm_allocate(size=220135424) failed (error code=3)
cc1(10820) malloc: *** error: can't allocate region
cc1(10820) malloc: *** set a breakpoint in szone_error to debug

cc1: out of memory allocating 220132608 bytes after a total of 0 bytes
make[1]: *** [_num.o] Error 1


-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=29374


^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2006-10-06 23:31 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2006-10-06 23:05 [Bug middle-end/29374] New: Inordinate space required for modulo scheduling lucier at math dot purdue dot edu
2006-10-06 23:06 ` [Bug middle-end/29374] " lucier at math dot purdue dot edu
2006-10-06 23:31 ` lucier at math dot purdue dot edu

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).