public inbox for gcc-bugs@sourceware.org
help / color / mirror / Atom feed
* [Bug optimization/8361] [3.3/3.4 regression] C++ compile-time performance regression
       [not found] <20021025133601.8361.gerald@pfeifer.com>
@ 2003-10-16  2:39 ` mmitchel at gcc dot gnu dot org
  2003-12-29 16:23 ` steven at gcc dot gnu dot org
                   ` (37 subsequent siblings)
  38 siblings, 0 replies; 43+ messages in thread
From: mmitchel at gcc dot gnu dot org @ 2003-10-16  2:39 UTC (permalink / raw)
  To: gcc-bugs

PLEASE REPLY TO gcc-bugzilla@gcc.gnu.org ONLY, *NOT* gcc-bugs@gcc.gnu.org.

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=8361


mmitchel at gcc dot gnu dot org changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
   Target Milestone|3.3.2                       |3.4


------- Additional Comments From mmitchel at gcc dot gnu dot org  2003-10-16 02:39 -------
Well, here we go postponing this PR yet again...  This time until GCC 3.4.


^ permalink raw reply	[flat|nested] 43+ messages in thread

* [Bug optimization/8361] [3.3/3.4 regression] C++ compile-time performance regression
       [not found] <20021025133601.8361.gerald@pfeifer.com>
  2003-10-16  2:39 ` [Bug optimization/8361] [3.3/3.4 regression] C++ compile-time performance regression mmitchel at gcc dot gnu dot org
@ 2003-12-29 16:23 ` steven at gcc dot gnu dot org
  2004-01-02  2:31 ` giovannibajo at libero dot it
                   ` (36 subsequent siblings)
  38 siblings, 0 replies; 43+ messages in thread
From: steven at gcc dot gnu dot org @ 2003-12-29 16:23 UTC (permalink / raw)
  To: gcc-bugs


------- Additional Comments From steven at gcc dot gnu dot org  2003-12-29 15:40 -------
Zdenek's new dominator interface helps, see: 
http://gcc.gnu.org/ml/gcc-patches/2003-12/msg02164.html 

-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=8361


^ permalink raw reply	[flat|nested] 43+ messages in thread

* [Bug optimization/8361] [3.3/3.4 regression] C++ compile-time performance regression
       [not found] <20021025133601.8361.gerald@pfeifer.com>
  2003-10-16  2:39 ` [Bug optimization/8361] [3.3/3.4 regression] C++ compile-time performance regression mmitchel at gcc dot gnu dot org
  2003-12-29 16:23 ` steven at gcc dot gnu dot org
@ 2004-01-02  2:31 ` giovannibajo at libero dot it
  2004-01-14  3:22 ` pinskia at gcc dot gnu dot org
                   ` (35 subsequent siblings)
  38 siblings, 0 replies; 43+ messages in thread
From: giovannibajo at libero dot it @ 2004-01-02  2:31 UTC (permalink / raw)
  To: gcc-bugs



-- 
Bug 8361 depends on bug 13479, which changed state.

Bug 13479 Summary: [3.4 regression] 20-30% compile-time regression with template-heavy code
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=13479

           What    |Old Value                   |New Value
----------------------------------------------------------------------------
             Status|NEW                         |RESOLVED
         Resolution|                            |INVALID

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=8361


^ permalink raw reply	[flat|nested] 43+ messages in thread

* [Bug optimization/8361] [3.3/3.4 regression] C++ compile-time performance regression
       [not found] <20021025133601.8361.gerald@pfeifer.com>
                   ` (2 preceding siblings ...)
  2004-01-02  2:31 ` giovannibajo at libero dot it
@ 2004-01-14  3:22 ` pinskia at gcc dot gnu dot org
  2004-01-21 18:28 ` [Bug optimization/8361] [3.3/3.4/3.5 " gerald at pfeifer dot com
                   ` (34 subsequent siblings)
  38 siblings, 0 replies; 43+ messages in thread
From: pinskia at gcc dot gnu dot org @ 2004-01-14  3:22 UTC (permalink / raw)
  To: gcc-bugs


------- Additional Comments From pinskia at gcc dot gnu dot org  2004-01-14 03:22 -------
Some improvements lately made by Jan.

-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=8361


^ permalink raw reply	[flat|nested] 43+ messages in thread

* [Bug optimization/8361] [3.3/3.4/3.5 regression] C++ compile-time performance regression
       [not found] <20021025133601.8361.gerald@pfeifer.com>
                   ` (3 preceding siblings ...)
  2004-01-14  3:22 ` pinskia at gcc dot gnu dot org
@ 2004-01-21 18:28 ` gerald at pfeifer dot com
  2004-02-06  1:44 ` pinskia at gcc dot gnu dot org
                   ` (33 subsequent siblings)
  38 siblings, 0 replies; 43+ messages in thread
From: gerald at pfeifer dot com @ 2004-01-21 18:28 UTC (permalink / raw)
  To: gcc-bugs


------- Additional Comments From gerald at pfeifer dot com  2004-01-21 18:28 -------
Some additional benchmark data (which will soon be outdated, and for the better
it seems) by work Jan is doing.

http://gcc.gnu.org/ml/gcc/2004-01/msg00657.html

-- 
           What    |Removed                     |Added
----------------------------------------------------------------------------
   Last reconfirmed|2003-12-19 06:46:21         |2004-01-21 18:28:31
               date|                            |


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=8361


^ permalink raw reply	[flat|nested] 43+ messages in thread

* [Bug optimization/8361] [3.3/3.4/3.5 regression] C++ compile-time performance regression
       [not found] <20021025133601.8361.gerald@pfeifer.com>
                   ` (4 preceding siblings ...)
  2004-01-21 18:28 ` [Bug optimization/8361] [3.3/3.4/3.5 " gerald at pfeifer dot com
@ 2004-02-06  1:44 ` pinskia at gcc dot gnu dot org
  2004-03-13 16:38 ` mmitchel at gcc dot gnu dot org
                   ` (32 subsequent siblings)
  38 siblings, 0 replies; 43+ messages in thread
From: pinskia at gcc dot gnu dot org @ 2004-02-06  1:44 UTC (permalink / raw)
  To: gcc-bugs



-- 
Bug 8361 depends on bug 10944, which changed state.

Bug 10944 Summary: alloc_page in ggc-page.c is slow
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=10944

           What    |Old Value                   |New Value
----------------------------------------------------------------------------
             Status|WAITING                     |RESOLVED
         Resolution|                            |WONTFIX

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=8361


^ permalink raw reply	[flat|nested] 43+ messages in thread

* [Bug optimization/8361] [3.3/3.4/3.5 regression] C++ compile-time performance regression
       [not found] <20021025133601.8361.gerald@pfeifer.com>
                   ` (5 preceding siblings ...)
  2004-02-06  1:44 ` pinskia at gcc dot gnu dot org
@ 2004-03-13 16:38 ` mmitchel at gcc dot gnu dot org
  2004-06-18 23:39 ` [Bug rtl-optimization/8361] " mmitchel at gcc dot gnu dot org
                   ` (31 subsequent siblings)
  38 siblings, 0 replies; 43+ messages in thread
From: mmitchel at gcc dot gnu dot org @ 2004-03-13 16:38 UTC (permalink / raw)
  To: gcc-bugs


------- Additional Comments From mmitchel at gcc dot gnu dot org  2004-03-13 16:38 -------
This PR just keeps hanging around.  How sad.  But, no more work will be done
this before 3.4.0, so I've postponed until 3.4.1.

-- 
           What    |Removed                     |Added
----------------------------------------------------------------------------
   Target Milestone|3.4.0                       |3.4.1


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=8361


^ permalink raw reply	[flat|nested] 43+ messages in thread

* [Bug rtl-optimization/8361] [3.3/3.4/3.5 regression] C++ compile-time performance regression
       [not found] <20021025133601.8361.gerald@pfeifer.com>
                   ` (6 preceding siblings ...)
  2004-03-13 16:38 ` mmitchel at gcc dot gnu dot org
@ 2004-06-18 23:39 ` mmitchel at gcc dot gnu dot org
  2004-06-22  8:38 ` gerald at pfeifer dot com
                   ` (30 subsequent siblings)
  38 siblings, 0 replies; 43+ messages in thread
From: mmitchel at gcc dot gnu dot org @ 2004-06-18 23:39 UTC (permalink / raw)
  To: gcc-bugs


------- Additional Comments From mmitchel at gcc dot gnu dot org  2004-06-18 23:39 -------
Postponed until GCC 3.4.2.

-- 
           What    |Removed                     |Added
----------------------------------------------------------------------------
   Target Milestone|3.4.1                       |3.4.2


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=8361


^ permalink raw reply	[flat|nested] 43+ messages in thread

* [Bug rtl-optimization/8361] [3.3/3.4/3.5 regression] C++ compile-time performance regression
       [not found] <20021025133601.8361.gerald@pfeifer.com>
                   ` (7 preceding siblings ...)
  2004-06-18 23:39 ` [Bug rtl-optimization/8361] " mmitchel at gcc dot gnu dot org
@ 2004-06-22  8:38 ` gerald at pfeifer dot com
  2004-06-25  4:32 ` pinskia at gcc dot gnu dot org
                   ` (29 subsequent siblings)
  38 siblings, 0 replies; 43+ messages in thread
From: gerald at pfeifer dot com @ 2004-06-22  8:38 UTC (permalink / raw)
  To: gcc-bugs



-- 
           What    |Removed                     |Added
----------------------------------------------------------------------------
  BugsThisDependsOn|                            |16131


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=8361


^ permalink raw reply	[flat|nested] 43+ messages in thread

* [Bug rtl-optimization/8361] [3.3/3.4/3.5 regression] C++ compile-time performance regression
       [not found] <20021025133601.8361.gerald@pfeifer.com>
                   ` (8 preceding siblings ...)
  2004-06-22  8:38 ` gerald at pfeifer dot com
@ 2004-06-25  4:32 ` pinskia at gcc dot gnu dot org
  2004-06-25  7:44 ` pinskia at gcc dot gnu dot org
                   ` (28 subsequent siblings)
  38 siblings, 0 replies; 43+ messages in thread
From: pinskia at gcc dot gnu dot org @ 2004-06-25  4:32 UTC (permalink / raw)
  To: gcc-bugs



-- 
Bug 8361 depends on bug 16131, which changed state.

Bug 16131 Summary: [3.5 Regression] internal compiler error: in create_tmp_var, at gimplify.c:361
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=16131

           What    |Old Value                   |New Value
----------------------------------------------------------------------------
             Status|UNCONFIRMED                 |NEW
             Status|NEW                         |RESOLVED
         Resolution|                            |FIXED

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=8361


^ permalink raw reply	[flat|nested] 43+ messages in thread

* [Bug rtl-optimization/8361] [3.3/3.4/3.5 regression] C++ compile-time performance regression
       [not found] <20021025133601.8361.gerald@pfeifer.com>
                   ` (9 preceding siblings ...)
  2004-06-25  4:32 ` pinskia at gcc dot gnu dot org
@ 2004-06-25  7:44 ` pinskia at gcc dot gnu dot org
  2004-08-29 18:50 ` mmitchel at gcc dot gnu dot org
                   ` (27 subsequent siblings)
  38 siblings, 0 replies; 43+ messages in thread
From: pinskia at gcc dot gnu dot org @ 2004-06-25  7:44 UTC (permalink / raw)
  To: gcc-bugs


------- Additional Comments From pinskia at gcc dot gnu dot org  2004-06-25 07:43 -------
For powerpc-apple-darwin I posted two patches which helps at -O0 which goes from 18.0 seconds to 
15.3 seconds:
<http://gcc.gnu.org/ml/gcc-patches/2004-06/msg02029.html>
<http://gcc.gnu.org/ml/gcc-patches/2004-06/msg02031.html>
Note these patches solve problems specific to darwin and only helps there.

-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=8361


^ permalink raw reply	[flat|nested] 43+ messages in thread

* [Bug rtl-optimization/8361] [3.3/3.4/3.5 regression] C++ compile-time performance regression
       [not found] <20021025133601.8361.gerald@pfeifer.com>
                   ` (10 preceding siblings ...)
  2004-06-25  7:44 ` pinskia at gcc dot gnu dot org
@ 2004-08-29 18:50 ` mmitchel at gcc dot gnu dot org
  2004-09-15 10:13 ` [Bug rtl-optimization/8361] [3.3/3.4/4.0 " pinskia at gcc dot gnu dot org
                   ` (26 subsequent siblings)
  38 siblings, 0 replies; 43+ messages in thread
From: mmitchel at gcc dot gnu dot org @ 2004-08-29 18:50 UTC (permalink / raw)
  To: gcc-bugs


------- Additional Comments From mmitchel at gcc dot gnu dot org  2004-08-29 18:47 -------
Postponed until GCC 3.4.3.

-- 
           What    |Removed                     |Added
----------------------------------------------------------------------------
   Target Milestone|3.4.2                       |3.4.3


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=8361


^ permalink raw reply	[flat|nested] 43+ messages in thread

* [Bug rtl-optimization/8361] [3.3/3.4/4.0 regression] C++ compile-time performance regression
       [not found] <20021025133601.8361.gerald@pfeifer.com>
                   ` (11 preceding siblings ...)
  2004-08-29 18:50 ` mmitchel at gcc dot gnu dot org
@ 2004-09-15 10:13 ` pinskia at gcc dot gnu dot org
  2004-09-18 14:40 ` pinskia at gcc dot gnu dot org
                   ` (25 subsequent siblings)
  38 siblings, 0 replies; 43+ messages in thread
From: pinskia at gcc dot gnu dot org @ 2004-09-15 10:13 UTC (permalink / raw)
  To: gcc-bugs



-- 
Bug 8361 depends on bug 17497, which changed state.

Bug 17497 Summary: [4.0 Regression] ICE: "address taken, but ADDRESSABLE bit not set"
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=17497

           What    |Old Value                   |New Value
----------------------------------------------------------------------------
             Status|NEW                         |RESOLVED
         Resolution|                            |DUPLICATE

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=8361


^ permalink raw reply	[flat|nested] 43+ messages in thread

* [Bug rtl-optimization/8361] [3.3/3.4/4.0 regression] C++ compile-time performance regression
       [not found] <20021025133601.8361.gerald@pfeifer.com>
                   ` (12 preceding siblings ...)
  2004-09-15 10:13 ` [Bug rtl-optimization/8361] [3.3/3.4/4.0 " pinskia at gcc dot gnu dot org
@ 2004-09-18 14:40 ` pinskia at gcc dot gnu dot org
  2004-09-24 15:01 ` bonzini at gcc dot gnu dot org
                   ` (24 subsequent siblings)
  38 siblings, 0 replies; 43+ messages in thread
From: pinskia at gcc dot gnu dot org @ 2004-09-18 14:40 UTC (permalink / raw)
  To: gcc-bugs



-- 
Bug 8361 depends on bug 17483, which changed state.

Bug 17483 Summary: [4.0 regression] ICE at -O3 when passing a reference.
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=17483

           What    |Old Value                   |New Value
----------------------------------------------------------------------------
             Status|NEW                         |ASSIGNED
             Status|ASSIGNED                    |NEW
             Status|NEW                         |RESOLVED
         Resolution|                            |FIXED

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=8361


^ permalink raw reply	[flat|nested] 43+ messages in thread

* [Bug rtl-optimization/8361] [3.3/3.4/4.0 regression] C++ compile-time performance regression
       [not found] <20021025133601.8361.gerald@pfeifer.com>
                   ` (13 preceding siblings ...)
  2004-09-18 14:40 ` pinskia at gcc dot gnu dot org
@ 2004-09-24 15:01 ` bonzini at gcc dot gnu dot org
  2004-10-26 12:34 ` nathan at gcc dot gnu dot org
                   ` (23 subsequent siblings)
  38 siblings, 0 replies; 43+ messages in thread
From: bonzini at gcc dot gnu dot org @ 2004-09-24 15:01 UTC (permalink / raw)
  To: gcc-bugs


------- Additional Comments From bonzini at gcc dot gnu dot org  2004-09-24 15:01 -------
This PR is unlikely to be closed ever, but some fresh numbers ought to be taken
for mainline.  Unfortunately I don't have even a fraction of the compilers in
the PR description here (only 3.3.4-debian and mainline), so no, I'm not
volunteering to do it. :-)

Paolo

-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=8361


^ permalink raw reply	[flat|nested] 43+ messages in thread

* [Bug rtl-optimization/8361] [3.3/3.4/4.0 regression] C++ compile-time performance regression
       [not found] <20021025133601.8361.gerald@pfeifer.com>
                   ` (14 preceding siblings ...)
  2004-09-24 15:01 ` bonzini at gcc dot gnu dot org
@ 2004-10-26 12:34 ` nathan at gcc dot gnu dot org
  2004-10-26 13:34 ` gerald at pfeifer dot com
                   ` (22 subsequent siblings)
  38 siblings, 0 replies; 43+ messages in thread
From: nathan at gcc dot gnu dot org @ 2004-10-26 12:34 UTC (permalink / raw)
  To: gcc-bugs


------- Additional Comments From nathan at gcc dot gnu dot org  2004-10-26 12:34 -------
The updated testcase doesn't compile on i686-pc-linux-gnu, with what looks to be
target independent errors. Here are the first few,

/sw/gcc-3.0.4/include/g++-v3/bits/stl_iterator.h:452: error: type/value mismatch
at argument 1 in template parameter list for '
template<class _Category, class _Tp, class _Distance, class _Pointer, class
_Reference> struct std::iterator'
/sw/gcc-3.0.4/include/g++-v3/bits/stl_iterator.h:452: error:   expected a type,
got 'std::iterator_traits<_Iterator>::iterator_
category'
/sw/gcc-3.0.4/include/g++-v3/bits/stl_iterator.h:452: error: type/value mismatch
at argument 2 in template parameter list for '
template<class _Category, class _Tp, class _Distance, class _Pointer, class
_Reference> struct std::iterator'
/sw/gcc-3.0.4/include/g++-v3/bits/stl_iterator.h:452: error:   expected a type,
got 'std::iterator_traits<_Iterator>::value_typ
e'
/sw/gcc-3.0.4/include/g++-v3/bits/stl_iterator.h:452: error: type/value mismatch
at argument 3 in template parameter list for '
template<class _Category, class _Tp, class _Distance, class _Pointer, class
_Reference> struct std::iterator'
/sw/gcc-3.0.4/include/g++-v3/bits/stl_iterator.h:452: error:   expected a type,
got 'std::iterator_traits<_Iterator>::differenc
e_type'
/sw/gcc-3.0.4/include/g++-v3/bits/stl_iterator.h:452: error: type/value mismatch
at argument 4 in template parameter list for '
template<class _Category, class _Tp, class _Distance, class _Pointer, class
_Reference> struct std::iterator'
/sw/gcc-3.0.4/include/g++-v3/bits/stl_iterator.h:452: error:   expected a type,
got 'std::iterator_traits<_Iterator>::pointer'
/sw/gcc-3.0.4/include/g++-v3/bits/stl_iterator.h:452: error: type/value mismatch
at argument 5 in template parameter list for '
template<class _Category, class _Tp, class _Distance, class _Pointer, class
_Reference> struct std::iterator'
/sw/gcc-3.0.4/include/g++-v3/bits/stl_iterator.h:452: error:   expected a type,
got 'std::iterator_traits<_Iterator>::reference
'

what's up?

-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=8361


^ permalink raw reply	[flat|nested] 43+ messages in thread

* [Bug rtl-optimization/8361] [3.3/3.4/4.0 regression] C++ compile-time performance regression
       [not found] <20021025133601.8361.gerald@pfeifer.com>
                   ` (15 preceding siblings ...)
  2004-10-26 12:34 ` nathan at gcc dot gnu dot org
@ 2004-10-26 13:34 ` gerald at pfeifer dot com
  2004-10-26 13:37 ` pinskia at gcc dot gnu dot org
                   ` (21 subsequent siblings)
  38 siblings, 0 replies; 43+ messages in thread
From: gerald at pfeifer dot com @ 2004-10-26 13:34 UTC (permalink / raw)
  To: gcc-bugs


------- Additional Comments From gerald at pfeifer dot com  2004-10-26 13:34 -------
Is there anything left to do wrt. the testcases?  I saw that Nathan made
some (description-only?) changes.

-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=8361


^ permalink raw reply	[flat|nested] 43+ messages in thread

* [Bug rtl-optimization/8361] [3.3/3.4/4.0 regression] C++ compile-time performance regression
       [not found] <20021025133601.8361.gerald@pfeifer.com>
                   ` (16 preceding siblings ...)
  2004-10-26 13:34 ` gerald at pfeifer dot com
@ 2004-10-26 13:37 ` pinskia at gcc dot gnu dot org
  2004-10-30 19:30 ` mmitchel at gcc dot gnu dot org
                   ` (20 subsequent siblings)
  38 siblings, 0 replies; 43+ messages in thread
From: pinskia at gcc dot gnu dot org @ 2004-10-26 13:37 UTC (permalink / raw)
  To: gcc-bugs


------- Additional Comments From pinskia at gcc dot gnu dot org  2004-10-26 13:37 -------
No, Nathan just got confused on which attachment to take.

-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=8361


^ permalink raw reply	[flat|nested] 43+ messages in thread

* [Bug rtl-optimization/8361] [3.3/3.4/4.0 regression] C++ compile-time performance regression
       [not found] <20021025133601.8361.gerald@pfeifer.com>
                   ` (17 preceding siblings ...)
  2004-10-26 13:37 ` pinskia at gcc dot gnu dot org
@ 2004-10-30 19:30 ` mmitchel at gcc dot gnu dot org
  2004-11-12 12:52 ` steven at gcc dot gnu dot org
                   ` (19 subsequent siblings)
  38 siblings, 0 replies; 43+ messages in thread
From: mmitchel at gcc dot gnu dot org @ 2004-10-30 19:30 UTC (permalink / raw)
  To: gcc-bugs


------- Additional Comments From mmitchel at gcc dot gnu dot org  2004-10-30 19:30 -------
I'm not sure how interesting it is to keep this PR open.  

I'll be postponing it every time we get to a release for the forseeable future.

-- 
           What    |Removed                     |Added
----------------------------------------------------------------------------
   Target Milestone|3.4.3                       |4.1.0


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=8361


^ permalink raw reply	[flat|nested] 43+ messages in thread

* [Bug rtl-optimization/8361] [3.3/3.4/4.0 regression] C++ compile-time performance regression
       [not found] <20021025133601.8361.gerald@pfeifer.com>
                   ` (18 preceding siblings ...)
  2004-10-30 19:30 ` mmitchel at gcc dot gnu dot org
@ 2004-11-12 12:52 ` steven at gcc dot gnu dot org
  2004-11-18 18:56 ` pinskia at gcc dot gnu dot org
                   ` (18 subsequent siblings)
  38 siblings, 0 replies; 43+ messages in thread
From: steven at gcc dot gnu dot org @ 2004-11-12 12:52 UTC (permalink / raw)
  To: gcc-bugs


------- Additional Comments From steven at gcc dot gnu dot org  2004-11-12 12:52 -------
GCC 3.4 (CVS today) takes 35s usr on my machine.
GCC 4.0 (CVS today) takes 46s usr on the same machine.

The difference is entirely in DOM, into-SSA and SSA-other
which is really also into-SSA:

			usr	sys	wall
dominator optimization	3.16	0.02	3.26
tree SSA rewrite	3.24	0.01	3.27
tree SSA other		3.47	0.09	3.40


Per-pass and cummulative time spent (top 10 only):
integration             1.09    2.30%   48.88%
tree PHI insertion      1.21    2.56%   51.44%
loop invariant motion   1.30    2.75%   54.18%
global alloc            1.30    2.75%   56.93%
CSE                     1.72    3.63%   60.56%
parser                  3.05    6.44%   67.00%
dominator optimization  3.16    6.68%   73.68%
tree SSA rewrite        3.24    6.84%   80.52%
tree SSA other          3.47    7.33%   87.85%
expand                  5.75    12.15%  100.00%


Flat profile:
                                                                               
                
Each sample counts as 0.01 seconds.
  %   cumulative   self              self     total
 time   seconds   seconds    calls   s/call   s/call  name
  1.82      8.19     8.19 13878865     0.00     0.00  is_gimple_reg
  1.50     14.95     6.76     6594     0.00     0.00  synth_mult
  1.28     20.69     5.74 12785589     0.00     0.00  ggc_alloc_stat
  1.27     26.38     5.69  3433257     0.00     0.00  free_df_for_stmt
  1.25     32.01     5.63 16868123     0.00     0.00  bitmap_set_bit
  1.19     37.35     5.34  4846931     0.00     0.00  get_stmt_operands
  1.17     42.59     5.24    62034     0.00     0.00  alloc_page
  1.15     47.75     5.16     3559     0.00     0.01  compute_immediate_uses
  0.99     52.18     4.43  6408238     0.00     0.00  htab_find_slot_with_hash
  0.98     56.60     4.42  2104725     0.00     0.00  compute_immediate_uses_for_phi
  0.93     60.76     4.16   821051     0.00     0.00  gt_ggc_mx_lang_tree_node
  0.91     64.83     4.07  7802758     0.00     0.00  register_new_def
  0.90     68.87     4.04   951728     0.00     0.00  rewrite_stmt
  0.88     72.82     3.95 30035694     0.00     0.00  bitmap_bit_p
  0.84     76.61     3.79   574332     0.00     0.00  cse_insn
  0.81     80.26     3.65   196671     0.00     0.00  compute_global_livein
  0.81     83.91     3.65   177070     0.00     0.00  insert_phi_nodes_for
  0.81     87.54     3.63  2697441     0.00     0.00  for_each_rtx
  0.81     91.16     3.62  1079773     0.00     0.00  check_phi_redundancy

which is a different way of saying "all over the map" :-(



-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=8361


^ permalink raw reply	[flat|nested] 43+ messages in thread

* [Bug rtl-optimization/8361] [3.3/3.4/4.0 regression] C++ compile-time performance regression
       [not found] <20021025133601.8361.gerald@pfeifer.com>
                   ` (19 preceding siblings ...)
  2004-11-12 12:52 ` steven at gcc dot gnu dot org
@ 2004-11-18 18:56 ` pinskia at gcc dot gnu dot org
  2004-11-19 18:22 ` pinskia at gcc dot gnu dot org
                   ` (17 subsequent siblings)
  38 siblings, 0 replies; 43+ messages in thread
From: pinskia at gcc dot gnu dot org @ 2004-11-18 18:56 UTC (permalink / raw)
  To: gcc-bugs


------- Additional Comments From pinskia at gcc dot gnu dot org  2004-11-18 18:56 -------
I noticed today that my patch for PR 18507 also helps this testcase.

-- 
           What    |Removed                     |Added
----------------------------------------------------------------------------
  BugsThisDependsOn|                            |18507


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=8361


^ permalink raw reply	[flat|nested] 43+ messages in thread

* [Bug rtl-optimization/8361] [3.3/3.4/4.0 regression] C++ compile-time performance regression
       [not found] <20021025133601.8361.gerald@pfeifer.com>
                   ` (20 preceding siblings ...)
  2004-11-18 18:56 ` pinskia at gcc dot gnu dot org
@ 2004-11-19 18:22 ` pinskia at gcc dot gnu dot org
  2004-12-04 16:31 ` pinskia at gcc dot gnu dot org
                   ` (16 subsequent siblings)
  38 siblings, 0 replies; 43+ messages in thread
From: pinskia at gcc dot gnu dot org @ 2004-11-19 18:22 UTC (permalink / raw)
  To: gcc-bugs



-- 
Bug 8361 depends on bug 18507, which changed state.

Bug 18507 Summary: block_defs_stack varrray should not be GC'ed
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=18507

           What    |Old Value                   |New Value
----------------------------------------------------------------------------
             Status|ASSIGNED                    |RESOLVED
         Resolution|                            |FIXED

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=8361


^ permalink raw reply	[flat|nested] 43+ messages in thread

* [Bug rtl-optimization/8361] [3.3/3.4/4.0 regression] C++ compile-time performance regression
       [not found] <20021025133601.8361.gerald@pfeifer.com>
                   ` (21 preceding siblings ...)
  2004-11-19 18:22 ` pinskia at gcc dot gnu dot org
@ 2004-12-04 16:31 ` pinskia at gcc dot gnu dot org
  2004-12-18  4:13 ` pinskia at gcc dot gnu dot org
                   ` (15 subsequent siblings)
  38 siblings, 0 replies; 43+ messages in thread
From: pinskia at gcc dot gnu dot org @ 2004-12-04 16:31 UTC (permalink / raw)
  To: gcc-bugs


------- Additional Comments From pinskia at gcc dot gnu dot org  2004-12-04 16:31 -------
Here is the current results for 3.3.2 vs the mainline:
                    -O0      -O1       -O2      -O3
3.3.2          28.93      42.81   61.13    58.140
mainline     11.06      43.18   54.86    58.35

So we are faster at -O0 but slightly slower at optimization levels  but if we trust the numbers for 3.0.4 
compared to 3.3, we are still 30% slower than 3.0.4 except at -O0.

-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=8361


^ permalink raw reply	[flat|nested] 43+ messages in thread

* [Bug rtl-optimization/8361] [3.3/3.4/4.0 regression] C++ compile-time performance regression
       [not found] <20021025133601.8361.gerald@pfeifer.com>
                   ` (22 preceding siblings ...)
  2004-12-04 16:31 ` pinskia at gcc dot gnu dot org
@ 2004-12-18  4:13 ` pinskia at gcc dot gnu dot org
  2004-12-23 11:16 ` steven at gcc dot gnu dot org
                   ` (14 subsequent siblings)
  38 siblings, 0 replies; 43+ messages in thread
From: pinskia at gcc dot gnu dot org @ 2004-12-18  4:13 UTC (permalink / raw)
  To: gcc-bugs


------- Additional Comments From pinskia at gcc dot gnu dot org  2004-12-18 04:13 -------
(In reply to comment #39)
> Here is the current results for 3.3.2 vs the mainline:
Now I am getting results that -O3 is faster than -O2, that is not right.

-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=8361


^ permalink raw reply	[flat|nested] 43+ messages in thread

* [Bug rtl-optimization/8361] [3.3/3.4/4.0 regression] C++ compile-time performance regression
       [not found] <20021025133601.8361.gerald@pfeifer.com>
                   ` (23 preceding siblings ...)
  2004-12-18  4:13 ` pinskia at gcc dot gnu dot org
@ 2004-12-23 11:16 ` steven at gcc dot gnu dot org
  2005-01-10  1:35 ` pinskia at gcc dot gnu dot org
                   ` (13 subsequent siblings)
  38 siblings, 0 replies; 43+ messages in thread
From: steven at gcc dot gnu dot org @ 2004-12-23 11:16 UTC (permalink / raw)
  To: gcc-bugs


------- Additional Comments From steven at gcc dot gnu dot org  2004-12-23 11:16 -------
Gerald, you think you can find some cycles to see where we
stand?  I'm very curious how we do for this file, and for
the rest of your test suite.

(It'd be nice if you can compare mainline with some other
official FSF build (3.3, 3.4), because our system compilers
are profiledbootstraped so that gives a skewed picture...)


-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=8361


^ permalink raw reply	[flat|nested] 43+ messages in thread

* [Bug rtl-optimization/8361] [3.3/3.4/4.0 regression] C++ compile-time performance regression
       [not found] <20021025133601.8361.gerald@pfeifer.com>
                   ` (24 preceding siblings ...)
  2004-12-23 11:16 ` steven at gcc dot gnu dot org
@ 2005-01-10  1:35 ` pinskia at gcc dot gnu dot org
  2005-01-16  4:50 ` pinskia at gcc dot gnu dot org
                   ` (12 subsequent siblings)
  38 siblings, 0 replies; 43+ messages in thread
From: pinskia at gcc dot gnu dot org @ 2005-01-10  1:35 UTC (permalink / raw)
  To: gcc-bugs


------- Additional Comments From pinskia at gcc dot gnu dot org  2005-01-10 01:35 -------
I am now getting results which say at -O1, we are now faster than 3.3.2, could someone test to make 
sure that they get close results to mine?

-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=8361


^ permalink raw reply	[flat|nested] 43+ messages in thread

* [Bug rtl-optimization/8361] [3.3/3.4/4.0 regression] C++ compile-time performance regression
       [not found] <20021025133601.8361.gerald@pfeifer.com>
                   ` (25 preceding siblings ...)
  2005-01-10  1:35 ` pinskia at gcc dot gnu dot org
@ 2005-01-16  4:50 ` pinskia at gcc dot gnu dot org
  2005-01-16 14:16 ` belyshev at depni dot sinp dot msu dot ru
                   ` (11 subsequent siblings)
  38 siblings, 0 replies; 43+ messages in thread
From: pinskia at gcc dot gnu dot org @ 2005-01-16  4:50 UTC (permalink / raw)
  To: gcc-bugs


------- Additional Comments From pinskia at gcc dot gnu dot org  2005-01-16 04:50 -------
(In reply to comment #39)
> Here is the current results for 3.3.2 vs the mainline:
>                     -O0      -O1       -O2      -O3
> 3.3.2          28.93      42.81   61.13    58.140
> mainline     11.06      43.18   54.86    58.35

And more current results for the mainline on powerpc-darwin:
                      11.09      30.55    39.09   38.74

So it looks like this is fixed really and we are 40 % faster than 3.3.2 at -O1 on this testcase.
56% faster at -O2 and 50% at -O3. (which means we have caught back up to and past 3.0.4's number if 
the numbers in comment #0 scales the same on powerpc).

Someone should really do timings on x86 to make sure that they give about the same as powerpc.

-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=8361


^ permalink raw reply	[flat|nested] 43+ messages in thread

* [Bug rtl-optimization/8361] [3.3/3.4/4.0 regression] C++ compile-time performance regression
       [not found] <20021025133601.8361.gerald@pfeifer.com>
                   ` (26 preceding siblings ...)
  2005-01-16  4:50 ` pinskia at gcc dot gnu dot org
@ 2005-01-16 14:16 ` belyshev at depni dot sinp dot msu dot ru
  2005-01-16 14:21 ` steven at gcc dot gnu dot org
                   ` (10 subsequent siblings)
  38 siblings, 0 replies; 43+ messages in thread
From: belyshev at depni dot sinp dot msu dot ru @ 2005-01-16 14:16 UTC (permalink / raw)
  To: gcc-bugs


------- Additional Comments From belyshev at depni dot sinp dot msu dot ru  2005-01-16 14:16 -------
here is the timings for i686-pc-linux-gnu:

        3.0.5   3.2.3   3.3.6   3.4.4   4.0.0   4.0.0/3.0.5

-O0      24.5    26.0    22.4    20.5    16.9    -31%
-O1      41.8    48.3    42.8    37.3    44.8     +7%
-O2      53.4    64.9    59.0    61.6    55.9     +5%
-O3      54.5    68.8    62.8    64.8    57.2     +5%

compilers are:

3.0.5 20030502 (prerelease)
3.2.3
3.3.6 20050116 (prerelease)
3.4.4 20050116 (prerelease)
4.0.0 20050116 (experimental)

all compilers compiled by GNU C version 3.3.6 20050116 (prerelease).


-- 
           What    |Removed                     |Added
----------------------------------------------------------------------------
   Last reconfirmed|2004-02-25 08:21:05         |2005-01-16 14:16:42
               date|                            |


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=8361


^ permalink raw reply	[flat|nested] 43+ messages in thread

* [Bug rtl-optimization/8361] [3.3/3.4/4.0 regression] C++ compile-time performance regression
       [not found] <20021025133601.8361.gerald@pfeifer.com>
                   ` (27 preceding siblings ...)
  2005-01-16 14:16 ` belyshev at depni dot sinp dot msu dot ru
@ 2005-01-16 14:21 ` steven at gcc dot gnu dot org
  2005-01-28 14:48 ` pinskia at gcc dot gnu dot org
                   ` (9 subsequent siblings)
  38 siblings, 0 replies; 43+ messages in thread
From: steven at gcc dot gnu dot org @ 2005-01-16 14:21 UTC (permalink / raw)
  To: gcc-bugs


------- Additional Comments From steven at gcc dot gnu dot org  2005-01-16 14:21 -------
Please don't close this bug, ever!  It's GCC nostalgia.  ;-) 

-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=8361


^ permalink raw reply	[flat|nested] 43+ messages in thread

* [Bug rtl-optimization/8361] [3.3/3.4/4.0 regression] C++ compile-time performance regression
       [not found] <20021025133601.8361.gerald@pfeifer.com>
                   ` (28 preceding siblings ...)
  2005-01-16 14:21 ` steven at gcc dot gnu dot org
@ 2005-01-28 14:48 ` pinskia at gcc dot gnu dot org
  2005-01-28 15:15 ` steven at gcc dot gnu dot org
                   ` (8 subsequent siblings)
  38 siblings, 0 replies; 43+ messages in thread
From: pinskia at gcc dot gnu dot org @ 2005-01-28 14:48 UTC (permalink / raw)
  To: gcc-bugs


------- Additional Comments From pinskia at gcc dot gnu dot org  2005-01-28 14:48 -------
Can someone do the timings again on x86, I think we are faster at -O1 now than previous versions and 
faster for all other optimization levels?
On ppc-darwin we speed up about 3% (-O2/-O3) to 16% (-O1) between the 15th and now.

-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=8361


^ permalink raw reply	[flat|nested] 43+ messages in thread

* [Bug rtl-optimization/8361] [3.3/3.4/4.0 regression] C++ compile-time performance regression
       [not found] <20021025133601.8361.gerald@pfeifer.com>
                   ` (29 preceding siblings ...)
  2005-01-28 14:48 ` pinskia at gcc dot gnu dot org
@ 2005-01-28 15:15 ` steven at gcc dot gnu dot org
  2005-02-06 15:34 ` steven@gcc.gnu.org
                   ` (7 subsequent siblings)
  38 siblings, 0 replies; 43+ messages in thread
From: steven at gcc dot gnu dot org @ 2005-01-28 15:15 UTC (permalink / raw)
  To: gcc-bugs


------- Additional Comments From steven at gcc dot gnu dot org  2005-01-28 15:15 -------
I will do timings with a bunch of gcc3.x compilers and gcc4.0.

-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=8361


^ permalink raw reply	[flat|nested] 43+ messages in thread

* [Bug rtl-optimization/8361] [3.3/3.4/4.0 regression] C++ compile-time performance regression
       [not found] <20021025133601.8361.gerald@pfeifer.com>
                   ` (30 preceding siblings ...)
  2005-01-28 15:15 ` steven at gcc dot gnu dot org
@ 2005-02-06 15:34 ` steven@gcc.gnu.org
  2005-02-06 16:26 ` steven@gcc.gnu.org
                   ` (6 subsequent siblings)
  38 siblings, 0 replies; 43+ messages in thread
From: steven@gcc.gnu.org @ 2005-02-06 15:34 UTC (permalink / raw)
  To: gcc-bugs

------- Additional Comments From steven at gcc dot gnu dot org  2005-02-06 16:04 -------
All compilers were bootstrapped, with the following flags: 
 
"--disable-{nls,checking} --enable-languages=c,c++" 
 
Below, gcc40 is CVS HEAD.  This was on a 1.6GHz Opteron, with -m32. 
The machine has 4GB of memory so garbage collection times are zero, 
which may account for some of the rather unexpected results. 
For gcc34 and gcc40 I used generate-3.4.ii.bz2 (attachment 3) and 
for the other two I used the latest generate.ii.bz2 (attachment 4). 
 
        gcc32   gcc33   gcc34   gcc40 
-O0     16.439s 16.172s 15.223s 6.674s 
-O1     30.265s 25.115s 20.678s 20.305s 
-O2     42.678s 34.908s 34.526s 27.418s 
-O3     47.469s 47.538s 35.706s 27.896s 
 
I'll try to get numbers on a 32bits machine (i686) as well. 
 

-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=8361


^ permalink raw reply	[flat|nested] 43+ messages in thread

* [Bug rtl-optimization/8361] [3.3/3.4/4.0 regression] C++ compile-time performance regression
       [not found] <20021025133601.8361.gerald@pfeifer.com>
                   ` (31 preceding siblings ...)
  2005-02-06 15:34 ` steven@gcc.gnu.org
@ 2005-02-06 16:26 ` steven@gcc.gnu.org
  2005-02-06 16:54 ` steven@gcc.gnu.org
                   ` (5 subsequent siblings)
  38 siblings, 0 replies; 43+ messages in thread
From: steven@gcc.gnu.org @ 2005-02-06 16:26 UTC (permalink / raw)
  To: gcc-bugs

------- Additional Comments From steven at gcc dot gnu dot org  2005-02-06 16:49 -------
Similar numbers on a 1.4GHz Xeon (i686): 
 
        gcc32   gcc33   gcc34   gcc40 
-O0     18.865s 15.107s 13.286s 10.193s 
-O1     33.511s 30.096s 24.693s 23.543s 
-O2     46.527s 42.657s 42.618s 33.549s 
-O3     49.537s 43.887s 44.056s 33.917s 
 

-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=8361


^ permalink raw reply	[flat|nested] 43+ messages in thread

* [Bug rtl-optimization/8361] [3.3/3.4/4.0 regression] C++ compile-time performance regression
       [not found] <20021025133601.8361.gerald@pfeifer.com>
                   ` (32 preceding siblings ...)
  2005-02-06 16:26 ` steven@gcc.gnu.org
@ 2005-02-06 16:54 ` steven@gcc.gnu.org
  2005-02-07  2:13 ` ghazi at gcc dot gnu dot org
                   ` (4 subsequent siblings)
  38 siblings, 0 replies; 43+ messages in thread
From: steven@gcc.gnu.org @ 2005-02-06 16:54 UTC (permalink / raw)
  To: gcc-bugs

------- Additional Comments From steven at gcc dot gnu dot org  2005-02-06 16:54 -------
Considering the numbers from #44, #48, and #49, I think we can conclude 
that we are back to the compile times GCC 3.0 used to have.  It should 
be noted that we have a significantly larger memory foot print though, 
and some of the speedups (especially from GCC 3.2 to GCC 3.3) came from 
smaller hacks to the GC system (collect less often, etc.).  But then, 
most people just use the compiler with -O[0123] and no fancy --params 
and similar hacks, so from a user POV this bug really is fixed, mostly. 
I'm not sure if it is useful to keep this bug open any longer. 

-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=8361


^ permalink raw reply	[flat|nested] 43+ messages in thread

* [Bug rtl-optimization/8361] [3.3/3.4/4.0 regression] C++ compile-time performance regression
       [not found] <20021025133601.8361.gerald@pfeifer.com>
                   ` (33 preceding siblings ...)
  2005-02-06 16:54 ` steven@gcc.gnu.org
@ 2005-02-07  2:13 ` ghazi at gcc dot gnu dot org
  2005-02-08 10:19 ` gerald at pfeifer dot com
                   ` (3 subsequent siblings)
  38 siblings, 0 replies; 43+ messages in thread
From: ghazi at gcc dot gnu dot org @ 2005-02-07  2:13 UTC (permalink / raw)
  To: gcc-bugs


------- Additional Comments From ghazi at gcc dot gnu dot org  2005-02-06 18:08 -------
If you want to compare how the memory footprint has affected performance, use 
these flags in 3.3 and later:

--param ggc-min-expand=30 --param ggc-min-heapsize=4096

Those are the hardcoded values that 3.2 uses to tune how often the collector 
runs.  I would be interested to see how later versions behave when supplied 
these flags, this will simulate how fast we compile on memory constrained boxes 
relative to 3.2.

Another perhaps more interesting test (but one which will take slightly more 
effort for you) would be to see how raising these values in 3.2 will affect 
performance.  Some distros (RH?) did in fact raise them in their releases so 
users may be comparing their cranked distro gcc-3.2 to our FSF releases.

Of course since these values are hardcoded in 3.2, you'd have to rebuild that 
compiler, however I think an apples-to-apples comparsion is in order before 
closing this PR.


-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=8361


^ permalink raw reply	[flat|nested] 43+ messages in thread

* [Bug rtl-optimization/8361] [3.3/3.4/4.0 regression] C++ compile-time performance regression
       [not found] <20021025133601.8361.gerald@pfeifer.com>
                   ` (34 preceding siblings ...)
  2005-02-07  2:13 ` ghazi at gcc dot gnu dot org
@ 2005-02-08 10:19 ` gerald at pfeifer dot com
  2005-07-23 22:14 ` [Bug rtl-optimization/8361] [3.4/4.0/4.1 " pinskia at gcc dot gnu dot org
                   ` (2 subsequent siblings)
  38 siblings, 0 replies; 43+ messages in thread
From: gerald at pfeifer dot com @ 2005-02-08 10:19 UTC (permalink / raw)
  To: gcc-bugs


------- Additional Comments From gerald at pfeifer dot com  2005-02-07 23:09 -------
I had done extensive benchmarks around New Year, based on Steven's request in
comment #41.  Unfortunately I lost most of that data directly before posting
it here and couldn't repeat everything, but coincidently I could save exactly
those parts that Steven did not check now. ;-)  CVS refers to the state in
early January.

The following are for the full application which generate.ii is only one part
of, albeit a representative one.

First the time to build with -O3 and the resulting binary size:

  --------+ stripped-+ build time
  2.95    |  4577588 | 170.78 real
  3.2.3   |  4106176 | 219.70 real
  3.3 CVS |  1073280 | 209.02 real
  3.4 CVS |  1079120 | 189.82 real
  4.0 CVS |  1081776 | 164.86 real

Then some benchmarks results for the binaries; times in seconds, smaller is
better:
      
                        |   2.95 |  3.2.3 | 3.3 CVS | 3.4 CVS | 4.0 CVS |
          --------------+--------+--------+---------+---------+---------+
          STRATCOMP2-ALL|  17.96 | 127.44 |   89.51 |   21.02 |   20.47 |
         STRATCOMP-BRAVE|  77.09 |  78.33 |   77.70 |   83.33 |   82.83 |
                   2QBF1|  11.68 |  13.72 |   13.45 |   13.75 |   12.31 |
              PRIMEIMPL2|   7.52 |   8.05 |    7.21 |    7.00 |    7.42 |
                ANCESTOR|  70.44 |  69.91 |   71.22 |   67.36 |   61.36 |
           3COL-SIMPLEX1|   3.67 |   3.81 |    3.86 |    3.77 |    3.52 |
             3COL-LADDER|  77.99 |  81.11 |   81.72 |   73.23 |   71.58 |
           3COL-N-LADDER|   1.68 |   2.82 |    2.76 |    1.81 |    1.81 |
            3COL-RANDOM1|   8.38 |   8.33 |    7.84 |    8.13 |    8.61 |
              HP-RANDOM1|   6.52 |   7.29 |    7.19 |    7.90 |    7.65 |
           HAMCYCLE-FREE|  68.46 |  88.72 |   82.77 |   64.63 |   66.40 |
                 DECOMP2|   7.75 |   8.48 |    8.98 |    9.87 |    8.80 |
            BW-P5-Esra-a|  34.76 |  36.23 |   35.20 |   31.39 |   31.41 |
            BW-P8-nopush|  90.17 |  89.79 |   88.17 |   81.97 |   83.51 |
           BW-P6-pushbin|  60.23 |  62.86 |   61.34 |   59.09 |   59.94 |
         BW-P7-nopushbin|  84.94 |  87.46 |   83.80 |   79.93 |   81.23 |
                  3SAT-1|  23.91 |  24.91 |   22.55 |   22.23 |   23.19 |
       3SAT-1-CONSTRAINT|  13.97 |  14.76 |   13.51 |   13.37 |   14.15 |
            HANOI-Towers| 737.91 | 632.95 |  636.27 |  680.56 |  661.77 |
         RAMSEY(3,7)!=21|  68.93 |  73.92 |   71.77 |   74.71 |   73.59 |
 RAMSEY(3,7)!=21, normal|  83.92 |  84.02 |   83.32 |   81.23 |   79.21 |
         RAMSEY(4,6)!=25|  92.53 |  99.69 |   95.06 |   96.33 |   90.40 |
         RAMSEY(4,6)!=26| 130.68 | 142.55 |  134.61 |  134.75 |  124.73 |
                 CRISTAL|   5.75 |   5.98 |    5.67 |    5.56 |    5.29 |
                 HANOI-K|1176.06 |1289.65 | 1252.41 | 1154.43 | 1082.85 |
               21-QUEENS|   7.09 |   7.12 |    6.30 |    6.30 |    6.31 |
       MSTDir[V=13,A=40]|  14.34 |  13.02 |   12.34 |   11.50 |   11.69 |
       MSTDir[V=15,A=40]|  14.20 |  12.98 |   12.43 |   11.47 |   11.65 |
     MSTUndir[V=13,A=40]|   7.18 |   7.07 |    6.53 |    6.14 |    6.34 |
     MSTUndir[V=15,A=40]| 116.86 | 113.12 |  104.71 |   99.37 |  103.56 |
          TIMETABLING_4C| 137.64 | 140.79 |  138.66 |  173.87 |  165.50 |
      SCHOOL_TIMETABLING| 328.57 |    -   |     -   |  329.02 |  310.30 | 

So, in terms of build time and binary size we are fine, and also benchmark
performance is nicely improved on average (with some regressions, though).

For whether we can close this now, I'll just refer to comment #32 and
comment #45 (and Kaveh's note on memory usage).

-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=8361


^ permalink raw reply	[flat|nested] 43+ messages in thread

* [Bug rtl-optimization/8361] [3.4/4.0/4.1 regression] C++ compile-time performance regression
       [not found] <20021025133601.8361.gerald@pfeifer.com>
                   ` (35 preceding siblings ...)
  2005-02-08 10:19 ` gerald at pfeifer dot com
@ 2005-07-23 22:14 ` pinskia at gcc dot gnu dot org
  2005-09-18 19:17 ` pinskia at gcc dot gnu dot org
  2005-09-18 19:27 ` pinskia at gcc dot gnu dot org
  38 siblings, 0 replies; 43+ messages in thread
From: pinskia at gcc dot gnu dot org @ 2005-07-23 22:14 UTC (permalink / raw)
  To: gcc-bugs


------- Additional Comments From pinskia at gcc dot gnu dot org  2005-07-23 21:58 -------
We have regressioned since the last time someone reported on this one:
-O0   -O1   -O2   -O3
11.1  41.7   55.6  65.9

For -O3, the following passes stand out for compile time:
 tree PTA              :   4.04 ( 6%) usr   0.11 ( 1%) sys   4.45 ( 5%) wall    9319 kB ( 2%) ggc
 tree alias analysis   :   5.34 ( 7%) usr   1.42 ( 9%) sys   7.07 ( 8%) wall   11463 kB ( 2%) ggc
 parser                :   4.48 ( 6%) usr   2.16 (14%) sys   7.11 ( 8%) wall   95214 kB (18%) ggc
 tree operand scan     :   4.28 ( 6%) usr   2.86 (19%) sys   7.41 ( 8%) wall   22145 kB ( 4%) ggc
 dominator optimization:   3.60 ( 5%) usr   0.21 ( 1%) sys   4.02 ( 4%) wall   16448 kB ( 3%) ggc
 expand                :   3.13 ( 4%) usr   0.27 ( 2%) sys   3.53 ( 4%) wall   34210 kB ( 6%) ggc

For memory usage:
 integration           :   2.70 ( 4%) usr   0.30 ( 2%) sys   3.24 ( 4%) wall  124856 kB (24%) ggc
 parser                :   4.48 ( 6%) usr   2.16 (14%) sys   7.11 ( 8%) wall   95214 kB (18%) ggc


At -O0 compile time:
 parser                :   4.55 (33%) usr   2.00 (29%) sys   6.75 (31%) wall   94454 kB (50%) ggc
 name lookup           :   1.82 (13%) usr   2.98 (43%) sys   5.02 (23%) wall   17923 kB ( 9%) ggc
 expand                :   1.57 (11%) usr   0.40 ( 6%) sys   2.04 ( 9%) wall   33674 kB (18%) ggc
 global alloc          :   1.22 ( 9%) usr   0.06 ( 1%) sys   1.36 ( 6%) wall    8858 kB ( 5%) ggc

for memory usage, just the parser.

at -O1:
 parser                :   4.23 ( 9%) usr   2.23 (17%) sys   6.94 (11%) wall   94371 kB (22%) ggc
 integration           :   2.46 ( 5%) usr   0.29 ( 2%) sys   2.70 ( 4%) wall  104683 kB (25%) ggc
 tree PTA              :   3.48 ( 7%) usr   0.09 ( 1%) sys   3.76 ( 6%) wall    8378 kB ( 2%) ggc
 tree alias analysis   :   3.22 ( 7%) usr   1.23 ( 9%) sys   4.69 ( 7%) wall    6203 kB ( 1%) ggc
 tree SSA incremental  :   2.52 ( 5%) usr   0.30 ( 2%) sys   3.06 ( 5%) wall    3278 kB ( 1%) ggc
 tree operand scan     :   3.56 ( 7%) usr   2.32 (18%) sys   6.40 (10%) wall   18232 kB ( 4%) ggc

memory usage:
 integration           :   2.46 ( 5%) usr   0.29 ( 2%) sys   2.70 ( 4%) wall  104683 kB (25%) ggc
 parser                :   4.23 ( 9%) usr   2.23 (17%) sys   6.94 (11%) wall   94371 kB (22%) ggc


-O2:
 expand                :   2.90 ( 5%) usr   0.24 ( 2%) sys   3.02 ( 4%) wall   31476 kB ( 7%) ggc
 tree SSA incremental  :   2.67 ( 4%) usr   0.38 ( 3%) sys   3.30 ( 4%) wall    6252 kB ( 1%) ggc
 tree operand scan     :   3.76 ( 6%) usr   2.49 (18%) sys   6.05 ( 8%) wall   19509 kB ( 4%) ggc
 dominator optimization:   2.91 ( 5%) usr   0.13 ( 1%) sys   3.14 ( 4%) wall   14117 kB ( 3%) ggc
 tree PTA              :   3.46 ( 6%) usr   0.15 ( 1%) sys   3.79 ( 5%) wall    8394 kB ( 2%) ggc
 tree alias analysis   :   3.97 ( 6%) usr   1.40 (10%) sys   5.65 ( 7%) wall   10165 kB ( 2%) ggc
 parser                :   4.41 ( 7%) usr   2.34 (17%) sys   7.21 ( 9%) wall   94371 kB (20%) ggc
 integration           :   2.48 ( 4%) usr   0.23 ( 2%) sys   2.70 ( 3%) wall  104710 kB (22%) ggc

memory usage:
 parser                :   4.41 ( 7%) usr   2.34 (17%) sys   7.21 ( 9%) wall   94371 kB (20%) ggc
 integration           :   2.48 ( 4%) usr   0.23 ( 2%) sys   2.70 ( 3%) wall  104710 kB (22%) ggc


-- 
           What    |Removed                     |Added
----------------------------------------------------------------------------
   Last reconfirmed|2005-01-16 14:16:42         |2005-07-23 21:58:55
               date|                            |


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=8361


^ permalink raw reply	[flat|nested] 43+ messages in thread

* [Bug rtl-optimization/8361] [3.4/4.0/4.1 regression] C++ compile-time performance regression
       [not found] <20021025133601.8361.gerald@pfeifer.com>
                   ` (36 preceding siblings ...)
  2005-07-23 22:14 ` [Bug rtl-optimization/8361] [3.4/4.0/4.1 " pinskia at gcc dot gnu dot org
@ 2005-09-18 19:17 ` pinskia at gcc dot gnu dot org
  2005-09-18 19:27 ` pinskia at gcc dot gnu dot org
  38 siblings, 0 replies; 43+ messages in thread
From: pinskia at gcc dot gnu dot org @ 2005-09-18 19:17 UTC (permalink / raw)
  To: gcc-bugs


------- Additional Comments From pinskia at gcc dot gnu dot org  2005-09-18 19:17 -------
Current numbers for 4.0.0 vs 4.1.0:
pc64:~/src/pr8361> time ~/onetest.release/bin/gcc pr8361.ii -S -m32 -O1
21.137u 0.399s 0:21.89 98.3%    0+0k 0+0io 3pf+0w
pc64:~/src/pr8361> time gcc-4.0 pr8361.ii -S -m32 -O1
14.059u 0.269s 0:14.46 98.9%    0+0k 0+0io 2pf+0w

This on x86_64-pc-linux-gnu.

-ftime-report for 4.1.0:

Execution times (seconds)
 garbage collection    :   0.35 ( 2%) usr   0.01 ( 1%) sys   0.37 ( 2%) wall       0 kB ( 0%) ggc
 callgraph construction:   0.13 ( 1%) usr   0.01 ( 1%) sys   0.18 ( 1%) wall    4538 kB ( 1%) ggc
 callgraph optimization:   0.01 ( 0%) usr   0.00 ( 0%) sys   0.03 ( 0%) wall    1193 kB ( 0%) ggc
 ipa reference         :   0.07 ( 0%) usr   0.00 ( 0%) sys   0.08 ( 0%) wall     273 kB ( 0%) ggc
 ipa pure const        :   0.04 ( 0%) usr   0.01 ( 1%) sys   0.04 ( 0%) wall       0 kB ( 0%) ggc
 cfg construction      :   0.02 ( 0%) usr   0.00 ( 0%) sys   0.04 ( 0%) wall    1607 kB ( 0%) ggc
 cfg cleanup           :   0.14 ( 1%) usr   0.01 ( 1%) sys   0.13 ( 1%) wall     103 kB ( 0%) ggc
 trivially dead code   :   0.10 ( 0%) usr   0.00 ( 0%) sys   0.17 ( 1%) wall       0 kB ( 0%) ggc
 life analysis         :   0.52 ( 2%) usr   0.00 ( 0%) sys   0.52 ( 2%) wall    3245 kB ( 0%) ggc
 life info update      :   0.04 ( 0%) usr   0.00 ( 0%) sys   0.04 ( 0%) wall     292 kB ( 0%) ggc
 alias analysis        :   0.12 ( 1%) usr   0.00 ( 0%) sys   0.17 ( 1%) wall    2150 kB ( 0%) ggc
 register scan         :   0.16 ( 1%) usr   0.00 ( 0%) sys   0.08 ( 0%) wall     211 kB ( 0%) ggc
 rebuild jump labels   :   0.03 ( 0%) usr   0.00 ( 0%) sys   0.03 ( 0%) wall       1 kB ( 0%) ggc
 preprocessing         :   0.13 ( 1%) usr   0.15 ( 8%) sys   0.28 ( 1%) wall     591 kB ( 0%) ggc
 parser                :   1.80 ( 8%) usr   0.42 (23%) sys   2.35 (10%) wall  154459 kB (23%) ggc
 name lookup           :   0.57 ( 3%) usr   0.46 (25%) sys   0.97 ( 4%) wall   31048 kB ( 5%) ggc
 inline heuristics     :   0.07 ( 0%) usr   0.00 ( 0%) sys   0.06 ( 0%) wall    7605 kB ( 1%) ggc
 integration           :   1.14 ( 5%) usr   0.01 ( 1%) sys   1.14 ( 5%) wall  162853 kB (24%) ggc
 tree gimplify         :   0.30 ( 1%) usr   0.02 ( 1%) sys   0.28 ( 1%) wall   14133 kB ( 2%) ggc
 tree eh               :   0.03 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall    1795 kB ( 0%) ggc
 tree CFG construction :   0.02 ( 0%) usr   0.01 ( 1%) sys   0.04 ( 0%) wall   11718 kB ( 2%) ggc
 tree CFG cleanup      :   0.49 ( 2%) usr   0.00 ( 0%) sys   0.68 ( 3%) wall    3669 kB ( 1%) ggc
 tree copy propagation :   0.60 ( 3%) usr   0.00 ( 0%) sys   0.63 ( 3%) wall    1441 kB ( 0%) ggc
 tree store copy prop  :   0.12 ( 1%) usr   0.00 ( 0%) sys   0.10 ( 0%) wall     181 kB ( 0%) ggc
 tree find ref. vars   :   0.25 ( 1%) usr   0.01 ( 1%) sys   0.29 ( 1%) wall   22675 kB ( 3%) ggc
 tree PTA              :   1.61 ( 7%) usr   0.02 ( 1%) sys   1.72 ( 7%) wall   10266 kB ( 2%) ggc
 tree alias analysis   :   1.05 ( 5%) usr   0.15 ( 8%) sys   1.23 ( 5%) wall   11045 kB ( 2%) ggc
 tree PHI insertion    :   0.29 ( 1%) usr   0.00 ( 0%) sys   0.29 ( 1%) wall   16546 kB ( 2%) ggc
 tree SSA rewrite      :   0.65 ( 3%) usr   0.01 ( 1%) sys   0.76 ( 3%) wall   30896 kB ( 5%) ggc
 tree SSA other        :   0.15 ( 1%) usr   0.06 ( 3%) sys   0.20 ( 1%) wall     580 kB ( 0%) ggc
 tree SSA incremental  :   1.58 ( 7%) usr   0.01 ( 1%) sys   1.34 ( 6%) wall    6475 kB ( 1%) ggc
 tree operand scan     :   1.15 ( 5%) usr   0.25 (14%) sys   1.47 ( 6%) wall   15753 kB ( 2%) ggc
 dominator optimization:   0.80 ( 4%) usr   0.04 ( 2%) sys   0.84 ( 4%) wall   14884 kB ( 2%) ggc
 tree SRA              :   0.22 ( 1%) usr   0.02 ( 1%) sys   0.20 ( 1%) wall   11416 kB ( 2%) ggc
 tree STORE-CCP        :   0.05 ( 0%) usr   0.00 ( 0%) sys   0.04 ( 0%) wall     165 kB ( 0%) ggc
 tree CCP              :   0.21 ( 1%) usr   0.00 ( 0%) sys   0.19 ( 1%) wall     601 kB ( 0%) ggc
 tree split crit edges :   0.03 ( 0%) usr   0.00 ( 0%) sys   0.03 ( 0%) wall    6441 kB ( 1%) ggc
 tree reassociation    :   0.05 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall       1 kB ( 0%) ggc
 tree FRE              :   0.51 ( 2%) usr   0.02 ( 1%) sys   0.53 ( 2%) wall   16049 kB ( 2%) ggc
 tree code sinking     :   0.06 ( 0%) usr   0.00 ( 0%) sys   0.08 ( 0%) wall      54 kB ( 0%) ggc
 tree linearize phis   :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall      16 kB ( 0%) ggc
 tree forward propagate:   0.04 ( 0%) usr   0.00 ( 0%) sys   0.14 ( 1%) wall    3515 kB ( 1%) ggc
 tree conservative DCE :   0.39 ( 2%) usr   0.00 ( 0%) sys   0.49 ( 2%) wall       0 kB ( 0%) ggc
 tree aggressive DCE   :   0.06 ( 0%) usr   0.00 ( 0%) sys   0.08 ( 0%) wall       0 kB ( 0%) ggc
 tree DSE              :   0.11 ( 1%) usr   0.00 ( 0%) sys   0.07 ( 0%) wall      85 kB ( 0%) ggc
 PHI merge             :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall     966 kB ( 0%) ggc
 tree loop bounds      :   0.14 ( 1%) usr   0.00 ( 0%) sys   0.05 ( 0%) wall    1796 kB ( 0%) ggc
 loop invariant motion :   0.04 ( 0%) usr   0.00 ( 0%) sys   0.08 ( 0%) wall     158 kB ( 0%) ggc
 tree canonical iv     :   0.03 ( 0%) usr   0.00 ( 0%) sys   0.03 ( 0%) wall     955 kB ( 0%) ggc
 scev constant prop    :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.05 ( 0%) wall     721 kB ( 0%) ggc
 complete unrolling    :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall    1340 kB ( 0%) ggc
 tree iv optimization  :   0.15 ( 1%) usr   0.01 ( 1%) sys   0.18 ( 1%) wall    7715 kB ( 1%) ggc
 tree loop init        :   0.11 ( 1%) usr   0.00 ( 0%) sys   0.08 ( 0%) wall       6 kB ( 0%) ggc
 tree copy headers     :   0.06 ( 0%) usr   0.00 ( 0%) sys   0.05 ( 0%) wall    6478 kB ( 1%) ggc
 tree SSA uncprop      :   0.02 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall       0 kB ( 0%) ggc
 tree SSA to normal    :   0.31 ( 1%) usr   0.00 ( 0%) sys   0.41 ( 2%) wall    3411 kB ( 1%) ggc
 tree rename SSA copies:   0.10 ( 0%) usr   0.01 ( 1%) sys   0.16 ( 1%) wall       0 kB ( 0%) ggc
 dominance frontiers   :   0.07 ( 0%) usr   0.00 ( 0%) sys   0.09 ( 0%) wall       0 kB ( 0%) ggc
 expand                :   1.28 ( 6%) usr   0.02 ( 1%) sys   1.12 ( 5%) wall   43499 kB ( 7%) ggc
 varconst              :   0.08 ( 0%) usr   0.02 ( 1%) sys   0.05 ( 0%) wall     403 kB ( 0%) ggc
 jump                  :   0.05 ( 0%) usr   0.01 ( 1%) sys   0.06 ( 0%) wall    1203 kB ( 0%) ggc
 CSE                   :   0.22 ( 1%) usr   0.00 ( 0%) sys   0.20 ( 1%) wall     647 kB ( 0%) ggc
 loop analysis         :   0.10 ( 0%) usr   0.00 ( 0%) sys   0.12 ( 1%) wall    1936 kB ( 0%) ggc
 branch prediction     :   0.20 ( 1%) usr   0.01 ( 1%) sys   0.18 ( 1%) wall    1979 kB ( 0%) ggc
 flow analysis         :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall       2 kB ( 0%) ggc
 combiner              :   0.46 ( 2%) usr   0.00 ( 0%) sys   0.50 ( 2%) wall    5390 kB ( 1%) ggc
 if-conversion         :   0.08 ( 0%) usr   0.01 ( 1%) sys   0.06 ( 0%) wall     308 kB ( 0%) ggc
 local alloc           :   0.25 ( 1%) usr   0.01 ( 1%) sys   0.26 ( 1%) wall    1622 kB ( 0%) ggc
 global alloc          :   0.85 ( 4%) usr   0.01 ( 1%) sys   0.78 ( 3%) wall    9331 kB ( 1%) ggc
 reload CSE regs       :   0.17 ( 1%) usr   0.00 ( 0%) sys   0.17 ( 1%) wall    2917 kB ( 0%) ggc
 flow 2                :   0.05 ( 0%) usr   0.00 ( 0%) sys   0.03 ( 0%) wall    1058 kB ( 0%) ggc
 if-conversion 2       :   0.07 ( 0%) usr   0.00 ( 0%) sys   0.03 ( 0%) wall      37 kB ( 0%) ggc
 rename registers      :   0.14 ( 1%) usr   0.00 ( 0%) sys   0.15 ( 1%) wall      21 kB ( 0%) ggc
 machine dep reorg     :   0.12 ( 1%) usr   0.00 ( 0%) sys   0.19 ( 1%) wall      86 kB ( 0%) ggc
 shorten branches      :   0.02 ( 0%) usr   0.00 ( 0%) sys   0.03 ( 0%) wall       0 kB ( 0%) ggc
 final                 :   0.24 ( 1%) usr   0.00 ( 0%) sys   0.17 ( 1%) wall    1199 kB ( 0%) ggc
 symout                :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall     180 kB ( 0%) ggc
 TOTAL                 :  21.70             1.81            23.79             667733 kB

for 4.0.0:

Execution times (seconds)
 garbage collection    :   0.32 ( 2%) usr   0.00 ( 0%) sys   0.32 ( 2%) wall
 callgraph construction:   0.08 ( 1%) usr   0.00 ( 0%) sys   0.08 ( 0%) wall
 callgraph optimization:   0.04 ( 0%) usr   0.01 ( 1%) sys   0.04 ( 0%) wall
 cfg construction      :   0.03 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall
 cfg cleanup           :   0.06 ( 0%) usr   0.00 ( 0%) sys   0.08 ( 0%) wall
 trivially dead code   :   0.08 ( 1%) usr   0.00 ( 0%) sys   0.09 ( 1%) wall
 life analysis         :   0.43 ( 3%) usr   0.00 ( 0%) sys   0.44 ( 3%) wall
 life info update      :   0.05 ( 0%) usr   0.00 ( 0%) sys   0.13 ( 1%) wall
 alias analysis        :   0.08 ( 1%) usr   0.00 ( 0%) sys   0.13 ( 1%) wall
 register scan         :   0.06 ( 0%) usr   0.00 ( 0%) sys   0.14 ( 1%) wall
 rebuild jump labels   :   0.03 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall
 preprocessing         :   0.11 ( 1%) usr   0.15 ( 9%) sys   0.22 ( 1%) wall
 parser                :   2.00 (14%) usr   0.46 (29%) sys   2.20 (13%) wall
 name lookup           :   0.49 ( 3%) usr   0.44 (28%) sys   1.12 ( 7%) wall
 integration           :   0.67 ( 5%) usr   0.03 ( 2%) sys   0.77 ( 5%) wall
 tree gimplify         :   0.23 ( 2%) usr   0.01 ( 1%) sys   0.35 ( 2%) wall
 tree eh               :   0.05 ( 0%) usr   0.00 ( 0%) sys   0.06 ( 0%) wall
 tree CFG construction :   0.12 ( 1%) usr   0.00 ( 0%) sys   0.11 ( 1%) wall
 tree CFG cleanup      :   0.22 ( 2%) usr   0.00 ( 0%) sys   0.15 ( 1%) wall
 tree find referenced vars:   0.23 ( 2%) usr   0.00 ( 0%) sys   0.20 ( 1%) wall
 tree PTA              :   0.37 ( 3%) usr   0.01 ( 1%) sys   0.45 ( 3%) wall
 tree alias analysis   :   0.51 ( 3%) usr   0.00 ( 0%) sys   0.56 ( 3%) wall
 tree PHI insertion    :   0.17 ( 1%) usr   0.00 ( 0%) sys   0.24 ( 1%) wall
 tree SSA rewrite      :   0.54 ( 4%) usr   0.01 ( 1%) sys   0.47 ( 3%) wall
 tree SSA other        :   0.58 ( 4%) usr   0.16 (10%) sys   0.85 ( 5%) wall
 tree operand scan     :   0.51 ( 3%) usr   0.21 (13%) sys   0.68 ( 4%) wall
 dominator optimization:   0.76 ( 5%) usr   0.05 ( 3%) sys   0.66 ( 4%) wall
 tree SRA              :   0.12 ( 1%) usr   0.00 ( 0%) sys   0.13 ( 1%) wall
 tree CCP              :   0.07 ( 0%) usr   0.00 ( 0%) sys   0.12 ( 1%) wall
 tree split crit edges :   0.04 ( 0%) usr   0.00 ( 0%) sys   0.04 ( 0%) wall
 tree remove redundant PHIs:   0.26 ( 2%) usr   0.00 ( 0%) sys   0.33 ( 2%) wall
 tree linearize phis   :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall
 tree forward propagate:   0.15 ( 1%) usr   0.00 ( 0%) sys   0.12 ( 1%) wall
 tree conservative DCE :   0.26 ( 2%) usr   0.00 ( 0%) sys   0.26 ( 2%) wall
 tree aggressive DCE   :   0.04 ( 0%) usr   0.00 ( 0%) sys   0.05 ( 0%) wall
 tree DSE              :   0.06 ( 0%) usr   0.00 ( 0%) sys   0.07 ( 0%) wall
 PHI merge             :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall
 tree record loop bounds:   0.05 ( 0%) usr   0.00 ( 0%) sys   0.04 ( 0%) wall
 loop invariant motion :   0.05 ( 0%) usr   0.00 ( 0%) sys   0.08 ( 0%) wall
 tree canonical iv creation:   0.03 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall
 tree iv optimization  :   0.22 ( 2%) usr   0.01 ( 1%) sys   0.13 ( 1%) wall
 tree loop init        :   0.12 ( 1%) usr   0.00 ( 0%) sys   0.05 ( 0%) wall
 tree loop fini        :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall
 tree copy headers     :   0.12 ( 1%) usr   0.00 ( 0%) sys   0.17 ( 1%) wall
 tree SSA to normal    :   0.33 ( 2%) usr   0.00 ( 0%) sys   0.30 ( 2%) wall
 tree NRV optimization :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall
 tree rename SSA copies:   0.11 ( 1%) usr   0.00 ( 0%) sys   0.10 ( 1%) wall
 dominance frontiers   :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall
 expand                :   0.97 ( 7%) usr   0.01 ( 1%) sys   1.06 ( 6%) wall
 varconst              :   0.04 ( 0%) usr   0.00 ( 0%) sys   0.11 ( 1%) wall
 jump                  :   0.04 ( 0%) usr   0.00 ( 0%) sys   0.03 ( 0%) wall
 CSE                   :   0.27 ( 2%) usr   0.00 ( 0%) sys   0.22 ( 1%) wall
 loop analysis         :   0.08 ( 1%) usr   0.00 ( 0%) sys   0.09 ( 1%) wall
 branch prediction     :   0.18 ( 1%) usr   0.00 ( 0%) sys   0.19 ( 1%) wall
 flow analysis         :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall
 combiner              :   0.39 ( 3%) usr   0.00 ( 0%) sys   0.35 ( 2%) wall
 if-conversion         :   0.02 ( 0%) usr   0.00 ( 0%) sys   0.08 ( 0%) wall
 local alloc           :   0.22 ( 2%) usr   0.00 ( 0%) sys   0.20 ( 1%) wall
 global alloc          :   0.64 ( 4%) usr   0.01 ( 1%) sys   0.66 ( 4%) wall
 reload CSE regs       :   0.21 ( 1%) usr   0.00 ( 0%) sys   0.13 ( 1%) wall
 flow 2                :   0.05 ( 0%) usr   0.00 ( 0%) sys   0.05 ( 0%) wall
 if-conversion 2       :   0.07 ( 0%) usr   0.01 ( 1%) sys   0.02 ( 0%) wall
 rename registers      :   0.12 ( 1%) usr   0.00 ( 0%) sys   0.12 ( 1%) wall
 machine dep reorg     :   0.05 ( 0%) usr   0.00 ( 0%) sys   0.11 ( 1%) wall
 shorten branches      :   0.10 ( 1%) usr   0.00 ( 0%) sys   0.07 ( 0%) wall
 final                 :   0.18 ( 1%) usr   0.02 ( 1%) sys   0.24 ( 1%) wall
 symout                :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall
 rest of compilation   :   0.12 ( 1%) usr   0.00 ( 0%) sys   0.03 ( 0%) wall
 TOTAL                 :  14.62             1.60            16.43
14.630u 1.630s 0:16.46 98.7%    0+0k 0+0io 0pf+0w

-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=8361


^ permalink raw reply	[flat|nested] 43+ messages in thread

* [Bug rtl-optimization/8361] [3.4/4.0/4.1 regression] C++ compile-time performance regression
       [not found] <20021025133601.8361.gerald@pfeifer.com>
                   ` (37 preceding siblings ...)
  2005-09-18 19:17 ` pinskia at gcc dot gnu dot org
@ 2005-09-18 19:27 ` pinskia at gcc dot gnu dot org
  38 siblings, 0 replies; 43+ messages in thread
From: pinskia at gcc dot gnu dot org @ 2005-09-18 19:27 UTC (permalink / raw)
  To: gcc-bugs


------- Additional Comments From pinskia at gcc dot gnu dot org  2005-09-18 19:27 -------
Even the -fno-inline case slowed down too:
pc64:~/src/pr8361> time ~/onetest.release/bin/gcc pr8361.ii -S -m32 -O1 -fno-inline
11.171u 0.359s 0:11.66 98.7%    0+0k 0+0io 0pf+0w
pc64:~/src/pr8361> time gcc-4.0 pr8361.ii -S -m32 -O1 -fno-inline
9.578u 0.295s 0:10.02 98.4%     0+0k 0+0io 0pf+0w


interesting part of 4.1 time report:
 combiner              :   0.36 ( 3%) usr   0.00 ( 0%) sys   0.31 ( 2%) wall    1933 kB ( 0%) ggc
 local alloc           :   0.33 ( 3%) usr   0.00 ( 0%) sys   0.31 ( 2%) wall    3654 kB ( 1%) ggc
 global alloc          :   0.85 ( 8%) usr   0.00 ( 0%) sys   0.75 ( 6%) wall   12187 kB ( 3%) ggc
 tree operand scan     :   0.25 ( 2%) usr   0.10 ( 7%) sys   0.29 ( 2%) wall    9315 kB ( 2%) ggc
 dominator optimization:   0.35 ( 3%) usr   0.01 ( 1%) sys   0.34 ( 3%) wall    3938 kB ( 1%) ggc
 tree PTA              :   0.46 ( 4%) usr   0.02 ( 1%) sys   0.53 ( 4%) wall   19358 kB ( 5%) ggc
 tree alias analysis   :   0.25 ( 2%) usr   0.04 ( 3%) sys   0.27 ( 2%) wall    9734 kB ( 2%) ggc
 tree SSA rewrite      :   0.19 ( 2%) usr   0.00 ( 0%) sys   0.24 ( 2%) wall   10314 kB ( 3%) ggc
 tree SSA incremental  :   0.32 ( 3%) usr   0.00 ( 0%) sys   0.26 ( 2%) wall    2956 kB ( 1%) ggc
 parser                :   1.71 (15%) usr   0.42 (28%) sys   2.07 (16%) wall  154459 kB (39%) ggc
 life analysis         :   0.57 ( 5%) usr   0.01 ( 1%) sys   0.51 ( 4%) wall    3127 kB ( 1%) ggc


corresponding 4.0 time report:
 combiner              :   0.31 ( 3%) usr   0.01 ( 1%) sys   0.28 ( 2%) wall
 local alloc           :   0.35 ( 4%) usr   0.00 ( 0%) sys   0.47 ( 4%) wall
 global alloc          :   0.74 ( 7%) usr   0.01 ( 1%) sys   0.96 ( 8%) wall
 tree operand scan     :   0.18 ( 2%) usr   0.07 ( 5%) sys   0.23 ( 2%) wall
 dominator optimization:   0.42 ( 4%) usr   0.01 ( 1%) sys   0.31 ( 3%) wall
 tree PTA              :   0.14 ( 1%) usr   0.01 ( 1%) sys   0.11 ( 1%) wall
 tree alias analysis   :   0.05 ( 1%) usr   0.00 ( 0%) sys   0.07 ( 1%) wall
 tree SSA rewrite      :   0.17 ( 2%) usr   0.00 ( 0%) sys   0.18 ( 2%) wall
 parser                :   1.71 (17%) usr   0.41 (32%) sys   2.22 (20%) wall
 life analysis         :   0.43 ( 4%) usr   0.00 ( 0%) sys   0.61 ( 5%) wall




-- 
           What    |Removed                     |Added
----------------------------------------------------------------------------
   Last reconfirmed|2005-07-25 01:54:59         |2005-09-18 19:27:11
               date|                            |


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=8361


^ permalink raw reply	[flat|nested] 43+ messages in thread

* [Bug rtl-optimization/8361] [3.4/4.0/4.1 regression] C++ compile-time performance regression
       [not found] <bug-8361-231@http.gcc.gnu.org/bugzilla/>
  2005-10-12  5:26 ` ian at airs dot com
  2005-10-13  3:34 ` pinskia at gcc dot gnu dot org
@ 2005-10-13  4:07 ` dberlin at dberlin dot org
  2 siblings, 0 replies; 43+ messages in thread
From: dberlin at dberlin dot org @ 2005-10-13  4:07 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #58 from dberlin at gcc dot gnu dot org  2005-10-13 04:07 -------
Subject: Re:  [3.4/4.0/4.1 regression] C++
        compile-time performance regression

On Thu, 2005-10-13 at 03:34 +0000, pinskia at gcc dot gnu dot org wrote:
> 
> ------- Comment #57 from pinskia at gcc dot gnu dot org  2005-10-13 03:34 -------
> A semi recent 4.1 (the 10th) gives:
>  tree PTA              :   1.60 ( 6%) usr   0.02 ( 1%) sys   1.73 ( 6%) wall  
> 10338 kB ( 1%) ggc
>  tree alias analysis   :   1.32 ( 5%) usr   0.19 (10%) sys   1.48 ( 5%) wall  
> 18910 kB ( 3%) ggc
> 
> while 4.0 gave:
>  tree PTA              :   0.50 ( 2%) usr   0.00 ( 0%) sys   0.48 ( 2%) wall
>  tree alias analysis   :   0.73 ( 3%) usr   0.00 ( 0%) sys   0.76 ( 3%) wall
> 
> So this is definitely a 4.1 regression.
> 
> 

I'm pretty sure we run PTA more times in 4.1 than 4.0
Maybe i'm wrong.
Can you oprofile this and give me some kind of hotspot to look into in
PTA?


-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=8361


^ permalink raw reply	[flat|nested] 43+ messages in thread

* Re: [Bug rtl-optimization/8361] [3.4/4.0/4.1 regression] C++ compile-time performance regression
  2005-10-13  3:34 ` pinskia at gcc dot gnu dot org
@ 2005-10-13  4:07   ` Daniel Berlin
  0 siblings, 0 replies; 43+ messages in thread
From: Daniel Berlin @ 2005-10-13  4:07 UTC (permalink / raw)
  To: gcc-bugzilla; +Cc: gcc-bugs

On Thu, 2005-10-13 at 03:34 +0000, pinskia at gcc dot gnu dot org wrote:
> 
> ------- Comment #57 from pinskia at gcc dot gnu dot org  2005-10-13 03:34 -------
> A semi recent 4.1 (the 10th) gives:
>  tree PTA              :   1.60 ( 6%) usr   0.02 ( 1%) sys   1.73 ( 6%) wall  
> 10338 kB ( 1%) ggc
>  tree alias analysis   :   1.32 ( 5%) usr   0.19 (10%) sys   1.48 ( 5%) wall  
> 18910 kB ( 3%) ggc
> 
> while 4.0 gave:
>  tree PTA              :   0.50 ( 2%) usr   0.00 ( 0%) sys   0.48 ( 2%) wall
>  tree alias analysis   :   0.73 ( 3%) usr   0.00 ( 0%) sys   0.76 ( 3%) wall
> 
> So this is definitely a 4.1 regression.
> 
> 

I'm pretty sure we run PTA more times in 4.1 than 4.0
Maybe i'm wrong.
Can you oprofile this and give me some kind of hotspot to look into in
PTA?



^ permalink raw reply	[flat|nested] 43+ messages in thread

* [Bug rtl-optimization/8361] [3.4/4.0/4.1 regression] C++ compile-time performance regression
       [not found] <bug-8361-231@http.gcc.gnu.org/bugzilla/>
  2005-10-12  5:26 ` ian at airs dot com
@ 2005-10-13  3:34 ` pinskia at gcc dot gnu dot org
  2005-10-13  4:07   ` Daniel Berlin
  2005-10-13  4:07 ` dberlin at dberlin dot org
  2 siblings, 1 reply; 43+ messages in thread
From: pinskia at gcc dot gnu dot org @ 2005-10-13  3:34 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #57 from pinskia at gcc dot gnu dot org  2005-10-13 03:34 -------
A semi recent 4.1 (the 10th) gives:
 tree PTA              :   1.60 ( 6%) usr   0.02 ( 1%) sys   1.73 ( 6%) wall  
10338 kB ( 1%) ggc
 tree alias analysis   :   1.32 ( 5%) usr   0.19 (10%) sys   1.48 ( 5%) wall  
18910 kB ( 3%) ggc

while 4.0 gave:
 tree PTA              :   0.50 ( 2%) usr   0.00 ( 0%) sys   0.48 ( 2%) wall
 tree alias analysis   :   0.73 ( 3%) usr   0.00 ( 0%) sys   0.76 ( 3%) wall

So this is definitely a 4.1 regression.


-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=8361


^ permalink raw reply	[flat|nested] 43+ messages in thread

* [Bug rtl-optimization/8361] [3.4/4.0/4.1 regression] C++ compile-time performance regression
       [not found] <bug-8361-231@http.gcc.gnu.org/bugzilla/>
@ 2005-10-12  5:26 ` ian at airs dot com
  2005-10-13  3:34 ` pinskia at gcc dot gnu dot org
  2005-10-13  4:07 ` dberlin at dberlin dot org
  2 siblings, 0 replies; 43+ messages in thread
From: ian at airs dot com @ 2005-10-12  5:26 UTC (permalink / raw)
  To: gcc-bugs



------- Comment #56 from ian at airs dot com  2005-10-12 05:26 -------
Is this PR really a 4.0 regression?  The timings which I see in the comments
suggest that 4.0 is just as fast as earlier releases.

That is, the PR may have become a 4.1 regression, but I don't see that it is a
4.0 regression.


-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=8361


^ permalink raw reply	[flat|nested] 43+ messages in thread

end of thread, other threads:[~2005-10-13  4:07 UTC | newest]

Thread overview: 43+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
     [not found] <20021025133601.8361.gerald@pfeifer.com>
2003-10-16  2:39 ` [Bug optimization/8361] [3.3/3.4 regression] C++ compile-time performance regression mmitchel at gcc dot gnu dot org
2003-12-29 16:23 ` steven at gcc dot gnu dot org
2004-01-02  2:31 ` giovannibajo at libero dot it
2004-01-14  3:22 ` pinskia at gcc dot gnu dot org
2004-01-21 18:28 ` [Bug optimization/8361] [3.3/3.4/3.5 " gerald at pfeifer dot com
2004-02-06  1:44 ` pinskia at gcc dot gnu dot org
2004-03-13 16:38 ` mmitchel at gcc dot gnu dot org
2004-06-18 23:39 ` [Bug rtl-optimization/8361] " mmitchel at gcc dot gnu dot org
2004-06-22  8:38 ` gerald at pfeifer dot com
2004-06-25  4:32 ` pinskia at gcc dot gnu dot org
2004-06-25  7:44 ` pinskia at gcc dot gnu dot org
2004-08-29 18:50 ` mmitchel at gcc dot gnu dot org
2004-09-15 10:13 ` [Bug rtl-optimization/8361] [3.3/3.4/4.0 " pinskia at gcc dot gnu dot org
2004-09-18 14:40 ` pinskia at gcc dot gnu dot org
2004-09-24 15:01 ` bonzini at gcc dot gnu dot org
2004-10-26 12:34 ` nathan at gcc dot gnu dot org
2004-10-26 13:34 ` gerald at pfeifer dot com
2004-10-26 13:37 ` pinskia at gcc dot gnu dot org
2004-10-30 19:30 ` mmitchel at gcc dot gnu dot org
2004-11-12 12:52 ` steven at gcc dot gnu dot org
2004-11-18 18:56 ` pinskia at gcc dot gnu dot org
2004-11-19 18:22 ` pinskia at gcc dot gnu dot org
2004-12-04 16:31 ` pinskia at gcc dot gnu dot org
2004-12-18  4:13 ` pinskia at gcc dot gnu dot org
2004-12-23 11:16 ` steven at gcc dot gnu dot org
2005-01-10  1:35 ` pinskia at gcc dot gnu dot org
2005-01-16  4:50 ` pinskia at gcc dot gnu dot org
2005-01-16 14:16 ` belyshev at depni dot sinp dot msu dot ru
2005-01-16 14:21 ` steven at gcc dot gnu dot org
2005-01-28 14:48 ` pinskia at gcc dot gnu dot org
2005-01-28 15:15 ` steven at gcc dot gnu dot org
2005-02-06 15:34 ` steven@gcc.gnu.org
2005-02-06 16:26 ` steven@gcc.gnu.org
2005-02-06 16:54 ` steven@gcc.gnu.org
2005-02-07  2:13 ` ghazi at gcc dot gnu dot org
2005-02-08 10:19 ` gerald at pfeifer dot com
2005-07-23 22:14 ` [Bug rtl-optimization/8361] [3.4/4.0/4.1 " pinskia at gcc dot gnu dot org
2005-09-18 19:17 ` pinskia at gcc dot gnu dot org
2005-09-18 19:27 ` pinskia at gcc dot gnu dot org
     [not found] <bug-8361-231@http.gcc.gnu.org/bugzilla/>
2005-10-12  5:26 ` ian at airs dot com
2005-10-13  3:34 ` pinskia at gcc dot gnu dot org
2005-10-13  4:07   ` Daniel Berlin
2005-10-13  4:07 ` dberlin at dberlin dot org

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).