From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 19078 invoked by alias); 31 Aug 2004 09:58:44 -0000 Mailing-List: contact gcc-help@gcc.gnu.org; run by ezmlm Precedence: bulk List-Archive: List-Post: List-Help: Sender: gcc-owner@gcc.gnu.org Received: (qmail 19070 invoked from network); 31 Aug 2004 09:58:42 -0000 Received: from unknown (HELO thinkpad.gardas.net) (80.188.250.3) by sourceware.org with SMTP; 31 Aug 2004 09:58:42 -0000 Received: from karel (helo=localhost) by thinkpad.gardas.net with local-esmtp (Exim 3.35 #1 (Debian)) id 1C25PV-0000hh-00; Tue, 31 Aug 2004 11:58:41 +0200 Date: Tue, 31 Aug 2004 10:28:00 -0000 From: Karel Gardas X-X-Sender: karel@thinkpad.gardas.net To: Steven Bosscher cc: GCC Mailing List Subject: Re: Compilation performance comparison of gcc3.4.1 and gcc3.5.0 2004-08-30 on MICO sources In-Reply-To: <200408311141.06568.stevenb@suse.de> Message-ID: MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII X-SW-Source: 2004-08/txt/msg01600.txt.bz2 On Tue, 31 Aug 2004, Steven Bosscher wrote: > On Tuesday 31 August 2004 11:11, Karel Gardas wrote: > > Hello, > > > > several times promised here are finally the results obtained for > > yesterday's main-trunk and -O0/1/2 compilations (whole table is below) > > > > As I've already reported -O0 is better, which is great! And O1 and O2 are > > slower for about 8.5% and 7%. > > > > Interesting files seem to be: > > > > 1) typecode.cc: 40% regression on O1 while 7% speedup on O2 > > Can you show us the time report for the 40% regression? Here we go. Execution times (seconds) garbage collection : 0.52 ( 2%) usr 0.00 ( 0%) sys 0.53 ( 2%) wall callgraph construction: 0.19 ( 1%) usr 0.00 ( 0%) sys 0.20 ( 1%) wall callgraph optimization: 0.03 ( 0%) usr 0.00 ( 0%) sys 0.04 ( 0%) wall cfg construction : 0.03 ( 0%) usr 0.00 ( 0%) sys 0.04 ( 0%) wall cfg cleanup : 0.10 ( 0%) usr 0.00 ( 0%) sys 0.15 ( 1%) wall trivially dead code : 0.12 ( 1%) usr 0.00 ( 0%) sys 0.14 ( 1%) wall life analysis : 0.97 ( 4%) usr 0.00 ( 0%) sys 0.84 ( 3%) wall life info update : 0.19 ( 1%) usr 0.00 ( 0%) sys 0.19 ( 1%) wall alias analysis : 0.17 ( 1%) usr 0.01 ( 1%) sys 0.14 ( 1%) wall register scan : 0.17 ( 1%) usr 0.00 ( 0%) sys 0.17 ( 1%) wall rebuild jump labels : 0.02 ( 0%) usr 0.00 ( 0%) sys 0.04 ( 0%) wall preprocessing : 0.48 ( 2%) usr 0.23 (12%) sys 0.57 ( 2%) wall parser : 3.93 (17%) usr 0.58 (30%) sys 4.67 (18%) wall name lookup : 1.09 ( 5%) usr 0.46 (24%) sys 1.79 ( 7%) wall integration : 1.01 ( 4%) usr 0.06 ( 3%) sys 0.88 ( 3%) wall tree gimplify : 0.60 ( 3%) usr 0.04 ( 2%) sys 0.60 ( 2%) wall tree eh : 0.10 ( 0%) usr 0.00 ( 0%) sys 0.12 ( 0%) wall tree CFG construction : 0.10 ( 0%) usr 0.03 ( 2%) sys 0.13 ( 0%) wall tree CFG cleanup : 0.21 ( 1%) usr 0.01 ( 1%) sys 0.14 ( 1%) wall tree PTA : 0.20 ( 1%) usr 0.00 ( 0%) sys 0.26 ( 1%) wall tree alias analysis : 0.33 ( 1%) usr 0.00 ( 0%) sys 0.37 ( 1%) wall tree PHI insertion : 0.42 ( 2%) usr 0.01 ( 1%) sys 0.50 ( 2%) wall tree SSA rewrite : 0.58 ( 3%) usr 0.00 ( 0%) sys 0.71 ( 3%) wall tree SSA other : 0.82 ( 4%) usr 0.12 ( 6%) sys 0.98 ( 4%) wall tree operand scan : 0.59 ( 3%) usr 0.16 ( 8%) sys 0.98 ( 4%) wall dominator optimization: 1.48 ( 6%) usr 0.02 ( 1%) sys 1.50 ( 6%) wall tree SRA : 0.08 ( 0%) usr 0.00 ( 0%) sys 0.05 ( 0%) wall tree CCP : 0.22 ( 1%) usr 0.00 ( 0%) sys 0.18 ( 1%) wall tree split crit edges : 0.03 ( 0%) usr 0.00 ( 0%) sys 0.03 ( 0%) wall tree PRE : 0.41 ( 2%) usr 0.01 ( 1%) sys 0.41 ( 2%) wall tree forward propagate: 0.04 ( 0%) usr 0.00 ( 0%) sys 0.04 ( 0%) wall tree conservative DCE : 0.30 ( 1%) usr 0.01 ( 1%) sys 0.28 ( 1%) wall tree aggressive DCE : 0.09 ( 0%) usr 0.00 ( 0%) sys 0.12 ( 0%) wall tree DSE : 0.28 ( 1%) usr 0.00 ( 0%) sys 0.31 ( 1%) wall loop invariant motion : 0.21 ( 1%) usr 0.00 ( 0%) sys 0.22 ( 1%) wall tree copy headers : 0.06 ( 0%) usr 0.01 ( 1%) sys 0.04 ( 0%) wall tree SSA to normal : 0.26 ( 1%) usr 0.01 ( 1%) sys 0.36 ( 1%) wall tree rename SSA copies: 0.11 ( 0%) usr 0.00 ( 0%) sys 0.13 ( 0%) wall dominance frontiers : 0.03 ( 0%) usr 0.00 ( 0%) sys 0.04 ( 0%) wall expand : 2.08 ( 9%) usr 0.07 ( 4%) sys 2.51 (10%) wall varconst : 0.08 ( 0%) usr 0.02 ( 1%) sys 0.09 ( 0%) wall jump : 0.05 ( 0%) usr 0.00 ( 0%) sys 0.07 ( 0%) wall CSE : 0.58 ( 3%) usr 0.00 ( 0%) sys 0.55 ( 2%) wall loop analysis : 0.06 ( 0%) usr 0.00 ( 0%) sys 0.06 ( 0%) wall branch prediction : 0.15 ( 1%) usr 0.00 ( 0%) sys 0.13 ( 0%) wall flow analysis : 0.02 ( 0%) usr 0.01 ( 1%) sys 0.04 ( 0%) wall combiner : 0.55 ( 2%) usr 0.00 ( 0%) sys 0.64 ( 2%) wall if-conversion : 0.08 ( 0%) usr 0.00 ( 0%) sys 0.08 ( 0%) wall local alloc : 0.30 ( 1%) usr 0.00 ( 0%) sys 0.33 ( 1%) wall global alloc : 1.16 ( 5%) usr 0.01 ( 1%) sys 1.34 ( 5%) wall reload CSE regs : 0.31 ( 1%) usr 0.00 ( 0%) sys 0.28 ( 1%) wall flow 2 : 0.09 ( 0%) usr 0.00 ( 0%) sys 0.10 ( 0%) wall if-conversion 2 : 0.05 ( 0%) usr 0.00 ( 0%) sys 0.07 ( 0%) wall rename registers : 0.18 ( 1%) usr 0.00 ( 0%) sys 0.18 ( 1%) wall machine dep reorg : 0.22 ( 1%) usr 0.00 ( 0%) sys 0.17 ( 1%) wall shorten branches : 0.15 ( 1%) usr 0.00 ( 0%) sys 0.17 ( 1%) wall final : 0.26 ( 1%) usr 0.01 ( 1%) sys 0.26 ( 1%) wall symout : 0.01 ( 0%) usr 0.00 ( 0%) sys 0.01 ( 0%) wall rest of compilation : 0.14 ( 1%) usr 0.01 ( 1%) sys 0.19 ( 1%) wall TOTAL : 23.12 1.91 26.21 # cc1plus 23.13 1.93 # as 0.34 0.02 Karel -- Karel Gardas kgardas@objectsecurity.com ObjectSecurity Ltd. http://www.objectsecurity.com