From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 4972 invoked by alias); 17 Jan 2004 14:44:59 -0000 Mailing-List: contact gcc-help@gcc.gnu.org; run by ezmlm Precedence: bulk List-Archive: List-Post: List-Help: Sender: gcc-owner@gcc.gnu.org Received: (qmail 4963 invoked from network); 17 Jan 2004 14:44:55 -0000 Received: from unknown (HELO nikam.ms.mff.cuni.cz) (195.113.18.106) by sources.redhat.com with SMTP; 17 Jan 2004 14:44:55 -0000 Received: from camelot.ms.mff.cuni.cz (kampanus.ms.mff.cuni.cz [195.113.18.107]) by nikam.ms.mff.cuni.cz (Postfix) with SMTP id 281064DF4A; Sat, 17 Jan 2004 15:44:56 +0100 (CET) Received: by camelot.ms.mff.cuni.cz (sSMTP sendmail emulation); Sat, 17 Jan 2004 15:44:57 +0100 Date: Sat, 17 Jan 2004 14:44:00 -0000 From: Jan Hubicka To: Roger Sayle Cc: Jan Hubicka , Steven Bosscher , gcc@gcc.gnu.org Subject: Re: [RFC] Contributing tree-ssa to mainline Message-ID: <20040117144457.GS8121@kam.mff.cuni.cz> References: Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.3.28i X-SW-Source: 2004-01/txt/msg01055.txt.bz2 > > Hi Jan, > > Perhaps you could provide details of where you 2 or 3% comes from? > > My apologies. The two URLs given in your posting do indeed show > GCC 3.3 only about 6% behind Intel 7.0, with aggressive optimizations. > Indeed, this is a much fairer comparison than the "apples-and-oranges" > results linked from Andreas' site. This compare to hammer branch GCC 3.3 and 64bit to 32bit mode, so it is not also really fine comparsion. (attached) I will have 32bit results on the same hardware bu tomorrow. I already checked that mainline outperforms hammer branch at -O2, at aggressive settings we have more issues, but I am just analyzing them and at least eon I analyzed yesterday was simple defect in profiling and -fwritable-strings. In fact we misscompile following constructor: struct a{ int a; char *b; } a={1,"ahoj"}; if there are no plans to fix this deprecated feature, I would preffer to just kill it as this is important defect and it took me about 2 hours yesterday to work out why profiling is broken. Also the peak runs suffer from too large inlining limit I already fixed. another problem is too large unrolling limit I will try to trottle and produce patch too. I will post more complette results once I am done. Honza Hi, Here's the requested output from gcc-benchmark. Base Compiler: hamer branch Peak Compiler: mainline cflags base: -O2 cflags peak: -O2 Iterations: 1 Running on: vivaldi Hardware: x86_64/AMD Opteron(tm) Processor 242 at 1593.799 PDO for base and peak: No PDO for peak only: No Using gfortran instead of F77: No This run is: SPECint Please direct questions about this to aj@suse.de Size of binaries: 164.gzip: Base: 62716 bytes 164.gzip: Peak: 62604 bytes 175.vpr: Base: 167701 bytes 175.vpr: Peak: 167354 bytes 176.gcc: Base: 1612426 bytes 176.gcc: Peak: 1605449 bytes 181.mcf: Base: 25589 bytes 181.mcf: Peak: 25525 bytes 186.crafty: Base: 224570 bytes 186.crafty: Peak: 224346 bytes 197.parser: Base: 143696 bytes 197.parser: Peak: 143568 bytes 252.eon: Base: 604611 bytes 252.eon: Peak: 635561 bytes 253.perlbmk: Base: 667061 bytes 253.perlbmk: Peak: 666917 bytes 254.gap: Base: 542438 bytes 254.gap: Peak: 542358 bytes 255.vortex: Base: 678497 bytes 255.vortex: Peak: 686898 bytes 256.bzip2: Base: 53904 bytes 256.bzip2: Peak: 53888 bytes 300.twolf: Base: 225871 bytes 300.twolf: Peak: 221423 bytes ============================= Total: Base: 5009080 bytes Total: Peak: 5035891 bytes Compile times for benchmarks: 164.gzip base: 2 s 175.vpr base: 5 s 176.gcc base: 55 s 181.mcf base: 1 s 186.crafty base: 10 s 197.parser base: 5 s 252.eon base: 100 s 253.perlbmk base: 27 s 254.gap base: 20 s 255.vortex base: 19 s 256.bzip2 base: 1 s 300.twolf base: 12 s 164.gzip peak: 2 s 175.vpr peak: 6 s 176.gcc peak: 57 s 181.mcf peak: 1 s 186.crafty peak: 10 s 197.parser peak: 5 s 252.eon peak: 105 s 253.perlbmk peak: 27 s 254.gap peak: 20 s 255.vortex peak: 20 s 256.bzip2 peak: 1 s 300.twolf peak: 13 s ====================================== Total time for base compilation: 257 s Total time for peak compilation: 267 s GCC was configured as: configure --enable-threads=posix --enable-languages="c,c++" --disable-multilib GCC bootstrap times for 'make -j2 bootstrap && make install': Base compiler: 691 s Peak compiler: 662 s --_----------=_1074261898165450 Content-Disposition: inline; filename="CINT2000.052.asc" Content-Transfer-Encoding: 8bit Content-Type: text/plain; name="CINT2000.052.asc" ############################################################################## # INVALID RUN INVALID RUN INVALID RUN INVALID RUN INVALID RUN INVALID RUN # # # # 'reportable' flag not set during run # # 254.gap base did not have enough runs! # # 256.bzip2 base did not have enough runs! # # 255.vortex base did not have enough runs! # # 175.vpr base did not have enough runs! # # 164.gzip base did not have enough runs! # # 253.perlbmk base did not have enough runs! # # 186.crafty base did not have enough runs! # # 176.gcc base did not have enough runs! # # 252.eon base did not have enough runs! # # 197.parser base did not have enough runs! # # 300.twolf base did not have enough runs! # # 181.mcf base did not have enough runs! # # 254.gap peak did not have enough runs! # # 256.bzip2 peak did not have enough runs! # # 255.vortex peak did not have enough runs! # # 175.vpr peak did not have enough runs! # # 164.gzip peak did not have enough runs! # # 253.perlbmk peak did not have enough runs! # # 186.crafty peak did not have enough runs! # # 176.gcc peak did not have enough runs! # # 252.eon peak did not have enough runs! # # 197.parser peak did not have enough runs! # # 300.twolf peak did not have enough runs! # # 181.mcf peak did not have enough runs! # # Error 252.eon: Child returned with invalid return code # # Error 252.eon: Output miscompare # # # # INVALID RUN INVALID RUN INVALID RUN INVALID RUN INVALID RUN INVALID RUN # ############################################################################## SPEC CINT2000 Summary Unknown Unknown Tested by SuSE Linux AG Fri Jan 16 13:30:03 2004 SPEC License #1922 Test date: 2004-01-16 Hardware availability: today Tester: Andreas Jaeger, SuSE Linux AG Software availability: Now Estimated Estimated Base Base Base Peak Peak Peak Benchmarks Ref Time Run Time Ratio Ref Time Run Time Ratio ------------ -------- -------- -------- -------- -------- -------- 164.gzip 1400 180 780* 1400 179 783* 175.vpr 1400 201 698* 1400 203 689* 176.gcc 1100 129 853* 1100 128 857* 181.mcf 1800 520 346* 1800 518 347* 186.crafty 1000 70.4 1421* 1000 72.4 1381* 197.parser 1800 335 537* 1800 333 541* 252.eon 1300 110 1187* 1300 0.0285 X 253.perlbmk 1800 192 937* 1800 185 972* 254.gap 1100 151 730* 1100 147 747* 255.vortex 1900 172 1103* 1900 162 1170* 256.bzip2 1500 198 757* 1500 200 751* 300.twolf 3000 381 788* 3000 376 797* ======================================================================== 164.gzip 1400 180 780* 1400 179 783* 175.vpr 1400 201 698* 1400 203 689* 176.gcc 1100 129 853* 1100 128 857* 181.mcf 1800 520 346* 1800 518 347* 186.crafty 1000 70.4 1421* 1000 72.4 1381* 197.parser 1800 335 537* 1800 333 541* 252.eon 1300 110 1187* 253.perlbmk 1800 192 937* 1800 185 972* 254.gap 1100 151 730* 1100 147 747* 255.vortex 1900 172 1103* 1900 162 1170* 256.bzip2 1500 198 757* 1500 200 751* 300.twolf 3000 381 788* 3000 376 797* Est. SPECint_base2000 797 Est. SPECint2000 775 HARDWARE -------- Hardware Vendor: Unknown Model Name: Unknown CPU: AMD Opteron(tm) Processor 242 CPU MHz: 1593.799 FPU: Integrated CPU(s) enabled: 2 CPU(s) orderable: 1,2 Parallel: No Primary Cache: Secondary Cache: 1024 KB L3 Cache: N/A Other Cache: N/A Memory: 992 MB Disk Subsystem: Unknown Other Hardware: Ethernet SOFTWARE -------- Operating System: SuSE Linux 9.0 (x86-64) Compiler: GCC CVS File System: Linux/reiserfs System State: Multi-User NOTES ----- Base flags: -O2 Compiler: Unspecified Peak flags: -O2 Compiler: Unspecified To compile and execute eon correctly the following extra flags are used for compilation: -ffast-math -fwritable-strings. ############################################################################## # INVALID RUN INVALID RUN INVALID RUN INVALID RUN INVALID RUN INVALID RUN # # # # 'reportable' flag not set during run # # 254.gap base did not have enough runs! # # 256.bzip2 base did not have enough runs! # # 255.vortex base did not have enough runs! # # 175.vpr base did not have enough runs! # # 164.gzip base did not have enough runs! # # 253.perlbmk base did not have enough runs! # # 186.crafty base did not have enough runs! # # 176.gcc base did not have enough runs! # # 252.eon base did not have enough runs! # # 197.parser base did not have enough runs! # # 300.twolf base did not have enough runs! # # 181.mcf base did not have enough runs! # # 254.gap peak did not have enough runs! # # 256.bzip2 peak did not have enough runs! # # 255.vortex peak did not have enough runs! # # 175.vpr peak did not have enough runs! # # 164.gzip peak did not have enough runs! # # 253.perlbmk peak did not have enough runs! # # 186.crafty peak did not have enough runs! # # 176.gcc peak did not have enough runs! # # 252.eon peak did not have enough runs! # # 197.parser peak did not have enough runs! # # 300.twolf peak did not have enough runs! # # 181.mcf peak did not have enough runs! # # Error 252.eon: Child returned with invalid return code # # Error 252.eon: Output miscompare # # # # INVALID RUN INVALID RUN INVALID RUN INVALID RUN INVALID RUN INVALID RUN # ############################################################################## ----------------------------------------------------------------------------- For questions about this result, please contact the tester. For other inquiries, please contact webmaster@spec.org. Copyright 1999-2001 Standard Performance Evaluation Corporation Generated on Fri Jan 16 15:04:57 2004 by SPEC CPU2000 ASCII formatter v2.1 --_----------=_1074261898165450-- Hi, Here's the requested output from gcc-benchmark. Base Compiler: hammer branch Peak Compiler: mainline cflags base: -O3 -ftracer -funroll-loops -ffast-math -fpeel-loops cflags peak: -O3 -ftracer -funroll-loops -ffast-math -fpeel-loops -fprofile-values -fvpt Iterations: 1 Running on: vivaldi Hardware: x86_64/AMD Opteron(tm) Processor 242 at 1593.799 PDO for base and peak: Yes PDO for peak only: No Using gfortran instead of F77: No This run is: SPECint Please direct questions about this to aj@suse.de Size of binaries: 164.gzip: Base: 74699 bytes 164.gzip: Peak: 76185 bytes 175.vpr: Base: 189819 bytes 175.vpr: Peak: 198472 bytes 176.gcc: Base: 2007407 bytes 176.gcc: Peak: 2080007 bytes 181.mcf: Base: 29853 bytes 181.mcf: Peak: 29821 bytes 186.crafty: Base: 247042 bytes 186.crafty: Peak: 241570 bytes 197.parser: Base: 214742 bytes 197.parser: Peak: 215766 bytes 252.eon: Base: 625272 bytes 252.eon: Peak: 655316 bytes 253.perlbmk: Base: 768347 bytes 253.perlbmk: Peak: 775934 bytes 254.gap: Base: 625415 bytes 254.gap: Peak: 616535 bytes 255.vortex: Base: 694349 bytes 255.vortex: Peak: 694612 bytes 256.bzip2: Base: 78776 bytes 256.bzip2: Peak: 66472 bytes 300.twolf: Base: 246508 bytes 300.twolf: Peak: 250179 bytes ============================= Total: Base: 5802229 bytes Total: Peak: 5900869 bytes Compile times for benchmarks: 164.gzip base: 7 s 175.vpr base: 19 s 176.gcc base: 228 s 181.mcf base: 4 s 186.crafty base: 26 s 197.parser base: 19 s 252.eon base: 221 s 253.perlbmk base: 96 s 254.gap base: 70 s 255.vortex base: 53 s 256.bzip2 base: 6 s 300.twolf base: 33 s 164.gzip peak: 7 s 175.vpr peak: 22 s 176.gcc peak: 250 s 181.mcf peak: 3 s 186.crafty peak: 26 s 197.parser peak: 21 s 252.eon peak: 242 s 253.perlbmk peak: 96 s 254.gap peak: 74 s 255.vortex peak: 52 s 256.bzip2 peak: 5 s 300.twolf peak: 38 s ====================================== Total time for base compilation: 782 s Total time for peak compilation: 836 s GCC was configured as: configure --enable-threads=posix --enable-languages="c,c++" --disable-multilib GCC bootstrap times for 'make -j2 bootstrap && make install': Base compiler: 695 s Peak compiler: 667 s --_----------=_107429000198830 Content-Disposition: inline; filename="CINT2000.058.asc" Content-Transfer-Encoding: 8bit Content-Type: text/plain; name="CINT2000.058.asc" ############################################################################## # INVALID RUN INVALID RUN INVALID RUN INVALID RUN INVALID RUN INVALID RUN # # # # 'reportable' flag not set during run # # 254.gap base did not have enough runs! # # 256.bzip2 base did not have enough runs! # # 255.vortex base did not have enough runs! # # 175.vpr base did not have enough runs! # # 164.gzip base did not have enough runs! # # 253.perlbmk base did not have enough runs! # # 186.crafty base did not have enough runs! # # 176.gcc base did not have enough runs! # # 252.eon base did not have enough runs! # # 197.parser base did not have enough runs! # # 300.twolf base did not have enough runs! # # 181.mcf base did not have enough runs! # # 254.gap peak did not have enough runs! # # 256.bzip2 peak did not have enough runs! # # 255.vortex peak did not have enough runs! # # 175.vpr peak did not have enough runs! # # 164.gzip peak did not have enough runs! # # 253.perlbmk peak did not have enough runs! # # 186.crafty peak did not have enough runs! # # 176.gcc peak did not have enough runs! # # 252.eon peak did not have enough runs! # # 197.parser peak did not have enough runs! # # 300.twolf peak did not have enough runs! # # 181.mcf peak did not have enough runs! # # # # INVALID RUN INVALID RUN INVALID RUN INVALID RUN INVALID RUN INVALID RUN # ############################################################################## SPEC CINT2000 Summary Unknown Unknown Tested by SuSE Linux AG Fri Jan 16 20:57:07 2004 SPEC License #1922 Test date: 2004-01-16 Hardware availability: today Tester: Andreas Jaeger, SuSE Linux AG Software availability: Now Estimated Estimated Base Base Base Peak Peak Peak Benchmarks Ref Time Run Time Ratio Ref Time Run Time Ratio ------------ -------- -------- -------- -------- -------- -------- 164.gzip 1400 167 839* 1400 164 853* 175.vpr 1400 191 735* 1400 193 726* 176.gcc 1100 122 898* 1100 123 893* 181.mcf 1800 514 350* 1800 512 352* 186.crafty 1000 72.1 1388* 1000 71.0 1409* 197.parser 1800 273 660* 1800 275 653* 252.eon 1300 90.4 1438* 1300 114 1141* 253.perlbmk 1800 174 1037* 1800 163 1101* 254.gap 1100 141 779* 1100 141 783* 255.vortex 1900 157 1209* 1900 154 1231* 256.bzip2 1500 186 805* 1500 191 787* 300.twolf 3000 366 819* 3000 374 802* ======================================================================== 164.gzip 1400 167 839* 1400 164 853* 175.vpr 1400 191 735* 1400 193 726* 176.gcc 1100 122 898* 1100 123 893* 181.mcf 1800 514 350* 1800 512 352* 186.crafty 1000 72.1 1388* 1000 71.0 1409* 197.parser 1800 273 660* 1800 275 653* 252.eon 1300 90.4 1438* 1300 114 1141* 253.perlbmk 1800 174 1037* 1800 163 1101* 254.gap 1100 141 779* 1100 141 783* 255.vortex 1900 157 1209* 1900 154 1231* 256.bzip2 1500 186 805* 1500 191 787* 300.twolf 3000 366 819* 3000 374 802* Est. SPECint_base2000 860 Est. SPECint2000 847 HARDWARE -------- Hardware Vendor: Unknown Model Name: Unknown CPU: AMD Opteron(tm) Processor 242 CPU MHz: 1593.799 FPU: Integrated CPU(s) enabled: 2 CPU(s) orderable: 1,2 Parallel: No Primary Cache: Secondary Cache: 1024 KB L3 Cache: N/A Other Cache: N/A Memory: 992 MB Disk Subsystem: Unknown Other Hardware: Ethernet SOFTWARE -------- Operating System: Linux 2.6.0-3-smp Compiler: GCC CVS File System: Linux/reiserfs System State: Multi-User NOTES ----- Base flags: -O3 -ftracer -funroll-loops -ffast-math -fpeel-loops Compiler: Unspecified Peak flags: -O3 -ftracer -funroll-loops -ffast-math -fpeel-loops -fprofile-values -fvpt Compiler: Unspecified Run with profile directed optimizations for both base and peak (first pass has -fprofile-arcs, second has -fbranch-probabilities) To compile and execute eon correctly the following extra flags are used for compilation: -ffast-math -fwritable-strings. ############################################################################## # INVALID RUN INVALID RUN INVALID RUN INVALID RUN INVALID RUN INVALID RUN # # # # 'reportable' flag not set during run # # 254.gap base did not have enough runs! # # 256.bzip2 base did not have enough runs! # # 255.vortex base did not have enough runs! # # 175.vpr base did not have enough runs! # # 164.gzip base did not have enough runs! # # 253.perlbmk base did not have enough runs! # # 186.crafty base did not have enough runs! # # 176.gcc base did not have enough runs! # # 252.eon base did not have enough runs! # # 197.parser base did not have enough runs! # # 300.twolf base did not have enough runs! # # 181.mcf base did not have enough runs! # # 254.gap peak did not have enough runs! # # 256.bzip2 peak did not have enough runs! # # 255.vortex peak did not have enough runs! # # 175.vpr peak did not have enough runs! # # 164.gzip peak did not have enough runs! # # 253.perlbmk peak did not have enough runs! # # 186.crafty peak did not have enough runs! # # 176.gcc peak did not have enough runs! # # 252.eon peak did not have enough runs! # # 197.parser peak did not have enough runs! # # 300.twolf peak did not have enough runs! # # 181.mcf peak did not have enough runs! # # # # INVALID RUN INVALID RUN INVALID RUN INVALID RUN INVALID RUN INVALID RUN # ############################################################################## ----------------------------------------------------------------------------- For questions about this result, please contact the tester. For other inquiries, please contact webmaster@spec.org. Copyright 1999-2001 Standard Performance Evaluation Corporation Generated on Fri Jan 16 22:53:20 2004 by SPEC CPU2000 ASCII formatter v2.1 --_----------=_107429000198830--