From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 4027 invoked by alias); 17 Jan 2009 18:12:38 -0000 Received: (qmail 3991 invoked by uid 48); 17 Jan 2009 18:12:26 -0000 Date: Sat, 17 Jan 2009 18:12:00 -0000 Message-ID: <20090117181226.3990.qmail@sourceware.org> X-Bugzilla-Reason: CC References: Subject: [Bug middle-end/38846] [Graphite] 70% slower using -floop* than without graphite (gas_dyn of Polyhedron) In-Reply-To: Reply-To: gcc-bugzilla@gcc.gnu.org To: gcc-bugs@gcc.gnu.org From: "dominiq at lps dot ens dot fr" Mailing-List: contact gcc-bugs-help@gcc.gnu.org; run by ezmlm Precedence: bulk List-Id: List-Archive: List-Post: List-Help: Sender: gcc-bugs-owner@gcc.gnu.org X-SW-Source: 2009-01/txt/msg01961.txt.bz2 ------- Comment #4 from dominiq at lps dot ens dot fr 2009-01-17 18:12 ------- I have similar results as comment #0 on i686-apple-darwin9 (Core2) trunk revision 143468: ================================================================================ Date & Time : 17 Jan 2009 17:41:32 Test Name : pbharness Compile Command : gfc %n.f90 -m64 -O3 -ffast-math -funroll-loops -fgraphite -fgraphite-identity -floop-block -floop-strip-mine -floop-interchange -ftree-loop-linear -fomit-frame-pointer -finline-limit=600 --param min-vect-loop-bound=2 -o %n Benchmarks : ac aermod air capacita channel doduc fatigue gas_dyn induct linpk mdbx nf protein rnflow test_fpu tfft Maximum Times : 300.0 Target Error % : 0.200 Minimum Repeats : 2 Maximum Repeats : 5 Benchmark Compile Executable Ave Run Number Estim Name (secs) (bytes) (secs) Repeats Err % --------- ------- ---------- ------- ------- ------ ac 4.86 42560 12.31 5 0.2625 aermod 87.72 1270544 30.36 5 0.2338 air 5.73 77336 8.38 5 0.0536 capacita 4.13 72760 45.60 2 0.0055 channel 1.69 30456 2.71 2 0.0368 doduc 11.71 200024 42.88 2 0.0501 fatigue 4.26 76736 12.91 2 0.0852 gas_dyn 5.83 692200 22.24 5 0.4693 induct 10.17 177072 34.38 2 0.1440 linpk 1.67 42536 28.21 5 0.3051 mdbx 3.43 73000 14.79 2 0.0068 nf 14.82 112264 32.25 2 0.1612 protein 9.92 114136 45.90 2 0.1961 rnflow 11.24 171464 37.49 2 0.0960 test_fpu 9.49 154224 13.06 2 0.1263 tfft 1.15 26432 2.88 5 0.2609 Geometric Mean Execution Time = 18.18 seconds ================================================================================ ... Finished Testing 16 benchmarks - 16 passed, and 0 failed compared to ================================================================================ Date & Time : 17 Jan 2009 18:03:59 Test Name : pbharness Compile Command : gfc %n.f90 -m64 -O3 -ffast-math -funroll-loops -ftree-loop-linear -fomit-frame-pointer -finline-limit=600 --param min-vect-loop-bound=2 -o %n Benchmarks : ac aermod air capacita channel doduc fatigue gas_dyn induct linpk mdbx nf protein rnflow test_fpu tfft Maximum Times : 300.0 Target Error % : 0.200 Minimum Repeats : 2 Maximum Repeats : 5 Benchmark Compile Executable Ave Run Number Estim Name (secs) (bytes) (secs) Repeats Err % --------- ------- ---------- ------- ------- ------ ac 2.38 42560 12.33 5 0.3327 aermod 86.86 1270544 29.86 2 0.0151 air 5.53 77336 8.39 5 0.2713 capacita 3.40 72760 55.49 5 0.5426 channel 1.98 38648 2.27 2 0.0000 doduc 11.42 200024 42.93 2 0.1456 fatigue 4.94 89024 10.83 5 0.2533 gas_dyn 6.61 708584 10.38 2 0.1541 induct 9.95 181168 34.41 2 0.0727 linpk 1.50 42536 27.98 2 0.0804 mdbx 3.30 73000 14.81 2 0.0911 nf 24.30 161416 32.06 4 0.1922 protein 10.54 126424 46.18 2 0.1646 rnflow 10.93 179616 36.00 2 0.0014 test_fpu 10.26 166512 12.45 2 0.0723 tfft 1.10 26432 2.86 5 0.2793 Geometric Mean Execution Time = 17.05 seconds ================================================================================ The 70% for gas_dyn turns to be more than a factor 2, and capacita is faster by almost 20% with floop-block. -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=38846