From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 30538 invoked by alias); 1 Aug 2008 17:27:20 -0000 Received: (qmail 30526 invoked by uid 22791); 1 Aug 2008 17:27:18 -0000 X-Spam-Check-By: sourceware.org Received: from fk-out-0910.google.com (HELO fk-out-0910.google.com) (209.85.128.191) by sourceware.org (qpsmtpd/0.31) with ESMTP; Fri, 01 Aug 2008 17:26:48 +0000 Received: by fk-out-0910.google.com with SMTP id 19so893386fkr.8 for ; Fri, 01 Aug 2008 10:26:45 -0700 (PDT) Received: by 10.180.203.3 with SMTP id a3mr4133715bkg.83.1217611604392; Fri, 01 Aug 2008 10:26:44 -0700 (PDT) Received: by 10.125.16.3 with HTTP; Fri, 1 Aug 2008 10:26:44 -0700 (PDT) Message-ID: <526520db0808011026v7e91653ega6aba5719961bdf4@mail.gmail.com> Date: Fri, 01 Aug 2008 17:27:00 -0000 From: "=?UTF-8?Q?CH_Gowri_Kumar?= =?UTF-8?Q?_-_=E0=B0=97=E0=B1=8C=E0=B0=B0=E0=B1=80?= =?UTF-8?Q?_=E0=B0=95=E0=B1=81=E0=B0=AE=E0=B0=BE=E0=B0=B0=E0=B1=8D?=" To: gcc-help@gcc.gnu.org Subject: gcc-3.3.6 vs gcc-4.2.2 scimark performance degradation MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit Content-Disposition: inline Mailing-List: contact gcc-help-help@gcc.gnu.org; run by ezmlm Precedence: bulk List-Id: List-Archive: List-Post: List-Help: Sender: gcc-help-owner@gcc.gnu.org X-SW-Source: 2008-08/txt/msg00003.txt.bz2 Hi all, We are observing a slowdown in the performance of scimark benchmark w.r.t to the gcc versions 3.3.6 vs 4.2.2. And in our actual application, we are observing a slowdown of almost 20%.. We are currently using -O2 for compilation for both the gcc versions. On our product, even with -O3 on gcc-4.2.2, we are not able to reach the performance of gcc-3.3.6 We are also observing that the size of binary has increased from 38M to 50MB. Compiling with -Os didn't yield much benefit (resultant binary was 47 MB). We were guessing that the increase in size could probably because of inlining and hence should yield a better performance. But the results are disappointing. We have done the profiling of the application code using gpror and we observe that almost 80% of the functions of gcc-4.2.2 are running slower than the ones on gcc-3.3.6 Did anyone face this sort of issue earlier (slowness 3.x vs 4.x) ? Could someone suggest the potential list of flags of 4.2.2, which can probably yield a better result or at least at par with the 3.3.6, so that we can experiment a bit and arrive at the best combination. ( we cannot use the -march flag and -ffast-math, due to strict requirement IEEE floating-point conformance ) I have searched through the gcc bugs list, and can find one bug for scimark, but it was not conclusive.(http://gcc.gnu.org/bugzilla/show_bug.cgi?id=33431) The following are the relavent details of gcc versions and scimark results. Machine ======= Linux opteron26 2.6.9-67.ELsmp #1 SMP Wed Nov 7 13:56:44 EST 2007 x86_64 x86_64 x86_64 GNU/Linux gcc-3.3.6 ======== $gcc -v Reading specs from /linux/depot/gcc-3.3.6-RHEL_4/bin/../lib/gcc-lib/x86_64-redhat-linux/3.3.6/specs Configured with: ../../src/gcc-3.3.6/configure --prefix=/depot/gcc-3.3.6-RHEL_4 --disable-shared --disable-checking --with-system-zlib --enable-threads=posix --enable-__cxa_atexit --enable-languages=c,c++,f77,objc --host=x86_64-redhat-linux Thread model: posix gcc version 3.3.6 ------------------------------------------------------------------------------------ ** ** ** SciMark2 Numeric Benchmark, see http://math.nist.gov/scimark ** ** for details. (Results can be submitted to pozo@nist.gov) ** ** ** Using 2.00 seconds min time per kenel. Composite Score : 749.67 FFT Mflops: 742.23 (N=1024) SOR Mflops: 648.81 (100 x 100) MonteCarlo: Mflops: 317.68 Sparse matmult Mflops: 942.96 (N=1000, nz=5000) LU Mflops: 1096.65 (M=100, N=100) ====================================================== gcc-4.2.2 ======= gcc -v Using built-in specs. Target: x86_64-redhat-linux Configured with: ../../src/gcc-4.2.2/configure --prefix=/depot/gcc-4.2.2-static --disable-shared --enable-threads=posix --disable-checking --with-system-zlib --enable-__cxa_atexit --disable-libunwind-exceptions --enable-languages=c,c++,objc,fortran --with-cpu=generic --host=x86_64-redhat-linux Thread model: posix gcc version 4.2.2 ** ** ** SciMark2 Numeric Benchmark, see http://math.nist.gov/scimark ** ** for details. (Results can be submitted to pozo@nist.gov) ** ** ** Using 2.00 seconds min time per kenel. Composite Score : 715.93 FFT Mflops : 676.06 (N=1024) SOR Mflops : 659.92 (100 x 100) MonteCarlo: Mflops : 349.75 Sparse matmult Mflops : 801.66 (N=1000, nz=5000) LU Mflops : 1092.27 (M=100, N=100) ===================================================== Thanks in advance. Regards, Gowri Kumar www.gowrikumar.com