public inbox for gcc-help@gcc.gnu.org
 help / color / mirror / Atom feed
* gcc-3.3.6 vs gcc-4.2.2 scimark performance degradation
@ 2008-08-01 17:27 CH Gowri Kumar - గౌరీ కుమార్
  0 siblings, 0 replies; only message in thread
From: CH Gowri Kumar - గౌరీ కుమార్ @ 2008-08-01 17:27 UTC (permalink / raw)
  To: gcc-help

Hi all,

We are observing a slowdown in the performance of scimark benchmark
w.r.t to the gcc versions 3.3.6 vs 4.2.2. And in our actual application, we
are observing a slowdown of almost 20%..

We are currently using -O2 for compilation for both the gcc versions.
On our product, even with -O3 on gcc-4.2.2, we are not able to reach the
performance of gcc-3.3.6

We are also observing that the size of binary has increased  from 38M to 50MB.
Compiling with -Os didn't yield much benefit (resultant binary was 47 MB).
We were guessing that the increase in size could probably because
of inlining and hence should yield a better performance. But the results
are  disappointing.

We have done the profiling of the application code using gpror and we
observe that
almost 80% of the functions of gcc-4.2.2 are running slower than the
ones on gcc-3.3.6

Did anyone face this sort of issue earlier (slowness 3.x vs 4.x) ?
Could someone suggest the potential list of flags of 4.2.2, which can
probably yield
a better result or at least at par with the 3.3.6, so that we can
experiment a bit
and arrive at the best combination. ( we cannot use the -march flag
and -ffast-math,
due to strict requirement IEEE floating-point conformance )

I have searched through the gcc bugs list, and can find one bug  for
scimark, but
it was not conclusive.(http://gcc.gnu.org/bugzilla/show_bug.cgi?id=33431)


The following are the relavent details of gcc versions and scimark results.
Machine
=======
Linux opteron26 2.6.9-67.ELsmp #1 SMP Wed Nov 7 13:56:44 EST 2007
x86_64 x86_64 x86_64 GNU/Linux

gcc-3.3.6
========
$gcc -v
Reading specs from
/linux/depot/gcc-3.3.6-RHEL_4/bin/../lib/gcc-lib/x86_64-redhat-linux/3.3.6/specs
Configured with: ../../src/gcc-3.3.6/configure
--prefix=/depot/gcc-3.3.6-RHEL_4 --disable-shared --disable-checking
--with-system-zlib --enable-threads=posix --enable-__cxa_atexit
--enable-languages=c,c++,f77,objc --host=x86_64-redhat-linux
Thread model: posix
gcc version 3.3.6

------------------------------------------------------------------------------------
**                                                              **
** SciMark2 Numeric Benchmark, see http://math.nist.gov/scimark **
** for details. (Results can be submitted to pozo@nist.gov)     **
**                                                              **
Using       2.00 seconds min time per kenel.
Composite Score        :    749.67
FFT                   Mflops:   742.23    (N=1024)
SOR                   Mflops:   648.81    (100 x 100)
MonteCarlo:       Mflops:   317.68
Sparse matmult  Mflops:   942.96    (N=1000, nz=5000)
LU                       Mflops:  1096.65    (M=100, N=100)
======================================================


gcc-4.2.2
=======
gcc -v
Using built-in specs.
Target: x86_64-redhat-linux
Configured with: ../../src/gcc-4.2.2/configure
--prefix=/depot/gcc-4.2.2-static --disable-shared
--enable-threads=posix --disable-checking --with-system-zlib
--enable-__cxa_atexit --disable-libunwind-exceptions
--enable-languages=c,c++,objc,fortran --with-cpu=generic
--host=x86_64-redhat-linux
Thread model: posix
gcc version 4.2.2

**                                                              **
** SciMark2 Numeric Benchmark, see http://math.nist.gov/scimark **
** for details. (Results can be submitted to pozo@nist.gov)     **
**                                                              **
Using       2.00 seconds min time per kenel.
Composite Score            :   715.93
FFT                  Mflops    :   676.06    (N=1024)
SOR                  Mflops   :   659.92    (100 x 100)
MonteCarlo:       Mflops  :   349.75
Sparse matmult  Mflops  :   801.66    (N=1000, nz=5000)
LU                      Mflops  :  1092.27    (M=100, N=100)
=====================================================

Thanks in advance.

Regards,
Gowri Kumar
www.gowrikumar.com

^ permalink raw reply	[flat|nested] only message in thread

only message in thread, other threads:[~2008-08-01 17:27 UTC | newest]

Thread overview: (only message) (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2008-08-01 17:27 gcc-3.3.6 vs gcc-4.2.2 scimark performance degradation CH Gowri Kumar - గౌరీ కుమార్

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).