public inbox for gcc-bugs@sourceware.org
help / color / mirror / Atom feed
* [Bug lto/44334]  New: [4.6 Regression] rnflow.f90 ~27% slower with -fwhole-program -flto after revision 159852
@ 2010-05-30 17:17 dominiq at lps dot ens dot fr
  2010-05-30 18:06 ` [Bug lto/44334] " dominiq at lps dot ens dot fr
                   ` (10 more replies)
  0 siblings, 11 replies; 12+ messages in thread
From: dominiq at lps dot ens dot fr @ 2010-05-30 17:17 UTC (permalink / raw)
  To: gcc-bugs

After revision 159852

Author: pault
Date:   Wed May 26 05:11:04 2010 UTC (4 days, 12 hours ago)
Changed paths:  4
Log Message:    
2010-05-26  Paul Thomas  <pault@gcc.gnu.org>

        PR fortran/40011
        * resolve.c (resolve_global_procedure): Resolve the gsymbol's
        namespace before trying to reorder the gsymbols.

2010-05-26  Paul Thomas  <pault@gcc.gnu.org>

        PR fortran/40011
        * gfortran.dg/whole_file_19.f90 : New test.

the executable of the polyhedron test rnflow.f90 is ~27% slower when compiled
with -fwhole-program -flto:

[macbook] lin/test% gfcpf -v
Using built-in specs.
COLLECT_GCC=gfcpf
COLLECT_LTO_WRAPPER=/opt/gcc/gcc4.6pf/libexec/gcc/x86_64-apple-darwin10/4.6.0/lto-wrapper
Target: x86_64-apple-darwin10
Configured with: ../p_work/configure --prefix=/opt/gcc/gcc4.6pf
--mandir=/opt/gcc/gcc4.6pf/share/man --infodir=/opt/gcc/gcc4.6pf/share/info
--build=x86_64-apple-darwin10 --host=x86_64-apple-darwin10
--target=x86_64-apple-darwin10 --enable-languages=c,fortran
--with-gmp=/opt/sw64 --with-libiconv-prefix=/opt/sw64 --with-system-zlib
--x-includes=/usr/X11R6/include --x-libraries=/usr/X11R6/lib
--with-cloog=/opt/sw64 --with-ppl=/opt/sw64 --with-mpc=/opt/sw64 --enable-lto
Thread model: posix
gcc version 4.6.0 20100526 (experimental) [trunk revision 159851] (GCC) 
[macbook] lin/test% gfcpf -O3 -ffast-math -funroll-loops -fomit-frame-pointer
rnflow.f90 
[macbook] lin/test% time a.out > /dev/null
25.826u 0.686s 0:26.52 99.9%    0+0k 0+0io 0pf+0w
[macbook] lin/test% gfcpf -O3 -ffast-math -funroll-loops -fomit-frame-pointer
-fwhole-file -flto rnflow.f90
[macbook] lin/test% time a.out > /dev/null
25.506u 0.674s 0:26.19 99.9%    0+0k 0+0io 0pf+0w
[macbook] lin/test% gfcpf -O3 -ffast-math -funroll-loops -fomit-frame-pointer
-fwhole-program -flto rnflow.f90
[macbook] lin/test% time a.out > /dev/null
25.772u 0.678s 0:26.46 99.9%    0+0k 0+0io 0pf+0w
[macbook] lin/test% gfcp -v
Using built-in specs.
COLLECT_GCC=gfcp
COLLECT_LTO_WRAPPER=/opt/gcc/gcc4.6p/libexec/gcc/x86_64-apple-darwin10/4.6.0/lto-wrapper
Target: x86_64-apple-darwin10
Configured with: ../p_work/configure --prefix=/opt/gcc/gcc4.6p
--mandir=/opt/gcc/gcc4.6p/share/man --infodir=/opt/gcc/gcc4.6p/share/info
--build=x86_64-apple-darwin10 --host=x86_64-apple-darwin10
--target=x86_64-apple-darwin10 --enable-languages=c,fortran
--with-gmp=/opt/sw64 --with-libiconv-prefix=/opt/sw64 --with-system-zlib
--x-includes=/usr/X11R6/include --x-libraries=/usr/X11R6/lib
--with-cloog=/opt/sw64 --with-ppl=/opt/sw64 --with-mpc=/opt/sw64 --enable-lto
Thread model: posix
gcc version 4.6.0 20100526 (experimental) [trunk revision 159852] (GCC) 
[macbook] lin/test% gfcp -O3 -ffast-math -funroll-loops -fomit-frame-pointer
rnflow.f90
[macbook] lin/test% time a.out > /dev/null
25.841u 0.696s 0:26.54 99.9%    0+0k 0+0io 0pf+0w
[macbook] lin/test% gfcp -O3 -ffast-math -funroll-loops -fomit-frame-pointer
-fwhole-file -flto rnflow.f90
[macbook] lin/test% time a.out > /dev/null
25.540u 0.677s 0:26.22 99.9%    0+0k 0+0io 0pf+0w
[macbook] lin/test% gfcp -O3 -ffast-math -funroll-loops -fomit-frame-pointer
-fwhole-program -flto rnflow.f90
[macbook] lin/test% time a.out > /dev/null
32.627u 0.685s 0:33.31 99.9%    0+0k 0+0io 0pf+0w             <---  ~27% slower

As it has been noticed previously the executable of fatigue.f90 is ~30% faster
when compiled with -fwhole-program:

[macbook] lin/test% gfcp -O3 -ffast-math -funroll-loops -fomit-frame-pointer
-fwhole-file -flto fatigue.f90
[macbook] lin/test% time a.out > /dev/null
9.031u 0.006s 0:09.04 99.8%     0+0k 0+1io 0pf+0w
[macbook] lin/test% gfcp -O3 -ffast-math -funroll-loops -fomit-frame-pointer
-fwhole-program fatigue.f90
[macbook] lin/test% time a.out > /dev/null
6.448u 0.004s 0:06.47 99.5%     0+0k 0+1io 0pf+0w


-- 
           Summary: [4.6 Regression] rnflow.f90 ~27% slower with -fwhole-
                    program -flto after revision 159852
           Product: gcc
           Version: 4.6.0
            Status: UNCONFIRMED
          Severity: normal
          Priority: P3
         Component: lto
        AssignedTo: unassigned at gcc dot gnu dot org
        ReportedBy: dominiq at lps dot ens dot fr
 GCC build triplet: x86_64-apple-darwin10
  GCC host triplet: x86_64-apple-darwin10
GCC target triplet: x86_64-apple-darwin10


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=44334


^ permalink raw reply	[flat|nested] 12+ messages in thread

end of thread, other threads:[~2010-09-09  9:01 UTC | newest]

Thread overview: 12+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2010-05-30 17:17 [Bug lto/44334] New: [4.6 Regression] rnflow.f90 ~27% slower with -fwhole-program -flto after revision 159852 dominiq at lps dot ens dot fr
2010-05-30 18:06 ` [Bug lto/44334] " dominiq at lps dot ens dot fr
2010-05-30 18:09 ` [Bug fortran/44334] " rguenth at gcc dot gnu dot org
2010-05-30 18:11 ` dominiq at lps dot ens dot fr
2010-05-30 18:12 ` dominiq at lps dot ens dot fr
2010-05-30 18:31 ` dominiq at lps dot ens dot fr
2010-05-30 18:49 ` rguenth at gcc dot gnu dot org
2010-05-30 18:55 ` dominiq at lps dot ens dot fr
2010-06-05  9:52 ` dominiq at lps dot ens dot fr
2010-09-08 21:00 ` burnus at gcc dot gnu dot org
2010-09-08 21:04 ` hubicka at gcc dot gnu dot org
2010-09-09  9:01 ` burnus at gcc dot gnu dot org

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).