From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <gcc-return-68561-listarch-gcc=gcc.gnu.org@gcc.gnu.org>
Received: (qmail 10492 invoked by alias); 18 Feb 2003 18:13:59 -0000
Mailing-List: contact gcc-help@gcc.gnu.org; run by ezmlm
Precedence: bulk
List-Archive: <http://gcc.gnu.org/ml/gcc/>
List-Post: <mailto:gcc@gcc.gnu.org>
List-Help: <http://gcc.gnu.org/ml/>
Sender: gcc-owner@gcc.gnu.org
Received: (qmail 10476 invoked from network); 18 Feb 2003 18:13:58 -0000
Received: from unknown (HELO egil.codesourcery.com) (66.92.14.122)
  by 172.16.49.205 with SMTP; 18 Feb 2003 18:13:58 -0000
Received: from zack by egil.codesourcery.com with local (Exim 3.36 #1 (Debian))
	id 18lCFe-0000hK-00; Tue, 18 Feb 2003 10:13:54 -0800
To: =?iso-8859-1?q?H=E5kan?= Hjort <hakan@safelogic.se>
Cc: Reza Roboubi <reza@linisoft.com>,  gcc@gcc.gnu.org
Subject: Re: optimizations
From: Zack Weinberg <zack@codesourcery.com>
Date: Tue, 18 Feb 2003 18:17:00 -0000
In-Reply-To: <20030218175524.GA8638@safelogic.se> =?iso-8859-1?q?(H=E5kan?=
 Hjort's message of "Tue, 18 Feb 2003 18:55:24 +0100")
Message-ID: <87vfzhwj6l.fsf@egil.codesourcery.com>
User-Agent: Gnus/5.090016 (Oort Gnus v0.16) Emacs/21.2
References: <Pine.LNX.4.21.0301151338040.30377-100000@mail.kloo.net>
	<3E25F2BB.BA90B2C9@linisoft.com> <20030218175524.GA8638@safelogic.se>
MIME-Version: 1.0
Content-Type: text/plain; charset=iso-8859-1
Content-Transfer-Encoding: quoted-printable
X-SW-Source: 2003-02/txt/msg01210.txt.bz2

H=E5kan Hjort <hakan@safelogic.se> writes:

> For Sun's Forte compiler one gets the following:
>
> main:
>          save    %sp,-104,%sp
>          or      %g0,16,%g1
>          st      %g1,[%fp-4]
>          add     %fp,-4,%o1
>          or      %g0,1,%o0
>          call    write   ! params =3D  %o0 %o1 %o2 ! Result
>          or      %g0,1,%o2
>          ret     ! Result =3D  %i0
>          restore %g0,0,%o0
>
> I.e. it just stores '16' in k before the call to write, no trace left
> of mm() or any loop, as should be.
>
> Perhaps GCC now does the same after hoisting both the load and the store?

Unfortunately not.  On x86, with -O2, 3.4 20030211 produces

main:
        pushl   %ebp
        movl    %esp, %ebp
        subl    $24, %esp
        movl    $0, -4(%ebp)
        andl    $-16, %esp
        jmp     .L2
        .p2align 4,,7
.L9:
        incl    %eax
        movl    %eax, -4(%ebp)
.L2:
        movl    -4(%ebp), %eax
        cmpl    $16, %eax
        jne     .L9
        movl    $1, 8(%esp)
        leal    -4(%ebp), %eax
        movl    %eax, 4(%esp)
        movl    $1, (%esp)
        call    write
        leave
        xorl    %eax, %eax
        ret

so you can see that not only is the loop still present, but the memory
write has not been sunk.=20=20

What happens at -O2 -fssa -fssa-ccp -fssa-dce is interesting:

main:
        pushl   %ebp
        movl    %esp, %ebp
        subl    $24, %esp
        andl    $-16, %esp
        jmp     .L2
        .p2align 4,,7
.L9:
        incl    %eax
.L2:
        cmpl    $16, %eax
        jne     .L9
        movl    $1, 8(%esp)
        leal    -4(%ebp), %eax
        movl    %eax, 4(%esp)
        movl    $1, (%esp)
        call    write
        leave
        xorl    %eax, %eax
        ret

The unnecessary memory references are now gone, but the loop remains;
also you can see what may appear to be a bug at first glance -- %eax
is never initialized.  This is not actually a correctness bug: no
matter what value %eax happened to have before the loop, it will leave
the loop with the value 16.  However, I think you'll agree that this
is poor optimization.

RTL-SSA is, I believe, considered somewhat of a failed experiment -
the interesting work is happening on the tree-ssa branch.  I do not
have that branch checked out to experiment with.  Also, the loop
optimizer has been overhauled on the rtlopt branch, which again I do
not have to hand.

zw