public inbox for gcc-bugs@sourceware.org
help / color / mirror / Atom feed
* [Bug middle-end/38671] [4.3/4.4/4.5/4.6 Regression] selecting one IV instead of three
       [not found] <bug-38671-4@http.gcc.gnu.org/bugzilla/>
@ 2010-10-01 11:54 ` jakub at gcc dot gnu.org
  2011-04-16 10:44 ` [Bug middle-end/38671] [4.3/4.4/4.5/4.6/4.7 " jakub at gcc dot gnu.org
  2012-01-16 13:16 ` [Bug middle-end/38671] [4.4/4.5/4.6/4.7 " rguenth at gcc dot gnu.org
  2 siblings, 0 replies; 4+ messages in thread
From: jakub at gcc dot gnu.org @ 2010-10-01 11:54 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=38671

Jakub Jelinek <jakub at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
   Target Milestone|4.4.5                       |4.4.6


^ permalink raw reply	[flat|nested] 4+ messages in thread

* [Bug middle-end/38671] [4.3/4.4/4.5/4.6/4.7 Regression] selecting one IV instead of three
       [not found] <bug-38671-4@http.gcc.gnu.org/bugzilla/>
  2010-10-01 11:54 ` [Bug middle-end/38671] [4.3/4.4/4.5/4.6 Regression] selecting one IV instead of three jakub at gcc dot gnu.org
@ 2011-04-16 10:44 ` jakub at gcc dot gnu.org
  2012-01-16 13:16 ` [Bug middle-end/38671] [4.4/4.5/4.6/4.7 " rguenth at gcc dot gnu.org
  2 siblings, 0 replies; 4+ messages in thread
From: jakub at gcc dot gnu.org @ 2011-04-16 10:44 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=38671

Jakub Jelinek <jakub at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
   Target Milestone|4.4.6                       |4.4.7


^ permalink raw reply	[flat|nested] 4+ messages in thread

* [Bug middle-end/38671] [4.4/4.5/4.6/4.7 Regression] selecting one IV instead of three
       [not found] <bug-38671-4@http.gcc.gnu.org/bugzilla/>
  2010-10-01 11:54 ` [Bug middle-end/38671] [4.3/4.4/4.5/4.6 Regression] selecting one IV instead of three jakub at gcc dot gnu.org
  2011-04-16 10:44 ` [Bug middle-end/38671] [4.3/4.4/4.5/4.6/4.7 " jakub at gcc dot gnu.org
@ 2012-01-16 13:16 ` rguenth at gcc dot gnu.org
  2 siblings, 0 replies; 4+ messages in thread
From: rguenth at gcc dot gnu.org @ 2012-01-16 13:16 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=38671

Richard Guenther <rguenth at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEW                         |RESOLVED
      Known to work|                            |4.7.0
         Resolution|                            |FIXED
   Target Milestone|4.4.7                       |4.7.0
      Known to fail|                            |

--- Comment #9 from Richard Guenther <rguenth at gcc dot gnu.org> 2012-01-16 12:54:21 UTC ---
We're back to exactly the code from 4.2 on trunk:

.L2:
        movaps  in2(%rax), %xmm0
        mulps   %xmm1, %xmm0
        addps   %xmm2, %xmm0
        mulps   in1(%rax), %xmm0
        movaps  %xmm0, out(%rax)
        addq    $16, %rax
        cmpq    %rdx, %rax
        jne     .L2

but I can't reproduce the originally reported assembly with 4.4.0 either
(the report lacks information on flags used besides -march=core2, so I used
both -O2 and -O3 with the same result).

I can confirm that for the testcase in comment #7 we, since 4.3.x and
up to 4.6.x generate sth like

f:
.LFB0:
        .cfi_startproc
        sarl    $2, %esi
        xorl    %eax, %eax
        subl    $1, %esi
        addq    $1, %rsi
        salq    $4, %rsi
        .p2align 4,,10
        .p2align 3
.L2:
        movl    (%rdi,%rax), %ecx
        movl    %ecx, (%rdx,%rax)
        addq    $16, %rax
        cmpq    %rsi, %rax
        jne     .L2

instead of what we generated with 4.2:

f:
.LFB2:
        sarl    $2, %esi
        .p2align 4,,7
.L2:
        movl    (%rdi), %eax
        addq    $16, %rdi
        movl    %eax, (%rdx)
        addq    $16, %rdx
        subl    $1, %esi
        jne     .L2
        rep ; ret

But that, even while not using decrement-and-branch looks superior to me.
For trunk we now create

f:
.LFB0:
        .cfi_startproc
        sarl    $2, %esi
        .p2align 4,,10
        .p2align 3
.L2:
        movl    (%rdi), %eax
        addq    $16, %rdi
        movl    %eax, (%rdx)
        addq    $16, %rdx
        subl    $1, %esi
        jne     .L2

again.

So, closing as fixed for trunk (or WORKSFORME for the original report).


^ permalink raw reply	[flat|nested] 4+ messages in thread

* [Bug middle-end/38671] [4.3/4.4/4.5/4.6 Regression] selecting one IV instead of three
  2008-12-30 12:58 [Bug inline-asm/38671] New: [4.4 Regression] speed regression with sse intrinsics tim at klingt dot org
@ 2010-04-30  9:25 ` jakub at gcc dot gnu dot org
  0 siblings, 0 replies; 4+ messages in thread
From: jakub at gcc dot gnu dot org @ 2010-04-30  9:25 UTC (permalink / raw)
  To: gcc-bugs



-- 

jakub at gcc dot gnu dot org changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
   Target Milestone|4.4.4                       |4.4.5


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=38671


^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2012-01-16 12:54 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
     [not found] <bug-38671-4@http.gcc.gnu.org/bugzilla/>
2010-10-01 11:54 ` [Bug middle-end/38671] [4.3/4.4/4.5/4.6 Regression] selecting one IV instead of three jakub at gcc dot gnu.org
2011-04-16 10:44 ` [Bug middle-end/38671] [4.3/4.4/4.5/4.6/4.7 " jakub at gcc dot gnu.org
2012-01-16 13:16 ` [Bug middle-end/38671] [4.4/4.5/4.6/4.7 " rguenth at gcc dot gnu.org
2008-12-30 12:58 [Bug inline-asm/38671] New: [4.4 Regression] speed regression with sse intrinsics tim at klingt dot org
2010-04-30  9:25 ` [Bug middle-end/38671] [4.3/4.4/4.5/4.6 Regression] selecting one IV instead of three jakub at gcc dot gnu dot org

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).