public inbox for gcc-bugs@sourceware.org
help / color / mirror / Atom feed
* [Bug target/49133] New: [4.6 Regression] modification of aliased __m128d miscompiles
@ 2011-05-23 22:20 kretz at kde dot org
  2011-05-24  8:54 ` [Bug target/49133] [4.6/4.7 " ubizjak at gmail dot com
                   ` (8 more replies)
  0 siblings, 9 replies; 10+ messages in thread
From: kretz at kde dot org @ 2011-05-23 22:20 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=49133

           Summary: [4.6 Regression] modification of aliased __m128d
                    miscompiles
           Product: gcc
           Version: 4.6.1
            Status: UNCONFIRMED
          Severity: normal
          Priority: P3
         Component: target
        AssignedTo: unassigned@gcc.gnu.org
        ReportedBy: kretz@kde.org


Compile the following testcase with "g++ -msse2 -O2"

#include <xmmintrin.h>

typedef double double_a __attribute__((__may_alias__));

struct V
{
    __m128d data;
};

int main()
{
    V a;
    __m128d b;
    b      = _mm_set_pd(1., 0.);
    a.data = _mm_set_pd(1., 0.);
    a.data = _mm_add_pd(a.data, _mm_and_pd(_mm_cmpeq_pd(a.data,
_mm_set1_pd(0.)), _mm_set1_pd(2.)));
    reinterpret_cast<double_a *>(&a.data)[1] += 1.;
    b = _mm_add_pd(b, _mm_and_pd(_mm_cmpeq_pd(b, _mm_set1_pd(0.)),
_mm_set1_pd(1.)));
    b = _mm_add_pd(b, _mm_and_pd(_mm_cmpeq_pd(b, _mm_set1_pd(1.)),
_mm_set1_pd(1.)));
    if (_mm_movemask_pd(_mm_cmpeq_pd(a.data, b)) != 0x3) {
        abort();
    }
    return 0;
}

GCC 4.6.[01] calculate the correct values for a.data[0] and a.data[1] but fail
to combine the results correctly. I.e. the resulting shufpd $0x1 is wrong. GCC
4.5.x uses unpacklpd, which gives the correct result, but emits unnecessary
stores to the stack.


^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug target/49133] [4.6/4.7 Regression] modification of aliased __m128d miscompiles
  2011-05-23 22:20 [Bug target/49133] New: [4.6 Regression] modification of aliased __m128d miscompiles kretz at kde dot org
@ 2011-05-24  8:54 ` ubizjak at gmail dot com
  2011-05-24 10:36 ` kretz at kde dot org
                   ` (7 subsequent siblings)
  8 siblings, 0 replies; 10+ messages in thread
From: ubizjak at gmail dot com @ 2011-05-24  8:54 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=49133

Uros Bizjak <ubizjak at gmail dot com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Target|                            |x86-*-*
             Status|UNCONFIRMED                 |ASSIGNED
   Last reconfirmed|                            |2011.05.24 08:22:04
         AssignedTo|unassigned at gcc dot       |ubizjak at gmail dot com
                   |gnu.org                     |
   Target Milestone|---                         |4.6.1
            Summary|[4.6 Regression]            |[4.6/4.7 Regression]
                   |modification of aliased     |modification of aliased
                   |__m128d miscompiles         |__m128d miscompiles
     Ever Confirmed|0                           |1

--- Comment #1 from Uros Bizjak <ubizjak at gmail dot com> 2011-05-24 08:22:04 UTC ---
Confirmed, proposed patch to sse2_loadhpd:

Index: sse.md
===================================================================
--- sse.md    (revision 174102)
+++ sse.md    (working copy)
@@ -4284,7 +4284,7 @@
    vmovhpd\t{%2, %1, %0|%0, %1, %2}
    unpcklpd\t{%2, %0|%0, %2}
    vunpcklpd\t{%2, %1, %0|%0, %1, %2}
-   shufpd\t{$1, %1, %0|%0, %1, 1}
+   shufpd\t{$0, %1, %0|%0, %1, 0}
    #
    #
    #"


^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug target/49133] [4.6/4.7 Regression] modification of aliased __m128d miscompiles
  2011-05-23 22:20 [Bug target/49133] New: [4.6 Regression] modification of aliased __m128d miscompiles kretz at kde dot org
  2011-05-24  8:54 ` [Bug target/49133] [4.6/4.7 " ubizjak at gmail dot com
@ 2011-05-24 10:36 ` kretz at kde dot org
  2011-05-24 11:48 ` ubizjak at gmail dot com
                   ` (6 subsequent siblings)
  8 siblings, 0 replies; 10+ messages in thread
From: kretz at kde dot org @ 2011-05-24 10:36 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=49133

--- Comment #2 from Matthias Kretz <kretz at kde dot org> 2011-05-24 10:16:41 UTC ---
I applied the patch to the latest 4.6 snapshot. I confirm that it fixes the
bug. Also, there are no regressions in my testsuite.

Just for confirmation, the patched sse.md looks like this for me now (starting
from line 4952):
(define_insn "sse2_loadhpd"
  [(set (match_operand:V2DF 0 "nonimmediate_operand"     "=x,x,x,o,o,o")
    (vec_concat:V2DF
      (vec_select:DF
        (match_operand:V2DF 1 "nonimmediate_operand" " 0,0,x,0,0,0")
        (parallel [(const_int 0)]))
      (match_operand:DF 2 "nonimmediate_operand"     " m,x,0,x,*f,r")))]
  "TARGET_SSE2 && !(MEM_P (operands[1]) && MEM_P (operands[2]))"
  "@
   movhpd\t{%2, %0|%0, %2}
   unpcklpd\t{%2, %0|%0, %2}
   shufpd\t{$0, %1, %0|%0, %1, 0}
   #

Question, why not use unpcklpd instead of shufpd $0? On older CPUs unpcklpd
should be slightly faster than shufpd.


^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug target/49133] [4.6/4.7 Regression] modification of aliased __m128d miscompiles
  2011-05-23 22:20 [Bug target/49133] New: [4.6 Regression] modification of aliased __m128d miscompiles kretz at kde dot org
  2011-05-24  8:54 ` [Bug target/49133] [4.6/4.7 " ubizjak at gmail dot com
  2011-05-24 10:36 ` kretz at kde dot org
@ 2011-05-24 11:48 ` ubizjak at gmail dot com
  2011-05-24 11:54 ` ubizjak at gmail dot com
                   ` (5 subsequent siblings)
  8 siblings, 0 replies; 10+ messages in thread
From: ubizjak at gmail dot com @ 2011-05-24 11:48 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=49133

--- Comment #4 from Uros Bizjak <ubizjak at gmail dot com> 2011-05-24 10:53:10 UTC ---
(In reply to comment #3)

> > I applied the patch to the latest 4.6 snapshot. I confirm that it fixes the
> > bug. Also, there are no regressions in my testsuite.

> OTOH, it looks that this alternative is wrong entirely. Unmodified operand can
> only be passed in lower half (operand 1 in the pattern above). GCC will then
> generate unpcklpd, as suggested.

Forgot to say, that unpcklpd will be generated with removed referred
alternative.


^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug target/49133] [4.6/4.7 Regression] modification of aliased __m128d miscompiles
  2011-05-23 22:20 [Bug target/49133] New: [4.6 Regression] modification of aliased __m128d miscompiles kretz at kde dot org
                   ` (2 preceding siblings ...)
  2011-05-24 11:48 ` ubizjak at gmail dot com
@ 2011-05-24 11:54 ` ubizjak at gmail dot com
  2011-05-24 15:58 ` uros at gcc dot gnu.org
                   ` (4 subsequent siblings)
  8 siblings, 0 replies; 10+ messages in thread
From: ubizjak at gmail dot com @ 2011-05-24 11:54 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=49133

--- Comment #3 from Uros Bizjak <ubizjak at gmail dot com> 2011-05-24 10:51:25 UTC ---
(In reply to comment #2)
> I applied the patch to the latest 4.6 snapshot. I confirm that it fixes the
> bug. Also, there are no regressions in my testsuite.
> 
> Just for confirmation, the patched sse.md looks like this for me now (starting
> from line 4952):
> (define_insn "sse2_loadhpd"
>   [(set (match_operand:V2DF 0 "nonimmediate_operand"     "=x,x,x,o,o,o")
>     (vec_concat:V2DF
>       (vec_select:DF
>         (match_operand:V2DF 1 "nonimmediate_operand" " 0,0,x,0,0,0")
>         (parallel [(const_int 0)]))
>       (match_operand:DF 2 "nonimmediate_operand"     " m,x,0,x,*f,r")))]
>   "TARGET_SSE2 && !(MEM_P (operands[1]) && MEM_P (operands[2]))"
>   "@
>    movhpd\t{%2, %0|%0, %2}
>    unpcklpd\t{%2, %0|%0, %2}
>    shufpd\t{$0, %1, %0|%0, %1, 0}
>    #
> 
> Question, why not use unpcklpd instead of shufpd $0? On older CPUs unpcklpd
> should be slightly faster than shufpd.

OTOH, it looks that this alternative is wrong entirely. Unmodified operand can
only be passed in lower half (operand 1 in the pattern above). GCC will then
generate unpcklpd, as suggested.


^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug target/49133] [4.6/4.7 Regression] modification of aliased __m128d miscompiles
  2011-05-23 22:20 [Bug target/49133] New: [4.6 Regression] modification of aliased __m128d miscompiles kretz at kde dot org
                   ` (3 preceding siblings ...)
  2011-05-24 11:54 ` ubizjak at gmail dot com
@ 2011-05-24 15:58 ` uros at gcc dot gnu.org
  2011-05-24 18:58 ` uros at gcc dot gnu.org
                   ` (3 subsequent siblings)
  8 siblings, 0 replies; 10+ messages in thread
From: uros at gcc dot gnu.org @ 2011-05-24 15:58 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=49133

--- Comment #5 from uros at gcc dot gnu.org 2011-05-24 15:31:15 UTC ---
Author: uros
Date: Tue May 24 15:31:12 2011
New Revision: 174122

URL: http://gcc.gnu.org/viewcvs?root=gcc&view=rev&rev=174122
Log:
    PR target/49133
    * config/i386/sse.md (sse2_loadhpd): Remove shufpd alternative.

testsuite/ChangeLog:

    PR target/49133
    * g++.dg/other/pr49133.C: New test.


Added:
    trunk/gcc/testsuite/g++.dg/other/pr49133.C
Modified:
    trunk/gcc/ChangeLog
    trunk/gcc/config/i386/sse.md
    trunk/gcc/testsuite/ChangeLog


^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug target/49133] [4.6/4.7 Regression] modification of aliased __m128d miscompiles
  2011-05-23 22:20 [Bug target/49133] New: [4.6 Regression] modification of aliased __m128d miscompiles kretz at kde dot org
                   ` (4 preceding siblings ...)
  2011-05-24 15:58 ` uros at gcc dot gnu.org
@ 2011-05-24 18:58 ` uros at gcc dot gnu.org
  2011-05-25  8:15 ` jakub at gcc dot gnu.org
                   ` (2 subsequent siblings)
  8 siblings, 0 replies; 10+ messages in thread
From: uros at gcc dot gnu.org @ 2011-05-24 18:58 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=49133

--- Comment #6 from uros at gcc dot gnu.org 2011-05-24 18:41:33 UTC ---
Author: uros
Date: Tue May 24 18:41:31 2011
New Revision: 174131

URL: http://gcc.gnu.org/viewcvs?root=gcc&view=rev&rev=174131
Log:
    PR target/49133
    * config/i386/sse.md (sse2_loadhpd): Remove shufpd alternative.

testsuite/ChangeLog:

    PR target/49133
    * g++.dg/other/pr49133.C: New test.


Added:
    branches/gcc-4_6-branch/gcc/testsuite/g++.dg/other/pr49133.C
Modified:
    branches/gcc-4_6-branch/gcc/ChangeLog
    branches/gcc-4_6-branch/gcc/config/i386/sse.md
    branches/gcc-4_6-branch/gcc/testsuite/ChangeLog


^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug target/49133] [4.6/4.7 Regression] modification of aliased __m128d miscompiles
  2011-05-23 22:20 [Bug target/49133] New: [4.6 Regression] modification of aliased __m128d miscompiles kretz at kde dot org
                   ` (5 preceding siblings ...)
  2011-05-24 18:58 ` uros at gcc dot gnu.org
@ 2011-05-25  8:15 ` jakub at gcc dot gnu.org
  2011-05-25 13:47 ` uros at gcc dot gnu.org
  2011-05-25 16:43 ` uros at gcc dot gnu.org
  8 siblings, 0 replies; 10+ messages in thread
From: jakub at gcc dot gnu.org @ 2011-05-25  8:15 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=49133

Jakub Jelinek <jakub at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|ASSIGNED                    |RESOLVED
                 CC|                            |jakub at gcc dot gnu.org
         Resolution|                            |FIXED

--- Comment #7 from Jakub Jelinek <jakub at gcc dot gnu.org> 2011-05-25 07:42:31 UTC ---
Fixed.


^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug target/49133] [4.6/4.7 Regression] modification of aliased __m128d miscompiles
  2011-05-23 22:20 [Bug target/49133] New: [4.6 Regression] modification of aliased __m128d miscompiles kretz at kde dot org
                   ` (6 preceding siblings ...)
  2011-05-25  8:15 ` jakub at gcc dot gnu.org
@ 2011-05-25 13:47 ` uros at gcc dot gnu.org
  2011-05-25 16:43 ` uros at gcc dot gnu.org
  8 siblings, 0 replies; 10+ messages in thread
From: uros at gcc dot gnu.org @ 2011-05-25 13:47 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=49133

--- Comment #8 from uros at gcc dot gnu.org 2011-05-25 13:26:46 UTC ---
Author: uros
Date: Wed May 25 13:26:42 2011
New Revision: 174195

URL: http://gcc.gnu.org/viewcvs?root=gcc&view=rev&rev=174195
Log:
    PR target/49133
    * config/i386/sse.md (sse2_loadhpd): Remove shufpd alternative.

testsuite/ChangeLog:

    PR target/49133
    * g++.dg/other/pr49133.C: New test.


Added:
    branches/gcc-4_5-branch/gcc/testsuite/g++.dg/other/pr49133.C
Modified:
    branches/gcc-4_5-branch/gcc/ChangeLog
    branches/gcc-4_5-branch/gcc/config/i386/sse.md
    branches/gcc-4_5-branch/gcc/testsuite/ChangeLog


^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug target/49133] [4.6/4.7 Regression] modification of aliased __m128d miscompiles
  2011-05-23 22:20 [Bug target/49133] New: [4.6 Regression] modification of aliased __m128d miscompiles kretz at kde dot org
                   ` (7 preceding siblings ...)
  2011-05-25 13:47 ` uros at gcc dot gnu.org
@ 2011-05-25 16:43 ` uros at gcc dot gnu.org
  8 siblings, 0 replies; 10+ messages in thread
From: uros at gcc dot gnu.org @ 2011-05-25 16:43 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=49133

--- Comment #9 from uros at gcc dot gnu.org 2011-05-25 16:39:25 UTC ---
Author: uros
Date: Wed May 25 16:39:22 2011
New Revision: 174215

URL: http://gcc.gnu.org/viewcvs?root=gcc&view=rev&rev=174215
Log:
    PR target/49133
    * config/i386/sse.md (sse2_loadhpd): Remove shufpd alternative.

testsuite/ChangeLog:

    PR target/49133
    * g++.dg/other/pr49133.C: New test.


Added:
    branches/gcc-4_4-branch/gcc/testsuite/g++.dg/other/pr49133.C
Modified:
    branches/gcc-4_4-branch/gcc/ChangeLog
    branches/gcc-4_4-branch/gcc/config/i386/sse.md
    branches/gcc-4_4-branch/gcc/testsuite/ChangeLog


^ permalink raw reply	[flat|nested] 10+ messages in thread

end of thread, other threads:[~2011-05-25 16:39 UTC | newest]

Thread overview: 10+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2011-05-23 22:20 [Bug target/49133] New: [4.6 Regression] modification of aliased __m128d miscompiles kretz at kde dot org
2011-05-24  8:54 ` [Bug target/49133] [4.6/4.7 " ubizjak at gmail dot com
2011-05-24 10:36 ` kretz at kde dot org
2011-05-24 11:48 ` ubizjak at gmail dot com
2011-05-24 11:54 ` ubizjak at gmail dot com
2011-05-24 15:58 ` uros at gcc dot gnu.org
2011-05-24 18:58 ` uros at gcc dot gnu.org
2011-05-25  8:15 ` jakub at gcc dot gnu.org
2011-05-25 13:47 ` uros at gcc dot gnu.org
2011-05-25 16:43 ` uros at gcc dot gnu.org

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).