[Bug c/49257] New: -mfpmath=sse generates x87 instructions on 32 bits OS

public inbox for gcc-bugs@sourceware.org
help / color / mirror / Atom feed

* [Bug c/49257] New: -mfpmath=sse generates x87 instructions on 32 bits OS
@ 2011-06-01 13:57 olivier_maury at mentor dot com
  2011-06-01 14:25 ` [Bug target/49257] " rguenth at gcc dot gnu.org
                   ` (17 more replies)
  0 siblings, 18 replies; 19+ messages in thread
From: olivier_maury at mentor dot com @ 2011-06-01 13:57 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=49257

           Summary: -mfpmath=sse generates x87 instructions on 32 bits OS
           Product: gcc
           Version: 4.5.1
            Status: UNCONFIRMED
          Severity: normal
          Priority: P3
         Component: c
        AssignedTo: unassigned@gcc.gnu.org
        ReportedBy: olivier_maury@mentor.com


Created attachment 24408
  --> http://gcc.gnu.org/bugzilla/attachment.cgi?id=24408
simple reproducer

Hi,

Please find attached a simple C program that shows the issue.
I compile it that way:

gcc -O0 simple.c -mfpmath=sse -msse3 -o ok.exe
./ok.exe returns the correct value

gcc -O0 simple.c -mfpmath=sse -msse3 -DRETURN_NAN -o nan.exe
./nan.exe returns a NAN

Both exe use some gcc intrinsic that change the FP stack into mmx registers. It
is expected that if I call a libc function (like printf) without restoring the
stack to its original state (using emms) then I'll have a wrong behavior. But,
here, because of the following assembly code:
.L4:
        fildq   72(%esp)
        cmpl    $0, 76(%esp)
        jns     .L3
        fldt    .LC2
        faddp   %st, %st(1)
.L3:
        fstpl   24(%esp)
        movsd   24(%esp), %xmm0
        mulsd   56(%esp), %xmm0
        movsd   %xmm0, 40(%esp)
        movl    68(%esp), %eax
        movl    4(%eax), %edx
        movl    (%eax), %eax
        movl    52(%esp), %ecx
        movl    %eax, 16(%esp)
        movl    %edx, 20(%esp)
        movq    16(%esp), %mm0
        movntq  %mm0, (%ecx)
        addl    $1, 72(%esp)
        adcl    $0, 76(%esp)
.L2:
        cmpl    $0, 76(%esp)
        jb      .L4
        cmpl    $0, 76(%esp)
        ja      .L6
        cmpl    $9, 72(%esp)
        jbe     .L4


I've got a NAN why I shouldn't !?

The instruction that generates that x87 code is the line
(double)i * coeff; where i is an unsigned long long and coeff is a double. 
It works well if for instance 'i' is a long/int/unsigned int.

Maybe that behavior is expected ?

BR

Olivier


^ permalink raw reply	[flat|nested] 19+ messages in thread

* [Bug target/49257] -mfpmath=sse generates x87 instructions on 32 bits OS
  2011-06-01 13:57 [Bug c/49257] New: -mfpmath=sse generates x87 instructions on 32 bits OS olivier_maury at mentor dot com
@ 2011-06-01 14:25 ` rguenth at gcc dot gnu.org
  2011-06-01 17:58 ` ubizjak at gmail dot com
                   ` (16 subsequent siblings)
  17 siblings, 0 replies; 19+ messages in thread
From: rguenth at gcc dot gnu.org @ 2011-06-01 14:25 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=49257

Richard Guenther <rguenth at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Target|                            |i?86-*-*
             Status|UNCONFIRMED                 |NEW
           Keywords|                            |wrong-code
   Last reconfirmed|                            |2011.06.01 14:25:20
          Component|c                           |target
     Ever Confirmed|0                           |1
      Known to fail|                            |3.3.3, 4.5.4, 4.7.0

--- Comment #1 from Richard Guenther <rguenth at gcc dot gnu.org> 2011-06-01 14:25:20 UTC ---
Confirmed.


^ permalink raw reply	[flat|nested] 19+ messages in thread

* [Bug target/49257] -mfpmath=sse generates x87 instructions on 32 bits OS
  2011-06-01 13:57 [Bug c/49257] New: -mfpmath=sse generates x87 instructions on 32 bits OS olivier_maury at mentor dot com
  2011-06-01 14:25 ` [Bug target/49257] " rguenth at gcc dot gnu.org
@ 2011-06-01 17:58 ` ubizjak at gmail dot com
  2011-06-01 18:28 ` ubizjak at gmail dot com
                   ` (15 subsequent siblings)
  17 siblings, 0 replies; 19+ messages in thread
From: ubizjak at gmail dot com @ 2011-06-01 17:58 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=49257

--- Comment #2 from Uros Bizjak <ubizjak at gmail dot com> 2011-06-01 17:57:44 UTC ---
The problem is in floatunsdidf2 expander that is currently disabled on x86_32
due to TARGET_KEEPS_VECTOR_ALIGNED_STACK. Without this expander, gcc generates
unsigned DImode->XFmode conversion using floatdixf pattern with XFmode x87
registers.

Since latest x86_32 ABI linux ABI mandates 16 byte alignment, we can perhaps:

Index: config/i386/linux.h
===================================================================
--- config/i386/linux.h    (revision 174535)
+++ config/i386/linux.h    (working copy)
@@ -24,3 +24,6 @@
 #define GLIBC_DYNAMIC_LINKER "/lib/ld-linux.so.2"

 #define MD_UNWIND_SUPPORT "config/i386/linux-unwind.h"
+
+#undef TARGET_KEEPS_VECTOR_ALIGNED_STACK
+#define TARGET_KEEPS_VECTOR_ALIGNED_STACK 1


^ permalink raw reply	[flat|nested] 19+ messages in thread

* [Bug target/49257] -mfpmath=sse generates x87 instructions on 32 bits OS
  2011-06-01 13:57 [Bug c/49257] New: -mfpmath=sse generates x87 instructions on 32 bits OS olivier_maury at mentor dot com
  2011-06-01 14:25 ` [Bug target/49257] " rguenth at gcc dot gnu.org
  2011-06-01 17:58 ` ubizjak at gmail dot com
@ 2011-06-01 18:28 ` ubizjak at gmail dot com
  2011-06-01 20:53 ` ubizjak at gmail dot com
                   ` (14 subsequent siblings)
  17 siblings, 0 replies; 19+ messages in thread
From: ubizjak at gmail dot com @ 2011-06-01 18:28 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=49257

--- Comment #3 from Uros Bizjak <ubizjak at gmail dot com> 2011-06-01 18:27:49 UTC ---
Actually, there is no DImode SSE conversion instruction for 32bit targets, so
implicit conversion generates fildq insn that uses x87 registers even with
unsigned DImode operands.


^ permalink raw reply	[flat|nested] 19+ messages in thread

* [Bug target/49257] -mfpmath=sse generates x87 instructions on 32 bits OS
  2011-06-01 13:57 [Bug c/49257] New: -mfpmath=sse generates x87 instructions on 32 bits OS olivier_maury at mentor dot com
                   ` (2 preceding siblings ...)
  2011-06-01 18:28 ` ubizjak at gmail dot com
@ 2011-06-01 20:53 ` ubizjak at gmail dot com
  2011-06-01 20:55 ` ubizjak at gmail dot com
                   ` (13 subsequent siblings)
  17 siblings, 0 replies; 19+ messages in thread
From: ubizjak at gmail dot com @ 2011-06-01 20:53 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=49257

--- Comment #4 from Uros Bizjak <ubizjak at gmail dot com> 2011-06-01 20:52:13 UTC ---
Created attachment 24413
  --> http://gcc.gnu.org/bugzilla/attachment.cgi?id=24413
Patch that wires up ix86_expand_convert_sign_didf_sse

For some reason, ix86_expand_convert_sign_didf_sse is not wired into floatdi
expander, so we never generate sse sequence for 32bit floatdi.

Attached patch also includes my pathetic attempt to implement
ix86_expand_convert_sign_disf_sse, but surely would need some help here...


^ permalink raw reply	[flat|nested] 19+ messages in thread

* [Bug target/49257] -mfpmath=sse generates x87 instructions on 32 bits OS
  2011-06-01 13:57 [Bug c/49257] New: -mfpmath=sse generates x87 instructions on 32 bits OS olivier_maury at mentor dot com
                   ` (3 preceding siblings ...)
  2011-06-01 20:53 ` ubizjak at gmail dot com
@ 2011-06-01 20:55 ` ubizjak at gmail dot com
  2011-06-06  8:44 ` rguenth at gcc dot gnu.org
                   ` (12 subsequent siblings)
  17 siblings, 0 replies; 19+ messages in thread
From: ubizjak at gmail dot com @ 2011-06-01 20:55 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=49257

Uros Bizjak <ubizjak at gmail dot com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |rguenth at gcc dot gnu.org,
                   |                            |ubizjak at gmail dot com

--- Comment #5 from Uros Bizjak <ubizjak at gmail dot com> 2011-06-01 20:54:33 UTC ---
(In reply to comment #4)

> Attached patch also includes my pathetic attempt to implement
> ix86_expand_convert_sign_disf_sse, but surely would need some help here...

Let's ask the author of SSE expanders.


^ permalink raw reply	[flat|nested] 19+ messages in thread

* [Bug target/49257] -mfpmath=sse generates x87 instructions on 32 bits OS
  2011-06-01 13:57 [Bug c/49257] New: -mfpmath=sse generates x87 instructions on 32 bits OS olivier_maury at mentor dot com
                   ` (4 preceding siblings ...)
  2011-06-01 20:55 ` ubizjak at gmail dot com
@ 2011-06-06  8:44 ` rguenth at gcc dot gnu.org
  2011-06-06 15:49 ` rth at gcc dot gnu.org
                   ` (11 subsequent siblings)
  17 siblings, 0 replies; 19+ messages in thread
From: rguenth at gcc dot gnu.org @ 2011-06-06  8:44 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=49257

Richard Guenther <rguenth at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |rth at gcc dot gnu.org

--- Comment #6 from Richard Guenther <rguenth at gcc dot gnu.org> 2011-06-06 08:43:00 UTC ---
(In reply to comment #5)
> (In reply to comment #4)
> 
> > Attached patch also includes my pathetic attempt to implement
> > ix86_expand_convert_sign_disf_sse, but surely would need some help here...
> 
> Let's ask the author of SSE expanders.

It was me? ;)  I think it was the other Richard.

But anyway, ix86_expand_convert_sign_disf_sse doesn't look correct for
signed numbers.  I think we should try to come up with a bit-fiddling
implementation instead, check to what the soft-fp code expands to.


^ permalink raw reply	[flat|nested] 19+ messages in thread

* [Bug target/49257] -mfpmath=sse generates x87 instructions on 32 bits OS
  2011-06-01 13:57 [Bug c/49257] New: -mfpmath=sse generates x87 instructions on 32 bits OS olivier_maury at mentor dot com
                   ` (5 preceding siblings ...)
  2011-06-06  8:44 ` rguenth at gcc dot gnu.org
@ 2011-06-06 15:49 ` rth at gcc dot gnu.org
  2011-06-06 18:34 ` ubizjak at gmail dot com
                   ` (10 subsequent siblings)
  17 siblings, 0 replies; 19+ messages in thread
From: rth at gcc dot gnu.org @ 2011-06-06 15:49 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=49257

--- Comment #7 from Richard Henderson <rth at gcc dot gnu.org> 2011-06-06 15:49:06 UTC ---
Uros, the code you generate has a double-rounding error.

You can generate code inline if you have access to SSE2,
by converting the pieces to DFmode and then truncating 
to SFmode.  Since DFmode has > 2x the number of bits, we
don't get a double rounding bug.

Otherwise, we *do* have an algorithm to do DI->SF conversion
in libgcc2.c.  See the last bit of ifdeffery there; the
amount of code is really quite large.

Unfortunately, we'd need to define some new symbol in
libgcc to do this completely in SSE mode.  The default
calling conventions would use the FP stack to return the
results.

^ permalink raw reply	[flat|nested] 19+ messages in thread

* [Bug target/49257] -mfpmath=sse generates x87 instructions on 32 bits OS
  2011-06-01 13:57 [Bug c/49257] New: -mfpmath=sse generates x87 instructions on 32 bits OS olivier_maury at mentor dot com
                   ` (6 preceding siblings ...)
  2011-06-06 15:49 ` rth at gcc dot gnu.org
@ 2011-06-06 18:34 ` ubizjak at gmail dot com
  2011-06-06 19:02 ` rth at gcc dot gnu.org
                   ` (9 subsequent siblings)
  17 siblings, 0 replies; 19+ messages in thread
From: ubizjak at gmail dot com @ 2011-06-06 18:34 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=49257

Uros Bizjak <ubizjak at gmail dot com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
  Attachment #24413|0                           |1
        is obsolete|                            |
             Status|NEW                         |ASSIGNED
         AssignedTo|unassigned at gcc dot       |ubizjak at gmail dot com
                   |gnu.org                     |

--- Comment #8 from Uros Bizjak <ubizjak at gmail dot com> 2011-06-06 18:33:50 UTC ---
Created attachment 24451
  --> http://gcc.gnu.org/bugzilla/attachment.cgi?id=24451
Patch in testing.

Patch that wires up ix86_expand_convert_sign_didf_sse and uses this function to
implement ix86_expand_convert_sign_disf_sse.

The patch fixes the testcase, but in addition to -msse2 -mfpmath=sse,
-mno-80387 is also needed to prevent expansion through floatdixf


^ permalink raw reply	[flat|nested] 19+ messages in thread

* [Bug target/49257] -mfpmath=sse generates x87 instructions on 32 bits OS
  2011-06-01 13:57 [Bug c/49257] New: -mfpmath=sse generates x87 instructions on 32 bits OS olivier_maury at mentor dot com
                   ` (7 preceding siblings ...)
  2011-06-06 18:34 ` ubizjak at gmail dot com
@ 2011-06-06 19:02 ` rth at gcc dot gnu.org
  2011-06-06 21:08 ` ubizjak at gmail dot com
                   ` (8 subsequent siblings)
  17 siblings, 0 replies; 19+ messages in thread
From: rth at gcc dot gnu.org @ 2011-06-06 19:02 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=49257

--- Comment #9 from Richard Henderson <rth at gcc dot gnu.org> 2011-06-06 19:01:53 UTC ---
Looks good to me.


^ permalink raw reply	[flat|nested] 19+ messages in thread

* [Bug target/49257] -mfpmath=sse generates x87 instructions on 32 bits OS
  2011-06-01 13:57 [Bug c/49257] New: -mfpmath=sse generates x87 instructions on 32 bits OS olivier_maury at mentor dot com
                   ` (8 preceding siblings ...)
  2011-06-06 19:02 ` rth at gcc dot gnu.org
@ 2011-06-06 21:08 ` ubizjak at gmail dot com
  2011-06-07  8:39 ` ubizjak at gmail dot com
                   ` (7 subsequent siblings)
  17 siblings, 0 replies; 19+ messages in thread
From: ubizjak at gmail dot com @ 2011-06-06 21:08 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=49257

Uros Bizjak <ubizjak at gmail dot com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
  Attachment #24451|0                           |1
        is obsolete|                            |

--- Comment #10 from Uros Bizjak <ubizjak at gmail dot com> 2011-06-06 21:07:25 UTC ---
Created attachment 24453
  --> http://gcc.gnu.org/bugzilla/attachment.cgi?id=24453
Updated patch

Slightly updated patch.  Unfortunately, it fails on -m32 target, configured
with -with-fpmath=sse:

Running /home/uros/gcc-svn/trunk/gcc/testsuite/gcc.dg/torture/dg-torture.exp
...
FAIL: gcc.dg/torture/fp-int-convert-float.c  -O0  execution test
FAIL: gcc.dg/torture/fp-int-convert-float.c  -O1  execution test
FAIL: gcc.dg/torture/fp-int-convert-float.c  -O2  execution test
FAIL: gcc.dg/torture/fp-int-convert-float.c  -O3 -fomit-frame-pointer 
execution test
FAIL: gcc.dg/torture/fp-int-convert-float.c  -O3 -g  execution test
FAIL: gcc.dg/torture/fp-int-convert-float.c  -Os  execution test
FAIL: gcc.dg/torture/fp-int-convert-float.c  -O2 -flto -flto-partition=none 
execution test
FAIL: gcc.dg/torture/fp-int-convert-float.c  -O2 -flto  execution test

^ permalink raw reply	[flat|nested] 19+ messages in thread

* [Bug target/49257] -mfpmath=sse generates x87 instructions on 32 bits OS
  2011-06-01 13:57 [Bug c/49257] New: -mfpmath=sse generates x87 instructions on 32 bits OS olivier_maury at mentor dot com
                   ` (9 preceding siblings ...)
  2011-06-06 21:08 ` ubizjak at gmail dot com
@ 2011-06-07  8:39 ` ubizjak at gmail dot com
  2011-06-07 13:18 ` ubizjak at gmail dot com
                   ` (6 subsequent siblings)
  17 siblings, 0 replies; 19+ messages in thread
From: ubizjak at gmail dot com @ 2011-06-07  8:39 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=49257

--- Comment #11 from Uros Bizjak <ubizjak at gmail dot com> 2011-06-07 08:38:45 UTC ---
Patched gcc fails following testcase:

--cut here--
#include <stdio.h>

float
__attribute__((noinline))
ll2f (long long i)
{
  return i;
}

long long
__attribute__((noinline))
f2ll (float f)
{
  return f;
}

int main ()
{
  static volatile long long ll = 0x4000004000000001ll;
  static volatile float f;
  static volatile long long o;

  f = ll2f (ll);
  o = f2ll (f);

  printf ("%#llx %f %#llx\n", ll, f, o);
  return 0;
}
--cut here--

gcc -m32 -O2 -msse2 -mfpmath=sse:
0x4000004000000001 4611686018427387904.000000 0x4000000000000000

gcc -m32 -O2 -msse2 -mfpmath=387:
0x4000004000000001 4611686568183201792.000000 0x4000008000000000

A bit has been lost...


^ permalink raw reply	[flat|nested] 19+ messages in thread

* [Bug target/49257] -mfpmath=sse generates x87 instructions on 32 bits OS
  2011-06-01 13:57 [Bug c/49257] New: -mfpmath=sse generates x87 instructions on 32 bits OS olivier_maury at mentor dot com
                   ` (10 preceding siblings ...)
  2011-06-07  8:39 ` ubizjak at gmail dot com
@ 2011-06-07 13:18 ` ubizjak at gmail dot com
  2011-06-07 18:41 ` rth at gcc dot gnu.org
                   ` (5 subsequent siblings)
  17 siblings, 0 replies; 19+ messages in thread
From: ubizjak at gmail dot com @ 2011-06-07 13:18 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=49257

--- Comment #12 from Uros Bizjak <ubizjak at gmail dot com> 2011-06-07 13:16:37 UTC ---
(In reply to comment #11)

> A bit has been lost...

... due to additional cvtsd2ss conversion.

Any help would be appreciated here.


^ permalink raw reply	[flat|nested] 19+ messages in thread

* [Bug target/49257] -mfpmath=sse generates x87 instructions on 32 bits OS
  2011-06-01 13:57 [Bug c/49257] New: -mfpmath=sse generates x87 instructions on 32 bits OS olivier_maury at mentor dot com
                   ` (11 preceding siblings ...)
  2011-06-07 13:18 ` ubizjak at gmail dot com
@ 2011-06-07 18:41 ` rth at gcc dot gnu.org
  2011-06-07 23:13 ` rth at gcc dot gnu.org
                   ` (4 subsequent siblings)
  17 siblings, 0 replies; 19+ messages in thread
From: rth at gcc dot gnu.org @ 2011-06-07 18:41 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=49257

--- Comment #13 from Richard Henderson <rth at gcc dot gnu.org> 2011-06-07 18:40:48 UTC ---
I apologize.  The error you're seeing here is the sort that's
handled by the second #if section in libgcc's __floatdisf.

In particular,

  /* Protect against double-rounding error.
     Represent any low-order bits, that might be truncated by a bit that
     won't be lost.  The bit can go in anywhere below the rounding position
     of the FSTYPE.  A fixed mask and bit position handles all usual
     configurations.  */
  if (! (- ((DWtype) 1 << FSIZE) < u
         && u < ((DWtype) 1 << FSIZE)))

This is because, while DF > 2*SF bits, DF < DI bits, so we've
already lost a bit while converting to DFmode.

This doesn't appear to be supportable inline without excessive code bloat.


^ permalink raw reply	[flat|nested] 19+ messages in thread

* [Bug target/49257] -mfpmath=sse generates x87 instructions on 32 bits OS
  2011-06-01 13:57 [Bug c/49257] New: -mfpmath=sse generates x87 instructions on 32 bits OS olivier_maury at mentor dot com
                   ` (12 preceding siblings ...)
  2011-06-07 18:41 ` rth at gcc dot gnu.org
@ 2011-06-07 23:13 ` rth at gcc dot gnu.org
  2011-06-08  6:14 ` jakub at gcc dot gnu.org
                   ` (3 subsequent siblings)
  17 siblings, 0 replies; 19+ messages in thread
From: rth at gcc dot gnu.org @ 2011-06-07 23:13 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=49257

Richard Henderson <rth at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
  Attachment #24453|0                           |1
        is obsolete|                            |

--- Comment #14 from Richard Henderson <rth at gcc dot gnu.org> 2011-06-07 23:12:17 UTC ---
Created attachment 24465
  --> http://gcc.gnu.org/bugzilla/attachment.cgi?id=24465
Hand coded assembly for DI->SF conversion

Instead of inlining all of this, or using a "real" libcall which
would force all of the %xmm, %st, and %mm registers to the stack.

(define_insn "*floatdisf_sse_32"
  [(set (match_operand:SF 0 "register_operand" "=Yz")
        (float:SF (match_operand:DI 1 "register_operand" "A")))
   (clobber (match_scratch:SI 2 "=a")
   (clobber (match_scratch:SI 3 "=d")
   (clobber (match_scratch:SI 4 "=c")]
  "!TARGET_64BIT && SSE_FLOAT_MODE_P (SFmode) && TARGET_SSE_MATH"
  "call __floatdisf_sse"
  [(set_attr "length" "5")
   (set_attr "type" "call")])

Note that the assembly is very carefuly to avoid clobbering 
anything but eax, edx, ecx, and could probably even avoid that
if necessary.

Thoughts?


^ permalink raw reply	[flat|nested] 19+ messages in thread

* [Bug target/49257] -mfpmath=sse generates x87 instructions on 32 bits OS
  2011-06-01 13:57 [Bug c/49257] New: -mfpmath=sse generates x87 instructions on 32 bits OS olivier_maury at mentor dot com
                   ` (13 preceding siblings ...)
  2011-06-07 23:13 ` rth at gcc dot gnu.org
@ 2011-06-08  6:14 ` jakub at gcc dot gnu.org
  2011-06-08 10:32 ` ubizjak at gmail dot com
                   ` (2 subsequent siblings)
  17 siblings, 0 replies; 19+ messages in thread
From: jakub at gcc dot gnu.org @ 2011-06-08  6:14 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=49257

Jakub Jelinek <jakub at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |jakub at gcc dot gnu.org

--- Comment #15 from Jakub Jelinek <jakub at gcc dot gnu.org> 2011-06-08 06:14:19 UTC ---
It would need @plt for flag_pic, and somehow ensure that the %ebx is
initialized, plus whatever else is needed for calls on e.g. darwin etc.


^ permalink raw reply	[flat|nested] 19+ messages in thread

* [Bug target/49257] -mfpmath=sse generates x87 instructions on 32 bits OS
  2011-06-01 13:57 [Bug c/49257] New: -mfpmath=sse generates x87 instructions on 32 bits OS olivier_maury at mentor dot com
                   ` (14 preceding siblings ...)
  2011-06-08  6:14 ` jakub at gcc dot gnu.org
@ 2011-06-08 10:32 ` ubizjak at gmail dot com
  2011-06-09 15:41 ` rth at gcc dot gnu.org
  2012-01-20 11:31 ` ubizjak at gmail dot com
  17 siblings, 0 replies; 19+ messages in thread
From: ubizjak at gmail dot com @ 2011-06-08 10:32 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=49257

--- Comment #16 from Uros Bizjak <ubizjak at gmail dot com> 2011-06-08 10:31:13 UTC ---
(In reply to comment #15)
> It would need @plt for flag_pic, and somehow ensure that the %ebx is
> initialized, plus whatever else is needed for calls on e.g. darwin etc.

We can probably use (parts of) i386.c,ix86_expand_call here.


^ permalink raw reply	[flat|nested] 19+ messages in thread

* [Bug target/49257] -mfpmath=sse generates x87 instructions on 32 bits OS
  2011-06-01 13:57 [Bug c/49257] New: -mfpmath=sse generates x87 instructions on 32 bits OS olivier_maury at mentor dot com
                   ` (15 preceding siblings ...)
  2011-06-08 10:32 ` ubizjak at gmail dot com
@ 2011-06-09 15:41 ` rth at gcc dot gnu.org
  2012-01-20 11:31 ` ubizjak at gmail dot com
  17 siblings, 0 replies; 19+ messages in thread
From: rth at gcc dot gnu.org @ 2011-06-09 15:41 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=49257

--- Comment #17 from Richard Henderson <rth at gcc dot gnu.org> 2011-06-09 15:39:33 UTC ---
The Problem here is that using the 387 for these conversions is
normally a Good Thing.  Even when we're not mixing 387 and SSE math,
the 387 can do the conversion in 1 insn.  Add a couple more for 
pushing the data between functional units and we've *still* got code
that's significantly smaller and faster than any pure SSE alternative.

Ignoring SSE and -mfpmath=sse for a moment, we've said that you
simply can't use FP math in a region that's using MMX operations
because that must needs use the FPU in 387 mode.  I'm not sure
that this stance should be altered just because -mfpmath=sse is
in effect.

Is there some Really Good Reason you're using MMX at all?  I mean,
you've explicitly said via compiler flags that the target cpu is
SSE enabled.  What can MMX do that SSE2 cannot?

We could do something with the code written in this thread in the
context of -mno-80387 -msse, but I'm not really sure it's worth it.
How often would it really be used, honestly?

I'm inclined to mark this bug as either INVALID or WONTFIX.

^ permalink raw reply	[flat|nested] 19+ messages in thread

* [Bug target/49257] -mfpmath=sse generates x87 instructions on 32 bits OS
  2011-06-01 13:57 [Bug c/49257] New: -mfpmath=sse generates x87 instructions on 32 bits OS olivier_maury at mentor dot com
                   ` (16 preceding siblings ...)
  2011-06-09 15:41 ` rth at gcc dot gnu.org
@ 2012-01-20 11:31 ` ubizjak at gmail dot com
  17 siblings, 0 replies; 19+ messages in thread
From: ubizjak at gmail dot com @ 2012-01-20 11:31 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=49257

Uros Bizjak <ubizjak at gmail dot com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|ASSIGNED                    |RESOLVED
         Resolution|                            |WONTFIX

--- Comment #18 from Uros Bizjak <ubizjak at gmail dot com> 2012-01-20 11:06:16 UTC ---
The only way this could work is to put __builtin_ia32_emms into the loop, after
__builtin_ia32_movntq. -mfpmath=sse does not mean that x87 is disabled, only
that equivalent arithmetic instructions use SSE instructions. If there is no
equivalent SSE insn, x87 insn is used.

IIRC, even Intel's Instruction set reference suggests to group FP and MMX insn
together and put emms after MMX block.

Since a substantial effort would be needed to fix this questionable corner
case, and the test is violating recommended practice, I'm marking this PR as
WONTFIX.


^ permalink raw reply	[flat|nested] 19+ messages in thread

end of thread, other threads:[~2012-01-20 11:08 UTC | newest]

Thread overview: 19+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2011-06-01 13:57 [Bug c/49257] New: -mfpmath=sse generates x87 instructions on 32 bits OS olivier_maury at mentor dot com
2011-06-01 14:25 ` [Bug target/49257] " rguenth at gcc dot gnu.org
2011-06-01 17:58 ` ubizjak at gmail dot com
2011-06-01 18:28 ` ubizjak at gmail dot com
2011-06-01 20:53 ` ubizjak at gmail dot com
2011-06-01 20:55 ` ubizjak at gmail dot com
2011-06-06  8:44 ` rguenth at gcc dot gnu.org
2011-06-06 15:49 ` rth at gcc dot gnu.org
2011-06-06 18:34 ` ubizjak at gmail dot com
2011-06-06 19:02 ` rth at gcc dot gnu.org
2011-06-06 21:08 ` ubizjak at gmail dot com
2011-06-07  8:39 ` ubizjak at gmail dot com
2011-06-07 13:18 ` ubizjak at gmail dot com
2011-06-07 18:41 ` rth at gcc dot gnu.org
2011-06-07 23:13 ` rth at gcc dot gnu.org
2011-06-08  6:14 ` jakub at gcc dot gnu.org
2011-06-08 10:32 ` ubizjak at gmail dot com
2011-06-09 15:41 ` rth at gcc dot gnu.org
2012-01-20 11:31 ` ubizjak at gmail dot com

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).