public inbox for gcc-bugs@sourceware.org
help / color / mirror / Atom feed
* [Bug target/19250] New: minss/maxss SSE insn not generated for -mfpmath=sse
@ 2005-01-04  8:10 uros at kss-loka dot si
  2005-01-04  8:27 ` [Bug target/19250] " pinskia at gcc dot gnu dot org
                   ` (8 more replies)
  0 siblings, 9 replies; 10+ messages in thread
From: uros at kss-loka dot si @ 2005-01-04  8:10 UTC (permalink / raw)
  To: gcc-bugs

This testcase should generate min{s,d}s, max{s,d}s SSE insn. It looks that
min?f, max?f pattern is converted to equivalent i387 insn sequence, because the
output is expected in FP reg. However, the result of min/max should be _moved_
from SSE to FP reg.

Testcase:

float minf(float a, float b) {
  return a <= b ? a : b;
}

when compiled with 'gcc -O2 -ffast-math -march=pentium4 -mfpmath=sse
-fomit-frame-pointer':

minf:
        subl    $4, %esp
        flds    8(%esp)
        movss   12(%esp), %xmm0
        movss   %xmm0, (%esp)
        flds    (%esp)
        fcomi   %st(1), %st
        fcmovnb %st(1), %st
        fstp    %st(1)
        addl    $4, %esp
        ret

Equivalent code could be something like:
minf:
	subl	$4, %esp
	movss	8(%esp), %xmm0
	minss	%xmm0, 12(%esp)
	movss	%xmm0, (%esp)
	flds	(%esp)
	addl	$4, %esp
	ret

Uros.

-- 
           Summary: minss/maxss SSE insn not generated for -mfpmath=sse
           Product: gcc
           Version: 4.0.0
            Status: UNCONFIRMED
          Severity: normal
          Priority: P2
         Component: target
        AssignedTo: unassigned at gcc dot gnu dot org
        ReportedBy: uros at kss-loka dot si
                CC: gcc-bugs at gcc dot gnu dot org
 GCC build triplet: i686-pc-linux-gnu
  GCC host triplet: i686-pc-linux-gnu
GCC target triplet: i686-pc-linux-gnu


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=19250


^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug target/19250] minss/maxss SSE insn not generated for -mfpmath=sse
  2005-01-04  8:10 [Bug target/19250] New: minss/maxss SSE insn not generated for -mfpmath=sse uros at kss-loka dot si
@ 2005-01-04  8:27 ` pinskia at gcc dot gnu dot org
  2005-01-04  9:05 ` aj at gcc dot gnu dot org
                   ` (7 subsequent siblings)
  8 siblings, 0 replies; 10+ messages in thread
From: pinskia at gcc dot gnu dot org @ 2005-01-04  8:27 UTC (permalink / raw)
  To: gcc-bugs


------- Additional Comments From pinskia at gcc dot gnu dot org  2005-01-04 08:27 -------
Confirmed but note this is a register allocator problem really.
in lreg:
(insn:HI 12 8 16 0 (parallel [
            (set (reg/v:SF 59 [ a ])
                (if_then_else:SF (lt (reg/v:SF 59 [ a ])
                        (reg/v:SF 60 [ b ]))
                    (reg/v:SF 59 [ a ])
                    (reg/v:SF 60 [ b ])))
            (clobber (reg:CC 17 flags))
        ]) 654 {*minsf_nonieee} (insn_list:REG_DEP_TRUE 6 (insn_list:REG_DEP_TRUE 7 (nil)))
    (expr_list:REG_UNUSED (reg:CC 17 flags)
        (expr_list:REG_DEAD (reg/v:SF 60 [ b ])
            (expr_list:REG_UNUSED (reg:CC 17 flags)
                (nil)))))

After global/postreload:
(insn:HI 12 32 16 0 (parallel [
            (set (reg/v:SF 8 st [orig:59 a ] [59])
                (if_then_else:SF (lt (reg/v:SF 8 st [orig:59 a ] [59])
                        (reg:SF 9 st(1)))
                    (reg/v:SF 8 st [orig:59 a ] [59])
                    (reg:SF 9 st(1))))
            (clobber (reg:CC 17 flags))
        ]) 654 {*minsf_nonieee} (insn_list:REG_DEP_TRUE 6 (insn_list:REG_DEP_TRUE 7 (nil)))
    (nil))

Note we chose the fp "registers".

-- 
           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|UNCONFIRMED                 |NEW
     Ever Confirmed|                            |1
           Keywords|                            |missed-optimization
   Last reconfirmed|0000-00-00 00:00:00         |2005-01-04 08:27:31
               date|                            |


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=19250


^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug target/19250] minss/maxss SSE insn not generated for -mfpmath=sse
  2005-01-04  8:10 [Bug target/19250] New: minss/maxss SSE insn not generated for -mfpmath=sse uros at kss-loka dot si
  2005-01-04  8:27 ` [Bug target/19250] " pinskia at gcc dot gnu dot org
@ 2005-01-04  9:05 ` aj at gcc dot gnu dot org
  2005-01-07 11:15 ` uros at kss-loka dot si
                   ` (6 subsequent siblings)
  8 siblings, 0 replies; 10+ messages in thread
From: aj at gcc dot gnu dot org @ 2005-01-04  9:05 UTC (permalink / raw)
  To: gcc-bugs


------- Additional Comments From aj at gcc dot gnu dot org  2005-01-04 09:05 -------
Btw. we do the right thing for 64-bit x86-64: 
 
$ gcc -O2 -ffast-math  -c t.c 
$ objdump -d t.o 
t.o:     file format elf64-x86-64 
 
Disassembly of section .text: 
 
0000000000000000 <minf>: 
   0:   f3 0f 5d c1             minss  %xmm1,%xmm0 
   4:   c3                      retq 
 
With -m32 I get the same ugly code. 

-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=19250


^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug target/19250] minss/maxss SSE insn not generated for -mfpmath=sse
  2005-01-04  8:10 [Bug target/19250] New: minss/maxss SSE insn not generated for -mfpmath=sse uros at kss-loka dot si
  2005-01-04  8:27 ` [Bug target/19250] " pinskia at gcc dot gnu dot org
  2005-01-04  9:05 ` aj at gcc dot gnu dot org
@ 2005-01-07 11:15 ` uros at kss-loka dot si
  2005-01-12  2:48 ` rth at gcc dot gnu dot org
                   ` (5 subsequent siblings)
  8 siblings, 0 replies; 10+ messages in thread
From: uros at kss-loka dot si @ 2005-01-07 11:15 UTC (permalink / raw)
  To: gcc-bugs


------- Additional Comments From uros at kss-loka dot si  2005-01-07 11:15 -------
The same trick as in http://gcc.gnu.org/ml/gcc-patches/2005-01/msg00394.html
with sqrtf applied to return value gets the code as expected.
minf:
        subl    $4, %esp
        movss   8(%esp), %xmm0
        movss   12(%esp), %xmm1
        minss   %xmm1, %xmm0
        sqrtss  %xmm0, %xmm0
        movss   %xmm0, (%esp)
        flds    (%esp)
        addl    $4, %esp
        ret


-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=19250


^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug target/19250] minss/maxss SSE insn not generated for -mfpmath=sse
  2005-01-04  8:10 [Bug target/19250] New: minss/maxss SSE insn not generated for -mfpmath=sse uros at kss-loka dot si
                   ` (2 preceding siblings ...)
  2005-01-07 11:15 ` uros at kss-loka dot si
@ 2005-01-12  2:48 ` rth at gcc dot gnu dot org
  2005-01-14  0:34 ` cvs-commit at gcc dot gnu dot org
                   ` (4 subsequent siblings)
  8 siblings, 0 replies; 10+ messages in thread
From: rth at gcc dot gnu dot org @ 2005-01-12  2:48 UTC (permalink / raw)
  To: gcc-bugs



-- 
           What    |Removed                     |Added
----------------------------------------------------------------------------
         AssignedTo|unassigned at gcc dot gnu   |rth at gcc dot gnu dot org
                   |dot org                     |
             Status|NEW                         |ASSIGNED
   Last reconfirmed|2005-01-04 08:27:31         |2005-01-12 02:48:34
               date|                            |


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=19250


^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug target/19250] minss/maxss SSE insn not generated for -mfpmath=sse
  2005-01-04  8:10 [Bug target/19250] New: minss/maxss SSE insn not generated for -mfpmath=sse uros at kss-loka dot si
                   ` (3 preceding siblings ...)
  2005-01-12  2:48 ` rth at gcc dot gnu dot org
@ 2005-01-14  0:34 ` cvs-commit at gcc dot gnu dot org
  2005-01-14  1:03 ` rth at gcc dot gnu dot org
                   ` (3 subsequent siblings)
  8 siblings, 0 replies; 10+ messages in thread
From: cvs-commit at gcc dot gnu dot org @ 2005-01-14  0:34 UTC (permalink / raw)
  To: gcc-bugs


------- Additional Comments From cvs-commit at gcc dot gnu dot org  2005-01-14 00:34 -------
Subject: Bug 19250

CVSROOT:	/cvs/gcc
Module name:	gcc
Changes by:	rth@gcc.gnu.org	2005-01-14 00:33:51

Modified files:
	gcc            : ChangeLog 
	gcc/config/i386: i386-protos.h i386.c i386.md 

Log message:
	PR target/19099
	PR target/19250
	PR target/19252
	* config/i386/i386.md (cmpdf, cmpsf, bunordered, bordered, buneq,
	bunge, bungt, bunle, bunlt, bltgt): Enable for TARGET_SSE_MATH,
	not just TARGET_SSE.
	(cmpfp_i_387): Rename from cmpfp_i.  Move after sse patterns.
	(cmpfp_i_mixed): Rename from cmpfp_i_sse; use for TARGET_MIX_SSE_I387.
	(cmpfp_i_sse): Rename from cmpfp_i_sse_only; use for TARGET_SSE_MATH.
	(cmpfp_iu_mixed, cmpfp_iu_sse, cmpfp_iu_387): Similarly.
	(fp_jcc_1_mixed, fp_jcc_1_sse, fp_jcc_1_387): Similarly.
	(fp_jcc_2_mixed, fp_jcc_2_sse, fp_jcc_2_387): Similarly.
	(fp_jcc_3_387, fp_jcc_4_387, fp_jcc_5_387, fp_jcc_6_387,
	fp_jcc_7_387, fp_jcc_8_387): Rename from fp_jcc_N.
	(movdicc_c_rex64): Rename with '*'.
	(movsfcc, movdfcc): Add checks for 387 and sse math to condition.
	(movsfcc_1_sse_min, movsfcc_1_sse_max, movsfcc_1_sse): New.
	(movsfcc_1_387): Rename from movsfcc_1.
	(movdfcc_1_sse_min, movdfcc_1_sse_max, movdfcc_1_sse): New.
	(movdfcc_1, movdfcc_1_rex64): Add check for 387.
	(sminsf3, smaxsf3, smindf3, smaxdf3): New.
	(minsf3, minsf, minsf_nonieee, minsf_sse, mindf3, mindf,
	mindf_nonieee, mindf_sse, maxsf3, maxsf, maxsf_nonieee, maxsf_sse,
	maxdf3, maxdf, maxdf_nonieee, maxdf_sse, sse_movsfcc, sse_movsfcc_eq,
	sse_movdfcc, sse_movdfcc_eq, sse_movsfcc_const0_1,
	sse_movsfcc_const0_2, sse_movsfcc_const0_3, sse_movsfcc_const0_4,
	sse_movdfcc_const0_1, sse_movdfcc_const0_2, sse_movdfcc_const0_3,
	sse_movdfcc_const0_4): Remove.
	* config/i386/i386.c (ix86_expand_fp_movcc): For TARGET_SSE_MATH,
	recognize min/max early.  Update for changed sse cmove patterns.
	(ix86_split_sse_movcc): New.
	* config/i386/i386-protos.h: Update.

Patches:
http://gcc.gnu.org/cgi-bin/cvsweb.cgi/gcc/gcc/ChangeLog.diff?cvsroot=gcc&r1=2.7114&r2=2.7115
http://gcc.gnu.org/cgi-bin/cvsweb.cgi/gcc/gcc/config/i386/i386-protos.h.diff?cvsroot=gcc&r1=1.125&r2=1.126
http://gcc.gnu.org/cgi-bin/cvsweb.cgi/gcc/gcc/config/i386/i386.c.diff?cvsroot=gcc&r1=1.776&r2=1.777
http://gcc.gnu.org/cgi-bin/cvsweb.cgi/gcc/gcc/config/i386/i386.md.diff?cvsroot=gcc&r1=1.605&r2=1.606



-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=19250


^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug target/19250] minss/maxss SSE insn not generated for -mfpmath=sse
  2005-01-04  8:10 [Bug target/19250] New: minss/maxss SSE insn not generated for -mfpmath=sse uros at kss-loka dot si
                   ` (4 preceding siblings ...)
  2005-01-14  0:34 ` cvs-commit at gcc dot gnu dot org
@ 2005-01-14  1:03 ` rth at gcc dot gnu dot org
  2005-01-14  1:05 ` pinskia at gcc dot gnu dot org
                   ` (2 subsequent siblings)
  8 siblings, 0 replies; 10+ messages in thread
From: rth at gcc dot gnu dot org @ 2005-01-14  1:03 UTC (permalink / raw)
  To: gcc-bugs


------- Additional Comments From rth at gcc dot gnu dot org  2005-01-14 01:03 -------
I believe the problem you ascribe to this bug is fixed.  Note that we do not
generate minss for the given example because "<=" is not the operation of the
minss instruction; it performs "<".  Which is relevant for "+0.0 < -0.0".

We *ought* to emit minss with -ffast-math, but that should happen via
noce_try_minmax invoking the sminsf3 pattern, rather than special-casing
this in the back-end.  Open a separate PR for that if you like.

But on the bright side, we at least generate a conditional move sequence:

        cmpless %xmm1, %xmm0
        andps   %xmm0, %xmm2
        andnps  %xmm1, %xmm0
        orps    %xmm2, %xmm0


-- 
           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|ASSIGNED                    |RESOLVED
         Resolution|                            |FIXED


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=19250


^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug target/19250] minss/maxss SSE insn not generated for -mfpmath=sse
  2005-01-04  8:10 [Bug target/19250] New: minss/maxss SSE insn not generated for -mfpmath=sse uros at kss-loka dot si
                   ` (5 preceding siblings ...)
  2005-01-14  1:03 ` rth at gcc dot gnu dot org
@ 2005-01-14  1:05 ` pinskia at gcc dot gnu dot org
  2005-01-14  6:37 ` uros at kss-loka dot si
  2005-01-18 12:38 ` uros at kss-loka dot si
  8 siblings, 0 replies; 10+ messages in thread
From: pinskia at gcc dot gnu dot org @ 2005-01-14  1:05 UTC (permalink / raw)
  To: gcc-bugs



-- 
           What    |Removed                     |Added
----------------------------------------------------------------------------
   Target Milestone|---                         |4.0.0


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=19250


^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug target/19250] minss/maxss SSE insn not generated for -mfpmath=sse
  2005-01-04  8:10 [Bug target/19250] New: minss/maxss SSE insn not generated for -mfpmath=sse uros at kss-loka dot si
                   ` (6 preceding siblings ...)
  2005-01-14  1:05 ` pinskia at gcc dot gnu dot org
@ 2005-01-14  6:37 ` uros at kss-loka dot si
  2005-01-18 12:38 ` uros at kss-loka dot si
  8 siblings, 0 replies; 10+ messages in thread
From: uros at kss-loka dot si @ 2005-01-14  6:37 UTC (permalink / raw)
  To: gcc-bugs


------- Additional Comments From uros at kss-loka dot si  2005-01-14 06:37 -------
This comment was found in fold-const.c:

  /* Try some transformations of A op B ? A : B.

     A == B? A : B    same as B
     A != B? A : B    same as A
     A >= B? A : B    same as max (A, B)
     A > B?  A : B    same as max (B, A)
     A <= B? A : B    same as min (A, B)
     A < B?  A : B    same as min (B, A)

     As above, these transformations don't work in the presence
     of signed zeros.  For example, if A and B are zeros of
     opposite sign, the first two transformations will change
     the sign of the result.  In the last four, the original
     expressions give different results for (A=+0, B=-0) and
     (A=-0, B=+0), but the transformed expressions do not.

     The first two transformations are correct if either A or B
     is a NaN.  In the first transformation, the condition will
     be false, and B will indeed be chosen.  In the case of the
     second transformation, the condition A != B will be true,
     and A will be chosen.

     The conversions to max() and min() are not correct if B is
     a number and A is not.  The conditions in the original
     expressions will be false, so all four give B.  The min()
     and max() versions would give a NaN instead.  */

The testcase is ineed a bit unfortunate. I'll open an enhancement request
regarding -ffast-math, as you suggested.


-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=19250


^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug target/19250] minss/maxss SSE insn not generated for -mfpmath=sse
  2005-01-04  8:10 [Bug target/19250] New: minss/maxss SSE insn not generated for -mfpmath=sse uros at kss-loka dot si
                   ` (7 preceding siblings ...)
  2005-01-14  6:37 ` uros at kss-loka dot si
@ 2005-01-18 12:38 ` uros at kss-loka dot si
  8 siblings, 0 replies; 10+ messages in thread
From: uros at kss-loka dot si @ 2005-01-18 12:38 UTC (permalink / raw)
  To: gcc-bugs


------- Additional Comments From uros at kss-loka dot si  2005-01-18 12:38 -------
With current mainline, testcase from description generates minss instruction as
expected, when -ffast-math is used.

gcc -O2 -msse2 -mfpmath=sse -ffast-math:

minf:
        pushl   %ebp
        movl    %esp, %ebp
        subl    $4, %esp
        movss   12(%ebp), %xmm0
        minss   8(%ebp), %xmm0
        movss   %xmm0, -4(%ebp)
        flds    -4(%ebp)
        leave
        ret

It also works for max and all other cases from comment #6, so there is in fact
no need for a new PR.

-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=19250


^ permalink raw reply	[flat|nested] 10+ messages in thread

end of thread, other threads:[~2005-01-18 12:38 UTC | newest]

Thread overview: 10+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2005-01-04  8:10 [Bug target/19250] New: minss/maxss SSE insn not generated for -mfpmath=sse uros at kss-loka dot si
2005-01-04  8:27 ` [Bug target/19250] " pinskia at gcc dot gnu dot org
2005-01-04  9:05 ` aj at gcc dot gnu dot org
2005-01-07 11:15 ` uros at kss-loka dot si
2005-01-12  2:48 ` rth at gcc dot gnu dot org
2005-01-14  0:34 ` cvs-commit at gcc dot gnu dot org
2005-01-14  1:03 ` rth at gcc dot gnu dot org
2005-01-14  1:05 ` pinskia at gcc dot gnu dot org
2005-01-14  6:37 ` uros at kss-loka dot si
2005-01-18 12:38 ` uros at kss-loka dot si

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).