* [PATCH] fold a * (a > 0 ? 1 : -1) to abs(a) and related optimizations
@ 2017-06-24 3:59 Andrew Pinski
2017-06-24 3:59 ` Andrew Pinski
2017-06-24 6:50 ` Marc Glisse
0 siblings, 2 replies; 7+ messages in thread
From: Andrew Pinski @ 2017-06-24 3:59 UTC (permalink / raw)
To: GCC Patches
Hi,
I saw this on llvm's review site (https://reviews.llvm.org/D34579)
and I thought why not add it to GCC. I expanded more than what was
done on the LLVM patch.
I added the following optimizations:
Transform X * (X > 0 ? 1 : -1) into ABS(X).
Transform X * (X >= 0 ? 1 : -1) into ABS(X).
Transform X * (X > 0.0 ? 1.0 : -1.0) into ABS(X).
Transform X * (X >= 0.0 ? 1.0 : -1.0) into ABS(X).
Transform X * (X > 0 ? -1 : 1) into -ABS(X).
Transform X * (X >= 0 ? -1 : 1) into -ABS(X).
Transform X * (X > 0.0 ? -1.0 : 1.0) into -ABS(X).
Transform X * (X >= 0.0 ? -1.0 : 1.0) into -ABS(X).
Transform X * (X < 0 ? 1 : -1) into -ABS(X).
Transform X * (X <= 0 ? 1 : -1) into -ABS(X).
Transform X * (X < 0.0 ? 1.0 : -1.0) into -ABS(X).
Transform X * (X <= 0.0 ? 1.0 : -1.0) into -ABS(X).
Transform X * (X < 0 ? -1 : 1) into ABS(X).
Transform X * (X <= 0 ? -1 : 1) into ABS(X).
Transform X * (X < 0.0 ? -1.0 : 1.0) into ABS(X).
Transform X * (X <= 0.0 ? -1.0 : 1.0) into ABS(X).
The floating points ones only happen when not honoring SNANS and not
honoring signed zeros.
OK? Bootstrapped and tested on aarch64-linux-gnu with no regressions.
Thanks,
Andrew Pinski
ChangeLog:
* match.pd ( X * (X >/>=/</<= 0 ? 1 : -1)): New patterns.
Testsuite/ChangeLog:
* testsuite/gcc.dg/tree-ssa/mult-abs-1.c: New testcase.
* testsuite/gcc.dg/tree-ssa/mult-abs-2.c: New testcase.
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [PATCH] fold a * (a > 0 ? 1 : -1) to abs(a) and related optimizations
2017-06-24 3:59 [PATCH] fold a * (a > 0 ? 1 : -1) to abs(a) and related optimizations Andrew Pinski
@ 2017-06-24 3:59 ` Andrew Pinski
2017-06-24 6:50 ` Marc Glisse
1 sibling, 0 replies; 7+ messages in thread
From: Andrew Pinski @ 2017-06-24 3:59 UTC (permalink / raw)
To: GCC Patches
[-- Attachment #1: Type: text/plain, Size: 1523 bytes --]
Forgot the patch
On Fri, Jun 23, 2017 at 8:59 PM, Andrew Pinski <pinskia@gmail.com> wrote:
> Hi,
> I saw this on llvm's review site (https://reviews.llvm.org/D34579)
> and I thought why not add it to GCC. I expanded more than what was
> done on the LLVM patch.
>
> I added the following optimizations:
> Transform X * (X > 0 ? 1 : -1) into ABS(X).
> Transform X * (X >= 0 ? 1 : -1) into ABS(X).
> Transform X * (X > 0.0 ? 1.0 : -1.0) into ABS(X).
> Transform X * (X >= 0.0 ? 1.0 : -1.0) into ABS(X).
> Transform X * (X > 0 ? -1 : 1) into -ABS(X).
> Transform X * (X >= 0 ? -1 : 1) into -ABS(X).
> Transform X * (X > 0.0 ? -1.0 : 1.0) into -ABS(X).
> Transform X * (X >= 0.0 ? -1.0 : 1.0) into -ABS(X).
> Transform X * (X < 0 ? 1 : -1) into -ABS(X).
> Transform X * (X <= 0 ? 1 : -1) into -ABS(X).
> Transform X * (X < 0.0 ? 1.0 : -1.0) into -ABS(X).
> Transform X * (X <= 0.0 ? 1.0 : -1.0) into -ABS(X).
> Transform X * (X < 0 ? -1 : 1) into ABS(X).
> Transform X * (X <= 0 ? -1 : 1) into ABS(X).
> Transform X * (X < 0.0 ? -1.0 : 1.0) into ABS(X).
> Transform X * (X <= 0.0 ? -1.0 : 1.0) into ABS(X).
>
> The floating points ones only happen when not honoring SNANS and not
> honoring signed zeros.
>
> OK? Bootstrapped and tested on aarch64-linux-gnu with no regressions.
>
> Thanks,
> Andrew Pinski
>
> ChangeLog:
> * match.pd ( X * (X >/>=/</<= 0 ? 1 : -1)): New patterns.
>
> Testsuite/ChangeLog:
> * testsuite/gcc.dg/tree-ssa/mult-abs-1.c: New testcase.
> * testsuite/gcc.dg/tree-ssa/mult-abs-2.c: New testcase.
[-- Attachment #2: mult-abs.diff.txt --]
[-- Type: text/plain, Size: 4029 bytes --]
Index: match.pd
===================================================================
--- match.pd (revision 249613)
+++ match.pd (working copy)
@@ -155,6 +155,55 @@
|| !COMPLEX_FLOAT_TYPE_P (type)))
(negate @0)))
+(for cmp (gt ge)
+ /* Transform X * (X > 0 ? 1 : -1) into ABS(X). */
+ /* Transform X * (X >= 0 ? 1 : -1) into ABS(X). */
+ (simplify
+ (mult:c @0 (cond (cmp @0 integer_zerop) integer_onep integer_all_onesp))
+ (abs @0))
+ /* Transform X * (X > 0.0 ? 1.0 : -1.0) into ABS(X). */
+ /* Transform X * (X >= 0.0 ? 1.0 : -1.0) into ABS(X). */
+ (simplify
+ (mult:c @0 (cond (cmp @0 real_zerop) real_onep real_minus_onep))
+ (if (!HONOR_SNANS (type) && !HONOR_SIGNED_ZEROS (type))
+ (abs @0)))
+ /* Transform X * (X > 0 ? -1 : 1) into -ABS(X). */
+ /* Transform X * (X >= 0 ? -1 : 1) into -ABS(X). */
+ (simplify
+ (mult:c @0 (cond (cmp @0 integer_zerop) integer_all_onesp integer_onep))
+ (negate (abs @0)))
+ /* Transform X * (X > 0.0 ? -1.0 : 1.0) into -ABS(X). */
+ /* Transform X * (X >= 0.0 ? -1.0 : 1.0) into -ABS(X). */
+ (simplify
+ (mult:c @0 (cond (cmp @0 real_zerop) real_minus_onep real_onep))
+ (if (!HONOR_SNANS (type) && !HONOR_SIGNED_ZEROS (type))
+ (negate (abs @0)))))
+
+(for cmp (lt le)
+ /* Transform X * (X < 0 ? 1 : -1) into -ABS(X). */
+ /* Transform X * (X <= 0 ? 1 : -1) into -ABS(X). */
+ (simplify
+ (mult:c @0 (cond (cmp @0 integer_zerop) integer_onep integer_all_onesp))
+ (negate (abs @0)))
+ /* Transform X * (X < 0.0 ? 1.0 : -1.0) into -ABS(X). */
+ /* Transform X * (X <= 0.0 ? 1.0 : -1.0) into -ABS(X). */
+ (simplify
+ (mult:c @0 (cond (cmp @0 real_zerop) real_onep real_minus_onep))
+ (if (!HONOR_SNANS (type) && !HONOR_SIGNED_ZEROS (type))
+ (negate (abs @0))))
+ /* Transform X * (X < 0 ? -1 : 1) into ABS(X). */
+ /* Transform X * (X <= 0 ? -1 : 1) into ABS(X). */
+ (simplify
+ (mult:c @0 (cond (cmp @0 integer_zerop) integer_all_onesp integer_onep))
+ (abs @0))
+ /* Transform X * (X < 0.0 ? -1.0 : 1.0) into ABS(X). */
+ /* Transform X * (X <= 0.0 ? -1.0 : 1.0) into ABS(X). */
+ (simplify
+ (mult:c @0 (cond (cmp @0 real_zerop) real_minus_onep real_onep))
+ (if (!HONOR_SNANS (type) && !HONOR_SIGNED_ZEROS (type))
+ (abs @0))))
+
+
/* X * 1, X / 1 -> X. */
(for op (mult trunc_div ceil_div floor_div round_div exact_div)
(simplify
Index: testsuite/gcc.dg/tree-ssa/mult-abs-1.c
===================================================================
--- testsuite/gcc.dg/tree-ssa/mult-abs-1.c (nonexistent)
+++ testsuite/gcc.dg/tree-ssa/mult-abs-1.c (working copy)
@@ -0,0 +1,35 @@
+/* { dg-options "-O2 -fdump-tree-gimple" } */
+/* { dg-do compile } */
+int f(int x)
+{
+ return x * (x > 0 ? -1 : 1);
+}
+int f1(int x)
+{
+ return x * (x > 0 ? 1 : -1);
+}
+int g(int x)
+{
+ return x * (x >= 0 ? -1 : 1);
+}
+int g1(int x)
+{
+ return x * (x >= 0 ? 1 : -1);
+}
+int h(int x)
+{
+ return x * (x < 0 ? -1 : 1);
+}
+int h1(int x)
+{
+ return x * (x < 0 ? 1 : -1);
+}
+int i(int x)
+{
+ return x * (x <= 0 ? -1 : 1);
+}
+int i1(int x)
+{
+ return x * (x <= 0 ? 1 : -1);
+}
+/* { dg-final { scan-tree-dump-times "ABS" 8 "gimple"} } */
Index: testsuite/gcc.dg/tree-ssa/mult-abs-2.c
===================================================================
--- testsuite/gcc.dg/tree-ssa/mult-abs-2.c (nonexistent)
+++ testsuite/gcc.dg/tree-ssa/mult-abs-2.c (working copy)
@@ -0,0 +1,35 @@
+/* { dg-options "-O2 -ffast-math -fdump-tree-gimple" } */
+/* { dg-do compile } */
+float f(float x)
+{
+ return x * (x > 0.f ? -1.f : 1.f);
+}
+float f1(float x)
+{
+ return x * (x > 0.f ? 1.f : -1.f);
+}
+float g(float x)
+{
+ return x * (x >= 0.f ? -1.f : 1.f);
+}
+float g1(float x)
+{
+ return x * (x >= 0.f ? 1.f : -1.f);
+}
+float h(float x)
+{
+ return x * (x < 0.f ? -1.f : 1.f);
+}
+float h1(float x)
+{
+ return x * (x < 0.f ? 1.f : -1.f);
+}
+float i(float x)
+{
+ return x * (x <= 0.f ? -1.f : 1.f);
+}
+float i1(float x)
+{
+ return x * (x <= 0.f ? 1.f : -1.f);
+}
+/* { dg-final { scan-tree-dump-times "ABS" 8 "gimple"} } */
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [PATCH] fold a * (a > 0 ? 1 : -1) to abs(a) and related optimizations
2017-06-24 3:59 [PATCH] fold a * (a > 0 ? 1 : -1) to abs(a) and related optimizations Andrew Pinski
2017-06-24 3:59 ` Andrew Pinski
@ 2017-06-24 6:50 ` Marc Glisse
2017-06-24 18:51 ` Andrew Pinski
2017-06-26 15:02 ` Joseph Myers
1 sibling, 2 replies; 7+ messages in thread
From: Marc Glisse @ 2017-06-24 6:50 UTC (permalink / raw)
To: Andrew Pinski; +Cc: GCC Patches
On Fri, 23 Jun 2017, Andrew Pinski wrote:
> Hi,
> I saw this on llvm's review site (https://reviews.llvm.org/D34579)
> and I thought why not add it to GCC. I expanded more than what was
> done on the LLVM patch.
>
> I added the following optimizations:
> Transform X * (X > 0 ? 1 : -1) into ABS(X).
> Transform X * (X >= 0 ? 1 : -1) into ABS(X).
> Transform X * (X > 0.0 ? 1.0 : -1.0) into ABS(X).
> Transform X * (X >= 0.0 ? 1.0 : -1.0) into ABS(X).
> Transform X * (X > 0 ? -1 : 1) into -ABS(X).
> Transform X * (X >= 0 ? -1 : 1) into -ABS(X).
> Transform X * (X > 0.0 ? -1.0 : 1.0) into -ABS(X).
> Transform X * (X >= 0.0 ? -1.0 : 1.0) into -ABS(X).
> Transform X * (X < 0 ? 1 : -1) into -ABS(X).
> Transform X * (X <= 0 ? 1 : -1) into -ABS(X).
> Transform X * (X < 0.0 ? 1.0 : -1.0) into -ABS(X).
> Transform X * (X <= 0.0 ? 1.0 : -1.0) into -ABS(X).
> Transform X * (X < 0 ? -1 : 1) into ABS(X).
> Transform X * (X <= 0 ? -1 : 1) into ABS(X).
> Transform X * (X < 0.0 ? -1.0 : 1.0) into ABS(X).
> Transform X * (X <= 0.0 ? -1.0 : 1.0) into ABS(X).
>
> The floating points ones only happen when not honoring SNANS and not
> honoring signed zeros.
Some random comments (not a review):
* if X is NaN, we may get a qNaN with the wrong sign bit. We probably
don't care much though...
* I am surprised (X<0.?-1.:1.) and copysign(1., X) remain different for
the whole optimization pipeline with -ffast-math. X*copysign(1., X) is
another candidate to become fabs(X).
* Whenever you get -ABS(X) for integers, what about the case where X is
INT_MIN?
* I guess we can't get there with an unsigned type because X>0 would have
become X!=0 .
* I wonder if we could use something like
(for cmp (gt ge lt le)
outp (convert convert negate negate)
outn (negate negate convert convert)
[...]
(outp (abs @0))
to reduce duplication or if that would be less readable.
* Some of the cases are handled by PRE turning
# iftmp.0_1 = PHI <1.0e+0(5), -1.0e+0(3)>
_3 = iftmp.0_1 * a_2(D);
into
_5 = -a_2(D);
[...]
# iftmp.0_1 = PHI <1.0e+0(2), -1.0e+0(3)>
# prephitmp_6 = PHI <a_2(D)(2), _5(3)>
which phiopt3 can handle (quite late).
* With cond, this currently (?) only affects generic, so I am not sure it
will hit very often... But it will be there if someone later writes a
match.pd->phiopt generator ;-)
--
Marc Glisse
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [PATCH] fold a * (a > 0 ? 1 : -1) to abs(a) and related optimizations
2017-06-24 6:50 ` Marc Glisse
@ 2017-06-24 18:51 ` Andrew Pinski
2017-06-24 19:47 ` Marc Glisse
2017-06-26 15:02 ` Joseph Myers
1 sibling, 1 reply; 7+ messages in thread
From: Andrew Pinski @ 2017-06-24 18:51 UTC (permalink / raw)
To: GCC Patches
On Fri, Jun 23, 2017 at 11:50 PM, Marc Glisse <marc.glisse@inria.fr> wrote:
> On Fri, 23 Jun 2017, Andrew Pinski wrote:
>
>> Hi,
>> I saw this on llvm's review site (https://reviews.llvm.org/D34579)
>> and I thought why not add it to GCC. I expanded more than what was
>> done on the LLVM patch.
>>
>> I added the following optimizations:
>> Transform X * (X > 0 ? 1 : -1) into ABS(X).
>> Transform X * (X >= 0 ? 1 : -1) into ABS(X).
>> Transform X * (X > 0.0 ? 1.0 : -1.0) into ABS(X).
>> Transform X * (X >= 0.0 ? 1.0 : -1.0) into ABS(X).
>> Transform X * (X > 0 ? -1 : 1) into -ABS(X).
>> Transform X * (X >= 0 ? -1 : 1) into -ABS(X).
>> Transform X * (X > 0.0 ? -1.0 : 1.0) into -ABS(X).
>> Transform X * (X >= 0.0 ? -1.0 : 1.0) into -ABS(X).
>> Transform X * (X < 0 ? 1 : -1) into -ABS(X).
>> Transform X * (X <= 0 ? 1 : -1) into -ABS(X).
>> Transform X * (X < 0.0 ? 1.0 : -1.0) into -ABS(X).
>> Transform X * (X <= 0.0 ? 1.0 : -1.0) into -ABS(X).
>> Transform X * (X < 0 ? -1 : 1) into ABS(X).
>> Transform X * (X <= 0 ? -1 : 1) into ABS(X).
>> Transform X * (X < 0.0 ? -1.0 : 1.0) into ABS(X).
>> Transform X * (X <= 0.0 ? -1.0 : 1.0) into ABS(X).
>>
>> The floating points ones only happen when not honoring SNANS and not
>> honoring signed zeros.
>
>
> Some random comments (not a review):
>
> * if X is NaN, we may get a qNaN with the wrong sign bit. We probably don't
> care much though...
Ok, I changed it to when not honoring NANs.
>
> * I am surprised (X<0.?-1.:1.) and copysign(1., X) remain different for the
> whole optimization pipeline with -ffast-math. X*copysign(1., X) is another
> candidate to become fabs(X).
This might be a better idea because of ...
>
> * Whenever you get -ABS(X) for integers, what about the case where X is
> INT_MIN?
This. Yes this is an issue; I guess I need to rethink the integer patterns.
>
> * I guess we can't get there with an unsigned type because X>0 would have
> become X!=0 .
No, unsigned is not an issue.
>
> * I wonder if we could use something like
>
> (for cmp (gt ge lt le)
> outp (convert convert negate negate)
> outn (negate negate convert convert)
> [...]
> (outp (abs @0))
>
> to reduce duplication or if that would be less readable.
I did thought of that but I added the lt/le parts latter on.
>
> * Some of the cases are handled by PRE turning
>
> # iftmp.0_1 = PHI <1.0e+0(5), -1.0e+0(3)>
> _3 = iftmp.0_1 * a_2(D);
>
> into
>
> _5 = -a_2(D);
> [...]
> # iftmp.0_1 = PHI <1.0e+0(2), -1.0e+0(3)>
> # prephitmp_6 = PHI <a_2(D)(2), _5(3)>
>
> which phiopt3 can handle (quite late).
>
> * With cond, this currently (?) only affects generic, so I am not sure it
> will hit very often... But it will be there if someone later writes a
> match.pd->phiopt generator ;-)
I have a start of this patch but I have not finished it yet. Both
phiopt and ifcombine should be moved over to gimple match and
simplify.
I will submit a new patch which implements some of the above by the
end of the day; I might split up the patch into two (one for the
integer case and one for the floating point case).
Thanks,
Andrew
>
> --
> Marc Glisse
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [PATCH] fold a * (a > 0 ? 1 : -1) to abs(a) and related optimizations
2017-06-24 18:51 ` Andrew Pinski
@ 2017-06-24 19:47 ` Marc Glisse
2017-06-24 20:53 ` Andrew Pinski
0 siblings, 1 reply; 7+ messages in thread
From: Marc Glisse @ 2017-06-24 19:47 UTC (permalink / raw)
To: Andrew Pinski; +Cc: GCC Patches
On Sat, 24 Jun 2017, Andrew Pinski wrote:
>> * if X is NaN, we may get a qNaN with the wrong sign bit. We probably don't
>> care much though...
>
> Ok, I changed it to when not honoring NANs.
Note that I have no idea what guarantees we give in gcc. It is quite
possible that your patch is fine without this change, I only wanted to
raise the question in case someone knows.
--
Marc Glisse
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [PATCH] fold a * (a > 0 ? 1 : -1) to abs(a) and related optimizations
2017-06-24 19:47 ` Marc Glisse
@ 2017-06-24 20:53 ` Andrew Pinski
0 siblings, 0 replies; 7+ messages in thread
From: Andrew Pinski @ 2017-06-24 20:53 UTC (permalink / raw)
To: GCC Patches
On Sat, Jun 24, 2017 at 12:47 PM, Marc Glisse <marc.glisse@inria.fr> wrote:
> On Sat, 24 Jun 2017, Andrew Pinski wrote:
>
>>> * if X is NaN, we may get a qNaN with the wrong sign bit. We probably
>>> don't
>>> care much though...
>>
>>
>> Ok, I changed it to when not honoring NANs.
>
>
> Note that I have no idea what guarantees we give in gcc. It is quite
> possible that your patch is fine without this change, I only wanted to raise
> the question in case someone knows.
So looking through, we do guarantee the sign of the NaNs except when
not honoring NaNs in the first place. So the conversion from
a>0?1.0:-1.0 to copysign will be conditional on honoring NaNs. But
the x*copysign(1.0,x) will only conditional on not honoring sNaNs.
>
> --
> Marc Glisse
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [PATCH] fold a * (a > 0 ? 1 : -1) to abs(a) and related optimizations
2017-06-24 6:50 ` Marc Glisse
2017-06-24 18:51 ` Andrew Pinski
@ 2017-06-26 15:02 ` Joseph Myers
1 sibling, 0 replies; 7+ messages in thread
From: Joseph Myers @ 2017-06-26 15:02 UTC (permalink / raw)
To: GCC Patches; +Cc: Andrew Pinski
On Sat, 24 Jun 2017, Marc Glisse wrote:
> * if X is NaN, we may get a qNaN with the wrong sign bit. We probably don't
> care much though...
The sign bit from a multiplication involving a NaN is not specified.
*But* making any of these transformations with a qNaN loses the "invalid"
exception from an ordered comparison involving a qNaN, so isn't valid in
the case of (qNaNs respected and trapping-math).
--
Joseph S. Myers
joseph@codesourcery.com
^ permalink raw reply [flat|nested] 7+ messages in thread
end of thread, other threads:[~2017-06-26 15:02 UTC | newest]
Thread overview: 7+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2017-06-24 3:59 [PATCH] fold a * (a > 0 ? 1 : -1) to abs(a) and related optimizations Andrew Pinski
2017-06-24 3:59 ` Andrew Pinski
2017-06-24 6:50 ` Marc Glisse
2017-06-24 18:51 ` Andrew Pinski
2017-06-24 19:47 ` Marc Glisse
2017-06-24 20:53 ` Andrew Pinski
2017-06-26 15:02 ` Joseph Myers
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).