From: Andrew Pinski <pinskia@gmail.com>
To: Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
Cc: Andrew Pinski <apinski@marvell.com>, gcc-patches@gcc.gnu.org
Subject: Re: [COMMITTEDv3] tree-optimization: [PR100864] `(a&!b) | b` is not opimized to `a | b` for comparisons
Date: Wed, 2 Aug 2023 10:14:48 -0700 [thread overview]
Message-ID: <CA+=Sn1=LxzSKbV-mGH3KeNJU+2qiNLazNJbLiNJPB20=wo_jeg@mail.gmail.com> (raw)
In-Reply-To: <CAAgBjMnW8d+AyfOxLED6DpkUHkiFO-4g_HQALrXsLP5c6Q0VQQ@mail.gmail.com>
On Wed, Aug 2, 2023 at 10:13 AM Prathamesh Kulkarni via Gcc-patches
<gcc-patches@gcc.gnu.org> wrote:
>
> On Mon, 31 Jul 2023 at 22:39, Andrew Pinski via Gcc-patches
> <gcc-patches@gcc.gnu.org> wrote:
> >
> > This is a new version of the patch.
> > Instead of doing the matching of inversion comparison directly inside
> > match, creating a new function (bitwise_inverted_equal_p) to do it.
> > It is very similar to bitwise_equal_p that was added in r14-2751-g2a3556376c69a1fb
> > but instead it says `expr1 == ~expr2`. A follow on patch, will
> > use this function in other patterns where we try to match `@0` and `(bit_not @0)`.
> >
> > Changed the name bitwise_not_equal_p to bitwise_inverted_equal_p.
> >
> > Committed as approved after a Bootstrapped and test on x86_64-linux-gnu with no regressions.
> Hi Andrew,
> Unfortunately, this patch (committed in
> 2bae476b511dc441bf61da8a49cca655575e7dd6) causes
> segmentation fault for pr33133.c on aarch64-linux-gnu because of
> infinite recursion.
A similar issue is recorded as PR 110874 which I am debugging right now.
Thanks,
Andrew
>
> Running the test under gdb shows:
> Program received signal SIGSEGV, Segmentation fault.
> operand_compare::operand_equal_p (this=0x29dc680
> <default_compare_instance>, arg0=0xfffff7789a68, arg1=0xfffff7789f30,
> flags=16) at ../../gcc/gcc/fold-const.cc:3088
> 3088 {
> (gdb) bt
> #0 operand_compare::operand_equal_p (this=0x29dc680
> <default_compare_instance>, arg0=0xfffff7789a68, arg1=0xfffff7789f30,
> flags=16) at ../../gcc/gcc/fold-const.cc:3088
> #1 0x0000000000a90394 in operand_compare::verify_hash_value
> (this=this@entry=0x29dc680 <default_compare_instance>,
> arg0=arg0@entry=0xfffff7789a68, arg1=arg1@entry=0xfffff7789f30,
> flags=flags@entry=0, ret=ret@entry=0xfffffc000157)
> at ../../gcc/gcc/fold-const.cc:4074
> #2 0x0000000000a9351c in operand_compare::verify_hash_value
> (ret=0xfffffc000157, flags=0, arg1=0xfffff7789f30,
> arg0=0xfffff7789a68, this=0x29dc680 <default_compare_instance>) at
> ../../gcc/gcc/fold-const.cc:4072
> #3 operand_compare::operand_equal_p (this=this@entry=0x29dc680
> <default_compare_instance>, arg0=arg0@entry=0xfffff7789a68,
> arg1=arg1@entry=0xfffff7789f30, flags=flags@entry=0) at
> ../../gcc/gcc/fold-const.cc:3090
> #4 0x0000000000a9791c in operand_equal_p
> (arg0=arg0@entry=0xfffff7789a68, arg1=arg1@entry=0xfffff7789f30,
> flags=flags@entry=0) at ../../gcc/gcc/fold-const.cc:4105
> #5 0x0000000001d38dd0 in gimple_bitwise_inverted_equal_p
> (expr1=0xfffff7789a68, expr2=0xfffff7789f30, valueize=
> 0x112d698 <rpo_vn_valueize(tree_node*)>) at
> ../../gcc/gcc/gimple-match-head.cc:284
> #6 0x0000000001d38e80 in gimple_bitwise_inverted_equal_p
> (expr1=0xfffff7789a68, expr2=0xfffff77d0240,
> valueize=0x112d698 <rpo_vn_valueize(tree_node*)>) at
> ../../gcc/gcc/gimple-match-head.cc:296
> #7 0x0000000001d38e80 in gimple_bitwise_inverted_equal_p
> (expr1=0xfffff7789a68, expr2=0xfffff7789f30,
> valueize=0x112d698 <rpo_vn_valueize(tree_node*)>) at
> ../../gcc/gcc/gimple-match-head.cc:296
> #8 0x0000000001d38e80 in gimple_bitwise_inverted_equal_p
> (expr1=0xfffff7789a68, expr2=0xfffff77d0240,
> ...
>
> It seems to recurse cyclically with expr2=0xfffff7789f30 ->
> expr2=0xfffff77d0240 eventually leading to segfault.
> while expr1=0xfffff7789a68 remains same throughout the stack frames.
>
> Thanks,
> Prathamesh
> >
> > PR tree-optimization/100864
> >
> > gcc/ChangeLog:
> >
> > * generic-match-head.cc (bitwise_inverted_equal_p): New function.
> > * gimple-match-head.cc (bitwise_inverted_equal_p): New macro.
> > (gimple_bitwise_inverted_equal_p): New function.
> > * match.pd ((~x | y) & x): Use bitwise_inverted_equal_p
> > instead of direct matching bit_not.
> >
> > gcc/testsuite/ChangeLog:
> >
> > * gcc.dg/tree-ssa/bitops-3.c: New test.
> > ---
> > gcc/generic-match-head.cc | 42 ++++++++++++++
> > gcc/gimple-match-head.cc | 71 ++++++++++++++++++++++++
> > gcc/match.pd | 5 +-
> > gcc/testsuite/gcc.dg/tree-ssa/bitops-3.c | 67 ++++++++++++++++++++++
> > 4 files changed, 183 insertions(+), 2 deletions(-)
> > create mode 100644 gcc/testsuite/gcc.dg/tree-ssa/bitops-3.c
> >
> > diff --git a/gcc/generic-match-head.cc b/gcc/generic-match-head.cc
> > index a71c0727b0b..ddaf22f2179 100644
> > --- a/gcc/generic-match-head.cc
> > +++ b/gcc/generic-match-head.cc
> > @@ -121,3 +121,45 @@ bitwise_equal_p (tree expr1, tree expr2)
> > return wi::to_wide (expr1) == wi::to_wide (expr2);
> > return operand_equal_p (expr1, expr2, 0);
> > }
> > +
> > +/* Return true if EXPR1 and EXPR2 have the bitwise opposite value,
> > + but not necessarily same type.
> > + The types can differ through nop conversions. */
> > +
> > +static inline bool
> > +bitwise_inverted_equal_p (tree expr1, tree expr2)
> > +{
> > + STRIP_NOPS (expr1);
> > + STRIP_NOPS (expr2);
> > + if (expr1 == expr2)
> > + return false;
> > + if (!tree_nop_conversion_p (TREE_TYPE (expr1), TREE_TYPE (expr2)))
> > + return false;
> > + if (TREE_CODE (expr1) == INTEGER_CST && TREE_CODE (expr2) == INTEGER_CST)
> > + return wi::to_wide (expr1) == ~wi::to_wide (expr2);
> > + if (operand_equal_p (expr1, expr2, 0))
> > + return false;
> > + if (TREE_CODE (expr1) == BIT_NOT_EXPR
> > + && bitwise_equal_p (TREE_OPERAND (expr1, 0), expr2))
> > + return true;
> > + if (TREE_CODE (expr2) == BIT_NOT_EXPR
> > + && bitwise_equal_p (expr1, TREE_OPERAND (expr2, 0)))
> > + return true;
> > + if (COMPARISON_CLASS_P (expr1)
> > + && COMPARISON_CLASS_P (expr2))
> > + {
> > + tree op10 = TREE_OPERAND (expr1, 0);
> > + tree op20 = TREE_OPERAND (expr2, 0);
> > + if (!operand_equal_p (op10, op20))
> > + return false;
> > + tree op11 = TREE_OPERAND (expr1, 1);
> > + tree op21 = TREE_OPERAND (expr2, 1);
> > + if (!operand_equal_p (op11, op21))
> > + return false;
> > + if (invert_tree_comparison (TREE_CODE (expr1),
> > + HONOR_NANS (op10))
> > + == TREE_CODE (expr2))
> > + return true;
> > + }
> > + return false;
> > +}
> > diff --git a/gcc/gimple-match-head.cc b/gcc/gimple-match-head.cc
> > index 5d6d26d009b..0265e55be93 100644
> > --- a/gcc/gimple-match-head.cc
> > +++ b/gcc/gimple-match-head.cc
> > @@ -263,3 +263,74 @@ gimple_bitwise_equal_p (tree expr1, tree expr2, tree (*valueize) (tree))
> > return true;
> > return false;
> > }
> > +
> > +/* Return true if EXPR1 and EXPR2 have the bitwise opposite value,
> > + but not necessarily same type.
> > + The types can differ through nop conversions. */
> > +#define bitwise_inverted_equal_p(expr1, expr2) \
> > + gimple_bitwise_inverted_equal_p (expr1, expr2, valueize)
> > +
> > +/* Helper function for bitwise_equal_p macro. */
> > +
> > +static inline bool
> > +gimple_bitwise_inverted_equal_p (tree expr1, tree expr2, tree (*valueize) (tree))
> > +{
> > + if (expr1 == expr2)
> > + return false;
> > + if (!tree_nop_conversion_p (TREE_TYPE (expr1), TREE_TYPE (expr2)))
> > + return false;
> > + if (TREE_CODE (expr1) == INTEGER_CST && TREE_CODE (expr2) == INTEGER_CST)
> > + return wi::to_wide (expr1) == ~wi::to_wide (expr2);
> > + if (operand_equal_p (expr1, expr2, 0))
> > + return false;
> > +
> > + tree other;
> > + if (gimple_nop_convert (expr1, &other, valueize)
> > + && gimple_bitwise_inverted_equal_p (other, expr2, valueize))
> > + return true;
> > +
> > + if (gimple_nop_convert (expr2, &other, valueize)
> > + && gimple_bitwise_inverted_equal_p (expr1, other, valueize))
> > + return true;
> > +
> > + if (TREE_CODE (expr1) != SSA_NAME
> > + || TREE_CODE (expr2) != SSA_NAME)
> > + return false;
> > +
> > + gimple *d1 = get_def (valueize, expr1);
> > + gassign *a1 = safe_dyn_cast <gassign *> (d1);
> > + gimple *d2 = get_def (valueize, expr2);
> > + gassign *a2 = safe_dyn_cast <gassign *> (d2);
> > + if (a1
> > + && gimple_assign_rhs_code (a1) == BIT_NOT_EXPR
> > + && gimple_bitwise_equal_p (do_valueize (valueize,
> > + gimple_assign_rhs1 (a1)),
> > + expr2, valueize))
> > + return true;
> > + if (a2
> > + && gimple_assign_rhs_code (a2) == BIT_NOT_EXPR
> > + && gimple_bitwise_equal_p (expr1,
> > + do_valueize (valueize,
> > + gimple_assign_rhs1 (a2)),
> > + valueize))
> > + return true;
> > +
> > + if (a1 && a2
> > + && TREE_CODE_CLASS (gimple_assign_rhs_code (a1)) == tcc_comparison
> > + && TREE_CODE_CLASS (gimple_assign_rhs_code (a2)) == tcc_comparison)
> > + {
> > + tree op10 = gimple_assign_rhs1 (a1);
> > + tree op20 = gimple_assign_rhs1 (a2);
> > + if (!operand_equal_p (op10, op20))
> > + return false;
> > + tree op11 = gimple_assign_rhs2 (a1);
> > + tree op21 = gimple_assign_rhs2 (a2);
> > + if (!operand_equal_p (op11, op21))
> > + return false;
> > + if (invert_tree_comparison (gimple_assign_rhs_code (a1),
> > + HONOR_NANS (op10))
> > + == gimple_assign_rhs_code (a2))
> > + return true;
> > + }
> > + return false;
> > +}
> > diff --git a/gcc/match.pd b/gcc/match.pd
> > index ee6cef6b09d..5fc6f517ab9 100644
> > --- a/gcc/match.pd
> > +++ b/gcc/match.pd
> > @@ -1943,8 +1943,9 @@ DEFINE_INT_AND_FLOAT_ROUND_FN (RINT)
> > /* (~x | y) & x -> x & y */
> > /* (~x & y) | x -> x | y */
> > (simplify
> > - (bitop:c (rbitop:c (bit_not @0) @1) @0)
> > - (bitop @0 @1)))
> > + (bitop:c (rbitop:c @2 @1) @0)
> > + (if (bitwise_inverted_equal_p (@0, @2))
> > + (bitop @0 @1))))
> >
> > /* ((x | y) & z) | x -> (z & y) | x */
> > (simplify
> > diff --git a/gcc/testsuite/gcc.dg/tree-ssa/bitops-3.c b/gcc/testsuite/gcc.dg/tree-ssa/bitops-3.c
> > new file mode 100644
> > index 00000000000..bf11a129b69
> > --- /dev/null
> > +++ b/gcc/testsuite/gcc.dg/tree-ssa/bitops-3.c
> > @@ -0,0 +1,67 @@
> > +/* PR tree-optimization/100864 */
> > +
> > +/* { dg-do run } */
> > +/* { dg-options "-O1 -fdump-tree-optimized-raw" } */
> > +
> > +#define op_ne !=
> > +#define op_eq ==
> > +#define op_lt <
> > +#define op_le <=
> > +#define op_gt >
> > +#define op_ge >=
> > +
> > +#define operators(t) \
> > +t(ne) \
> > +t(eq) \
> > +t(lt) \
> > +t(le) \
> > +t(gt) \
> > +t(ge)
> > +
> > +#define cmpfunc(v, op) \
> > +__attribute__((noipa)) \
> > +_Bool func_##op##_##v(v int a, v int b, v _Bool e) \
> > +{ \
> > + v _Bool c = (a op_##op b); \
> > + v _Bool d = !c; \
> > + return (e & d) | c; \
> > +}
> > +
> > +#define cmp_funcs(op) \
> > +cmpfunc(, op) \
> > +cmpfunc(volatile , op)
> > +
> > +operators(cmp_funcs)
> > +
> > +#define test(op) \
> > +if (func_##op##_ (a, b, e) != func_##op##_volatile (a, b, e)) \
> > + __builtin_abort();
> > +
> > +int main()
> > +{
> > + for(int a = -3; a <= 3; a++)
> > + for(int b = -3; b <= 3; b++)
> > + {
> > + _Bool e = 0;
> > + operators(test)
> > + e = 1;
> > + operators(test)
> > + }
> > + return 0;
> > +}
> > +
> > +/* Check to make sure we optimize `(a&!b) | b` -> `a | b`. */
> > +/* There are 6 different comparison operators testing here. */
> > +/* bit_not_expr and bit_and_expr should show up for each one (volatile). */
> > +/* Each operator should show up twice
> > + (except for `!=` which shows up 2*6 (each tester) + 2 (the 2 loops) extra = 16). */
> > +/* bit_ior_expr will show up for each operator twice (non-volatile and volatile). */
> > +/* { dg-final { scan-tree-dump-times "ne_expr," 16 "optimized"} } */
> > +/* { dg-final { scan-tree-dump-times "eq_expr," 2 "optimized"} } */
> > +/* { dg-final { scan-tree-dump-times "lt_expr," 2 "optimized"} } */
> > +/* { dg-final { scan-tree-dump-times "le_expr," 2 "optimized"} } */
> > +/* { dg-final { scan-tree-dump-times "gt_expr," 2 "optimized"} } */
> > +/* { dg-final { scan-tree-dump-times "ge_expr," 2 "optimized"} } */
> > +/* { dg-final { scan-tree-dump-times "bit_not_expr," 6 "optimized"} } */
> > +/* { dg-final { scan-tree-dump-times "bit_and_expr," 6 "optimized"} } */
> > +/* { dg-final { scan-tree-dump-times "bit_ior_expr," 12 "optimized"} } */
> > --
> > 2.31.1
> >
next prev parent reply other threads:[~2023-08-02 17:15 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
2023-07-31 17:07 Andrew Pinski
2023-08-02 17:11 ` Prathamesh Kulkarni
2023-08-02 17:14 ` Andrew Pinski [this message]
2023-08-02 21:23 ` Andrew Pinski
2023-08-03 8:07 ` Prathamesh Kulkarni
2023-08-03 11:57 ` Mikael Morin
2023-08-03 15:34 ` Andrew Pinski
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to='CA+=Sn1=LxzSKbV-mGH3KeNJU+2qiNLazNJbLiNJPB20=wo_jeg@mail.gmail.com' \
--to=pinskia@gmail.com \
--cc=apinski@marvell.com \
--cc=gcc-patches@gcc.gnu.org \
--cc=prathamesh.kulkarni@linaro.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).