public inbox for gcc-patches@gcc.gnu.org
 help / color / mirror / Atom feed
From: Richard Biener <rguenther@suse.de>
To: Manolis Tsamis <manolis.tsamis@vrull.eu>
Cc: gcc-patches@gcc.gnu.org,
	Jiangning Liu <jiangning.liu@amperecomputing.com>,
	 Philipp Tomsich <philipp.tomsich@vrull.eu>,
	 Andrew Pinski <quic_apinski@quicinc.com>
Subject: Re: [PATCH] MATCH: Maybe expand (T)(A + C1) * C2 and (T)(A + C1) * C2 + C3 [PR109393]
Date: Thu, 2 May 2024 15:27:02 +0200 (CEST)	[thread overview]
Message-ID: <59511q86-4p5o-sn6r-43o0-5qpn4154n123@fhfr.qr> (raw)
In-Reply-To: <CAM3yNXoR6WtkiusbD2duU61BuQ=iqCXLL-NhVimEKpopkbdE7g@mail.gmail.com>

[-- Attachment #1: Type: text/plain, Size: 5361 bytes --]

On Thu, 2 May 2024, Manolis Tsamis wrote:

> On Thu, May 2, 2024 at 4:00 PM Richard Biener <rguenther@suse.de> wrote:
> >
> > On Tue, 23 Apr 2024, Manolis Tsamis wrote:
> >
> > > The original motivation for this pattern was that the following function does
> > > not fold to 'return 1':
> > >
> > > int foo(int *a, int j)
> > > {
> > >   int k = j - 1;
> > >   return a[j - 1] == a[k];
> > > }
> > >
> > > The expression ((unsigned long) (X +- C1) * C2) appears frequently as part of
> > > address calculations (e.g. arrays). These patterns help fold and simplify more
> > > expressions.
> > >
> > >       PR tree-optimization/109393
> > >
> > > gcc/ChangeLog:
> > >
> > >       * match.pd: Add new patterns for ((T)(A +- CST1)) * CST2 and
> > >         ((T)(A +- CST1)) * CST2 + CST3.
> > >
> > > gcc/testsuite/ChangeLog:
> > >
> > >       * gcc.dg/pr109393.c: New test.
> > >
> > > Signed-off-by: Manolis Tsamis <manolis.tsamis@vrull.eu>
> > > ---
> > >
> > >  gcc/match.pd                    | 30 ++++++++++++++++++++++++++++++
> > >  gcc/testsuite/gcc.dg/pr109393.c | 16 ++++++++++++++++
> > >  2 files changed, 46 insertions(+)
> > >  create mode 100644 gcc/testsuite/gcc.dg/pr109393.c
> > >
> > > diff --git a/gcc/match.pd b/gcc/match.pd
> > > index d401e7503e6..13c828ba70d 100644
> > > --- a/gcc/match.pd
> > > +++ b/gcc/match.pd
> > > @@ -3650,6 +3650,36 @@ DEFINE_INT_AND_FLOAT_ROUND_FN (RINT)
> > >         (plus (convert @0) (op @2 (convert @1))))))
> > >  #endif
> > >
> > > +/* ((T)(A + CST1)) * CST2 + CST3
> > > +     -> ((T)(A) * CST2) + ((T)CST1 * CST2 + CST3)
> > > +   Where (A + CST1) doesn't need to have a single use.  */
> > > +#if GIMPLE
> > > +  (for op (plus minus)
> > > +   (simplify
> > > +    (plus (mult (convert:s (op @0 INTEGER_CST@1)) INTEGER_CST@2) INTEGER_CST@3)
> > > +     (if (TREE_CODE (TREE_TYPE (@0)) == INTEGER_TYPE
> > > +       && TREE_CODE (type) == INTEGER_TYPE
> > > +       && TYPE_PRECISION (type) > TYPE_PRECISION (TREE_TYPE (@0))
> > > +       && TYPE_OVERFLOW_UNDEFINED (TREE_TYPE (@0))
> > > +       && !TYPE_OVERFLOW_SANITIZED (TREE_TYPE (@0))
> > > +       && TYPE_OVERFLOW_WRAPS (type))
> > > +       (op (mult @2 (convert @0)) (plus (mult @2 (convert @1)) @3)))))
> > > +#endif
> > > +
> > > +/* ((T)(A + CST1)) * CST2 -> ((T)(A) * CST2) + ((T)CST1 * CST2)  */
> > > +#if GIMPLE
> > > +  (for op (plus minus)
> > > +   (simplify
> > > +    (mult (convert:s (op:s @0 INTEGER_CST@1)) INTEGER_CST@2)
> > > +     (if (TREE_CODE (TREE_TYPE (@0)) == INTEGER_TYPE
> >
> > Please use INTEGRAL_TYPE_P
> >
> > > +       && TREE_CODE (type) == INTEGER_TYPE
> > > +       && TYPE_PRECISION (type) > TYPE_PRECISION (TREE_TYPE (@0))
> > > +       && TYPE_OVERFLOW_UNDEFINED (TREE_TYPE (@0))
> > > +       && !TYPE_OVERFLOW_SANITIZED (TREE_TYPE (@0))
> > > +       && TYPE_OVERFLOW_WRAPS (type))
> > > +       (op (mult @2 (convert @0)) (mult @2 (convert @1))))))
> >
> > (mult @2 (convert @0)) is non-canonical for no good reason if @0
> > isn't constant - constant should be 2nd, please swap operands here.
> >
> > > +#endif
> >
> > The first pattern is an extension of the second, why's the first
> > necessary at all?  The add of CST3 is unchanged (OK, you seem to
> > associate here, but that's again a different thing).
> >
> > I'd say the 2nd pattern is OK with the above changes but the first
> > looks redundant.
> >
> Hi Richard,
> 
> Thanks for the comments, I'll fix these.
> 
> The difference is that the second uses op:s while the first uses just op.
> In the second case if A + CST1 has other uses expanding the pattern
> may not be a good idea but in the first case it always is because we
> know + CST1 * CST2 will merge with + CST3.

I see.  But that pattern misses a :s on the multiplication result then, 
no?  Local pattern-matching isn't the best vehicle to handle multi-use
cases, introducing context dependent canonicalizations can lead to
SCEV analysis no longer matching up for related accesses and then
data dependence analysis failing.  It's been a trade-off here.

Richard.

> 
> Thanks,
> Manolis
> 
> > Thanks,
> > Richard.
> >
> > > +
> > >  /* (T)(A) +- (T)(B) -> (T)(A +- B) only when (A +- B) could be simplified
> > >     to a simple value.  */
> > >    (for op (plus minus)
> > > diff --git a/gcc/testsuite/gcc.dg/pr109393.c b/gcc/testsuite/gcc.dg/pr109393.c
> > > new file mode 100644
> > > index 00000000000..e9051273672
> > > --- /dev/null
> > > +++ b/gcc/testsuite/gcc.dg/pr109393.c
> > > @@ -0,0 +1,16 @@
> > > +/* PR tree-optimization/109393 */
> > > +/* { dg-do compile } */
> > > +/* { dg-options "-O2 -fdump-tree-optimized" } */
> > > +/* { dg-final { scan-tree-dump-times "return 1;" 2 "optimized" } } */
> > > +
> > > +int foo(int *a, int j)
> > > +{
> > > +  int k = j - 1;
> > > +  return a[j - 1] == a[k];
> > > +}
> > > +
> > > +int bar(int *a, int j)
> > > +{
> > > +  int k = j - 1;
> > > +  return (&a[j + 1] - 2) == &a[k];
> > > +}
> > >
> >
> > --
> > Richard Biener <rguenther@suse.de>
> > SUSE Software Solutions Germany GmbH,
> > Frankenstrasse 146, 90461 Nuernberg, Germany;
> > GF: Ivo Totev, Andrew McDonald, Werner Knoblich; (HRB 36809, AG Nuernberg)
> 

-- 
Richard Biener <rguenther@suse.de>
SUSE Software Solutions Germany GmbH,
Frankenstrasse 146, 90461 Nuernberg, Germany;
GF: Ivo Totev, Andrew McDonald, Werner Knoblich; (HRB 36809, AG Nuernberg)

  reply	other threads:[~2024-05-02 13:27 UTC|newest]

Thread overview: 10+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2024-04-23 10:33 Manolis Tsamis
2024-05-02 13:00 ` Richard Biener
2024-05-02 13:08   ` Manolis Tsamis
2024-05-02 13:27     ` Richard Biener [this message]
2024-05-14  8:57 ` Manolis Tsamis
2024-05-16  8:15   ` Richard Biener
2024-05-17  8:23     ` Manolis Tsamis
2024-05-17  9:22       ` Richard Biener
2024-05-17 11:35         ` Manolis Tsamis
2024-05-17 11:41           ` Richard Biener

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=59511q86-4p5o-sn6r-43o0-5qpn4154n123@fhfr.qr \
    --to=rguenther@suse.de \
    --cc=gcc-patches@gcc.gnu.org \
    --cc=jiangning.liu@amperecomputing.com \
    --cc=manolis.tsamis@vrull.eu \
    --cc=philipp.tomsich@vrull.eu \
    --cc=quic_apinski@quicinc.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).