public inbox for gcc-patches@gcc.gnu.org
 help / color / mirror / Atom feed
From: Richard Biener <rguenther@suse.de>
To: "Andre Vieira (lists)" <andre.simoesdiasvieira@arm.com>
Cc: gcc-patches@gcc.gnu.org
Subject: Re: [PATCH] tree-optimization/110221 - SLP and loop mask/len
Date: Fri, 1 Mar 2024 10:48:27 +0100 (CET)	[thread overview]
Message-ID: <86190n96-6srr-1n99-p71q-59431ror308r@fhfr.qr> (raw)
In-Reply-To: <6846f165-1cfb-415c-9a47-e620c784dc96@arm.com>

On Fri, 1 Mar 2024, Andre Vieira (lists) wrote:

> Hi,
> 
> Bootstrapped and tested the gcc-13 backport of this on gcc-12 for
> aarch64-unknown-linux-gnu and x86_64-pc-linux-gnu and no regressions.
> 
> OK to push to gcc-12 branch?

OK.

Thanks,
Richard.

> Kind regards,
> Andre Vieira
> 
> On 10/11/2023 13:16, Richard Biener wrote:
> > The following fixes the issue that when SLP stmts are internal defs
> > but appear invariant because they end up only using invariant defs
> > then they get scheduled outside of the loop.  This nice optimization
> > breaks down when loop masks or lens are applied since those are not
> > explicitly tracked as dependences.  The following makes sure to never
> > schedule internal defs outside of the vectorized loop when the
> > loop uses masks/lens.
> > 
> > Bootstrapped and tested on x86_64-unknown-linux-gnu, pushed.
> > 
> >  PR tree-optimization/110221
> >  * tree-vect-slp.cc (vect_schedule_slp_node): When loop
> >  masking / len is applied make sure to not schedule
> >  intenal defs outside of the loop.
> > 
> > 	* gfortran.dg/pr110221.f: New testcase.
> > ---
> >   gcc/testsuite/gfortran.dg/pr110221.f | 17 +++++++++++++++++
> >   gcc/tree-vect-slp.cc                 | 10 ++++++++++
> >   2 files changed, 27 insertions(+)
> >   create mode 100644 gcc/testsuite/gfortran.dg/pr110221.f
> > 
> > diff --git a/gcc/testsuite/gfortran.dg/pr110221.f
> > b/gcc/testsuite/gfortran.dg/pr110221.f
> > new file mode 100644
> > index 00000000000..8b57384313a
> > --- /dev/null
> > +++ b/gcc/testsuite/gfortran.dg/pr110221.f
> > @@ -0,0 +1,17 @@
> > +C PR middle-end/68146
> > +C { dg-do compile }
> > +C { dg-options "-O2 -w" }
> > +C { dg-additional-options "-mavx512f --param vect-partial-vector-usage=2" {
> > target avx512f } }
> > +      SUBROUTINE CJYVB(V,Z,V0,CBJ,CDJ,CBY,CYY)
> > +      IMPLICIT DOUBLE PRECISION (A,B,G,O-Y)
> > +      IMPLICIT COMPLEX*16 (C,Z)
> > +      DIMENSION CBJ(0:*),CDJ(0:*),CBY(0:*)
> > +      N=INT(V)
> > +      CALL GAMMA2(VG,GA)
> > +      DO 65 K=1,N
> > +        CBY(K)=CYY
> > +65    CONTINUE
> > +      CDJ(0)=V0/Z*CBJ(0)-CBJ(1)
> > +      DO 70 K=1,N
> > +70      CDJ(K)=-(K+V0)/Z*CBJ(K)+CBJ(K-1)
> > +      END
> > diff --git a/gcc/tree-vect-slp.cc b/gcc/tree-vect-slp.cc
> > index 3e5814c3a31..80e279d8f50 100644
> > --- a/gcc/tree-vect-slp.cc
> > +++ b/gcc/tree-vect-slp.cc
> > @@ -9081,6 +9081,16 @@ vect_schedule_slp_node (vec_info *vinfo,
> >           /* Emit other stmts after the children vectorized defs which is
> >     earliest possible.  */
> >         gimple *last_stmt = NULL;
> > +      if (auto loop_vinfo = dyn_cast <loop_vec_info> (vinfo))
> > +	if (LOOP_VINFO_FULLY_MASKED_P (loop_vinfo)
> > +	    || LOOP_VINFO_FULLY_WITH_LENGTH_P (loop_vinfo))
> > +	  {
> > +	    /* But avoid scheduling internal defs outside of the loop when
> > +	       we might have only implicitly tracked loop mask/len defs.  */
> > +	    gimple_stmt_iterator si
> > +	      = gsi_after_labels (LOOP_VINFO_LOOP (loop_vinfo)->header);
> > +	    last_stmt = *si;
> > +	  }
> >         bool seen_vector_def = false;
> >         FOR_EACH_VEC_ELT (SLP_TREE_CHILDREN (node), i, child)
> >    if (SLP_TREE_DEF_TYPE (child) == vect_internal_def)
> 

-- 
Richard Biener <rguenther@suse.de>
SUSE Software Solutions Germany GmbH,
Frankenstrasse 146, 90461 Nuernberg, Germany;
GF: Ivo Totev, Andrew McDonald, Werner Knoblich; (HRB 36809, AG Nuernberg)

      reply	other threads:[~2024-03-01  9:48 UTC|newest]

Thread overview: 3+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2023-11-10 13:16 Richard Biener
2024-03-01  9:24 ` Andre Vieira (lists)
2024-03-01  9:48   ` Richard Biener [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=86190n96-6srr-1n99-p71q-59431ror308r@fhfr.qr \
    --to=rguenther@suse.de \
    --cc=andre.simoesdiasvieira@arm.com \
    --cc=gcc-patches@gcc.gnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).