public inbox for gcc-patches@gcc.gnu.org
 help / color / mirror / Atom feed
From: Jiufu Guo <guojiufu@linux.ibm.com>
To: Richard Biener <richard.guenther@gmail.com>
Cc: Richard Biener <rguenther@suse.de>,
	GCC Patches <gcc-patches@gcc.gnu.org>,
	Segher Boessenkool <segher@kernel.crashing.org>,
	Bill Schmidt <wschmidt@linux.ibm.com>,
	David Edelsohn <dje.gcc@gmail.com>
Subject: Re: [PATCH V2] Clean up loop-closed PHIs after loop finalize
Date: Tue, 17 Nov 2020 13:58:53 +0800	[thread overview]
Message-ID: <42c93146f7c9ac873edf3d31293b17ed@linux.vnet.ibm.com> (raw)
In-Reply-To: <CAFiYyc0xhXRUyD+63z6sfrQBfviCNUFrMqQCFyByLp2LFa78mw@mail.gmail.com>

On 2020-11-16 17:35, Richard Biener wrote:
> On Mon, Nov 16, 2020 at 10:26 AM Jiufu Guo <guojiufu@linux.ibm.com> 
> wrote:
>> 
>> Jiufu Guo <guojiufu@linux.ibm.com> writes:
>> 
>> > Richard Biener <rguenther@suse.de> writes:
>> >
>> >> On Wed, 11 Nov 2020, Jiufu Guo wrote:
>> >>
>> >>>
>> >>> Thanks a lot for the suggestion from previous mails.
>> >>> The patch was updated accordingly.
>> >>>
>> >>> This updated patch propagates loop-closed PHIs them out after
>> >>> loop_optimizer_finalize under a new introduced flag.  At some cases,
>> >>> to clean up loop-closed PHIs would save efforts of optimization passes
>> >>> after loopdone.
>> >>>
>> >>> This patch passes bootstrap and regtest on ppc64le.  Is this ok for trunk?
>> >>
>> >> Comments below
>> >>
>> >>>    free_numbers_of_iterations_estimates (fn);
>> >>>
>> >>> +  if (flag_clean_up_loop_closed_phi
>> >>
>> >> Sorry if there was miscommunication but I've not meant to add a
>> >> new user-visible flag but instead a flag argument to loop_optimizer_finalize
>> >> (as said, you can default it to false to only need to change the
>> >> one in fini_loops)
>> > Sorry for misunderstand.
>> > Updated the patch.
>> 
>> Thanks for the comments!  The patch was updated accordingly.
>> 
>> This updated patch clean up loop closed PHIs in 
>> loop_optimizer_finalize,
>> which is called when some loop optimizations are done. For some cases,
>> this would save efforts of optimization passes after loopdone.
>> 
>> gcc/ChangeLog:
>> 2020-10-16  Jiufu Guo   <guojiufu@linux.ibm.com>
>> 
>>         * cfgloop.h (loop_optimizer_finalize): Add flag argument.
>>         * loop-init.c (loop_optimizer_finalize): Call 
>> clean_up_loop_closed_phi.
>>         * tree-cfgcleanup.h (clean_up_loop_closed_phi): New declare.
>>         * tree-ssa-loop.c (tree_ssa_loop_done): Call 
>> loop_optimizer_finalize
>>         with flag argument.
>>         * tree-ssa-propagate.c (clean_up_loop_closed_phi): New 
>> function.
>> 
>> gcc/testsuite/ChangeLog:
>> 2020-10-16  Jiufu Guo   <guojiufu@linux.ibm.com>
>> 
>>         * gcc.dg/tree-ssa/loopclosedphi.c: New test.
>> 
>> ---
>>  gcc/cfgloop.h                                 |  2 +-
>>  gcc/loop-init.c                               |  9 ++-
>>  gcc/testsuite/gcc.dg/tree-ssa/loopclosedphi.c | 21 +++++++
>>  gcc/tree-cfgcleanup.h                         |  1 +
>>  gcc/tree-ssa-loop.c                           |  2 +-
>>  gcc/tree-ssa-propagate.c                      | 63 
>> +++++++++++++++++++
>>  6 files changed, 95 insertions(+), 3 deletions(-)
>>  create mode 100644 gcc/testsuite/gcc.dg/tree-ssa/loopclosedphi.c
>> 
>> diff --git a/gcc/cfgloop.h b/gcc/cfgloop.h
>> index d14689dc31f..438b1f779bb 100644
>> --- a/gcc/cfgloop.h
>> +++ b/gcc/cfgloop.h
>> @@ -824,7 +824,7 @@ extern void init_set_costs (void);
>> 
>>  /* Loop optimizer initialization.  */
>>  extern void loop_optimizer_init (unsigned);
>> -extern void loop_optimizer_finalize (function *);
>> +extern void loop_optimizer_finalize (function *, bool = false);
>>  inline void
>>  loop_optimizer_finalize ()
>>  {
>> diff --git a/gcc/loop-init.c b/gcc/loop-init.c
>> index 401e5282907..0cb6f509b89 100644
>> --- a/gcc/loop-init.c
>> +++ b/gcc/loop-init.c
>> @@ -33,6 +33,7 @@ along with GCC; see the file COPYING3.  If not see
>>  #include "tree-ssa-loop-niter.h"
>>  #include "loop-unroll.h"
>>  #include "tree-scalar-evolution.h"
>> +#include "tree-cfgcleanup.h"
>> 
>> 
>>  /* Apply FLAGS to the loop state.  */
>> @@ -133,7 +134,7 @@ loop_optimizer_init (unsigned flags)
>>  /* Finalize loop structures.  */
>> 
>>  void
>> -loop_optimizer_finalize (struct function *fn)
>> +loop_optimizer_finalize (struct function *fn, bool 
>> clean_loop_closed_phi)
>>  {
>>    class loop *loop;
>>    basic_block bb;
>> @@ -145,6 +146,12 @@ loop_optimizer_finalize (struct function *fn)
>> 
>>    free_numbers_of_iterations_estimates (fn);
>> 
>> +  if (clean_loop_closed_phi && loops_state_satisfies_p (fn, 
>> LOOP_CLOSED_SSA))
>> +    {
>> +      clean_up_loop_closed_phi (fn);
>> +      loops_state_clear (fn, LOOP_CLOSED_SSA);
>> +    }
>> +
>>    /* If we should preserve loop structure, do not free it but clear
>>       flags that advanced properties are there as we are not 
>> preserving
>>       that in full.  */
>> diff --git a/gcc/testsuite/gcc.dg/tree-ssa/loopclosedphi.c 
>> b/gcc/testsuite/gcc.dg/tree-ssa/loopclosedphi.c
>> new file mode 100644
>> index 00000000000..d71b757fbca
>> --- /dev/null
>> +++ b/gcc/testsuite/gcc.dg/tree-ssa/loopclosedphi.c
>> @@ -0,0 +1,21 @@
>> +/* { dg-do compile } */
>> +/* { dg-options "-O3 -fno-tree-ch -w -fdump-tree-loopdone-details" } 
>> */
>> +
>> +void
>> +t6 (int qz, int wh)
>> +{
>> +  int jl = wh;
>> +
>> +  while (1.0 * qz / wh < 1)
>> +    {
>> +      qz = wh * (wh + 2);
>> +
>> +      while (wh < 1)
>> +        jl = 0;
>> +    }
>> +
>> +  while (qz < 1)
>> +    qz = jl * wh;
>> +}
>> +
>> +/* { dg-final { scan-tree-dump-times "Replacing" 2 "loopdone"} } */
>> diff --git a/gcc/tree-cfgcleanup.h b/gcc/tree-cfgcleanup.h
>> index 6ff6726bfe4..9e368d63709 100644
>> --- a/gcc/tree-cfgcleanup.h
>> +++ b/gcc/tree-cfgcleanup.h
>> @@ -26,5 +26,6 @@ extern bool cleanup_tree_cfg (unsigned = 0);
>>  extern bool fixup_noreturn_call (gimple *stmt);
>>  extern bool delete_unreachable_blocks_update_callgraph (cgraph_node 
>> *dst_node,
>>                                                         bool 
>> update_clones);
>> +extern unsigned clean_up_loop_closed_phi (function *);
>> 
>>  #endif /* GCC_TREE_CFGCLEANUP_H */
>> diff --git a/gcc/tree-ssa-loop.c b/gcc/tree-ssa-loop.c
>> index 5e8365d4e83..339a0c50bc8 100644
>> --- a/gcc/tree-ssa-loop.c
>> +++ b/gcc/tree-ssa-loop.c
>> @@ -529,7 +529,7 @@ tree_ssa_loop_done (void)
>>  {
>>    free_numbers_of_iterations_estimates (cfun);
>>    scev_finalize ();
>> -  loop_optimizer_finalize ();
>> +  loop_optimizer_finalize (cfun, true);
>>    return 0;
>>  }
>> 
>> diff --git a/gcc/tree-ssa-propagate.c b/gcc/tree-ssa-propagate.c
>> index 87dbf55fab9..daa1db82afc 100644
>> --- a/gcc/tree-ssa-propagate.c
>> +++ b/gcc/tree-ssa-propagate.c
>> @@ -1549,3 +1549,66 @@ propagate_tree_value_into_stmt 
>> (gimple_stmt_iterator *gsi, tree val)
>>    else
>>      gcc_unreachable ();
>>  }
>> +
>> +/* Check exits of each loop in FUN, walk over loop closed PHIs in
>> +   each exit basic block and propagate degenerate PHIs.  */
>> +
>> +unsigned
>> +clean_up_loop_closed_phi (function *fun)
>> +{
>> +  unsigned i;
>> +  edge e;
>> +  gphi *phi;
>> +  tree rhs;
>> +  tree lhs;
>> +  gphi_iterator gsi;
>> +  struct loop *loop;
>> +
>> +  /* Check dominator info before get loop-close PHIs from loop exits. 
>>  */
>> +  if (dom_info_state (CDI_DOMINATORS) != DOM_OK)
> 
> Please change this to
> 
>        /* Avoid possibly quadratic work when scanning for loop exits 
> across
>           all loops of a nest.  */
>        if (!loop_state_satisfies_p (LOOPS_HAVE_RECORDED_EXITS))
>          return 0;
> 

Great suggestion, thanks!

And, the patch for loop-init.c, is also updated a little as below: call
clean_up_loop_closed_phi before release_recorded_exits, to avoid flag
LOOPS_HAVE_RECORDED_EXITS is cleared before checked.

-----------------
diff --git a/gcc/loop-init.c b/gcc/loop-init.c
index 401e5282907..ac87dafef6e 100644
--- a/gcc/loop-init.c
+++ b/gcc/loop-init.c
@@ -33,6 +33,7 @@ along with GCC; see the file COPYING3.  If not see
  #include "tree-ssa-loop-niter.h"
  #include "loop-unroll.h"
  #include "tree-scalar-evolution.h"
+#include "tree-cfgcleanup.h"

  ^L
  /* Apply FLAGS to the loop state.  */
@@ -133,13 +134,19 @@ loop_optimizer_init (unsigned flags)
  /* Finalize loop structures.  */

  void
-loop_optimizer_finalize (struct function *fn)
+loop_optimizer_finalize (struct function *fn, bool 
clean_loop_closed_phi)
  {
    class loop *loop;
    basic_block bb;

    timevar_push (TV_LOOP_FINI);

+  if (clean_loop_closed_phi && loops_state_satisfies_p (fn, 
LOOP_CLOSED_SSA))
+    {
+      clean_up_loop_closed_phi (fn);
+      loops_state_clear (fn, LOOP_CLOSED_SSA);
+    }
+
    if (loops_state_satisfies_p (fn, LOOPS_HAVE_RECORDED_EXITS))
      release_recorded_exits (fn);
----------------
>> +    return 0;
>> +
>> +  /* Walk over loop in function.  */
>> +  FOR_EACH_LOOP_FN (fun, loop, 0)
>> +    {
>> +      /* Check each exit edege of loop.  */
>> +      auto_vec<edge> exits = get_loop_exit_edges (loop);
>> +      FOR_EACH_VEC_ELT (exits, i, e)
>> +       if (single_pred_p (e->dest))
>> +         /* Walk over loop-closed PHIs.  */
>> +         for (gsi = gsi_start_phis (e->dest); !gsi_end_p (gsi);)
>> +           {
>> +             phi = gsi.phi ();
>> +             rhs = degenerate_phi_result (phi);
> 
>   rhs = gimple_phi_arg_def (phi, 0);
Thanks, sorry for missing this, you mentioned in previous mail.

I will commit the updated patch.

Thanks!
Jiufu Guo.


> 
> OK with that changes.
> 
> Thanks,
> Richard.
> 
>> +             lhs = gimple_phi_result (phi);
>> +
>> +             if (rhs && may_propagate_copy (lhs, rhs))
>> +               {
>> +                 /* Dump details.  */
>> +                 if (dump_file && (dump_flags & TDF_DETAILS))
>> +                   {
>> +                     fprintf (dump_file, "  Replacing '");
>> +                     print_generic_expr (dump_file, lhs, dump_flags);
>> +                     fprintf (dump_file, "' with '");
>> +                     print_generic_expr (dump_file, rhs, dump_flags);
>> +                     fprintf (dump_file, "'\n");
>> +                   }
>> +
>> +                 use_operand_p use_p;
>> +                 imm_use_iterator iter;
>> +                 gimple *use_stmt;
>> +                 FOR_EACH_IMM_USE_STMT (use_stmt, iter, lhs)
>> +                   {
>> +                     FOR_EACH_IMM_USE_ON_STMT (use_p, iter)
>> +                       replace_exp (use_p, rhs);
>> +                     update_stmt (use_stmt);
>> +                   }
>> +                 remove_phi_node (&gsi, true);
>> +               }
>> +             else
>> +               gsi_next (&gsi);
>> +           }
>> +    }
>> +
>> +  return 0;
>> +}
>> --
>> 2.25.1
>> 
>> 
>> 
>> 
>> >>
>> >>> +      && loops_state_satisfies_p (fn, LOOP_CLOSED_SSA))
>> >>> +    {
>> >>> +      clean_up_loop_closed_phi (fn);
>> >>> +      loops_state_clear (fn, LOOP_CLOSED_SSA);
>> >>> +    }
>> >>> +
>> > ......
>> >>> +  gphi *phi;
>> >>> +  tree rhs;
>> >>> +  tree lhs;
>> >>> +  gphi_iterator gsi;
>> >>> +  struct loop *loop;
>> >>> +  bool cfg_altered = false;
>> >>> +
>> >>> +  /* Check dominator info before get loop-close PHIs from loop exits.  */
>> >>> +  if (dom_info_state (CDI_DOMINATORS) != DOM_OK)
>> >>
>> >> Why?
>> >>
>> > As you said, loop_optimizer_finalize is also called from where dominator
>> > info is not ready, e.g. called from: vrp(execute_vrp).
>> > At there, loop exits info is not ready, and then
>> > get_loop_exit_edges/get_loop_body_with_size function does not work.
>> >
>> >>> +  /* Walk over loop in function.  */
>> >>> +  FOR_EACH_LOOP_FN (fun, loop, 0)
>> >>> +    {
>> >>> +      /* Check each exit edege of loop.  */
>> >>> +      auto_vec<edge> exits = get_loop_exit_edges (loop);
>> >>> +      FOR_EACH_VEC_ELT (exits, i, e)
>> >>> +   if (single_pred_p (e->dest))
>> >>> +     /* Walk over loop-closed PHIs.  */
>> >>> +     for (gsi = gsi_start_phis (e->dest); !gsi_end_p (gsi);)
>> >>> +       {
>> >>> +         phi = gsi.phi ();
>> >>> +         rhs = degenerate_phi_result (phi);
>> >>
>> >> You have a single predecessor, thus the result is always degenerate.
>> >>
>> >>> +         lhs = gimple_phi_result (phi);
>> >>> +
>> >>> +         if (rhs && may_propagate_copy (lhs, rhs))
>> >>> +           {
>> >>> +             gimple_stmt_iterator psi = gsi;
>> >>> +             /* Advance the iterator before stmt is removed.  */
>> >>> +             gsi_next (&gsi);
>> >>
>> >> remove_phi_node should take care of this, you shouldn't need this
>> >> (just do not advance the iterator when you remove the PHI node).
>> >>
>> > Yeap, get it! Thanks, will update the patch accordingly.
>> >>> +
>> > ......
>> >>> +
>> >>> +             replace_uses_by (lhs, rhs);
>> >>> +             remove_phi_node (&psi, true);
>> >>> +             cfg_altered = true;
>> >>
>> >> in the end the return value is unused but I think we should avoid
>> >> altering the CFG since doing so requires it to be cleaned up for
>> >> unreachable blocks.  That means to open-code replace_uses_by as
>> >>
>> >>   imm_use_iterator imm_iter;
>> >>   use_operand_p use;
>> >>   gimple *stmt;
>> >>   FOR_EACH_IMM_USE_STMT (stmt, imm_iter, name)
>> >>     {
>> >>       FOR_EACH_IMM_USE_ON_STMT (use, imm_iter)
>> >>         replace_exp (use, val);
>> >>       update_stmt (stmt);
>> >>     }
>> >
>> > Thansk! This could also save some code in replace_uses_by.
>> >
>> > BR.
>> > Jiufu Guo
>> >>
>> >> Thanks,
>> >> Richard.
>> >>
>> >>> +           }
>> >>> +         else
>> >>> +           gsi_next (&gsi);
>> >>> +       }
>> >>> +    }
>> >>> +
>> >>> +  return cfg_altered;
>> >>> +}
>> >>>

  reply	other threads:[~2020-11-17  5:58 UTC|newest]

Thread overview: 12+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2020-11-05 13:18 [PATCH] Clean up loop-closed PHIs at loopdone pass guojiufu
2020-11-05 13:43 ` Richard Biener
2020-11-06  7:27   ` Jiufu Guo
2020-11-06  9:55     ` Richard Biener
2020-11-11  8:10       ` [PATCH V2] Clean up loop-closed PHIs after loop finalize Jiufu Guo
2020-11-13  8:18         ` Richard Biener
2020-11-16  7:59           ` Jiufu Guo
2020-11-16  9:26             ` Jiufu Guo
2020-11-16  9:35               ` Richard Biener
2020-11-17  5:58                 ` Jiufu Guo [this message]
2020-11-17  8:09                   ` Jiufu Guo
2020-11-17 10:21                     ` Richard Biener

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=42c93146f7c9ac873edf3d31293b17ed@linux.vnet.ibm.com \
    --to=guojiufu@linux.ibm.com \
    --cc=dje.gcc@gmail.com \
    --cc=gcc-patches@gcc.gnu.org \
    --cc=rguenther@suse.de \
    --cc=richard.guenther@gmail.com \
    --cc=segher@kernel.crashing.org \
    --cc=wschmidt@linux.ibm.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).