[Bug tree-optimization/66646] New: small loop turned into memmove because of tree ldist

public inbox for gcc-bugs@sourceware.org
help / color / mirror / Atom feed

* [Bug tree-optimization/66646] New: small loop turned into memmove because of tree ldist
@ 2015-06-24  2:31 amker at gcc dot gnu.org
  2015-06-24  2:32 ` [Bug tree-optimization/66646] " amker at gcc dot gnu.org
                   ` (2 more replies)
  0 siblings, 3 replies; 4+ messages in thread
From: amker at gcc dot gnu.org @ 2015-06-24  2:31 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=66646

            Bug ID: 66646
           Summary: small loop turned into memmove because of tree ldist
           Product: gcc
           Version: 6.0
            Status: UNCONFIRMED
          Severity: normal
          Priority: P3
         Component: tree-optimization
          Assignee: unassigned at gcc dot gnu.org
          Reporter: amker at gcc dot gnu.org
  Target Milestone: ---

Considering below small case
int foo (int flag, char *a)                                                    
{                                                                              
  short i, j;                                                                  
  short l = 0;                                                                 
  if (flag == 1)                                                               
    l = 3;                                                                     

  for (i = 0; i < 4; i++)                                                      
    {                                                                          
      for (j = l - 1; j > 0; j--)                                              
        a[j] = a[j - 1];                                                       
      a[0] = i;                                                                
    }                                                                          
}
After revision 224020, SCEV can recognize &a[j], &a[j-1].  pass ldist decides
to replace the inner loop with memmove.  Since it's a small loop, most likely
this results in peformance regression.  We need:
A) compute more accurate loop niter bound in such case so that optimizers know
it's small loop.
B) don't turn loop into mem* call if it's small loop.


^ permalink raw reply	[flat|nested] 4+ messages in thread

* [Bug tree-optimization/66646] small loop turned into memmove because of tree ldist
  2015-06-24  2:31 [Bug tree-optimization/66646] New: small loop turned into memmove because of tree ldist amker at gcc dot gnu.org
@ 2015-06-24  2:32 ` amker at gcc dot gnu.org
  2015-06-24  7:34 ` rguenth at gcc dot gnu.org
  2021-09-05  1:01 ` pinskia at gcc dot gnu.org
  2 siblings, 0 replies; 4+ messages in thread
From: amker at gcc dot gnu.org @ 2015-06-24  2:32 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=66646

--- Comment #1 from amker at gcc dot gnu.org ---
With Ofast optimization level.


^ permalink raw reply	[flat|nested] 4+ messages in thread

* [Bug tree-optimization/66646] small loop turned into memmove because of tree ldist
  2015-06-24  2:31 [Bug tree-optimization/66646] New: small loop turned into memmove because of tree ldist amker at gcc dot gnu.org
  2015-06-24  2:32 ` [Bug tree-optimization/66646] " amker at gcc dot gnu.org
@ 2015-06-24  7:34 ` rguenth at gcc dot gnu.org
  2021-09-05  1:01 ` pinskia at gcc dot gnu.org
  2 siblings, 0 replies; 4+ messages in thread
From: rguenth at gcc dot gnu.org @ 2015-06-24  7:34 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=66646

Richard Biener <rguenth at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
           Keywords|                            |missed-optimization
             Status|UNCONFIRMED                 |NEW
   Last reconfirmed|                            |2015-06-24
                 CC|                            |rguenth at gcc dot gnu.org
     Ever confirmed|0                           |1

--- Comment #2 from Richard Biener <rguenth at gcc dot gnu.org> ---
Well, even if it is a small loop the theory is that mem* inline expansion will
produce better code than the loop copying chars.

niter is

  <bb 2>:
  if (flag_6(D) == 1)
    goto <bb 3>;
  else
    goto <bb 4>;

  <bb 3>:

  <bb 4>:
  # prephitmp_41 = PHI <-1(OVF)(2), 2(3)>

...
  _3 = (unsigned short) prephitmp_41;
  _30 = _3 + 65535;
  _48 = (sizetype) _30;

here and the loop is guarded with

  <bb 5>:
  # i_28 = PHI <i_25(7), 0(4)>
  if (prephitmp_41 > 0)

there is a pre-existing issue of a (OVF) constant in the IL (that's a no-no)
and a missed jump-threading to expose the constant.  There is also
range info on the size argument of the memmove at expansion time:

  # RANGE [1, 2] NONZERO 3
  _47 = _48 + 1;
  __builtin_memmove (_36, _33, _47);


but we don't seem to have a target/middle-end expander for BUILT_IN_MEMMOVE.
So that's a missed optimization there.

Without loop distribution we fail to peel the inner loop as well (on the
tree level), because

Loop 2 iterates at most 32767 times.

so we fail to compute a proper upper bound.  The very same issue is
present during loop distribution so it can't know the loop iterates only
1 or 2 times.

Apart from special-casing this memmove in RTL expansion we could also enhance
the memory builtin folders (in gimple-fold.c) to honor range information
and in this case expand the memmove to something more optimal, like

  if (_47 == 1)
    *_36 = *_33;
  else
    *(unsigned short *)_36 = *(unsigned short *)_33;

of course creating control-flow here is not expected (so dealing with this
at RTL expansion time is easier).


Thus confirmed - but applies to similar loops (niter bound not precise)
before the SCEV changes.


^ permalink raw reply	[flat|nested] 4+ messages in thread

* [Bug tree-optimization/66646] small loop turned into memmove because of tree ldist
  2015-06-24  2:31 [Bug tree-optimization/66646] New: small loop turned into memmove because of tree ldist amker at gcc dot gnu.org
  2015-06-24  2:32 ` [Bug tree-optimization/66646] " amker at gcc dot gnu.org
  2015-06-24  7:34 ` rguenth at gcc dot gnu.org
@ 2021-09-05  1:01 ` pinskia at gcc dot gnu.org
  2 siblings, 0 replies; 4+ messages in thread
From: pinskia at gcc dot gnu.org @ 2021-09-05  1:01 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=66646

Andrew Pinski <pinskia at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
           Severity|normal                      |enhancement
   Last reconfirmed|2015-06-24 00:00:00         |2021-9-4

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2021-09-05  1:01 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2015-06-24  2:31 [Bug tree-optimization/66646] New: small loop turned into memmove because of tree ldist amker at gcc dot gnu.org
2015-06-24  2:32 ` [Bug tree-optimization/66646] " amker at gcc dot gnu.org
2015-06-24  7:34 ` rguenth at gcc dot gnu.org
2021-09-05  1:01 ` pinskia at gcc dot gnu.org

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).