public inbox for gcc-bugs@sourceware.org
help / color / mirror / Atom feed
* [Bug tree-optimization/98299] New: Failure to optimize sub loop into modulo-based pattern
@ 2020-12-15 18:57 gabravier at gmail dot com
  2020-12-15 19:02 ` [Bug tree-optimization/98299] " jakub at gcc dot gnu.org
                   ` (3 more replies)
  0 siblings, 4 replies; 5+ messages in thread
From: gabravier at gmail dot com @ 2020-12-15 18:57 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98299

            Bug ID: 98299
           Summary: Failure to optimize sub loop into modulo-based pattern
           Product: gcc
           Version: 11.0
            Status: UNCONFIRMED
          Severity: normal
          Priority: P3
         Component: tree-optimization
          Assignee: unassigned at gcc dot gnu.org
          Reporter: gabravier at gmail dot com
  Target Milestone: ---

int f1(int n)
{
    while (n >= 45)
        n -= 45;

    return n;
}

This can be optimized into a modulo-based pattern, for example this :

int f2(int n)
{
    int tmp = n > 44 ? n : 44;
    return ((tmp % 45) - tmp) + n;
}

This transformation is done by LLVM, but not by GCC.

^ permalink raw reply	[flat|nested] 5+ messages in thread

* [Bug tree-optimization/98299] Failure to optimize sub loop into modulo-based pattern
  2020-12-15 18:57 [Bug tree-optimization/98299] New: Failure to optimize sub loop into modulo-based pattern gabravier at gmail dot com
@ 2020-12-15 19:02 ` jakub at gcc dot gnu.org
  2020-12-15 19:22 ` gabravier at gmail dot com
                   ` (2 subsequent siblings)
  3 siblings, 0 replies; 5+ messages in thread
From: jakub at gcc dot gnu.org @ 2020-12-15 19:02 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98299

Jakub Jelinek <jakub at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |jakub at gcc dot gnu.org

--- Comment #1 from Jakub Jelinek <jakub at gcc dot gnu.org> ---
The big question is if it is an optimization or not.
Perhaps the user wrote it that way because n is always or most of the time
small that it will be faster than doing modulo.

^ permalink raw reply	[flat|nested] 5+ messages in thread

* [Bug tree-optimization/98299] Failure to optimize sub loop into modulo-based pattern
  2020-12-15 18:57 [Bug tree-optimization/98299] New: Failure to optimize sub loop into modulo-based pattern gabravier at gmail dot com
  2020-12-15 19:02 ` [Bug tree-optimization/98299] " jakub at gcc dot gnu.org
@ 2020-12-15 19:22 ` gabravier at gmail dot com
  2020-12-15 19:26 ` gabravier at gmail dot com
  2021-01-04 15:21 ` rguenth at gcc dot gnu.org
  3 siblings, 0 replies; 5+ messages in thread
From: gabravier at gmail dot com @ 2020-12-15 19:22 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98299

--- Comment #2 from Gabriel Ravier <gabravier at gmail dot com> ---
At the very least, it seems like a worthwhile pattern to recognize in -O3, even
if only to avoid vectorizing it, i.e. have similar effects to what happens if
you add `if (n >= 1000) __builtin_unreachable();` to the start of f1.

Altogether, though, it seems unlikely that the modulo would be costlier than
the loop except in very narrow cases, since it is optimized into a
multiplication and a few other operations with little cost.

Also, the transformation into a modulo seems to occur in the vectorized version
too, though it is weirdly optimized.

^ permalink raw reply	[flat|nested] 5+ messages in thread

* [Bug tree-optimization/98299] Failure to optimize sub loop into modulo-based pattern
  2020-12-15 18:57 [Bug tree-optimization/98299] New: Failure to optimize sub loop into modulo-based pattern gabravier at gmail dot com
  2020-12-15 19:02 ` [Bug tree-optimization/98299] " jakub at gcc dot gnu.org
  2020-12-15 19:22 ` gabravier at gmail dot com
@ 2020-12-15 19:26 ` gabravier at gmail dot com
  2021-01-04 15:21 ` rguenth at gcc dot gnu.org
  3 siblings, 0 replies; 5+ messages in thread
From: gabravier at gmail dot com @ 2020-12-15 19:26 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98299

--- Comment #3 from Gabriel Ravier <gabravier at gmail dot com> ---
PS: The optimization seems to already occur for simpler cases such as powers of
2, e.g. :

int f1(int n)
{
    while (n >= 64)
        n -= 64;

    return n;
}

is optimized into `return (n <= 63) ? n : (n & 63);`.

^ permalink raw reply	[flat|nested] 5+ messages in thread

* [Bug tree-optimization/98299] Failure to optimize sub loop into modulo-based pattern
  2020-12-15 18:57 [Bug tree-optimization/98299] New: Failure to optimize sub loop into modulo-based pattern gabravier at gmail dot com
                   ` (2 preceding siblings ...)
  2020-12-15 19:26 ` gabravier at gmail dot com
@ 2021-01-04 15:21 ` rguenth at gcc dot gnu.org
  3 siblings, 0 replies; 5+ messages in thread
From: rguenth at gcc dot gnu.org @ 2021-01-04 15:21 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98299

Richard Biener <rguenth at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|UNCONFIRMED                 |NEW
   Last reconfirmed|                            |2021-01-04
     Ever confirmed|0                           |1

--- Comment #4 from Richard Biener <rguenth at gcc dot gnu.org> ---
Btw for the similar division pattern kernel folks once complained ;)  But yes,
confirmed.  SCCP is doing such transform usually but we don't do it because
we consider the replacement

(n_2(D) + -45) + (int) (((unsigned int) n_2(D) + 4294967251) / 45) * -45

too expensive (guess the cast makes recognizing this as modulo not valid).

^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2021-01-04 15:21 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2020-12-15 18:57 [Bug tree-optimization/98299] New: Failure to optimize sub loop into modulo-based pattern gabravier at gmail dot com
2020-12-15 19:02 ` [Bug tree-optimization/98299] " jakub at gcc dot gnu.org
2020-12-15 19:22 ` gabravier at gmail dot com
2020-12-15 19:26 ` gabravier at gmail dot com
2021-01-04 15:21 ` rguenth at gcc dot gnu.org

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).