[Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options

public inbox for gcc-bugs@sourceware.org
help / color / mirror / Atom feed

* [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options
@ 2021-05-10 10:20 dongjianqiang2 at huawei dot com
  2021-05-10 11:13 ` [Bug tree-optimization/100499] " rguenth at gcc dot gnu.org
                   ` (45 more replies)
  0 siblings, 46 replies; 47+ messages in thread
From: dongjianqiang2 at huawei dot com @ 2021-05-10 10:20 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

            Bug ID: 100499
           Summary: Different results with -fpeel-loops
                    -ftree-loop-vectorize options
           Product: gcc
           Version: 10.2.0
            Status: UNCONFIRMED
          Severity: normal
          Priority: P3
         Component: tree-optimization
          Assignee: unassigned at gcc dot gnu.org
          Reporter: dongjianqiang2 at huawei dot com
  Target Milestone: ---

cat demo.c
#include<stdio.h>
typedef unsigned short uint16_t;
typedef signed int int32_t;
static uint16_t g_2823 = 0xEC75L;
static uint16_t g_116 = 0xBC07L;

static uint16_t
(safe_mul_func_uint16_t_u_u)(uint16_t ui1, uint16_t ui2 )
{

  return ((unsigned int)ui1) * ((unsigned int)ui2);
}

int main (int argc, char* argv[])
{
    uint16_t l_2815 = 65535UL;
    uint16_t *l_2821 = &g_116;
    uint16_t *l_2822 = &g_2823;

    printf ("start g_2823 = %d\n", g_2823);
lbl_2826:
    l_2815 &= 0x9DEF1EAEL;
    if (+(safe_mul_func_uint16_t_u_u(((*l_2821) = l_2815), (--(*l_2822)))))
    {
      goto lbl_2826;
    }
    printf ("end g_2823 = %d\n", g_2823);

    return 0;
}

Compiling with 

gcc demo.c -O1
./a.out
start g_2823 = 60533
end g_2823 = 32768

gcc demo.c -O1 -fpeel-loops -ftree-loop-vectorize
./a.out
start g_2823 = 60533
end g_2823 = 60526

gives different results

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
@ 2021-05-10 11:13 ` rguenth at gcc dot gnu.org
  2021-05-11  9:06 ` marxin at gcc dot gnu.org
                   ` (44 subsequent siblings)
  45 siblings, 0 replies; 47+ messages in thread
From: rguenth at gcc dot gnu.org @ 2021-05-10 11:13 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

Richard Biener <rguenth at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
      Known to work|                            |11.1.0
     Ever confirmed|0                           |1
           Keywords|                            |needs-bisection, wrong-code
             Status|UNCONFIRMED                 |NEW
                 CC|                            |rguenth at gcc dot gnu.org
   Last reconfirmed|                            |2021-05-10
      Known to fail|                            |10.3.1, 8.4.1, 9.3.1

--- Comment #1 from Richard Biener <rguenth at gcc dot gnu.org> ---
Confirmed.  It looks like GCC 11 works (it's not vectorizing).  I wonder what
fixed it.

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
  2021-05-10 11:13 ` [Bug tree-optimization/100499] " rguenth at gcc dot gnu.org
@ 2021-05-11  9:06 ` marxin at gcc dot gnu.org
  2021-05-11  9:07 ` marxin at gcc dot gnu.org
                   ` (43 subsequent siblings)
  45 siblings, 0 replies; 47+ messages in thread
From: marxin at gcc dot gnu.org @ 2021-05-11  9:06 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

Martin Liška <marxin at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
           Keywords|needs-bisection             |
                 CC|                            |marxin at gcc dot gnu.org

--- Comment #2 from Martin Liška <marxin at gcc dot gnu.org> ---
gcc pr100499.c -O1 -fpeel-loops -ftree-loop-vectorize && ./a.out | grep 60526

succeeds since r11-2453-gc89366b12ff4f362.

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
  2021-05-10 11:13 ` [Bug tree-optimization/100499] " rguenth at gcc dot gnu.org
  2021-05-11  9:06 ` marxin at gcc dot gnu.org
@ 2021-05-11  9:07 ` marxin at gcc dot gnu.org
  2021-05-11  9:09 ` marxin at gcc dot gnu.org
                   ` (42 subsequent siblings)
  45 siblings, 0 replies; 47+ messages in thread
From: marxin at gcc dot gnu.org @ 2021-05-11  9:07 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

--- Comment #3 from Martin Liška <marxin at gcc dot gnu.org> ---
But expected result is end g_2823 = 32768, right?
Clang returns the same result 32768.

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
                   ` (2 preceding siblings ...)
  2021-05-11  9:07 ` marxin at gcc dot gnu.org
@ 2021-05-11  9:09 ` marxin at gcc dot gnu.org
  2021-05-11  9:12 ` dongjianqiang2 at huawei dot com
                   ` (41 subsequent siblings)
  45 siblings, 0 replies; 47+ messages in thread
From: marxin at gcc dot gnu.org @ 2021-05-11  9:09 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

Martin Liška <marxin at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |amker at gcc dot gnu.org

--- Comment #4 from Martin Liška <marxin at gcc dot gnu.org> ---
(In reply to Martin Liška from comment #3)
> But expected result is end g_2823 = 32768, right?
> Clang returns the same result 32768.

Which regresses since r7-2373-g69b806f6a60efcf1.

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
                   ` (3 preceding siblings ...)
  2021-05-11  9:09 ` marxin at gcc dot gnu.org
@ 2021-05-11  9:12 ` dongjianqiang2 at huawei dot com
  2021-05-11  9:42 ` rguenth at gcc dot gnu.org
                   ` (40 subsequent siblings)
  45 siblings, 0 replies; 47+ messages in thread
From: dongjianqiang2 at huawei dot com @ 2021-05-11  9:12 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

--- Comment #5 from John Dong <dongjianqiang2 at huawei dot com> ---
(In reply to Martin Liška from comment #3)
> But expected result is end g_2823 = 32768, right?
> Clang returns the same result 32768.

Yes, I think so.

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
                   ` (4 preceding siblings ...)
  2021-05-11  9:12 ` dongjianqiang2 at huawei dot com
@ 2021-05-11  9:42 ` rguenth at gcc dot gnu.org
  2021-05-11 10:40 ` amker at gcc dot gnu.org
                   ` (39 subsequent siblings)
  45 siblings, 0 replies; 47+ messages in thread
From: rguenth at gcc dot gnu.org @ 2021-05-11  9:42 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

Richard Biener <rguenth at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
      Known to work|11.1.0                      |
      Known to fail|                            |11.1.0

--- Comment #6 from Richard Biener <rguenth at gcc dot gnu.org> ---
OK, so GCC 11 is broken as well.  Might want to bisect with
-fno-vect-cost-model

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
                   ` (5 preceding siblings ...)
  2021-05-11  9:42 ` rguenth at gcc dot gnu.org
@ 2021-05-11 10:40 ` amker at gcc dot gnu.org
  2021-05-11 11:33 ` marxin at gcc dot gnu.org
                   ` (38 subsequent siblings)
  45 siblings, 0 replies; 47+ messages in thread
From: amker at gcc dot gnu.org @ 2021-05-11 10:40 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

bin cheng <amker at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
           Assignee|unassigned at gcc dot gnu.org      |amker at gcc dot gnu.org

--- Comment #7 from bin cheng <amker at gcc dot gnu.org> ---
(In reply to Martin Liška from comment #4)
> (In reply to Martin Liška from comment #3)
> > But expected result is end g_2823 = 32768, right?
> > Clang returns the same result 32768.
> 
> Which regresses since r7-2373-g69b806f6a60efcf1.

Hmm, that was a fix long ago.  Will investigate this.  Sorry for the breakage.

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
                   ` (6 preceding siblings ...)
  2021-05-11 10:40 ` amker at gcc dot gnu.org
@ 2021-05-11 11:33 ` marxin at gcc dot gnu.org
  2021-05-17  2:17 ` amker at gcc dot gnu.org
                   ` (37 subsequent siblings)
  45 siblings, 0 replies; 47+ messages in thread
From: marxin at gcc dot gnu.org @ 2021-05-11 11:33 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

--- Comment #8 from Martin Liška <marxin at gcc dot gnu.org> ---
(In reply to bin cheng from comment #7)
> (In reply to Martin Liška from comment #4)
> > (In reply to Martin Liška from comment #3)
> > > But expected result is end g_2823 = 32768, right?
> > > Clang returns the same result 32768.
> > 
> > Which regresses since r7-2373-g69b806f6a60efcf1.
> 
> Hmm, that was a fix long ago.  Will investigate this.  Sorry for the
> breakage.

gcc pr100499.c -O1 -fpeel-loops -ftree-loop-vectorize -fno-vect-cost-model &&
./a.out | grep 32768

also breaks with r7-2373-g69b806f6a60efcf1.

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
                   ` (7 preceding siblings ...)
  2021-05-11 11:33 ` marxin at gcc dot gnu.org
@ 2021-05-17  2:17 ` amker at gcc dot gnu.org
  2021-05-18  9:49 ` rguenth at gcc dot gnu.org
                   ` (36 subsequent siblings)
  45 siblings, 0 replies; 47+ messages in thread
From: amker at gcc dot gnu.org @ 2021-05-17  2:17 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

--- Comment #9 from bin cheng <amker at gcc dot gnu.org> ---
Seems we have a long standing bug in fold-const.c:multiple_of_p in case of
wrapping types.  Take unsigned int as an example:
  (0xfffffffc * 0x3) % 0x3 = 0x1
But multiple_of_p returns true here.

The same issue also stands for MINUS_EXPR and PLUS_EXPR.  Given multiple_of_p
is used elsewhere, the fix might break existing optimizations.  Especially,
number of loop iterations is computed in unsigned types

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
                   ` (8 preceding siblings ...)
  2021-05-17  2:17 ` amker at gcc dot gnu.org
@ 2021-05-18  9:49 ` rguenth at gcc dot gnu.org
  2021-05-18  9:57 ` rguenth at gcc dot gnu.org
                   ` (35 subsequent siblings)
  45 siblings, 0 replies; 47+ messages in thread
From: rguenth at gcc dot gnu.org @ 2021-05-18  9:49 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

--- Comment #10 from Richard Biener <rguenth at gcc dot gnu.org> ---
(In reply to bin cheng from comment #9)
> Seems we have a long standing bug in fold-const.c:multiple_of_p in case of
> wrapping types.  Take unsigned int as an example:
>   (0xfffffffc * 0x3) % 0x3 = 0x1
> But multiple_of_p returns true here.
> 
> The same issue also stands for MINUS_EXPR and PLUS_EXPR.  Given
> multiple_of_p is used elsewhere, the fix might break existing optimizations.
> Especially, number of loop iterations is computed in unsigned types

multiple_of_p is mostly used in contexts where overflow "cannot happen"
(in TYPE/DECL_SIZE computation context), and in niter analysis it seems to
be guarded similarly.  This restriction of multiple_of_p seems undocumented,
so fixing that might be good.

Now, you don't say what's the chain of events that lead to a multiple_of_p call
eventually leading to the wrong answer, but I guess it's the code added
under the

+  if (!niter->control.no_overflow
+      && (integer_onep (s) || multiple_of_p (type, c, s)))

check as !niter->control.no_overflow seems to suggest that the multiple_of_p
check is not properly guarded?

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
                   ` (9 preceding siblings ...)
  2021-05-18  9:49 ` rguenth at gcc dot gnu.org
@ 2021-05-18  9:57 ` rguenth at gcc dot gnu.org
  2021-05-18 10:25 ` rguenth at gcc dot gnu.org
                   ` (34 subsequent siblings)
  45 siblings, 0 replies; 47+ messages in thread
From: rguenth at gcc dot gnu.org @ 2021-05-18  9:57 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

Richard Biener <rguenth at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |jakub at gcc dot gnu.org,
                   |                            |law at gcc dot gnu.org

--- Comment #11 from Richard Biener <rguenth at gcc dot gnu.org> ---
But there's still

int
multiple_of_p (tree type, const_tree top, const_tree bottom)
{
...
    case NOP_EXPR:
      /* Can't handle conversions from non-integral or wider integral type.  */
      if ((TREE_CODE (TREE_TYPE (TREE_OPERAND (top, 0))) != INTEGER_TYPE)
          || (TYPE_PRECISION (type)
              < TYPE_PRECISION (TREE_TYPE (TREE_OPERAND (top, 0)))))
        return 0;

      /* fall through */

    case SAVE_EXPR:
      return multiple_of_p (type, TREE_OPERAND (top, 0), bottom);

which makes guards like

      || (TYPE_OVERFLOW_UNDEFINED (type)
          && multiple_of_p (type, c, s)))

break for say multiple_of_p (int, (int)(unsigned)(x*s), s).  So it's a bit
fishy in the end ... :/

Jeff originally introduced multiple_of_p, maybe he remembers something.

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
                   ` (10 preceding siblings ...)
  2021-05-18  9:57 ` rguenth at gcc dot gnu.org
@ 2021-05-18 10:25 ` rguenth at gcc dot gnu.org
  2021-05-18 11:33 ` amker at gcc dot gnu.org
                   ` (33 subsequent siblings)
  45 siblings, 0 replies; 47+ messages in thread
From: rguenth at gcc dot gnu.org @ 2021-05-18 10:25 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

Richard Biener <rguenth at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |aldyh at gcc dot gnu.org

--- Comment #12 from Richard Biener <rguenth at gcc dot gnu.org> ---
So in number_of_iterations_ne it looks like the step 's' is always constant
which makes me wonder if we can somehow use ranger to tell multiple_of_p (type,
c, s)
or at least whether, if c is x * s, the multiplication could have overflowed?

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
                   ` (11 preceding siblings ...)
  2021-05-18 10:25 ` rguenth at gcc dot gnu.org
@ 2021-05-18 11:33 ` amker at gcc dot gnu.org
  2021-05-18 11:36 ` amker at gcc dot gnu.org
                   ` (32 subsequent siblings)
  45 siblings, 0 replies; 47+ messages in thread
From: amker at gcc dot gnu.org @ 2021-05-18 11:33 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

--- Comment #13 from bin cheng <amker at gcc dot gnu.org> ---
(In reply to Richard Biener from comment #10)
> (In reply to bin cheng from comment #9)
> > Seems we have a long standing bug in fold-const.c:multiple_of_p in case of
> > wrapping types.  Take unsigned int as an example:
> >   (0xfffffffc * 0x3) % 0x3 = 0x1
> > But multiple_of_p returns true here.
> > 
> > The same issue also stands for MINUS_EXPR and PLUS_EXPR.  Given
> > multiple_of_p is used elsewhere, the fix might break existing optimizations.
> > Especially, number of loop iterations is computed in unsigned types
> 
> multiple_of_p is mostly used in contexts where overflow "cannot happen"
> (in TYPE/DECL_SIZE computation context), and in niter analysis it seems to
> be guarded similarly.  This restriction of multiple_of_p seems undocumented,
Oh, I am not aware of this.  Actually my previous change to it seems broke this
assumption already.  Will see how to fix or revert the change.

> so fixing that might be good.
> 
> Now, you don't say what's the chain of events that lead to a multiple_of_p
> call
> eventually leading to the wrong answer, but I guess it's the code added
> under the
> 
> +  if (!niter->control.no_overflow
> +      && (integer_onep (s) || multiple_of_p (type, c, s)))
> 
> check as !niter->control.no_overflow seems to suggest that the multiple_of_p
> check is not properly guarded?

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
                   ` (12 preceding siblings ...)
  2021-05-18 11:33 ` amker at gcc dot gnu.org
@ 2021-05-18 11:36 ` amker at gcc dot gnu.org
  2021-05-18 11:54 ` rguenther at suse dot de
                   ` (31 subsequent siblings)
  45 siblings, 0 replies; 47+ messages in thread
From: amker at gcc dot gnu.org @ 2021-05-18 11:36 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

--- Comment #14 from bin cheng <amker at gcc dot gnu.org> ---
(In reply to Richard Biener from comment #12)
> So in number_of_iterations_ne it looks like the step 's' is always constant
> which makes me wonder if we can somehow use ranger to tell multiple_of_p
> (type, c, s)
> or at least whether, if c is x * s, the multiplication could have overflowed?

Yeah, I am looking if "multiple of" can be feasibly checked in niter analysis,
with help of some basic information from multiple_of_p.

BTW, I am not following changes in "ranger", how should I used in analysis? or
similar to value range info?

Thanks

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
                   ` (13 preceding siblings ...)
  2021-05-18 11:36 ` amker at gcc dot gnu.org
@ 2021-05-18 11:54 ` rguenther at suse dot de
  2021-05-18 15:46 ` amacleod at redhat dot com
                   ` (30 subsequent siblings)
  45 siblings, 0 replies; 47+ messages in thread
From: rguenther at suse dot de @ 2021-05-18 11:54 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

--- Comment #15 from rguenther at suse dot de <rguenther at suse dot de> ---
On Tue, 18 May 2021, amker at gcc dot gnu.org wrote:

> https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499
> 
> --- Comment #14 from bin cheng <amker at gcc dot gnu.org> ---
> (In reply to Richard Biener from comment #12)
> > So in number_of_iterations_ne it looks like the step 's' is always constant
> > which makes me wonder if we can somehow use ranger to tell multiple_of_p
> > (type, c, s)
> > or at least whether, if c is x * s, the multiplication could have overflowed?
> 
> Yeah, I am looking if "multiple of" can be feasibly checked in niter analysis,
> with help of some basic information from multiple_of_p.
> 
> BTW, I am not following changes in "ranger", how should I used in analysis? or
> similar to value range info?

I'm not sure - let's see if the ranger folks have any good idea here.

Btw, there's tree_ctz which looks more conservative and could be used
for power-of-two 's'.  split_constant_offset also recently got
refactoring to avoid a plethora of overflow issues it ran into,
so we can eventually improve multiple_of_p to be correct without
pre-conditions.  But I fear that for DECL_SIZE & friends where
we "know" that multiplications by 8 to get from byte to bit size
do not overflow we cannot be too conservative here.  Maybe in the
end we need to distinguish those with new MULT_NO_OVERFLOW,
PLUS_NO_OVERFLOW, etc.  When creating those expressions we should
already be using size_{bin,un}op.  The conversion handling of course
still looks bogus to me even in this context.

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
                   ` (14 preceding siblings ...)
  2021-05-18 11:54 ` rguenther at suse dot de
@ 2021-05-18 15:46 ` amacleod at redhat dot com
  2021-05-19  6:31 ` rguenth at gcc dot gnu.org
                   ` (29 subsequent siblings)
  45 siblings, 0 replies; 47+ messages in thread
From: amacleod at redhat dot com @ 2021-05-18 15:46 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

Andrew Macleod <amacleod at redhat dot com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |msebor at gcc dot gnu.org

--- Comment #16 from Andrew Macleod <amacleod at redhat dot com> ---
Aldy and I are discussing this.

Ranger itself can't do anything outside of the gimple IL, its effectively just
a GIMPLE interface to range-ops. ... but I don't think  it would be hard to add
a range-ops expression evaluator.  I notice the multiple_of_p's "top" argument
is an expression tree  ie:

g_2823_lsm.5_13 * 7854 + 57682

We could add an expression evaluator that can walk that expression, invoking
range-ops on each expression, and calling a ranger instance to evaluate a range
for any ssa_name it finds.

It would bail if there are unknown tree-codes to range-ops. 

I don't think this is a particularly difficult thing to do, but by itself
doesn't really tell you if an overflow is possible

Once we can evaluate an expression outside of the IL like this, it would also
not be difficult to then evaluate the expression in an increased precision.  
We could cast the range at each point of the evaluation to the increased
precision and invoke range-ops on that range.  We could tell at any point if an
overflow is possible because the increased precision calculation would be
different than the original.

so the original expression is in 16 bit math, and if it was evaluated as
     [0, 65535] * [7854, 7854] + [57682, 57682]
 in 32 bit precision, it would come back with the answer [57682, 514769572].
Casting the final original value to 32 bit would yield a different result, and
we could conclude than an overflow is possible.

Would this be useful?  and would it solve this problem? I'm sure there are
other details to work out related to the increased precision, but it seems like
it might work?   

I think Martin Sebor was also interested in something along these lines so I'm
CC ing him.  I think he wanted to do this within the IL for some of his
warnings.. but I think something similar is feasible with an IL walk rather
than expression walk.

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
                   ` (15 preceding siblings ...)
  2021-05-18 15:46 ` amacleod at redhat dot com
@ 2021-05-19  6:31 ` rguenth at gcc dot gnu.org
  2021-05-19  7:10 ` amker at gcc dot gnu.org
                   ` (28 subsequent siblings)
  45 siblings, 0 replies; 47+ messages in thread
From: rguenth at gcc dot gnu.org @ 2021-05-19  6:31 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

--- Comment #17 from Richard Biener <rguenth at gcc dot gnu.org> ---
(In reply to Andrew Macleod from comment #16)
> Aldy and I are discussing this.
> 
> Ranger itself can't do anything outside of the gimple IL, its effectively
> just a GIMPLE interface to range-ops. ... but I don't think  it would be
> hard to add a range-ops expression evaluator.  I notice the multiple_of_p's
> "top" argument is an expression tree  ie:
> 
> g_2823_lsm.5_13 * 7854 + 57682
> 
> We could add an expression evaluator that can walk that expression, invoking
> range-ops on each expression, and calling a ranger instance to evaluate a
> range for any ssa_name it finds.
> 
> It would bail if there are unknown tree-codes to range-ops. 

Yeah, it would be similar to the existing determine_value_range () function
which does exactly do this (but not using ranger).

> I don't think this is a particularly difficult thing to do, but by itself
> doesn't really tell you if an overflow is possible

Yes, so in itself it wouldn't be enough.

> Once we can evaluate an expression outside of the IL like this, it would
> also not be difficult to then evaluate the expression in an increased
> precision.  
> We could cast the range at each point of the evaluation to the increased
> precision and invoke range-ops on that range.  We could tell at any point if
> an overflow is possible because the increased precision calculation would be
> different than the original.
> 
> so the original expression is in 16 bit math, and if it was evaluated as
>      [0, 65535] * [7854, 7854] + [57682, 57682]
>  in 32 bit precision, it would come back with the answer [57682, 514769572].
> Casting the final original value to 32 bit would yield a different result,
> and we could conclude than an overflow is possible.
> 
> Would this be useful?  and would it solve this problem? I'm sure there are
> other details to work out related to the increased precision, but it seems
> like it might work?   

Hmm, so the issue we're facing is asking whether x * 3 is a multiple of 3
but we have to consider that x * 3 might overflow.  For '3' being power-of-two
overflow doesn't change anything but for some C ((X * C) % S % C) != 0
(where S is the modulo reduction applied on overflow).

So it seems that, to be useful, this ranger expression walk would need to be
integrated with multiple_of_p itself.  Or we need to restrict calls to
multiple_of_p to those C where S is a multiple of C.  (but in principle
multiple_of_p is happy with a symbolic C - not sure if it is ever called with
one though)

> I think Martin Sebor was also interested in something along these lines so
> I'm CC ing him.  I think he wanted to do this within the IL for some of his
> warnings.. but I think something similar is feasible with an IL walk rather
> than expression walk.

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
                   ` (16 preceding siblings ...)
  2021-05-19  6:31 ` rguenth at gcc dot gnu.org
@ 2021-05-19  7:10 ` amker at gcc dot gnu.org
  2021-05-19  7:15 ` amker at gcc dot gnu.org
                   ` (27 subsequent siblings)
  45 siblings, 0 replies; 47+ messages in thread
From: amker at gcc dot gnu.org @ 2021-05-19  7:10 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

--- Comment #18 from bin cheng <amker at gcc dot gnu.org> ---
Did some experiments, there are two fallouts after explicitly returning false
for unsigned/wrapping types in MULT_EXPR/MINUS_EXPR/PLUS_EXPR.  One is the
mentioned use of multiple_of_p in number_of_iterations_ne, the other is for
alignment warning in stor-layout.c.  As pointed out, the latter case is known
not overflow/wrap.  

So I am thinking to introduce an additional parameter indicating that caller
knows "top" doesn't overfow/wrap, otherwise, try to get rid of the undocumented
assumption.  we can always improve the accuracy using ranger or other tools. 
Not sure if this is the right way to do.

As for MULT_NO_OVERFLOW/PLUS_NO_OVERFLOW, IMHO, it's not that simple?  For
example, unsigned_num(multiple of 4, and larger than 0) + 0xfffffffc is
multiple of 4, but it's overflow behavior on which we rely here.

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
                   ` (17 preceding siblings ...)
  2021-05-19  7:10 ` amker at gcc dot gnu.org
@ 2021-05-19  7:15 ` amker at gcc dot gnu.org
  2021-05-19  8:24 ` aldyh at redhat dot com
                   ` (26 subsequent siblings)
  45 siblings, 0 replies; 47+ messages in thread
From: amker at gcc dot gnu.org @ 2021-05-19  7:15 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

--- Comment #19 from bin cheng <amker at gcc dot gnu.org> ---
(In reply to bin cheng from comment #18)
> Did some experiments, there are two fallouts after explicitly returning
> false for unsigned/wrapping types in MULT_EXPR/MINUS_EXPR/PLUS_EXPR.  One is
> the mentioned use of multiple_of_p in number_of_iterations_ne, the other is
> for alignment warning in stor-layout.c.  As pointed out, the latter case is
> known not overflow/wrap.  
> 
> So I am thinking to introduce an additional parameter indicating that caller
> knows "top" doesn't overfow/wrap, otherwise, try to get rid of the
> undocumented assumption.  we can always improve the accuracy using ranger or
> other tools.  Not sure if this is the right way to do.
> 
> As for MULT_NO_OVERFLOW/PLUS_NO_OVERFLOW, IMHO, it's not that simple?  For
> example, unsigned_num(multiple of 4, and larger than 0) + 0xfffffffc is
> multiple of 4, but it's overflow behavior on which we rely here.

Hmm, 4 is special and not a correct example.  Considering:
  n (unsigned, multiple of 3, and > 0) + 0xfffffffd
It's multiple of 3, but we need to rely on wrapping to get answer.

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
                   ` (18 preceding siblings ...)
  2021-05-19  7:15 ` amker at gcc dot gnu.org
@ 2021-05-19  8:24 ` aldyh at redhat dot com
  2021-05-19 18:11 ` amacleod at redhat dot com
                   ` (25 subsequent siblings)
  45 siblings, 0 replies; 47+ messages in thread
From: aldyh at redhat dot com @ 2021-05-19  8:24 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

--- Comment #20 from Aldy Hernandez <aldyh at redhat dot com> ---
On Wed, May 19, 2021 at 8:31 AM rguenth at gcc dot gnu.org
<gcc-bugzilla@gcc.gnu.org> wrote:
>
> https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499
>
> --- Comment #17 from Richard Biener <rguenth at gcc dot gnu.org> ---
> (In reply to Andrew Macleod from comment #16)

> > We could add an expression evaluator that can walk that expression, invoking
> > range-ops on each expression, and calling a ranger instance to evaluate a
> > range for any ssa_name it finds.
> >
> > It would bail if there are unknown tree-codes to range-ops.
>
> Yeah, it would be similar to the existing determine_value_range () function
> which does exactly do this (but not using ranger).

determine_value_range() has been calling range-ops under the covers
for quite a while, so it's half-way there.  It would require some
minor tweaks:

a) Use irange instead of value_range so as to not throw away the
higher precision range-ops calculates.

b) If we want context-aware ranges, pass it a gimple statement / edge
/ etc, and a range_query/ranger.

Oh yeah, and return a proper range, not this value_range_kind +
wide_int + wide_int business (determine_value_range_1 does this
already).

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
                   ` (19 preceding siblings ...)
  2021-05-19  8:24 ` aldyh at redhat dot com
@ 2021-05-19 18:11 ` amacleod at redhat dot com
  2021-05-22 15:37 ` law at gcc dot gnu.org
                   ` (24 subsequent siblings)
  45 siblings, 0 replies; 47+ messages in thread
From: amacleod at redhat dot com @ 2021-05-19 18:11 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

--- Comment #21 from Andrew Macleod <amacleod at redhat dot com> ---

> > 
> > Would this be useful?  and would it solve this problem? I'm sure there are
> > other details to work out related to the increased precision, but it seems
> > like it might work?   
> 
> Hmm, so the issue we're facing is asking whether x * 3 is a multiple of 3
> but we have to consider that x * 3 might overflow.  For '3' being
> power-of-two
> overflow doesn't change anything but for some C ((X * C) % S % C) != 0
> (where S is the modulo reduction applied on overflow).
> 
> So it seems that, to be useful, this ranger expression walk would need to be
> integrated with multiple_of_p itself.  Or we need to restrict calls to
> multiple_of_p to those C where S is a multiple of C.  (but in principle
> multiple_of_p is happy with a symbolic C - not sure if it is ever called with
> one though)
>

We'll add the general expression evaluator, since I think its pretty
straightforward and useful in general.  And if we then add the ability to
indicate an expression may overflow, then multiple_of_p can decide whether that
is a check it wants to make or not, and if it is, then it can simply ask. 
multiple_of_p would simply be a client calling "may_overflow_p (expr)"

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
                   ` (20 preceding siblings ...)
  2021-05-19 18:11 ` amacleod at redhat dot com
@ 2021-05-22 15:37 ` law at gcc dot gnu.org
  2021-05-26  8:14 ` aldyh at gcc dot gnu.org
                   ` (23 subsequent siblings)
  45 siblings, 0 replies; 47+ messages in thread
From: law at gcc dot gnu.org @ 2021-05-22 15:37 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

--- Comment #22 from Jeffrey A. Law <law at gcc dot gnu.org> ---
I have vague memories of it, but it wasn't my code.  It was actually Craig
Burley.  It's original purpose was merely to allow converting *_DIV_EXPR into
EXACT_DIV_EXPR which presumably was important for some g77 cases way back then.

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
                   ` (21 preceding siblings ...)
  2021-05-22 15:37 ` law at gcc dot gnu.org
@ 2021-05-26  8:14 ` aldyh at gcc dot gnu.org
  2021-05-26  8:21 ` aldyh at gcc dot gnu.org
                   ` (22 subsequent siblings)
  45 siblings, 0 replies; 47+ messages in thread
From: aldyh at gcc dot gnu.org @ 2021-05-26  8:14 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

--- Comment #23 from Aldy Hernandez <aldyh at gcc dot gnu.org> ---
I have an upcoming patchset that implements a range evaluator for tree
expressions (similar to determine_value_range), as well as a gimple_ranger that
evaluates expressions in a higher precision.  This combination looks like it
could provide a way for determining overflow:

// Return TRUE if evaluating EXPR in its type can produce an overflow.
// Overflow is determined by calculating the possible range for EXPR
// in its natural type and in a wider type.  If the results differ,
// evaluating EXPR may have overflowed.

bool
may_overflow_p (const_tree expr)
{
  gimple_ranger ranger;
  gimple_ranger_wider wranger (TREE_TYPE (expr));
  int_range_max r, w;

  if (!ranger.range_of_expr (r, const_cast <tree> (expr))
      || !wranger.range_of_expr (w, const_cast <tree> (expr)))
    return true;

  return r != w;
}

Of course, we'd need to adapt the above to re-use any active ranger instead of
instantiating a new one every time.

The above yields overflow for the 16-bit expression in question:

(gdb) p debug(top)
g_2823_lsm.5_6 * 7854 + 57682

(gdb) p may_overflow_p (top)
$6 = true

This is because the range of the above expression in unsigned 16-bit yields
VARYING (R), whereas in 32-bit unsigned yields [57682, 514769572] (W).

Plugging in the above into multiple_of_p:

diff --git a/gcc/fold-const.c b/gcc/fold-const.c
index 4a4358362e1..ea8ec3bbeec 100644
--- a/gcc/fold-const.c
+++ b/gcc/fold-const.c
@@ -13987,6 +13987,9 @@ multiple_of_p (tree type, const_tree top, const_tree
bottom)
   if (TREE_CODE (type) != INTEGER_TYPE)
     return 0;

+  if (may_overflow_p (top))
+    return false;
+
   switch (TREE_CODE (top))
     {
     case BIT_AND_EXPR:

...yields what I think is the correct result:

(base) abulafia:~/bld/t/gcc$ ./xgcc -B./ a.c -O1  -w
(base) abulafia:~/bld/t/gcc$ ./a.out
start g_2823 = 60533
end g_2823 = 32768
(base) abulafia:~/bld/t/gcc$ ./xgcc -B./ a.c -O1 -fpeel-loops
-ftree-loop-vectorize -w
(base) abulafia:~/bld/t/gcc$ ./a.out
start g_2823 = 60533
end g_2823 = 32768

Does this seem right?

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
                   ` (22 preceding siblings ...)
  2021-05-26  8:14 ` aldyh at gcc dot gnu.org
@ 2021-05-26  8:21 ` aldyh at gcc dot gnu.org
  2021-05-26  8:34 ` rguenther at suse dot de
                   ` (21 subsequent siblings)
  45 siblings, 0 replies; 47+ messages in thread
From: aldyh at gcc dot gnu.org @ 2021-05-26  8:21 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

--- Comment #24 from Aldy Hernandez <aldyh at gcc dot gnu.org> ---
(In reply to Aldy Hernandez from comment #23)

> The above yields overflow for the 16-bit expression in question:
> 
> (gdb) p debug(top)
> g_2823_lsm.5_6 * 7854 + 57682
> 
> (gdb) p may_overflow_p (top)
> $6 = true
> 
> This is because the range of the above expression in unsigned 16-bit yields
> VARYING (R), whereas in 32-bit unsigned yields [57682, 514769572] (W).

Which, I will note, matches Andrew's hand calculation of:

>so the original expression is in 16 bit math, and if it was evaluated as
>     [0, 65535] * [7854, 7854] + [57682, 57682]
> in 32 bit precision, it would come back with the answer [57682, 514769572].

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
                   ` (23 preceding siblings ...)
  2021-05-26  8:21 ` aldyh at gcc dot gnu.org
@ 2021-05-26  8:34 ` rguenther at suse dot de
  2021-05-26  8:40 ` aldyh at redhat dot com
                   ` (20 subsequent siblings)
  45 siblings, 0 replies; 47+ messages in thread
From: rguenther at suse dot de @ 2021-05-26  8:34 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

--- Comment #25 from rguenther at suse dot de <rguenther at suse dot de> ---
On Wed, 26 May 2021, aldyh at gcc dot gnu.org wrote:

> https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499
> 
> --- Comment #23 from Aldy Hernandez <aldyh at gcc dot gnu.org> ---
> I have an upcoming patchset that implements a range evaluator for tree
> expressions (similar to determine_value_range), as well as a gimple_ranger that
> evaluates expressions in a higher precision.  This combination looks like it
> could provide a way for determining overflow:
> 
> // Return TRUE if evaluating EXPR in its type can produce an overflow.
> // Overflow is determined by calculating the possible range for EXPR
> // in its natural type and in a wider type.  If the results differ,
> // evaluating EXPR may have overflowed.
> 
> bool
> may_overflow_p (const_tree expr)
> {
>   gimple_ranger ranger;
>   gimple_ranger_wider wranger (TREE_TYPE (expr));
>   int_range_max r, w;
> 
>   if (!ranger.range_of_expr (r, const_cast <tree> (expr))
>       || !wranger.range_of_expr (w, const_cast <tree> (expr)))
>     return true;
> 
>   return r != w;
> }
> 
> Of course, we'd need to adapt the above to re-use any active ranger instead of
> instantiating a new one every time.
> 
> The above yields overflow for the 16-bit expression in question:
> 
> (gdb) p debug(top)
> g_2823_lsm.5_6 * 7854 + 57682
> 
> (gdb) p may_overflow_p (top)
> $6 = true
> 
> This is because the range of the above expression in unsigned 16-bit yields
> VARYING (R), whereas in 32-bit unsigned yields [57682, 514769572] (W).
> 
> Plugging in the above into multiple_of_p:
> 
> diff --git a/gcc/fold-const.c b/gcc/fold-const.c
> index 4a4358362e1..ea8ec3bbeec 100644
> --- a/gcc/fold-const.c
> +++ b/gcc/fold-const.c
> @@ -13987,6 +13987,9 @@ multiple_of_p (tree type, const_tree top, const_tree
> bottom)
>    if (TREE_CODE (type) != INTEGER_TYPE)
>      return 0;
> 
> +  if (may_overflow_p (top))
> +    return false;
> +
>    switch (TREE_CODE (top))
>      {
>      case BIT_AND_EXPR:
> 
> ...yields what I think is the correct result:
> 
> (base) abulafia:~/bld/t/gcc$ ./xgcc -B./ a.c -O1  -w
> (base) abulafia:~/bld/t/gcc$ ./a.out
> start g_2823 = 60533
> end g_2823 = 32768
> (base) abulafia:~/bld/t/gcc$ ./xgcc -B./ a.c -O1 -fpeel-loops
> -ftree-loop-vectorize -w
> (base) abulafia:~/bld/t/gcc$ ./a.out
> start g_2823 = 60533
> end g_2823 = 32768
> 
> Does this seem right?

It's probably too strict for multiple_of_p which is fine with
overflows that preserve modulo behavior.

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
                   ` (24 preceding siblings ...)
  2021-05-26  8:34 ` rguenther at suse dot de
@ 2021-05-26  8:40 ` aldyh at redhat dot com
  2021-05-26  8:59 ` rguenther at suse dot de
                   ` (19 subsequent siblings)
  45 siblings, 0 replies; 47+ messages in thread
From: aldyh at redhat dot com @ 2021-05-26  8:40 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

--- Comment #26 from Aldy Hernandez <aldyh at redhat dot com> ---
On Wed, May 26, 2021 at 10:34 AM rguenther at suse dot de
<gcc-bugzilla@gcc.gnu.org> wrote:

> It's probably too strict for multiple_of_p which is fine with
> overflows that preserve modulo behavior.

Could you provide an example?

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
                   ` (25 preceding siblings ...)
  2021-05-26  8:40 ` aldyh at redhat dot com
@ 2021-05-26  8:59 ` rguenther at suse dot de
  2021-05-26 13:17 ` amacleod at redhat dot com
                   ` (18 subsequent siblings)
  45 siblings, 0 replies; 47+ messages in thread
From: rguenther at suse dot de @ 2021-05-26  8:59 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

--- Comment #27 from rguenther at suse dot de <rguenther at suse dot de> ---
On Wed, 26 May 2021, aldyh at redhat dot com wrote:

> https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499
> 
> --- Comment #26 from Aldy Hernandez <aldyh at redhat dot com> ---
> On Wed, May 26, 2021 at 10:34 AM rguenther at suse dot de
> <gcc-bugzilla@gcc.gnu.org> wrote:
> 
> > It's probably too strict for multiple_of_p which is fine with
> > overflows that preserve modulo behavior.
> 
> Could you provide an example?

Like with DECL_SIZE being D.1234 * 8 as unsigned multiplication
and the query whether it's a multiple of 8.  Once you have no
range for 'D.1234' you will signal overflow (correctly) but
even then it's still a mutliple of 8.

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
                   ` (26 preceding siblings ...)
  2021-05-26  8:59 ` rguenther at suse dot de
@ 2021-05-26 13:17 ` amacleod at redhat dot com
  2021-05-26 13:23 ` rguenther at suse dot de
                   ` (17 subsequent siblings)
  45 siblings, 0 replies; 47+ messages in thread
From: amacleod at redhat dot com @ 2021-05-26 13:17 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

--- Comment #28 from Andrew Macleod <amacleod at redhat dot com> ---
(In reply to rguenther@suse.de from comment #27)
> On Wed, 26 May 2021, aldyh at redhat dot com wrote:
> 
> > https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499
> > 
> > --- Comment #26 from Aldy Hernandez <aldyh at redhat dot com> ---
> > On Wed, May 26, 2021 at 10:34 AM rguenther at suse dot de
> > <gcc-bugzilla@gcc.gnu.org> wrote:
> > 
> > > It's probably too strict for multiple_of_p which is fine with
> > > overflows that preserve modulo behavior.
> > 
> > Could you provide an example?
> 
> Like with DECL_SIZE being D.1234 * 8 as unsigned multiplication
> and the query whether it's a multiple of 8.  Once you have no
> range for 'D.1234' you will signal overflow (correctly) but
> even then it's still a mutliple of 8.

Determining whether an arbitrary expression is a multiple of some number is not
really something we can figure out via ranges. Well, that's not quite true. If
we fully fleshed out the operations you care about, things like multiply or
shift you could get some results. presumably things like multiply by 2,4,8 and
16.. if we created correct multi-ranges for those, a cast of the high precision
range back to the original precision would yield an identical non-varying
range. and for non-multiples/unsupported values we'd get varying or something
not the same as the original value?.  This would only work if the original
value doesn't come out varying.    Although if its varying, maybe you dont care
and a match is ok anyway?    We could have may_overflow_p also return the
higher precision range for inspection if its true...

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
                   ` (27 preceding siblings ...)
  2021-05-26 13:17 ` amacleod at redhat dot com
@ 2021-05-26 13:23 ` rguenther at suse dot de
  2021-05-26 13:30 ` aldyh at redhat dot com
                   ` (16 subsequent siblings)
  45 siblings, 0 replies; 47+ messages in thread
From: rguenther at suse dot de @ 2021-05-26 13:23 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

--- Comment #29 from rguenther at suse dot de <rguenther at suse dot de> ---
On Wed, 26 May 2021, amacleod at redhat dot com wrote:

> https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499
> 
> --- Comment #28 from Andrew Macleod <amacleod at redhat dot com> ---
> (In reply to rguenther@suse.de from comment #27)
> > On Wed, 26 May 2021, aldyh at redhat dot com wrote:
> > 
> > > https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499
> > > 
> > > --- Comment #26 from Aldy Hernandez <aldyh at redhat dot com> ---
> > > On Wed, May 26, 2021 at 10:34 AM rguenther at suse dot de
> > > <gcc-bugzilla@gcc.gnu.org> wrote:
> > > 
> > > > It's probably too strict for multiple_of_p which is fine with
> > > > overflows that preserve modulo behavior.
> > > 
> > > Could you provide an example?
> > 
> > Like with DECL_SIZE being D.1234 * 8 as unsigned multiplication
> > and the query whether it's a multiple of 8.  Once you have no
> > range for 'D.1234' you will signal overflow (correctly) but
> > even then it's still a mutliple of 8.
> 
> Determining whether an arbitrary expression is a multiple of some number is not
> really something we can figure out via ranges. Well, that's not quite true. If
> we fully fleshed out the operations you care about, things like multiply or
> shift you could get some results. presumably things like multiply by 2,4,8 and
> 16.. if we created correct multi-ranges for those, a cast of the high precision
> range back to the original precision would yield an identical non-varying
> range. and for non-multiples/unsupported values we'd get varying or something
> not the same as the original value?.  This would only work if the original
> value doesn't come out varying.    Although if its varying, maybe you dont care
> and a match is ok anyway?    We could have may_overflow_p also return the
> higher precision range for inspection if its true...

I guess optimally multiple_of_p would check for each individual
operation it looks at whether the op is transparent for the
modulo query and only if not checks whether there's possible overflow
(where ranges could help).

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
                   ` (28 preceding siblings ...)
  2021-05-26 13:23 ` rguenther at suse dot de
@ 2021-05-26 13:30 ` aldyh at redhat dot com
  2021-07-22  8:09 ` [Bug tree-optimization/100499] [9/10/11/12 Regression] " rguenth at gcc dot gnu.org
                   ` (15 subsequent siblings)
  45 siblings, 0 replies; 47+ messages in thread
From: aldyh at redhat dot com @ 2021-05-26 13:30 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

--- Comment #30 from Aldy Hernandez <aldyh at redhat dot com> ---
On 5/26/21 3:23 PM, rguenther at suse dot de wrote:
> https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499
> 
> --- Comment #29 from rguenther at suse dot de <rguenther at suse dot de> ---
> On Wed, 26 May 2021, amacleod at redhat dot com wrote:
> 
>> https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499
>>
>> --- Comment #28 from Andrew Macleod <amacleod at redhat dot com> ---
>> (In reply to rguenther@suse.de from comment #27)
>>> On Wed, 26 May 2021, aldyh at redhat dot com wrote:
>>>
>>>> https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499
>>>>
>>>> --- Comment #26 from Aldy Hernandez <aldyh at redhat dot com> ---
>>>> On Wed, May 26, 2021 at 10:34 AM rguenther at suse dot de
>>>> <gcc-bugzilla@gcc.gnu.org> wrote:
>>>>
>>>>> It's probably too strict for multiple_of_p which is fine with
>>>>> overflows that preserve modulo behavior.
>>>>
>>>> Could you provide an example?
>>>
>>> Like with DECL_SIZE being D.1234 * 8 as unsigned multiplication
>>> and the query whether it's a multiple of 8.  Once you have no
>>> range for 'D.1234' you will signal overflow (correctly) but
>>> even then it's still a mutliple of 8.
>>
>> Determining whether an arbitrary expression is a multiple of some number is not
>> really something we can figure out via ranges. Well, that's not quite true. If
>> we fully fleshed out the operations you care about, things like multiply or
>> shift you could get some results. presumably things like multiply by 2,4,8 and
>> 16.. if we created correct multi-ranges for those, a cast of the high precision
>> range back to the original precision would yield an identical non-varying
>> range. and for non-multiples/unsupported values we'd get varying or something
>> not the same as the original value?.  This would only work if the original
>> value doesn't come out varying.    Although if its varying, maybe you dont care
>> and a match is ok anyway?    We could have may_overflow_p also return the
>> higher precision range for inspection if its true...
> 
> I guess optimally multiple_of_p would check for each individual
> operation it looks at whether the op is transparent for the
> modulo query and only if not checks whether there's possible overflow
> (where ranges could help).

My POC gimple_ranger_wider class can just be overloaded and in 
range_of_expr, you could just hijack the operations you care about 
(PLUS_EXPR, MULTIPLY_EXPR, etc) and treat them specially.  Just a thought...

Aldy

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] [9/10/11/12 Regression] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
                   ` (29 preceding siblings ...)
  2021-05-26 13:30 ` aldyh at redhat dot com
@ 2021-07-22  8:09 ` rguenth at gcc dot gnu.org
  2021-07-22  8:09 ` rguenth at gcc dot gnu.org
                   ` (14 subsequent siblings)
  45 siblings, 0 replies; 47+ messages in thread
From: rguenth at gcc dot gnu.org @ 2021-07-22  8:09 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

Richard Biener <rguenth at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
   Target Milestone|---                         |9.5
            Summary|Different results with      |[9/10/11/12 Regression]
                   |-fpeel-loops                |Different results with
                   |-ftree-loop-vectorize       |-fpeel-loops
                   |options                     |-ftree-loop-vectorize
                   |                            |options
           Priority|P3                          |P2

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] [9/10/11/12 Regression] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
                   ` (30 preceding siblings ...)
  2021-07-22  8:09 ` [Bug tree-optimization/100499] [9/10/11/12 Regression] " rguenth at gcc dot gnu.org
@ 2021-07-22  8:09 ` rguenth at gcc dot gnu.org
  2021-07-22  8:11 ` rguenth at gcc dot gnu.org
                   ` (13 subsequent siblings)
  45 siblings, 0 replies; 47+ messages in thread
From: rguenth at gcc dot gnu.org @ 2021-07-22  8:09 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

Richard Biener <rguenth at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |qrzhang at gatech dot edu

--- Comment #31 from Richard Biener <rguenth at gcc dot gnu.org> ---
*** Bug 101501 has been marked as a duplicate of this bug. ***

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] [9/10/11/12 Regression] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
                   ` (31 preceding siblings ...)
  2021-07-22  8:09 ` rguenth at gcc dot gnu.org
@ 2021-07-22  8:11 ` rguenth at gcc dot gnu.org
  2021-07-28  9:05 ` rguenth at gcc dot gnu.org
                   ` (12 subsequent siblings)
  45 siblings, 0 replies; 47+ messages in thread
From: rguenth at gcc dot gnu.org @ 2021-07-22  8:11 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

--- Comment #32 from Richard Biener <rguenth at gcc dot gnu.org> ---
Testcase from PR101501

/* { dg-do run } */

unsigned char ag = 55;
unsigned i;
int main()
{
  unsigned char c;
  unsigned char a = ag;
d:
  c = a-- * 52;
  if (c)
    goto d;
  if (a != 255)
    __builtin_abort ();
  return 0;
}

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] [9/10/11/12 Regression] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
                   ` (32 preceding siblings ...)
  2021-07-22  8:11 ` rguenth at gcc dot gnu.org
@ 2021-07-28  9:05 ` rguenth at gcc dot gnu.org
  2021-07-29  7:56 ` ebotcazou at gcc dot gnu.org
                   ` (11 subsequent siblings)
  45 siblings, 0 replies; 47+ messages in thread
From: rguenth at gcc dot gnu.org @ 2021-07-28  9:05 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

Richard Biener <rguenth at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |ebotcazou at gcc dot gnu.org

--- Comment #33 from Richard Biener <rguenth at gcc dot gnu.org> ---
Issues with multiple_of_p:

    case MINUS_EXPR:
      /* It is impossible to prove if op0 - op1 is multiple of bottom
         precisely, so be conservative here checking if both op0 and op1
         are multiple of bottom.  Note we check the second operand first
         since it's usually simpler.  */
      return (multiple_of_p (type, TREE_OPERAND (top, 1), bottom)
              && multiple_of_p (type, TREE_OPERAND (top, 0), bottom));

wrong for unsigned type and 0 - 3 with bottom == 3, -3u is not a multiple of 3.

    case PLUS_EXPR:
      /* The same as MINUS_EXPR, but handle cases like op0 + 0xfffffffd
         as op0 - 3 if the expression has unsigned type.  For example,
         (X / 3) + 0xfffffffd is multiple of 3, but 0xfffffffd is not.  */
      op1 = TREE_OPERAND (top, 1);
      if (TYPE_UNSIGNED (type)
          && TREE_CODE (op1) == INTEGER_CST && tree_int_cst_sign_bit (op1))
        op1 = fold_build1 (NEGATE_EXPR, type, op1);
      return (multiple_of_p (type, op1, bottom)
              && multiple_of_p (type, TREE_OPERAND (top, 0), bottom));

it probably means (X * 3) + 0xfffffffd but then for X == 0 this is wrong
again?

Of course it might hint at that multiple_of_p wants to interpret everything
as signed, thus compute whether (signed)top % (signed)bottom is zero and
maybe this is why we pass in a custom type rather than using the type of
'top'?

But then we have

    case INTEGER_CST:
      if (TREE_CODE (bottom) != INTEGER_CST
          || integer_zerop (bottom)
          || (TYPE_UNSIGNED (type)
              && (tree_int_cst_sgn (top) < 0
                  || tree_int_cst_sgn (bottom) < 0)))
        return 0;
      return wi::multiple_of_p (wi::to_widest (top), wi::to_widest (bottom),
                                SIGNED);

that doesn't agree with such interpretation.

Doing such re-interpretation might help avoid regressions for queries
on {TYPE,DECL}_SIZE top which are generally unsigned but any symbolic
computation is not expected to wrap.  We'd then pass in ssizetype
but make sure to adjust that accordingly when facing non-sign-changing
conversion?

fold_binary does

      /* Fold (X * Y) & -(1 << CST) to X * Y if Y is a constant
         multiple of 1 << CST.  */
      if (TREE_CODE (arg1) == INTEGER_CST)
        {
          wi::tree_to_wide_ref cst1 = wi::to_wide (arg1);
          wide_int ncst1 = -cst1;
          if ((cst1 & ncst1) == ncst1
              && multiple_of_p (type, arg0,
                                wide_int_to_tree (TREE_TYPE (arg1), ncst1)))
            return fold_convert_loc (loc, type, arg0);
        }

which passes in the original type of the AND but then arg0 has
nop-conversions stripped which means it can have different sign.

      /* If arg0 is a multiple of arg1, then rewrite to the fastest div
         operation, EXACT_DIV_EXPR.

         Note that only CEIL_DIV_EXPR and FLOOR_DIV_EXPR are rewritten now.
         At one time others generated faster code, it's not clear if they do
         after the last round to changes to the DIV code in expmed.c.  */
      if ((code == CEIL_DIV_EXPR || code == FLOOR_DIV_EXPR)
          && multiple_of_p (type, arg0, arg1))
        return fold_build2_loc (loc, EXACT_DIV_EXPR, type,
                                fold_convert (type, arg0),
                                fold_convert (type, arg1));

happily does the wrong thing for (unsigned * 3) / 3.  The round_{up,down}_loc
helpers have similar correctness issues.

The user I'm most worried about is stor-layout.c:place_field which
uses multiple_of_p to optimize DECL_OFFSET_ALIGN, where losing that
might have severe impact on Ada.

For each individual operation - apart from checking TYPE_OVERFLOW_UNDEFINED -
we can compute a value-range for its operands and then determine whether the
operation itself can overflow.  With ranger that should eventually be
caching, but I'm not sure about the GENERIC part where this operation would
be quadratic when we recurse down the tree.  In general that wouldn't help
the stor-layout.c case since it's looking at a series of unsigned computes
in the cases we care about.  It looks like for the purpose of sizes
we'd want sth like a EXACT_MULT_EXPR (denoting no overflow), likewise a
EXACT_PLUS_EXPR.  OTOH I remember offsets can be effectively negative in
Ada but they are still unsigned?

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] [9/10/11/12 Regression] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
                   ` (33 preceding siblings ...)
  2021-07-28  9:05 ` rguenth at gcc dot gnu.org
@ 2021-07-29  7:56 ` ebotcazou at gcc dot gnu.org
  2022-01-24 13:16 ` rguenth at gcc dot gnu.org
                   ` (10 subsequent siblings)
  45 siblings, 0 replies; 47+ messages in thread
From: ebotcazou at gcc dot gnu.org @ 2021-07-29  7:56 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

--- Comment #34 from Eric Botcazou <ebotcazou at gcc dot gnu.org> ---
> The user I'm most worried about is stor-layout.c:place_field which
> uses multiple_of_p to optimize DECL_OFFSET_ALIGN, where losing that
> might have severe impact on Ada.

Indeed, this might measurably impact dynamic offsets, which are very common in
Ada (but they are never negative in record/union/qualified union types).

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] [9/10/11/12 Regression] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
                   ` (34 preceding siblings ...)
  2021-07-29  7:56 ` ebotcazou at gcc dot gnu.org
@ 2022-01-24 13:16 ` rguenth at gcc dot gnu.org
  2022-01-24 13:16 ` rguenth at gcc dot gnu.org
                   ` (9 subsequent siblings)
  45 siblings, 0 replies; 47+ messages in thread
From: rguenth at gcc dot gnu.org @ 2022-01-24 13:16 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

--- Comment #35 from Richard Biener <rguenth at gcc dot gnu.org> ---
Testcase from this PR, for the testsuite:

/* { dg-do run } */

typedef __UINT16_TYPE__ uint16_t;
typedef __INT32_TYPE__ int32_t;
static uint16_t g_2823 = 0xEC75L;
static uint16_t g_116 = 0xBC07L;

static uint16_t
safe_mul_func_uint16_t_u_u(uint16_t ui1, uint16_t ui2)
{
  return ((unsigned int)ui1) * ((unsigned int)ui2);
}

int main ()
{
  uint16_t l_2815 = 0xffff;
  uint16_t *l_2821 = &g_116;
  uint16_t *l_2822 = &g_2823;

lbl_2826:
  l_2815 &= 0x1eae;
  if (safe_mul_func_uint16_t_u_u(((*l_2821) = l_2815), (--(*l_2822))))
    goto lbl_2826;
  if (g_2823 != 32768)
    __builtin_abort ();
  return 0;
}

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] [9/10/11/12 Regression] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
                   ` (35 preceding siblings ...)
  2022-01-24 13:16 ` rguenth at gcc dot gnu.org
@ 2022-01-24 13:16 ` rguenth at gcc dot gnu.org
  2022-01-24 13:17 ` rguenth at gcc dot gnu.org
                   ` (8 subsequent siblings)
  45 siblings, 0 replies; 47+ messages in thread
From: rguenth at gcc dot gnu.org @ 2022-01-24 13:16 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

Richard Biener <rguenth at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |stefansf at linux dot ibm.com

--- Comment #36 from Richard Biener <rguenth at gcc dot gnu.org> ---
*** Bug 103063 has been marked as a duplicate of this bug. ***

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] [9/10/11/12 Regression] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
                   ` (36 preceding siblings ...)
  2022-01-24 13:16 ` rguenth at gcc dot gnu.org
@ 2022-01-24 13:17 ` rguenth at gcc dot gnu.org
  2022-01-31  8:08 ` cvs-commit at gcc dot gnu.org
                   ` (7 subsequent siblings)
  45 siblings, 0 replies; 47+ messages in thread
From: rguenth at gcc dot gnu.org @ 2022-01-24 13:17 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

Richard Biener <rguenth at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
           Assignee|amker at gcc dot gnu.org           |rguenth at gcc dot gnu.org
             Status|NEW                         |ASSIGNED

--- Comment #37 from Richard Biener <rguenth at gcc dot gnu.org> ---
>From PR103063:

/* { dg-do run } */

int a = 0;
unsigned char b = 0;

int main() {
  a - 6;
  for (; a >= -13; a = a - 8)
    while((unsigned char)(b-- * 6))
      ;
  if (b != 127)
    __builtin_abort();
  return 0;
}

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] [9/10/11/12 Regression] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
                   ` (37 preceding siblings ...)
  2022-01-24 13:17 ` rguenth at gcc dot gnu.org
@ 2022-01-31  8:08 ` cvs-commit at gcc dot gnu.org
  2022-01-31  8:38 ` cvs-commit at gcc dot gnu.org
                   ` (6 subsequent siblings)
  45 siblings, 0 replies; 47+ messages in thread
From: cvs-commit at gcc dot gnu.org @ 2022-01-31  8:08 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

--- Comment #38 from CVS Commits <cvs-commit at gcc dot gnu.org> ---
The master branch has been updated by Richard Biener <rguenth@gcc.gnu.org>:

https://gcc.gnu.org/g:3c7067cc92281dcbfeaf64116d8416f2092b4c10

commit r12-6936-g3c7067cc92281dcbfeaf64116d8416f2092b4c10
Author: Richard Biener <rguenther@suse.de>
Date:   Mon Jan 24 14:49:20 2022 +0100

    Reduce multiple_of_p uses

    There are a few cases where we know we're dealing with (poly-)integer
    constants, so remove the use of multiple_of_p in those cases to make
    the PR100499 fix less impactful.

    2022-01-24  Richard Biener  <rguenther@suse.de>

            PR tree-optimization/100499
            * tree-cfg.cc (verify_gimple_assign_ternary): Use multiple_p
            on poly-ints instead of multiple_of_p.
            * tree-ssa.cc (maybe_rewrite_mem_ref_base): Likewise.
            (non_rewritable_mem_ref_base): Likewise.
            (non_rewritable_lvalue_p): Likewise.
            (execute_update_addresses_taken): Likewise.

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] [9/10/11/12 Regression] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
                   ` (38 preceding siblings ...)
  2022-01-31  8:08 ` cvs-commit at gcc dot gnu.org
@ 2022-01-31  8:38 ` cvs-commit at gcc dot gnu.org
  2022-02-04 13:39 ` cvs-commit at gcc dot gnu.org
                   ` (5 subsequent siblings)
  45 siblings, 0 replies; 47+ messages in thread
From: cvs-commit at gcc dot gnu.org @ 2022-01-31  8:38 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

--- Comment #39 from CVS Commits <cvs-commit at gcc dot gnu.org> ---
The master branch has been updated by Richard Biener <rguenth@gcc.gnu.org>:

https://gcc.gnu.org/g:625f16c798757dcbfdded841f01d7c566d15c55c

commit r12-6939-g625f16c798757dcbfdded841f01d7c566d15c55c
Author: Richard Biener <rguenther@suse.de>
Date:   Mon Jan 24 14:59:00 2022 +0100

    Fix multiple_of_p behavior with NOP_EXPR

    We were passing down the original type to recursive invocations
    of multiple_of_p for say (int)(unsigned * unsigned).

    2022-01-24  Richard Biener  <rguenther@suse.de>

            PR tree-optimization/100499
            * fold-const.cc (multiple_of_p): Pass the correct type of
            the expression to the recursive invocation of multiple_of_p
            for conversions and use CASE_CONVERT.

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] [9/10/11/12 Regression] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
                   ` (39 preceding siblings ...)
  2022-01-31  8:38 ` cvs-commit at gcc dot gnu.org
@ 2022-02-04 13:39 ` cvs-commit at gcc dot gnu.org
  2022-02-04 13:40 ` [Bug tree-optimization/100499] [9/10/11 " rguenth at gcc dot gnu.org
                   ` (4 subsequent siblings)
  45 siblings, 0 replies; 47+ messages in thread
From: cvs-commit at gcc dot gnu.org @ 2022-02-04 13:39 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

--- Comment #40 from CVS Commits <cvs-commit at gcc dot gnu.org> ---
The master branch has been updated by Richard Biener <rguenth@gcc.gnu.org>:

https://gcc.gnu.org/g:0898049ad9bf6c46e510b18aaafca4946802749f

commit r12-7052-g0898049ad9bf6c46e510b18aaafca4946802749f
Author: Richard Biener <rguenther@suse.de>
Date:   Wed Jan 26 09:35:57 2022 +0100

    tree-optimization/100499 - niter analysis and multiple_of_p

    niter analysis uses multiple_of_p which currently assumes
    operations like MULT_EXPR do not wrap.  We've got to rely on this
    for optimizing size expressions like those in DECL_SIZE and those
    generally use unsigned arithmetic with no indication that they
    are not expected to wrap.  To preserve that the following adds
    a parameter to multiple_of_p, defaulted to true, indicating that
    the TOP expression is not expected to wrap for outer computations
    in TYPE.  This mostly follows a patch proposed by Bin last year
    with the conversion behavior added.

    Applying to all users the new effect is that upon type conversions
    in the TOP expression the behavior will switch to honor
    TYPE_OVERFLOW_UNDEFINED for the converted sub-expressions.

    The patch also changes the occurance in niter analysis that we
    know is problematic and we have testcases for to pass false
    to multiple_of_p.  The patch also contains a change to the
    PR72817 fix from Bin to avoid regressing gcc.dg/tree-ssa/loop-42.c.

    The intent for stage1 is to introduce a size_multiple_of_p and
    internalize the added parameter so all multiple_of_p users will
    honor TYPE_OVERFLOW_UNDEFINED and users dealing with size expressions
    need to be switched to size_multiple_of_p.

    2022-01-26  Richard Biener  <rguenther@suse.de>

            PR tree-optimization/100499
            * fold-const.h (multiple_of_p): Add nowrap parameter, defaulted
            to true.
            * fold-const.cc (multiple_of_p): Likewise.  Honor it for
            MULT_EXPR, PLUS_EXPR and MINUS_EXPR and pass it along,
            switching to false for conversions.
            * tree-ssa-loop-niter.cc (number_of_iterations_ne): Do not
            claim the outermost expression does not wrap when calling
            multiple_of_p.  Refactor the check done to check the
            original IV, avoiding a bias that might wrap.

            * gcc.dg/torture/pr100499-1.c: New testcase.
            * gcc.dg/torture/pr100499-2.c: Likewise.
            * gcc.dg/torture/pr100499-3.c: Likewise.

    Co-authored-by: Bin Cheng  <bin.cheng@linux.alibaba.com>

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] [9/10/11 Regression] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
                   ` (40 preceding siblings ...)
  2022-02-04 13:39 ` cvs-commit at gcc dot gnu.org
@ 2022-02-04 13:40 ` rguenth at gcc dot gnu.org
  2022-02-15 12:23 ` rguenth at gcc dot gnu.org
                   ` (3 subsequent siblings)
  45 siblings, 0 replies; 47+ messages in thread
From: rguenth at gcc dot gnu.org @ 2022-02-04 13:40 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

Richard Biener <rguenth at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
            Summary|[9/10/11/12 Regression]     |[9/10/11 Regression]
                   |Different results with      |Different results with
                   |-fpeel-loops                |-fpeel-loops
                   |-ftree-loop-vectorize       |-ftree-loop-vectorize
                   |options                     |options
      Known to work|                            |12.0

--- Comment #41 from Richard Biener <rguenth at gcc dot gnu.org> ---
This series, also g:24c72fb0eedfe7c67f9b15aa31b474a999cc4723, fixes the known
cases of miscompiles.

Thanks for reporting!

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] [9/10/11 Regression] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
                   ` (41 preceding siblings ...)
  2022-02-04 13:40 ` [Bug tree-optimization/100499] [9/10/11 " rguenth at gcc dot gnu.org
@ 2022-02-15 12:23 ` rguenth at gcc dot gnu.org
  2022-05-27  9:45 ` [Bug tree-optimization/100499] [10/11 " rguenth at gcc dot gnu.org
                   ` (2 subsequent siblings)
  45 siblings, 0 replies; 47+ messages in thread
From: rguenth at gcc dot gnu.org @ 2022-02-15 12:23 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499
Bug 100499 depends on bug 104519, which changed state.

Bug 104519 Summary: [12 Regression] wrong code at -Os on x86_64-linux-gnu and char as induction variable
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104519

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|ASSIGNED                    |RESOLVED
         Resolution|---                         |FIXED

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] [10/11 Regression] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
                   ` (42 preceding siblings ...)
  2022-02-15 12:23 ` rguenth at gcc dot gnu.org
@ 2022-05-27  9:45 ` rguenth at gcc dot gnu.org
  2022-06-28 10:44 ` jakub at gcc dot gnu.org
  2023-07-07 10:39 ` [Bug tree-optimization/100499] [11 " rguenth at gcc dot gnu.org
  45 siblings, 0 replies; 47+ messages in thread
From: rguenth at gcc dot gnu.org @ 2022-05-27  9:45 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

Richard Biener <rguenth at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
   Target Milestone|9.5                         |10.4

--- Comment #42 from Richard Biener <rguenth at gcc dot gnu.org> ---
GCC 9 branch is being closed

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] [10/11 Regression] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
                   ` (43 preceding siblings ...)
  2022-05-27  9:45 ` [Bug tree-optimization/100499] [10/11 " rguenth at gcc dot gnu.org
@ 2022-06-28 10:44 ` jakub at gcc dot gnu.org
  2023-07-07 10:39 ` [Bug tree-optimization/100499] [11 " rguenth at gcc dot gnu.org
  45 siblings, 0 replies; 47+ messages in thread
From: jakub at gcc dot gnu.org @ 2022-06-28 10:44 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

Jakub Jelinek <jakub at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
   Target Milestone|10.4                        |10.5

--- Comment #43 from Jakub Jelinek <jakub at gcc dot gnu.org> ---
GCC 10.4 is being released, retargeting bugs to GCC 10.5.

^ permalink raw reply	[flat|nested] 47+ messages in thread

* [Bug tree-optimization/100499] [11 Regression] Different results with -fpeel-loops -ftree-loop-vectorize options
  2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
                   ` (44 preceding siblings ...)
  2022-06-28 10:44 ` jakub at gcc dot gnu.org
@ 2023-07-07 10:39 ` rguenth at gcc dot gnu.org
  45 siblings, 0 replies; 47+ messages in thread
From: rguenth at gcc dot gnu.org @ 2023-07-07 10:39 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100499

Richard Biener <rguenth at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
   Target Milestone|10.5                        |11.5

--- Comment #44 from Richard Biener <rguenth at gcc dot gnu.org> ---
GCC 10 branch is being closed.

^ permalink raw reply	[flat|nested] 47+ messages in thread

end of thread, other threads:[~2023-07-07 10:39 UTC | newest]

Thread overview: 47+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2021-05-10 10:20 [Bug tree-optimization/100499] New: Different results with -fpeel-loops -ftree-loop-vectorize options dongjianqiang2 at huawei dot com
2021-05-10 11:13 ` [Bug tree-optimization/100499] " rguenth at gcc dot gnu.org
2021-05-11  9:06 ` marxin at gcc dot gnu.org
2021-05-11  9:07 ` marxin at gcc dot gnu.org
2021-05-11  9:09 ` marxin at gcc dot gnu.org
2021-05-11  9:12 ` dongjianqiang2 at huawei dot com
2021-05-11  9:42 ` rguenth at gcc dot gnu.org
2021-05-11 10:40 ` amker at gcc dot gnu.org
2021-05-11 11:33 ` marxin at gcc dot gnu.org
2021-05-17  2:17 ` amker at gcc dot gnu.org
2021-05-18  9:49 ` rguenth at gcc dot gnu.org
2021-05-18  9:57 ` rguenth at gcc dot gnu.org
2021-05-18 10:25 ` rguenth at gcc dot gnu.org
2021-05-18 11:33 ` amker at gcc dot gnu.org
2021-05-18 11:36 ` amker at gcc dot gnu.org
2021-05-18 11:54 ` rguenther at suse dot de
2021-05-18 15:46 ` amacleod at redhat dot com
2021-05-19  6:31 ` rguenth at gcc dot gnu.org
2021-05-19  7:10 ` amker at gcc dot gnu.org
2021-05-19  7:15 ` amker at gcc dot gnu.org
2021-05-19  8:24 ` aldyh at redhat dot com
2021-05-19 18:11 ` amacleod at redhat dot com
2021-05-22 15:37 ` law at gcc dot gnu.org
2021-05-26  8:14 ` aldyh at gcc dot gnu.org
2021-05-26  8:21 ` aldyh at gcc dot gnu.org
2021-05-26  8:34 ` rguenther at suse dot de
2021-05-26  8:40 ` aldyh at redhat dot com
2021-05-26  8:59 ` rguenther at suse dot de
2021-05-26 13:17 ` amacleod at redhat dot com
2021-05-26 13:23 ` rguenther at suse dot de
2021-05-26 13:30 ` aldyh at redhat dot com
2021-07-22  8:09 ` [Bug tree-optimization/100499] [9/10/11/12 Regression] " rguenth at gcc dot gnu.org
2021-07-22  8:09 ` rguenth at gcc dot gnu.org
2021-07-22  8:11 ` rguenth at gcc dot gnu.org
2021-07-28  9:05 ` rguenth at gcc dot gnu.org
2021-07-29  7:56 ` ebotcazou at gcc dot gnu.org
2022-01-24 13:16 ` rguenth at gcc dot gnu.org
2022-01-24 13:16 ` rguenth at gcc dot gnu.org
2022-01-24 13:17 ` rguenth at gcc dot gnu.org
2022-01-31  8:08 ` cvs-commit at gcc dot gnu.org
2022-01-31  8:38 ` cvs-commit at gcc dot gnu.org
2022-02-04 13:39 ` cvs-commit at gcc dot gnu.org
2022-02-04 13:40 ` [Bug tree-optimization/100499] [9/10/11 " rguenth at gcc dot gnu.org
2022-02-15 12:23 ` rguenth at gcc dot gnu.org
2022-05-27  9:45 ` [Bug tree-optimization/100499] [10/11 " rguenth at gcc dot gnu.org
2022-06-28 10:44 ` jakub at gcc dot gnu.org
2023-07-07 10:39 ` [Bug tree-optimization/100499] [11 " rguenth at gcc dot gnu.org

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).