[Bug rtl-optimization/48580] New: missed optimization: integer overflow checks

public inbox for gcc-bugs@sourceware.org
help / color / mirror / Atom feed

* [Bug rtl-optimization/48580] New: missed optimization: integer overflow checks
@ 2011-04-12 18:36 zackw at panix dot com
  2011-04-12 20:18 ` [Bug rtl-optimization/48580] " joseph at codesourcery dot com
                   ` (24 more replies)
  0 siblings, 25 replies; 26+ messages in thread
From: zackw at panix dot com @ 2011-04-12 18:36 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=48580

           Summary: missed optimization: integer overflow checks
           Product: gcc
           Version: 4.6.0
            Status: UNCONFIRMED
          Severity: enhancement
          Priority: P3
         Component: rtl-optimization
        AssignedTo: unassigned@gcc.gnu.org
        ReportedBy: zackw@panix.com


To the best of my knowledge, this is the only safe way (without -fwrapv) to
check whether the product of two signed integers overflowed:

bool product_does_not_overflow(signed x, signed y)
{
  unsigned tmp = x * unsigned(y);

  return signed(tmp) > 0 && tmp / x == unsigned(y);
}

(I believe C and C++ are the same in this regard but I could be wrong.  If
there is a better way to write this test I would love to know about it.)

g++ 4.6 produces this assembly dump on x86-64:

_Z25product_does_not_overflowii:
    movl    %esi, %edx
    xorl    %eax, %eax
    imull    %edi, %edx
    testl    %edx, %edx
    jle    .L2
    movl    %edx, %eax
    xorl    %edx, %edx
    divl    %edi
    cmpl    %eax, %esi
    sete    %al
.L2:
    rep
    ret

but, if I understand the semantics of IMUL correctly, it could do this instead:

_Z25product_does_not_overflowii:
    xorl    %eax, %eax
    imull    %edi, %esi
    setno    %al
    ret

which is a pretty substantial micro-win, particularly in getting rid of a
divide.


^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug rtl-optimization/48580] missed optimization: integer overflow checks
  2011-04-12 18:36 [Bug rtl-optimization/48580] New: missed optimization: integer overflow checks zackw at panix dot com
@ 2011-04-12 20:18 ` joseph at codesourcery dot com
  2011-04-12 20:40 ` zackw at panix dot com
                   ` (23 subsequent siblings)
  24 siblings, 0 replies; 26+ messages in thread
From: joseph at codesourcery dot com @ 2011-04-12 20:18 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=48580

--- Comment #1 from joseph at codesourcery dot com <joseph at codesourcery dot com> 2011-04-12 20:18:13 UTC ---
On Tue, 12 Apr 2011, zackw at panix dot com wrote:

> To the best of my knowledge, this is the only safe way (without -fwrapv) to
> check whether the product of two signed integers overflowed:
> 
> bool product_does_not_overflow(signed x, signed y)
> {
>   unsigned tmp = x * unsigned(y);
> 
>   return signed(tmp) > 0 && tmp / x == unsigned(y);
> }

Two signed integers given that they are known to be positive, anyway.  
This may return unexpected results if either or both arguments are 
negative or zero.

I sort of think GCC should have built-in functions exposing C and C++ 
interfaces for: each basic arithmetic operation, defined to wrap; each 
basic arithmetic operation, defined to saturate; each basic arithmetic 
operation, defined to have undefined overflow; each basic arithmetic 
operation, with a separate overflow flag being set; each basic arithmetic 
operation, defined to trap on overflow.  All of these for both signed and 
unsigned and for any desired number of bits (up to the maximum number 
supported for arithmetic, so generally 1-64 bits on 32-bit configurations 
and 1-128 bits on 64-bit configurations); except for the defined-to-trap 
case, all would still have undefined behavior on division by 0.  You could 
then have optimizations mapping generic C idioms to such built-in 
operations where the target supports efficient code for the operations.  
But this rather relies on the no-undefined-overflow work being finished 
first so that some of the required operations actually exist inside GCC, 
before they can easily be exposed to the user.

> which is a pretty substantial micro-win, particularly in getting rid of a
> divide.

(If the function gets called with one constant operand, you can make it 
inline and use __builtin_constant_p to replace a divide with a range check 
on the other operand.  That's only useful for some cases of overflow 
checks, of course.)

^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug rtl-optimization/48580] missed optimization: integer overflow checks
  2011-04-12 18:36 [Bug rtl-optimization/48580] New: missed optimization: integer overflow checks zackw at panix dot com
  2011-04-12 20:18 ` [Bug rtl-optimization/48580] " joseph at codesourcery dot com
@ 2011-04-12 20:40 ` zackw at panix dot com
  2011-04-12 20:52 ` joseph at codesourcery dot com
                   ` (22 subsequent siblings)
  24 siblings, 0 replies; 26+ messages in thread
From: zackw at panix dot com @ 2011-04-12 20:40 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=48580

--- Comment #2 from Zack Weinberg <zackw at panix dot com> 2011-04-12 20:40:41 UTC ---
(In reply to comment #1)
> 
> Two signed integers given that they are known to be positive, anyway.  
> This may return unexpected results if either or both arguments are 
> negative or zero.
...
> (If the function gets called with one constant operand, you can make it 
> inline and use __builtin_constant_p to replace a divide with a range check 
> on the other operand.  That's only useful for some cases of overflow 
> checks, of course.)

In the code that this is cut down from, both arguments are known to be strictly
positive, but neither is constant.  (They're only signed for historical
reasons, I think, but it would be a huge amount of work to change that.)

> I sort of think GCC should have built-in functions exposing C and C++ 
> interfaces for: each basic arithmetic operation, defined to wrap; each 
> basic arithmetic operation, defined to saturate; each basic arithmetic 
> operation, defined to have undefined overflow; each basic arithmetic 
> operation, with a separate overflow flag being set; each basic arithmetic 
> operation, defined to trap on overflow.  All of these for both signed and 
> unsigned and for any desired number of bits (up to the maximum number 
> supported for arithmetic, so generally 1-64 bits on 32-bit configurations 
> and 1-128 bits on 64-bit configurations); except for the defined-to-trap 
> case, all would still have undefined behavior on division by 0.  You could 
> then have optimizations mapping generic C idioms to such built-in 
> operations where the target supports efficient code for the operations.  
> But this rather relies on the no-undefined-overflow work being finished 
> first so that some of the required operations actually exist inside GCC, 
> before they can easily be exposed to the user.

So you see this as more of a tree-level than an RTL-level missed optimization,
then?  Your plan sounds fine to me, although I might look for a less ambitious
but more likely to get done soon approach, personally.


^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug rtl-optimization/48580] missed optimization: integer overflow checks
  2011-04-12 18:36 [Bug rtl-optimization/48580] New: missed optimization: integer overflow checks zackw at panix dot com
  2011-04-12 20:18 ` [Bug rtl-optimization/48580] " joseph at codesourcery dot com
  2011-04-12 20:40 ` zackw at panix dot com
@ 2011-04-12 20:52 ` joseph at codesourcery dot com
  2011-04-12 21:03 ` zackw at panix dot com
                   ` (21 subsequent siblings)
  24 siblings, 0 replies; 26+ messages in thread
From: joseph at codesourcery dot com @ 2011-04-12 20:52 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=48580

--- Comment #3 from joseph at codesourcery dot com <joseph at codesourcery dot com> 2011-04-12 20:52:48 UTC ---
On Tue, 12 Apr 2011, zackw at panix dot com wrote:

> http://gcc.gnu.org/bugzilla/show_bug.cgi?id=48580
> 
> --- Comment #2 from Zack Weinberg <zackw at panix dot com> 2011-04-12 20:40:41 UTC ---
> (In reply to comment #1)
> > 
> > Two signed integers given that they are known to be positive, anyway.  
> > This may return unexpected results if either or both arguments are 
> > negative or zero.
> ...
> > (If the function gets called with one constant operand, you can make it 
> > inline and use __builtin_constant_p to replace a divide with a range check 
> > on the other operand.  That's only useful for some cases of overflow 
> > checks, of course.)
> 
> In the code that this is cut down from, both arguments are known to be strictly
> positive, but neither is constant.  (They're only signed for historical
> reasons, I think, but it would be a huge amount of work to change that.)

My point in noting the need for the integers to be positive was really 
that unless the compiler knows they are positive, the transformation 
you're asking for appears to be incorrect - the semantics of your function 
are that a product with either term 0 counts as overflowing, but using a 
processor overflow flag would report it as not overflowing.

> So you see this as more of a tree-level than an RTL-level missed optimization,
> then?  Your plan sounds fine to me, although I might look for a less ambitious
> but more likely to get done soon approach, personally.

I think it's both levels, but probably tree-level for spotting the 
patterns unless it seems likely that expansion or RTL optimizations will 
produce more instances of them.  And tree-level is where we have VRP 
information, if you want the compiler to tell that the arguments must be 
strictly positive that way.


^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug rtl-optimization/48580] missed optimization: integer overflow checks
  2011-04-12 18:36 [Bug rtl-optimization/48580] New: missed optimization: integer overflow checks zackw at panix dot com
                   ` (2 preceding siblings ...)
  2011-04-12 20:52 ` joseph at codesourcery dot com
@ 2011-04-12 21:03 ` zackw at panix dot com
  2011-04-12 21:04 ` zackw at panix dot com
                   ` (20 subsequent siblings)
  24 siblings, 0 replies; 26+ messages in thread
From: zackw at panix dot com @ 2011-04-12 21:03 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=48580

--- Comment #4 from Zack Weinberg <zackw at panix dot com> 2011-04-12 21:03:01 UTC ---
On Tue, Apr 12, 2011 at 1:52 PM, joseph at codesourcery dot com
<gcc-bugzilla@gcc.gnu.org> wrote:
>> In the code that this is cut down from, both arguments are known to be strictly
>> positive, but neither is constant.  (They're only signed for historical
>> reasons, I think, but it would be a huge amount of work to change that.)
>
> My point in noting the need for the integers to be positive was really
> that unless the compiler knows they are positive, the transformation
> you're asking for appears to be incorrect - the semantics of your function
> are that a product with either term 0 counts as overflowing, but using a
> processor overflow flag would report it as not overflowing.

Well, if the compiler didn't know that, it could still use the
overflow flag plus an extra test for either input operand being zero,
couldn't it?  The C idiom has to test for a zero result, because e.g.
0x4000_0000U * 16 wraps to zero.

(The original code does in fact check for x or y  <= 0 in a place
where VRP would notice; I should have said that instead of "known to
be strictly positive", sorry for any confusion.)


^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug rtl-optimization/48580] missed optimization: integer overflow checks
  2011-04-12 18:36 [Bug rtl-optimization/48580] New: missed optimization: integer overflow checks zackw at panix dot com
                   ` (3 preceding siblings ...)
  2011-04-12 21:03 ` zackw at panix dot com
@ 2011-04-12 21:04 ` zackw at panix dot com
  2011-04-12 21:10 ` joseph at codesourcery dot com
                   ` (19 subsequent siblings)
  24 siblings, 0 replies; 26+ messages in thread
From: zackw at panix dot com @ 2011-04-12 21:04 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=48580

--- Comment #5 from Zack Weinberg <zackw at panix dot com> 2011-04-12 21:04:34 UTC ---
Addendum: what would *you* describe as the correct C idiom for
ensuring that the product of two signed integers was positive and did
not overflow the range of a same-sized signed integer, assuming
nothing about either multiplicand?


^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug rtl-optimization/48580] missed optimization: integer overflow checks
  2011-04-12 18:36 [Bug rtl-optimization/48580] New: missed optimization: integer overflow checks zackw at panix dot com
                   ` (4 preceding siblings ...)
  2011-04-12 21:04 ` zackw at panix dot com
@ 2011-04-12 21:10 ` joseph at codesourcery dot com
  2011-04-12 21:16 ` joseph at codesourcery dot com
                   ` (18 subsequent siblings)
  24 siblings, 0 replies; 26+ messages in thread
From: joseph at codesourcery dot com @ 2011-04-12 21:10 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=48580

--- Comment #6 from joseph at codesourcery dot com <joseph at codesourcery dot com> 2011-04-12 21:09:53 UTC ---
On Tue, 12 Apr 2011, zackw at panix dot com wrote:

> http://gcc.gnu.org/bugzilla/show_bug.cgi?id=48580
> 
> --- Comment #4 from Zack Weinberg <zackw at panix dot com> 2011-04-12 21:03:01 UTC ---
> On Tue, Apr 12, 2011 at 1:52 PM, joseph at codesourcery dot com
> <gcc-bugzilla@gcc.gnu.org> wrote:
> >> In the code that this is cut down from, both arguments are known to be strictly
> >> positive, but neither is constant.  (They're only signed for historical
> >> reasons, I think, but it would be a huge amount of work to change that.)
> >
> > My point in noting the need for the integers to be positive was really
> > that unless the compiler knows they are positive, the transformation
> > you're asking for appears to be incorrect - the semantics of your function
> > are that a product with either term 0 counts as overflowing, but using a
> > processor overflow flag would report it as not overflowing.
> 
> Well, if the compiler didn't know that, it could still use the
> overflow flag plus an extra test for either input operand being zero,
> couldn't it?  The C idiom has to test for a zero result, because e.g.
> 0x4000_0000U * 16 wraps to zero.

Yes (a check for them being <= 0, that is; I think that function will 
report any case with negative operands as overflowing as well).


^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug rtl-optimization/48580] missed optimization: integer overflow checks
  2011-04-12 18:36 [Bug rtl-optimization/48580] New: missed optimization: integer overflow checks zackw at panix dot com
                   ` (5 preceding siblings ...)
  2011-04-12 21:10 ` joseph at codesourcery dot com
@ 2011-04-12 21:16 ` joseph at codesourcery dot com
  2011-04-13 12:11 ` [Bug middle-end/48580] " rguenth at gcc dot gnu.org
                   ` (17 subsequent siblings)
  24 siblings, 0 replies; 26+ messages in thread
From: joseph at codesourcery dot com @ 2011-04-12 21:16 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=48580

--- Comment #7 from joseph at codesourcery dot com <joseph at codesourcery dot com> 2011-04-12 21:16:41 UTC ---
On Tue, 12 Apr 2011, zackw at panix dot com wrote:

> Addendum: what would *you* describe as the correct C idiom for
> ensuring that the product of two signed integers was positive and did
> not overflow the range of a same-sized signed integer, assuming
> nothing about either multiplicand?

I think that's bordering complicated enough that any two people are likely 
to produce different enough code that it may not be worth detecting - the 
cases where generic C becomes complicated like that are the ones providing 
the best case for built-in operations where you can just say "multiply two 
(signed) values, check whether the result fits in 31-bit unsigned and set 
an overflow flag accordingly".


^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug middle-end/48580] missed optimization: integer overflow checks
  2011-04-12 18:36 [Bug rtl-optimization/48580] New: missed optimization: integer overflow checks zackw at panix dot com
                   ` (6 preceding siblings ...)
  2011-04-12 21:16 ` joseph at codesourcery dot com
@ 2011-04-13 12:11 ` rguenth at gcc dot gnu.org
  2011-04-13 12:46 ` joseph at codesourcery dot com
                   ` (16 subsequent siblings)
  24 siblings, 0 replies; 26+ messages in thread
From: rguenth at gcc dot gnu.org @ 2011-04-13 12:11 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=48580

Richard Guenther <rguenth at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|UNCONFIRMED                 |NEW
   Last reconfirmed|                            |2011.04.13 12:11:47
                 CC|                            |rguenth at gcc dot gnu.org
          Component|rtl-optimization            |middle-end
     Ever Confirmed|0                           |1

--- Comment #8 from Richard Guenther <rguenth at gcc dot gnu.org> 2011-04-13 12:11:47 UTC ---
I too would see it as two pieces, pattern matching on trees to produce
a canonical builtin call and RTL expansion of that for optimal target
code generation (where I don't know whether we can do better than using
UNSPECs).  Note that usually you also want to use the result of the
multiplication (if it didn't overflow), and using just a single
multiplication might be even more complicated (if we need to go the
UNSPEC way).

For the latter reasons I think the builtins should be sth like
__builtin_smul_ovfl_p (multiplication-result, op0, op1), thus pass
in the multiplication result and keep the multiplication itself
in the IL to also allow for regular optimizations to work on them.
If the multiplication is just used as the builtin call argument
expansion can get rid of it.

The set of builtins with defined behavior is still useful, if not
only to allow mixing of wrapv/trapv/... operations in C source.
But it isn't exactly the form I'd like the operations to reside
in the IL.

Pattern-matching the multiplication overflow code can be tricky
because of the various ways the handling of zero operands can
be implemented.  Implementing this entirely on the RTL side seems
very tricky to me.

^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug middle-end/48580] missed optimization: integer overflow checks
  2011-04-12 18:36 [Bug rtl-optimization/48580] New: missed optimization: integer overflow checks zackw at panix dot com
                   ` (7 preceding siblings ...)
  2011-04-13 12:11 ` [Bug middle-end/48580] " rguenth at gcc dot gnu.org
@ 2011-04-13 12:46 ` joseph at codesourcery dot com
  2011-04-13 17:44 ` svfuerst at gmail dot com
                   ` (15 subsequent siblings)
  24 siblings, 0 replies; 26+ messages in thread
From: joseph at codesourcery dot com @ 2011-04-13 12:46 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=48580

--- Comment #9 from joseph at codesourcery dot com <joseph at codesourcery dot com> 2011-04-13 12:45:35 UTC ---
On Wed, 13 Apr 2011, rguenth at gcc dot gnu.org wrote:

> For the latter reasons I think the builtins should be sth like
> __builtin_smul_ovfl_p (multiplication-result, op0, op1), thus pass
> in the multiplication result and keep the multiplication itself
> in the IL to also allow for regular optimizations to work on them.
> If the multiplication is just used as the builtin call argument
> expansion can get rid of it.

My inclination is that we probably want to separate the API for users from 
the built-in functions or other operations used internally - possibly 
providing a header gccarith.h with a supported API and saying the built-in 
functions are subject to change and should not be used directly in source 
code.  Maybe (inspired by the style used on C1X stdatomic.h) one could 
have operations like

arith_mul_signed (a, b, 32, ARITH_SAT)

that multiplies a and b (integers to infinite precision, producing an 
infinite precision result), saturates the result to 32-bit signed and 
returns an implementation-defined type (signed, two's complement, at least 
32 bits), or

arith_mul_signed_check (a, b, 32, ARITH_WRAP, &overflow_p)

that stores an overflow flag in the provided _Bool *.  Or the overflow 
handling (wrap, saturate, undefined, unspecified-value, trap) could be 
part of the macro/function name (putting it as a separate operand is 
inspired by the memory_order parameters in stdatomic.h).

The _check versions with explicit overflow flags could be used with any 
overflow handling other than "undefined behavior".  (So if you don't care 
about what the value is when the overflow flag is set - if you'll just 
call some error handler - then you'd use "unspecified-value" as the 
overflow handling.)  On divide by zero, both division and modulo 
operations would have undefined behavior except for the "trap" case - but 
modulo -1 would always be defined, unlike for C INT_MIN % -1 (I'm thinking 
of these operations as all being defined on their own by producing an 
infinite precision result first, but there might also be operations such 
as divmod producing multiple results).  It may also make sense to expose 
different variations of division/modulo (rounding to 0, -infinity or 
+infinity; a true modulo operation, rounding to -infinity, is one of those 
things C doesn't make easy; GCC has these as TRUNC_DIV_EXPR, 
FLOOR_DIV_EXPR, CEIL_DIV_EXPR and corresponding *_MOD_EXPR).  Starting 
with addition then increasing the set of operations gradually probably 
makes sense.

Along with all the arithmetic operations you get range check operations - 
take a value (in any integer type), and check whether it is in the 
signed/unsigned range with a given number of bits, wrapping/saturating 
etc. and setting overflow flags as needed.  These are just like the above 
but with one fewer operands.

The explicit number of bits (required to be an integer constant 
expression) is deliberate because it is sometimes useful e.g. to saturate 
to 16 bits, but integer promotions in C make it rather fragile to rely on 
16-bit operands remaining 16-bit rather than getting implicitly promoted 
to 32 bits.  A consequence is that operations with strange numbers of bits 
can be written without involving bit-field types (since the result is 
allowed to have more than the given number of bits).  I don't expect those 
to be very efficient in general, but note that ARM for example has SSAT 
and USAT instructions to saturate a value to a given number of bits (up to 
32) and set a flag if it was out of range.

^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug middle-end/48580] missed optimization: integer overflow checks
  2011-04-12 18:36 [Bug rtl-optimization/48580] New: missed optimization: integer overflow checks zackw at panix dot com
                   ` (8 preceding siblings ...)
  2011-04-13 12:46 ` joseph at codesourcery dot com
@ 2011-04-13 17:44 ` svfuerst at gmail dot com
  2011-06-20  9:47 ` jsm28 at gcc dot gnu.org
                   ` (14 subsequent siblings)
  24 siblings, 0 replies; 26+ messages in thread
From: svfuerst at gmail dot com @ 2011-04-13 17:44 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=48580

Steven Fuerst <svfuerst at gmail dot com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |svfuerst at gmail dot com

--- Comment #10 from Steven Fuerst <svfuerst at gmail dot com> 2011-04-13 17:44:22 UTC ---
There are C and x86 assembly code fragments showing how to do signed and
unsigned saturating arithmetic here:
http://locklessinc.com/articles/sat_arithmetic/

Although, if there were new intrinsics that allowed direct access to the
overflow and carry flags from arithmetic instructions, they would be quite
useful.


^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug middle-end/48580] missed optimization: integer overflow checks
  2011-04-12 18:36 [Bug rtl-optimization/48580] New: missed optimization: integer overflow checks zackw at panix dot com
                   ` (9 preceding siblings ...)
  2011-04-13 17:44 ` svfuerst at gmail dot com
@ 2011-06-20  9:47 ` jsm28 at gcc dot gnu.org
  2011-10-05 12:44 ` jules at gcc dot gnu.org
                   ` (13 subsequent siblings)
  24 siblings, 0 replies; 26+ messages in thread
From: jsm28 at gcc dot gnu.org @ 2011-06-20  9:47 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=48580

Joseph S. Myers <jsm28 at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |noloader at gmail dot com

--- Comment #11 from Joseph S. Myers <jsm28 at gcc dot gnu.org> 2011-06-20 09:46:59 UTC ---
*** Bug 49467 has been marked as a duplicate of this bug. ***


^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug middle-end/48580] missed optimization: integer overflow checks
  2011-04-12 18:36 [Bug rtl-optimization/48580] New: missed optimization: integer overflow checks zackw at panix dot com
                   ` (10 preceding siblings ...)
  2011-06-20  9:47 ` jsm28 at gcc dot gnu.org
@ 2011-10-05 12:44 ` jules at gcc dot gnu.org
  2011-10-05 13:08 ` jules at gcc dot gnu.org
                   ` (12 subsequent siblings)
  24 siblings, 0 replies; 26+ messages in thread
From: jules at gcc dot gnu.org @ 2011-10-05 12:44 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=48580

jules at gcc dot gnu.org changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |jules at gcc dot gnu.org

--- Comment #12 from jules at gcc dot gnu.org 2011-10-05 12:41:55 UTC ---
I don't much like the idea of using builtins for operations as fundamental as
integer arithmetic. How about this as a straw-man suggestion: adding new
qualifiers for "fat" integers-with-flags, somewhat in the spirit of the
embedded-C fractional/saturating types? So you might have:

int x, y;

void foo (void)
{
  _Flagged int res;

  res = (_Flagged int) x + y;

  if (_Carry (res))
    printf ("sum carried\n");

  if (_Overflow (res))
    printf ("sum overflowed\n");
}

this avoids problems with global state, and allows for the programming style
which (I vaguely get the impression that) people seem to want -- performing a
"normal" integer operation, then querying for carry/overflow flags afterwards.

These types wouldn't be allowed to cross ABI boundaries (although of course you
could use the magic builtins _Carry/_Overflow to extract values to pass to
functions), and "_Overflow" would only be allowed for signed types. You could
also have "_Borrow" for subtraction maybe (whose meaning would be the inverse
of _Carry).

Signalling variants could look like, e.g.:

void bar (void)
{
  int res;

  res = (_Signalling _Flagged int) x + y;
}

Things would get awkward if you tried to use these new constructs in
more-complicated expressions, I suppose. Anyway, just an idea.

^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug middle-end/48580] missed optimization: integer overflow checks
  2011-04-12 18:36 [Bug rtl-optimization/48580] New: missed optimization: integer overflow checks zackw at panix dot com
                   ` (11 preceding siblings ...)
  2011-10-05 12:44 ` jules at gcc dot gnu.org
@ 2011-10-05 13:08 ` jules at gcc dot gnu.org
  2011-10-05 15:20 ` joseph at codesourcery dot com
                   ` (11 subsequent siblings)
  24 siblings, 0 replies; 26+ messages in thread
From: jules at gcc dot gnu.org @ 2011-10-05 13:08 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=48580

--- Comment #13 from jules at gcc dot gnu.org 2011-10-05 13:05:47 UTC ---
Coming to think of it, if _Sat were allowed on plain integers too, a _Flagged
_Sat int could also be queried for saturation using a similar mechanism, like:

int foo (_Sat int x, _Sat int y)
{
  return _Saturated ((_Sat _Flagged) x + y);
}

I'm probably getting ahead of myself :-).


^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug middle-end/48580] missed optimization: integer overflow checks
  2011-04-12 18:36 [Bug rtl-optimization/48580] New: missed optimization: integer overflow checks zackw at panix dot com
                   ` (12 preceding siblings ...)
  2011-10-05 13:08 ` jules at gcc dot gnu.org
@ 2011-10-05 15:20 ` joseph at codesourcery dot com
  2013-02-02 14:03 ` Martin.vGagern at gmx dot net
                   ` (10 subsequent siblings)
  24 siblings, 0 replies; 26+ messages in thread
From: joseph at codesourcery dot com @ 2011-10-05 15:20 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=48580

--- Comment #14 from joseph at codesourcery dot com <joseph at codesourcery dot com> 2011-10-05 15:19:01 UTC ---
On Wed, 5 Oct 2011, jules at gcc dot gnu.org wrote:

> I don't much like the idea of using builtins for operations as fundamental as
> integer arithmetic. How about this as a straw-man suggestion: adding new
> qualifiers for "fat" integers-with-flags, somewhat in the spirit of the
> embedded-C fractional/saturating types? So you might have:

The trouble is that the nature of an operation is more a property of the 
operation than of the type - and the proliferation of types for what 
should be operations on normal types is much of the problem with what the 
embedded-C TR does.  You could have pragmas to say that a cumulative flag 
for a particular scope goes in a particular variable, and that normal 
operations have particular overflow semantics in that scope, maybe.


^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug middle-end/48580] missed optimization: integer overflow checks
  2011-04-12 18:36 [Bug rtl-optimization/48580] New: missed optimization: integer overflow checks zackw at panix dot com
                   ` (13 preceding siblings ...)
  2011-10-05 15:20 ` joseph at codesourcery dot com
@ 2013-02-02 14:03 ` Martin.vGagern at gmx dot net
  2013-02-02 17:02 ` noloader at gmail dot com
                   ` (9 subsequent siblings)
  24 siblings, 0 replies; 26+ messages in thread
From: Martin.vGagern at gmx dot net @ 2013-02-02 14:03 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=48580

Martin von Gagern <Martin.vGagern at gmx dot net> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |Martin.vGagern at gmx dot
                   |                            |net

--- Comment #15 from Martin von Gagern <Martin.vGagern at gmx dot net> 2013-02-02 14:03:12 UTC ---
(In reply to comment #7)
> […] built-in operations where you can just say "multiply two 
> (signed) values, check whether the result fits in 31-bit unsigned and set 
> an overflow flag accordingly".

Would be easier to read, easier to maintain, and less difficult to detect.
Sounds like an overall win. I'm very much in favor of builtins like these.

(In reply to comment #9)
> arith_mul_signed_check (a, b, 32, ARITH_WRAP, &overflow_p)

I'd rather make the overflow indicator the return value, and transfer the
result to a location indicated via a pointer. I.e.

overflow_p = arith_mul_signed_check (a, b, 32, ARITH_WRAP, &a_times_b)

One reason is that one usually wants to check for overflow BEFORE using the
result, i.e. the common use case would likely use the above as a condition of
some branching construct. A second reason is that this would allow for a flavor
which will not modify a_times_b in case of an overflow, which might be useful
for some scenarios, particularly if the result is stored in the same location
as one of the operands, thus overwriting the operand. So an unchecked a*=b
could be transformed into the checked construct

overflow_p = arith_mul_signed_check (a, b, 32, ARITH_CAREFUL, &a)

(In reply to comment #14)
> The trouble is that the nature of an operation is more a property of the 
> operation than of the type

I agree. The main point of all of this is optimization. And in terms of
optimization, one would want to examine some flag immediately after an
operation setting that flag. One would act upon the flag, and then discard it.
If the information were part of the type, one would require extra storage to
save those flags, which would lead to a change in the size of these types,
which in turn would severely impact many other things.

^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug middle-end/48580] missed optimization: integer overflow checks
  2011-04-12 18:36 [Bug rtl-optimization/48580] New: missed optimization: integer overflow checks zackw at panix dot com
                   ` (14 preceding siblings ...)
  2013-02-02 14:03 ` Martin.vGagern at gmx dot net
@ 2013-02-02 17:02 ` noloader at gmail dot com
  2013-02-02 18:54 ` Martin.vGagern at gmx dot net
                   ` (8 subsequent siblings)
  24 siblings, 0 replies; 26+ messages in thread
From: noloader at gmail dot com @ 2013-02-02 17:02 UTC (permalink / raw)
  To: gcc-bugs


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=48580

--- Comment #16 from Jeffrey Walton <noloader at gmail dot com> 2013-02-02 17:01:55 UTC ---
(In reply to comment #15)
> I agree. The main point of all of this is optimization. And in terms of
> optimization, one would want to examine some flag immediately after an
> operation setting that flag. One would act upon the flag, and then discard it.
I somewhat disagree. A program must be correct; it should be secure; and it can
be efficient.

I'm interested in "correct" and "secure". If a program silently overflows, its
surely not correct. If an adversary can do something with the errant result,
its not secure either.

What's the point in doing something wrong but doing it quickly?

Jeff


^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug middle-end/48580] missed optimization: integer overflow checks
  2011-04-12 18:36 [Bug rtl-optimization/48580] New: missed optimization: integer overflow checks zackw at panix dot com
                   ` (15 preceding siblings ...)
  2013-02-02 17:02 ` noloader at gmail dot com
@ 2013-02-02 18:54 ` Martin.vGagern at gmx dot net
  2013-02-02 21:59 ` zackw at panix dot com
                   ` (7 subsequent siblings)
  24 siblings, 0 replies; 26+ messages in thread
From: Martin.vGagern at gmx dot net @ 2013-02-02 18:54 UTC (permalink / raw)
  To: gcc-bugs


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=48580

--- Comment #17 from Martin von Gagern <Martin.vGagern at gmx dot net> 2013-02-02 18:54:43 UTC ---
(In reply to comment #16)
> I somewhat disagree. A program must be correct; it should be secure;
> and it can be efficient. I'm interested in "correct" and "secure".
> If a program silently overflows, its surely not correct.

I'm not talking about silently ignoring overflows, quite the contrary. Always
doing the correct thing leads to arbitrary size integers. Checking all (signed)
arithmetic leads to -ftrapv. Checking some arithmetic might perhaps be achieved
with the signalling types from comment #12, although semantics for mixed types
might be problematic. The non-signalling versions will only improve things if
one actually checks the additional information after the operation, which might
easily be forgotten. Checking individual operations could also (and in my
opinion better) be achieved with builtins, and in this case a warning could be
issued if the return value indicating the overflow is ignored. Builtins might
even allow using specific overflow semantics for code otherwise compiled with
-ftrapv, thus increasing the usability of that flag.


^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug middle-end/48580] missed optimization: integer overflow checks
  2011-04-12 18:36 [Bug rtl-optimization/48580] New: missed optimization: integer overflow checks zackw at panix dot com
                   ` (16 preceding siblings ...)
  2013-02-02 18:54 ` Martin.vGagern at gmx dot net
@ 2013-02-02 21:59 ` zackw at panix dot com
  2013-02-02 22:08 ` Martin.vGagern at gmx dot net
                   ` (6 subsequent siblings)
  24 siblings, 0 replies; 26+ messages in thread
From: zackw at panix dot com @ 2013-02-02 21:59 UTC (permalink / raw)
  To: gcc-bugs


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=48580

--- Comment #18 from Zack Weinberg <zackw at panix dot com> 2013-02-02 21:59:37 UTC ---
I find it a little disappointing that what should have been a straightforward
additional optimization has gotten totally derailed into bikeshedding of an
enormous class of builtins which seem unlikely ever to gain any traction.  I
just want the code in the original bug report to be optimized.  That would be
enough.


^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug middle-end/48580] missed optimization: integer overflow checks
  2011-04-12 18:36 [Bug rtl-optimization/48580] New: missed optimization: integer overflow checks zackw at panix dot com
                   ` (17 preceding siblings ...)
  2013-02-02 21:59 ` zackw at panix dot com
@ 2013-02-02 22:08 ` Martin.vGagern at gmx dot net
  2013-05-19 13:04 ` glisse at gcc dot gnu.org
                   ` (5 subsequent siblings)
  24 siblings, 0 replies; 26+ messages in thread
From: Martin.vGagern at gmx dot net @ 2013-02-02 22:08 UTC (permalink / raw)
  To: gcc-bugs


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=48580

--- Comment #19 from Martin von Gagern <Martin.vGagern at gmx dot net> 2013-02-02 22:08:09 UTC ---
Bug 49467 asked about builtins, and got duped here, so small wonder people
wanting a builtin-colored bikeshed like I do end up here...


^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug middle-end/48580] missed optimization: integer overflow checks
  2011-04-12 18:36 [Bug rtl-optimization/48580] New: missed optimization: integer overflow checks zackw at panix dot com
                   ` (18 preceding siblings ...)
  2013-02-02 22:08 ` Martin.vGagern at gmx dot net
@ 2013-05-19 13:04 ` glisse at gcc dot gnu.org
  2014-08-24  7:06 ` Martin.vGagern at gmx dot net
                   ` (4 subsequent siblings)
  24 siblings, 0 replies; 26+ messages in thread
From: glisse at gcc dot gnu.org @ 2013-05-19 13:04 UTC (permalink / raw)
  To: gcc-bugs

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=48580

--- Comment #20 from Marc Glisse <glisse at gcc dot gnu.org> ---
(In reply to Zack Weinberg from comment #5)
> Addendum: what would *you* describe as the correct C idiom for
> ensuring that the product of two signed integers was positive and did
> not overflow the range of a same-sized signed integer, assuming
> nothing about either multiplicand?

The most natural way to do it depends on whether you have access to a wider
type (and may rely on modular casts to signed integer types as guaranteed by
gcc). For instance if you want a non-negative result that fits an int:

  return x * (unsigned long long)(y) <= __INT_MAX__;

or if you only care about signed overflow and not the sign of the result:

  long long l = x * (long long)(y);
  return l == (int) l;

The value of the overflow flags after a multiplication doesn't seem modeled in
i386.md currently (apart from "clobbered"), so it won't be used, but the code
generated is not too horrible.


^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug middle-end/48580] missed optimization: integer overflow checks
  2011-04-12 18:36 [Bug rtl-optimization/48580] New: missed optimization: integer overflow checks zackw at panix dot com
                   ` (19 preceding siblings ...)
  2013-05-19 13:04 ` glisse at gcc dot gnu.org
@ 2014-08-24  7:06 ` Martin.vGagern at gmx dot net
  2021-08-15 11:45 ` pinskia at gcc dot gnu.org
                   ` (3 subsequent siblings)
  24 siblings, 0 replies; 26+ messages in thread
From: Martin.vGagern at gmx dot net @ 2014-08-24  7:06 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=48580

--- Comment #21 from Martin von Gagern <Martin.vGagern at gmx dot net> ---
(In reply to myself from comment #15)
> (In reply to comment #7)
> > […] built-in operations where you can just say "multiply two 
> > (signed) values, check whether the result fits in 31-bit unsigned and set 
> > an overflow flag accordingly".
> 
> Would be easier to read, easier to maintain, and less difficult to detect.
> Sounds like an overall win. I'm very much in favor of builtins like these.
> 
> I'd rather make the overflow indicator the return value, and transfer the
> result to a location indicated via a pointer.

Cross reference: bug #59708 is going in this direction, motivated by a
precedence set by clang.

(In reply to Zack Weinberg from comment #18)
> I just want the code in the original bug report to be optimized.

I suggested duping bug #49467 there instead, in which case this issue here
could return to the originally requested goal of detecting idioms without
relying on builtins.
>From gcc-bugs-return-459124-listarch-gcc-bugs=gcc.gnu.org@gcc.gnu.org Sun Aug 24 07:14:33 2014
Return-Path: <gcc-bugs-return-459124-listarch-gcc-bugs=gcc.gnu.org@gcc.gnu.org>
Delivered-To: listarch-gcc-bugs@gcc.gnu.org
Received: (qmail 32127 invoked by alias); 24 Aug 2014 07:14:30 -0000
Mailing-List: contact gcc-bugs-help@gcc.gnu.org; run by ezmlm
Precedence: bulk
List-Id: <gcc-bugs.gcc.gnu.org>
List-Archive: <http://gcc.gnu.org/ml/gcc-bugs/>
List-Post: <mailto:gcc-bugs@gcc.gnu.org>
List-Help: <mailto:gcc-bugs-help@gcc.gnu.org>
Sender: gcc-bugs-owner@gcc.gnu.org
Delivered-To: mailing list gcc-bugs@gcc.gnu.org
Received: (qmail 32058 invoked by uid 48); 24 Aug 2014 07:14:19 -0000
From: "pinskia at gcc dot gnu.org" <gcc-bugzilla@gcc.gnu.org>
To: gcc-bugs@gcc.gnu.org
Subject: [Bug c/59708] clang-compatible checked arithmetic builtins
Date: Sun, 24 Aug 2014 07:14:00 -0000
X-Bugzilla-Reason: CC
X-Bugzilla-Type: changed
X-Bugzilla-Watch-Reason: None
X-Bugzilla-Product: gcc
X-Bugzilla-Component: c
X-Bugzilla-Version: unknown
X-Bugzilla-Keywords:
X-Bugzilla-Severity: enhancement
X-Bugzilla-Who: pinskia at gcc dot gnu.org
X-Bugzilla-Status: NEW
X-Bugzilla-Priority: P3
X-Bugzilla-Assigned-To: unassigned at gcc dot gnu.org
X-Bugzilla-Target-Milestone: ---
X-Bugzilla-Flags:
X-Bugzilla-Changed-Fields:
Message-ID: <bug-59708-4-5xk27qiI1v@http.gcc.gnu.org/bugzilla/>
In-Reply-To: <bug-59708-4@http.gcc.gnu.org/bugzilla/>
References: <bug-59708-4@http.gcc.gnu.org/bugzilla/>
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: 7bit
X-Bugzilla-URL: http://gcc.gnu.org/bugzilla/
Auto-Submitted: auto-generated
MIME-Version: 1.0
X-SW-Source: 2014-08/txt/msg01621.txt.bz2
Content-length: 325

https://gcc.gnu.org/bugzilla/show_bug.cgi?idY708

--- Comment #8 from Andrew Pinski <pinskia at gcc dot gnu.org> ---
I rather not have builtins as you can write portable C code that detects
overflow just fine.  GCC could have internal functions if needed which are used
when converting the pattern for detecting overflow.


^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug middle-end/48580] missed optimization: integer overflow checks
  2011-04-12 18:36 [Bug rtl-optimization/48580] New: missed optimization: integer overflow checks zackw at panix dot com
                   ` (20 preceding siblings ...)
  2014-08-24  7:06 ` Martin.vGagern at gmx dot net
@ 2021-08-15 11:45 ` pinskia at gcc dot gnu.org
  2021-08-15 11:49 ` pinskia at gcc dot gnu.org
                   ` (2 subsequent siblings)
  24 siblings, 0 replies; 26+ messages in thread
From: pinskia at gcc dot gnu.org @ 2021-08-15 11:45 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=48580

--- Comment #22 from Andrew Pinski <pinskia at gcc dot gnu.org> ---
For the original testcase in comment #0 we produce (in GCC 11+):
        movl    %edi, %eax
        mull    %esi
        seto    %dl
        xorl    %r8d, %r8d
        movzbl  %dl, %edx
        testl   %eax, %eax
        jle     .L1
        testl   %edx, %edx
        sete    %r8b
.L1:
        movl    %r8d, %eax
        ret

------- CUT ----
I have a patch which I think improves the code even more.

The gimple level looks like this correctly:
  x.0_1 = (unsigned int) x_6(D);
  y.1_2 = (unsigned int) y_7(D);
  _11 = .MUL_OVERFLOW (x.0_1, y.1_2);
  tmp_8 = REALPART_EXPR <_11>;
  tmp.3_3 = (int) tmp_8;
  if (tmp.3_3 > 0)
    goto <bb 3>; [59.00%]
  else
    goto <bb 4>; [41.00%]

  <bb 3> [local count: 633507680]:
  _12 = IMAGPART_EXPR <_11>;
  _10 = _12 == 0;

  <bb 4> [local count: 1073741824]:
  # iftmp.2_5 = PHI <_10(3), 0(2)>

Notice no divide.  The _12 == 0 part really should just _12 ^ 1.

After my patch (which I need to finish up) we get:
        movl    %edi, %eax
        mull    %esi
        seto    %dl
        xorl    %r8d, %r8d
        movzbl  %dl, %edx
        xorl    $1, %edx
        testl   %eax, %eax
        cmovg   %edx, %r8d
        movl    %r8d, %eax
        ret
Which should be exactly what you wanted or very close.
There looks to be a few micro-optimizations needed still really.

^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug middle-end/48580] missed optimization: integer overflow checks
  2011-04-12 18:36 [Bug rtl-optimization/48580] New: missed optimization: integer overflow checks zackw at panix dot com
                   ` (21 preceding siblings ...)
  2021-08-15 11:45 ` pinskia at gcc dot gnu.org
@ 2021-08-15 11:49 ` pinskia at gcc dot gnu.org
  2021-10-22 22:18 ` pinskia at gcc dot gnu.org
  2023-08-09 22:29 ` pinskia at gcc dot gnu.org
  24 siblings, 0 replies; 26+ messages in thread
From: pinskia at gcc dot gnu.org @ 2021-08-15 11:49 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=48580

Andrew Pinski <pinskia at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
           See Also|                            |https://gcc.gnu.org/bugzill
                   |                            |a/show_bug.cgi?id=59708
                 CC|                            |pinskia at gcc dot gnu.org

--- Comment #23 from Andrew Pinski <pinskia at gcc dot gnu.org> ---
Also the builtins have been in GCC since GCC 5; r5-4844, PR 59708.

^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug middle-end/48580] missed optimization: integer overflow checks
  2011-04-12 18:36 [Bug rtl-optimization/48580] New: missed optimization: integer overflow checks zackw at panix dot com
                   ` (22 preceding siblings ...)
  2021-08-15 11:49 ` pinskia at gcc dot gnu.org
@ 2021-10-22 22:18 ` pinskia at gcc dot gnu.org
  2023-08-09 22:29 ` pinskia at gcc dot gnu.org
  24 siblings, 0 replies; 26+ messages in thread
From: pinskia at gcc dot gnu.org @ 2021-10-22 22:18 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=48580

Andrew Pinski <pinskia at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEW                         |ASSIGNED
           Assignee|unassigned at gcc dot gnu.org      |pinskia at gcc dot gnu.org

--- Comment #24 from Andrew Pinski <pinskia at gcc dot gnu.org> ---
Mine. I had a patch which improves the "== 0" to just ^1. Let me see if I can
finish up.

^ permalink raw reply	[flat|nested] 26+ messages in thread

* [Bug middle-end/48580] missed optimization: integer overflow checks
  2011-04-12 18:36 [Bug rtl-optimization/48580] New: missed optimization: integer overflow checks zackw at panix dot com
                   ` (23 preceding siblings ...)
  2021-10-22 22:18 ` pinskia at gcc dot gnu.org
@ 2023-08-09 22:29 ` pinskia at gcc dot gnu.org
  24 siblings, 0 replies; 26+ messages in thread
From: pinskia at gcc dot gnu.org @ 2023-08-09 22:29 UTC (permalink / raw)
  To: gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=48580

--- Comment #25 from Andrew Pinski <pinskia at gcc dot gnu.org> ---
We have this now:
  if (tmp.3_3 > 0)
    goto <bb 3>; [59.00%]
  else
    goto <bb 4>; [41.00%]

  <bb 3> [local count: 633507679]:
  _10 = _12 == 0;

  <bb 4> [local count: 1073741824]:
  # iftmp.2_5 = PHI <_10(3), 0(2)>

I suspect what we could do is in isel change that to:
iftmp.2_5 = tmp.3_3 > 0 ? _12 == 0 : 0;
(which itself gets optimized to:
iftmp.2_5 = (tmp.3_3 > 0) & (_12 == 0)

Which should produce better code at least.

^ permalink raw reply	[flat|nested] 26+ messages in thread

end of thread, other threads:[~2023-08-09 22:29 UTC | newest]

Thread overview: 26+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2011-04-12 18:36 [Bug rtl-optimization/48580] New: missed optimization: integer overflow checks zackw at panix dot com
2011-04-12 20:18 ` [Bug rtl-optimization/48580] " joseph at codesourcery dot com
2011-04-12 20:40 ` zackw at panix dot com
2011-04-12 20:52 ` joseph at codesourcery dot com
2011-04-12 21:03 ` zackw at panix dot com
2011-04-12 21:04 ` zackw at panix dot com
2011-04-12 21:10 ` joseph at codesourcery dot com
2011-04-12 21:16 ` joseph at codesourcery dot com
2011-04-13 12:11 ` [Bug middle-end/48580] " rguenth at gcc dot gnu.org
2011-04-13 12:46 ` joseph at codesourcery dot com
2011-04-13 17:44 ` svfuerst at gmail dot com
2011-06-20  9:47 ` jsm28 at gcc dot gnu.org
2011-10-05 12:44 ` jules at gcc dot gnu.org
2011-10-05 13:08 ` jules at gcc dot gnu.org
2011-10-05 15:20 ` joseph at codesourcery dot com
2013-02-02 14:03 ` Martin.vGagern at gmx dot net
2013-02-02 17:02 ` noloader at gmail dot com
2013-02-02 18:54 ` Martin.vGagern at gmx dot net
2013-02-02 21:59 ` zackw at panix dot com
2013-02-02 22:08 ` Martin.vGagern at gmx dot net
2013-05-19 13:04 ` glisse at gcc dot gnu.org
2014-08-24  7:06 ` Martin.vGagern at gmx dot net
2021-08-15 11:45 ` pinskia at gcc dot gnu.org
2021-08-15 11:49 ` pinskia at gcc dot gnu.org
2021-10-22 22:18 ` pinskia at gcc dot gnu.org
2023-08-09 22:29 ` pinskia at gcc dot gnu.org

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).