Re: Question on -fwrapv and -fwrapv-pointer

public inbox for gcc@gcc.gnu.org
 help / color / mirror / Atom feed

* Re: Question on -fwrapv and -fwrapv-pointer
       [not found] <202309151107.C2F7D9A01@keescook>
@ 2023-09-16  8:36 ` Martin Uecker
  2023-09-18  7:31   ` Richard Biener
  2023-09-20 20:40   ` Kees Cook
  0 siblings, 2 replies; 9+ messages in thread
From: Martin Uecker @ 2023-09-16  8:36 UTC (permalink / raw)
  To: keescook; +Cc: gcc



(moved to gcc@)

> On Fri, Sep 15, 2023 at 08:18:28AM -0700, Andrew Pinski wrote:
> > On Fri, Sep 15, 2023 at 8:12 AM Qing Zhao <qing.zhao@oracle.com> wrote:
> > >
> > >
> > >
> > > > On Sep 15, 2023, at 3:43 AM, Xi Ruoyao <xry111@xry111.site> wrote:
> > > >
> > > > On Thu, 2023-09-14 at 21:41 +0000, Qing Zhao wrote:
> > > >>>> CLANG already provided -fsanitize=unsigned-integer-overflow. GCC
> > > >>>> might need to do the same.
> > > >>>
> > > >>> NO. There is no such thing as unsigned integer overflow. That option
> > > >>> is badly designed and the GCC community has rejected a few times now
> > > >>> having that sanitizer before. It is bad form to have a sanitizer for
> > > >>> well defined code.
> > > >>
> > > >> Even though unsigned integer overflow is well defined, it might be
> > > >> unintentional, shall we warn user about this?
> > > >
> > > > *Everything* could be unintentional and should be warned then.  GCC is a
> > > > compiler, not an advanced AI educating the programmers.
> > >
> > > Well, you are right in some sense. -:)
> > >
> > > However, overflow is one important source for security flaws, it’s important  for compilers to detect
> > > overflows in the programs in general.
> > 
> > Except it is NOT an overflow. Rather it is wrapping. That is a big
> > point here. unsigned wraps and does NOT overflow. Yes there is a major
> > difference.
> 
> Right, yes. I will try to pick my language very carefully. :)
> 
> The practical problem I am trying to solve in the 30 million lines of
> Linux kernel code is that of catching arithmetic wrap-around. The
> problem is one of evolving the code -- I can't just drop -fwrapv and
> -fwrapv-pointer because it's not possible to fix all the cases at once.
> (And we really don't want to reintroduce undefined behavior.)
> 
> So, for signed, pointer, and unsigned types, we need:
> 
> a) No arithmetic UB -- everything needs to have deterministic behavior.
>    The current solution here is "-fno-strict-overflow", which eliminates
>    the UB and makes sure everything wraps.
> 
> b) A way to run-time warn/trap on overflow/underflow/wrap-around. This
>    would work with -fsanitize=[signed-integer|pointer]-overflow except
>    due to "a)" we always wrap. And there isn't currently coverage like
>    this for unsigned (in GCC).
> 
> Our problem is that the kernel is filled with a mix of places where there
> is intended wrap-around and unintended wrap-around. We can chip away at
> fixing the intended wrap-around that we can find with static analyzers,
> etc, but at the end of the day there is a long tail of finding the places
> where intended wrap-around is hiding. But when the refactoring is
> sufficiently completely, we can move the wrap-around warning to a trap,
> and the kernel will not longer have this class of security flaw.
> 
> As a real-world example, here is a bug where a u8 wraps around causing
> an under-allocation that allowed for a heap overwrite:
> 
> https://git.kernel.org/linus/6311071a0562
> https://elixir.bootlin.com/linux/v6.5/source/net/wireless/nl80211.c#L5422
> 
> If there were more than 255 elements in a linked list, the allocation
> would be too small, and the second loop would write past the end of the
> allocation. This is a pretty classic allocation underflow and linear
> heap write overflow security flaw. (And it would be trivially stopped by
> trapping on the u8 wrap around.)
> 
> So, I want to be able to catch that at run-time. But we also have code
> doing things like "if (ulong + offset < ulong) { ... }":
> 
> https://elixir.bootlin.com/linux/v6.5/source/drivers/crypto/axis/artpec6_crypto.c#L1187
> 
> This is easy for a static analyzer to find and we can replace it with a
> non-wrapping test (e.g. __builtin_add_overflow()), but we'll not find
> them all immediately, especially for the signed and pointer cases.
> 
> So, I need to retain the "everything wraps" behavior while still being
> able to detect when it happens.


Hi Kees,

I have a couple of questions:

Currently, my thinking was that you would use signed integers
if you want the usual integer arithmetic rules we know from
elementary school and if you overflow this is clearly a bug 
you can diagnose with UBsan.

There are people who think that signed overflow should be
defined to wrap, but I think this would be a severe
mistake because then code would start to rely on it, which
makes it then difficult to differentiate between bugs and
intended uses (e.g. the unfortunate situation you have 
with the kernel).

I assume you want to combine UBSan plus wrapping for
production use?  Or only for testing?   Or in other words:
why would testing UBSan and production with wrapping
not be sufficient to find and fix all bugs? 

Wrapping would not be correct because it may lead to
logic errors or use-after-free etc.  I assume it is still
preferred because it more deterministic than whatever comes
out of the optimizer assuming that overflow has UB. Is this
the reasoning applied here?


For unsigned the intended use case is modulo arithmetic
where wrapping is the correct behavior. At least, this
is what I thought so far.. This seems also to be the 
position of the overall GCC community rejecting 
-fsanitize=unsigned-integer-overflow.

But then there are also people like Richard Seacord that
have the position that one should use  "unsigned" for 
every quantity which can not be negative, which implies
that then the modulo use case becomes the exception.

Related to this I have the following question: In the 
bug you refer to above, an unsigned variable was used
for something that is not meant for modulo arithmetic.
Is this done in the kernel to save space?  I.e. because
127 would not be enough as maximum but going to i16 takes
to much space? or is this for compatibility with some
on-the-wire protocol? 

Would an attribute for variables help that tells the
compiler that if stores to the variable wrap around
then this is not intended and this is an error? 

u8 x [[nowrap]];

x = 256; // can be diagnosed ?



Martin 







^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Question on -fwrapv and -fwrapv-pointer
  2023-09-16  8:36 ` Question on -fwrapv and -fwrapv-pointer Martin Uecker
@ 2023-09-18  7:31   ` Richard Biener
  2023-09-18  8:17     ` Martin Uecker
  2023-09-18 15:27     ` Andrew Pinski
  2023-09-20 20:40   ` Kees Cook
  1 sibling, 2 replies; 9+ messages in thread
From: Richard Biener @ 2023-09-18  7:31 UTC (permalink / raw)
  To: Martin Uecker; +Cc: keescook, gcc

On Sat, Sep 16, 2023 at 10:38 AM Martin Uecker via Gcc <gcc@gcc.gnu.org> wrote:
>
>
>
> (moved to gcc@)
>
> > On Fri, Sep 15, 2023 at 08:18:28AM -0700, Andrew Pinski wrote:
> > > On Fri, Sep 15, 2023 at 8:12 AM Qing Zhao <qing.zhao@oracle.com> wrote:
> > > >
> > > >
> > > >
> > > > > On Sep 15, 2023, at 3:43 AM, Xi Ruoyao <xry111@xry111.site> wrote:
> > > > >
> > > > > On Thu, 2023-09-14 at 21:41 +0000, Qing Zhao wrote:
> > > > >>>> CLANG already provided -fsanitize=unsigned-integer-overflow. GCC
> > > > >>>> might need to do the same.
> > > > >>>
> > > > >>> NO. There is no such thing as unsigned integer overflow. That option
> > > > >>> is badly designed and the GCC community has rejected a few times now
> > > > >>> having that sanitizer before. It is bad form to have a sanitizer for
> > > > >>> well defined code.
> > > > >>
> > > > >> Even though unsigned integer overflow is well defined, it might be
> > > > >> unintentional, shall we warn user about this?
> > > > >
> > > > > *Everything* could be unintentional and should be warned then.  GCC is a
> > > > > compiler, not an advanced AI educating the programmers.
> > > >
> > > > Well, you are right in some sense. -:)
> > > >
> > > > However, overflow is one important source for security flaws, it’s important  for compilers to detect
> > > > overflows in the programs in general.
> > >
> > > Except it is NOT an overflow. Rather it is wrapping. That is a big
> > > point here. unsigned wraps and does NOT overflow. Yes there is a major
> > > difference.
> >
> > Right, yes. I will try to pick my language very carefully. :)
> >
> > The practical problem I am trying to solve in the 30 million lines of
> > Linux kernel code is that of catching arithmetic wrap-around. The
> > problem is one of evolving the code -- I can't just drop -fwrapv and
> > -fwrapv-pointer because it's not possible to fix all the cases at once.
> > (And we really don't want to reintroduce undefined behavior.)
> >
> > So, for signed, pointer, and unsigned types, we need:
> >
> > a) No arithmetic UB -- everything needs to have deterministic behavior.
> >    The current solution here is "-fno-strict-overflow", which eliminates
> >    the UB and makes sure everything wraps.
> >
> > b) A way to run-time warn/trap on overflow/underflow/wrap-around. This
> >    would work with -fsanitize=[signed-integer|pointer]-overflow except
> >    due to "a)" we always wrap. And there isn't currently coverage like
> >    this for unsigned (in GCC).
> >
> > Our problem is that the kernel is filled with a mix of places where there
> > is intended wrap-around and unintended wrap-around. We can chip away at
> > fixing the intended wrap-around that we can find with static analyzers,
> > etc, but at the end of the day there is a long tail of finding the places
> > where intended wrap-around is hiding. But when the refactoring is
> > sufficiently completely, we can move the wrap-around warning to a trap,
> > and the kernel will not longer have this class of security flaw.
> >
> > As a real-world example, here is a bug where a u8 wraps around causing
> > an under-allocation that allowed for a heap overwrite:
> >
> > https://git.kernel.org/linus/6311071a0562
> > https://elixir.bootlin.com/linux/v6.5/source/net/wireless/nl80211.c#L5422
> >
> > If there were more than 255 elements in a linked list, the allocation
> > would be too small, and the second loop would write past the end of the
> > allocation. This is a pretty classic allocation underflow and linear
> > heap write overflow security flaw. (And it would be trivially stopped by
> > trapping on the u8 wrap around.)
> >
> > So, I want to be able to catch that at run-time. But we also have code
> > doing things like "if (ulong + offset < ulong) { ... }":
> >
> > https://elixir.bootlin.com/linux/v6.5/source/drivers/crypto/axis/artpec6_crypto.c#L1187
> >
> > This is easy for a static analyzer to find and we can replace it with a
> > non-wrapping test (e.g. __builtin_add_overflow()), but we'll not find
> > them all immediately, especially for the signed and pointer cases.
> >
> > So, I need to retain the "everything wraps" behavior while still being
> > able to detect when it happens.
>
>
> Hi Kees,
>
> I have a couple of questions:
>
> Currently, my thinking was that you would use signed integers
> if you want the usual integer arithmetic rules we know from
> elementary school and if you overflow this is clearly a bug
> you can diagnose with UBsan.
>
> There are people who think that signed overflow should be
> defined to wrap, but I think this would be a severe
> mistake because then code would start to rely on it, which
> makes it then difficult to differentiate between bugs and
> intended uses (e.g. the unfortunate situation you have
> with the kernel).
>
> I assume you want to combine UBSan plus wrapping for
> production use?  Or only for testing?   Or in other words:
> why would testing UBSan and production with wrapping
> not be sufficient to find and fix all bugs?
>
> Wrapping would not be correct because it may lead to
> logic errors or use-after-free etc.  I assume it is still
> preferred because it more deterministic than whatever comes
> out of the optimizer assuming that overflow has UB. Is this
> the reasoning applied here?
>
>
> For unsigned the intended use case is modulo arithmetic
> where wrapping is the correct behavior. At least, this
> is what I thought so far.. This seems also to be the
> position of the overall GCC community rejecting
> -fsanitize=unsigned-integer-overflow.
>
> But then there are also people like Richard Seacord that
> have the position that one should use  "unsigned" for
> every quantity which can not be negative, which implies
> that then the modulo use case becomes the exception.
>
> Related to this I have the following question: In the
> bug you refer to above, an unsigned variable was used
> for something that is not meant for modulo arithmetic.
> Is this done in the kernel to save space?  I.e. because
> 127 would not be enough as maximum but going to i16 takes
> to much space? or is this for compatibility with some
> on-the-wire protocol?
>
> Would an attribute for variables help that tells the
> compiler that if stores to the variable wrap around
> then this is not intended and this is an error?
>
> u8 x [[nowrap]];
>
> x = 256; // can be diagnosed ?

There's a problem with representing this, it's a long-standing thing standing
in the way of handling -fwrapv in the IL.  Maybe C wants to have
'nonnegative int' in addition to 'unsigned int', or 'bit int' for the
"just some two-complement bits".

For GCC we think of -fwrapv to perform signed integer arithmetic as if
promoted to a large enough type first and then truncated to the original
type in the implementation defined manner (modulo reducing).  But we
end up doing what the actual hardware does (I'm for example unsure
how that handles INT_MIN / -1 - x86 documents it as raising #DE
which it indeed does, even with -fwrapv).

volatile int a, b;
int main ()
{
  a = -__INT_MAX__ - 1;
  b = -1;
  int z = a / b;
  __builtin_printf ("%d\n", z);
}

raises SIGFPE with and without -fwrapv.  I'll note that we document
-fwrapv to only affect addition, subtraction and multiplication
(it also affects negation), matching the libgcc *v functions we have
for -ftrapv.  -fsanitize=undefined instruments division but
-fsanitize=recover doesn't help.  Maybe there's a bug about this
recorded.  The documentation of -fwrapv should likely be clarified.

For recent work I failed to spot the C standards language on
singed overflow behavior for negation, addition, subtraction
and multiplication.  I can find it for division though.  Can someone
point me to where that is (moved?)?

Richard.

>
>
>
> Martin
>
>
>
>
>
>

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Question on -fwrapv and -fwrapv-pointer
  2023-09-18  7:31   ` Richard Biener
@ 2023-09-18  8:17     ` Martin Uecker
  2023-09-18  8:47       ` Richard Biener
  2023-09-18 15:27     ` Andrew Pinski
  1 sibling, 1 reply; 9+ messages in thread
From: Martin Uecker @ 2023-09-18  8:17 UTC (permalink / raw)
  To: Richard Biener; +Cc: keescook, gcc

Am Montag, dem 18.09.2023 um 09:31 +0200 schrieb Richard Biener via Gcc:
> On Sat, Sep 16, 2023 at 10:38 AM Martin Uecker via Gcc <gcc@gcc.gnu.org> wrote:
> > 
> > 
> > 
> > (moved to gcc@)
> > 
> > > On Fri, Sep 15, 2023 at 08:18:28AM -0700, Andrew Pinski wrote:
> > > > On Fri, Sep 15, 2023 at 8:12 AM Qing Zhao <qing.zhao@oracle.com> wrote:
> > > > > 
> > > > > 
> > > > > 
> > > > > > On Sep 15, 2023, at 3:43 AM, Xi Ruoyao <xry111@xry111.site> wrote:
> > > > > > 
> > > > > > On Thu, 2023-09-14 at 21:41 +0000, Qing Zhao wrote:
> > > > > > > > > CLANG already provided -fsanitize=unsigned-integer-overflow. GCC
> > > > > > > > > might need to do the same.
> > > > > > > > 
> > > > > > > > NO. There is no such thing as unsigned integer overflow. That option
> > > > > > > > is badly designed and the GCC community has rejected a few times now
> > > > > > > > having that sanitizer before. It is bad form to have a sanitizer for
> > > > > > > > well defined code.
> > > > > > > 
> > > > > > > Even though unsigned integer overflow is well defined, it might be
> > > > > > > unintentional, shall we warn user about this?
> > > > > > 
> > > > > > *Everything* could be unintentional and should be warned then.  GCC is a
> > > > > > compiler, not an advanced AI educating the programmers.
> > > > > 
> > > > > Well, you are right in some sense. -:)
> > > > > 
> > > > > However, overflow is one important source for security flaws, it’s important  for compilers to detect
> > > > > overflows in the programs in general.
> > > > 
> > > > Except it is NOT an overflow. Rather it is wrapping. That is a big
> > > > point here. unsigned wraps and does NOT overflow. Yes there is a major
> > > > difference.
> > > 
> > > Right, yes. I will try to pick my language very carefully. :)
> > > 
> > > The practical problem I am trying to solve in the 30 million lines of
> > > Linux kernel code is that of catching arithmetic wrap-around. The
> > > problem is one of evolving the code -- I can't just drop -fwrapv and
> > > -fwrapv-pointer because it's not possible to fix all the cases at once.
> > > (And we really don't want to reintroduce undefined behavior.)
> > > 
> > > So, for signed, pointer, and unsigned types, we need:
> > > 
> > > a) No arithmetic UB -- everything needs to have deterministic behavior.
> > >    The current solution here is "-fno-strict-overflow", which eliminates
> > >    the UB and makes sure everything wraps.
> > > 
> > > b) A way to run-time warn/trap on overflow/underflow/wrap-around. This
> > >    would work with -fsanitize=[signed-integer|pointer]-overflow except
> > >    due to "a)" we always wrap. And there isn't currently coverage like
> > >    this for unsigned (in GCC).
> > > 
> > > Our problem is that the kernel is filled with a mix of places where there
> > > is intended wrap-around and unintended wrap-around. We can chip away at
> > > fixing the intended wrap-around that we can find with static analyzers,
> > > etc, but at the end of the day there is a long tail of finding the places
> > > where intended wrap-around is hiding. But when the refactoring is
> > > sufficiently completely, we can move the wrap-around warning to a trap,
> > > and the kernel will not longer have this class of security flaw.
> > > 
> > > As a real-world example, here is a bug where a u8 wraps around causing
> > > an under-allocation that allowed for a heap overwrite:
> > > 
> > > https://git.kernel.org/linus/6311071a0562
> > > https://elixir.bootlin.com/linux/v6.5/source/net/wireless/nl80211.c#L5422
> > > 
> > > If there were more than 255 elements in a linked list, the allocation
> > > would be too small, and the second loop would write past the end of the
> > > allocation. This is a pretty classic allocation underflow and linear
> > > heap write overflow security flaw. (And it would be trivially stopped by
> > > trapping on the u8 wrap around.)
> > > 
> > > So, I want to be able to catch that at run-time. But we also have code
> > > doing things like "if (ulong + offset < ulong) { ... }":
> > > 
> > > https://elixir.bootlin.com/linux/v6.5/source/drivers/crypto/axis/artpec6_crypto.c#L1187
> > > 
> > > This is easy for a static analyzer to find and we can replace it with a
> > > non-wrapping test (e.g. __builtin_add_overflow()), but we'll not find
> > > them all immediately, especially for the signed and pointer cases.
> > > 
> > > So, I need to retain the "everything wraps" behavior while still being
> > > able to detect when it happens.
> > 
> > 
> > Hi Kees,
> > 
> > I have a couple of questions:
> > 
> > Currently, my thinking was that you would use signed integers
> > if you want the usual integer arithmetic rules we know from
> > elementary school and if you overflow this is clearly a bug
> > you can diagnose with UBsan.
> > 
> > There are people who think that signed overflow should be
> > defined to wrap, but I think this would be a severe
> > mistake because then code would start to rely on it, which
> > makes it then difficult to differentiate between bugs and
> > intended uses (e.g. the unfortunate situation you have
> > with the kernel).
> > 
> > I assume you want to combine UBSan plus wrapping for
> > production use?  Or only for testing?   Or in other words:
> > why would testing UBSan and production with wrapping
> > not be sufficient to find and fix all bugs?
> > 
> > Wrapping would not be correct because it may lead to
> > logic errors or use-after-free etc.  I assume it is still
> > preferred because it more deterministic than whatever comes
> > out of the optimizer assuming that overflow has UB. Is this
> > the reasoning applied here?
> > 
> > 
> > For unsigned the intended use case is modulo arithmetic
> > where wrapping is the correct behavior. At least, this
> > is what I thought so far.. This seems also to be the
> > position of the overall GCC community rejecting
> > -fsanitize=unsigned-integer-overflow.
> > 
> > But then there are also people like Richard Seacord that
> > have the position that one should use  "unsigned" for
> > every quantity which can not be negative, which implies
> > that then the modulo use case becomes the exception.
> > 
> > Related to this I have the following question: In the
> > bug you refer to above, an unsigned variable was used
> > for something that is not meant for modulo arithmetic.
> > Is this done in the kernel to save space?  I.e. because
> > 127 would not be enough as maximum but going to i16 takes
> > to much space? or is this for compatibility with some
> > on-the-wire protocol?
> > 
> > Would an attribute for variables help that tells the
> > compiler that if stores to the variable wrap around
> > then this is not intended and this is an error?
> > 
> > u8 x [[nowrap]];
> > 
> > x = 256; // can be diagnosed ?
> 
> There's a problem with representing this, it's a long-standing thing standing
> in the way of handling -fwrapv in the IL.  Maybe C wants to have
> 'nonnegative int' in addition to 'unsigned int', or 'bit int' for the
> "just some two-complement bits".
> 
> For GCC we think of -fwrapv to perform signed integer arithmetic as if
> promoted to a large enough type first and then truncated to the original
> type in the implementation defined manner (modulo reducing).  But we
> end up doing what the actual hardware does (I'm for example unsure
> how that handles INT_MIN / -1 - x86 documents it as raising #DE
> which it indeed does, even with -fwrapv).
> 
> volatile int a, b;
> int main ()
> {
>   a = -__INT_MAX__ - 1;
>   b = -1;
>   int z = a / b;
>   __builtin_printf ("%d\n", z);
> }
> 
> raises SIGFPE with and without -fwrapv.  I'll note that we document
> -fwrapv to only affect addition, subtraction and multiplication
> (it also affects negation), matching the libgcc *v functions we have
> for -ftrapv.  -fsanitize=undefined instruments division but
> -fsanitize=recover doesn't help.  Maybe there's a bug about this
> recorded.  The documentation of -fwrapv should likely be clarified.
> 
> For recent work I failed to spot the C standards language on
> singed overflow behavior for negation, addition, subtraction
> and multiplication.  I can find it for division though.  Can someone
> point me to where that is (moved?)?

I don't think it moved. It should follow from the general rule 6.5p5:

"If an exceptional condition occurs during the evaluation of an
expression (that is, if the result is not mathematically defined
or not in the range of representable values for its type), the
behavior is undefined."

Martin




^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Question on -fwrapv and -fwrapv-pointer
  2023-09-18  8:17     ` Martin Uecker
@ 2023-09-18  8:47       ` Richard Biener
  2023-09-18 15:22         ` Martin Uecker
  0 siblings, 1 reply; 9+ messages in thread
From: Richard Biener @ 2023-09-18  8:47 UTC (permalink / raw)
  To: Martin Uecker; +Cc: keescook, gcc

On Mon, Sep 18, 2023 at 10:17 AM Martin Uecker <ma.uecker@gmail.com> wrote:
>
> Am Montag, dem 18.09.2023 um 09:31 +0200 schrieb Richard Biener via Gcc:
> > On Sat, Sep 16, 2023 at 10:38 AM Martin Uecker via Gcc <gcc@gcc.gnu.org> wrote:
> > >
> > >
> > >
> > > (moved to gcc@)
> > >
> > > > On Fri, Sep 15, 2023 at 08:18:28AM -0700, Andrew Pinski wrote:
> > > > > On Fri, Sep 15, 2023 at 8:12 AM Qing Zhao <qing.zhao@oracle.com> wrote:
> > > > > >
> > > > > >
> > > > > >
> > > > > > > On Sep 15, 2023, at 3:43 AM, Xi Ruoyao <xry111@xry111.site> wrote:
> > > > > > >
> > > > > > > On Thu, 2023-09-14 at 21:41 +0000, Qing Zhao wrote:
> > > > > > > > > > CLANG already provided -fsanitize=unsigned-integer-overflow. GCC
> > > > > > > > > > might need to do the same.
> > > > > > > > >
> > > > > > > > > NO. There is no such thing as unsigned integer overflow. That option
> > > > > > > > > is badly designed and the GCC community has rejected a few times now
> > > > > > > > > having that sanitizer before. It is bad form to have a sanitizer for
> > > > > > > > > well defined code.
> > > > > > > >
> > > > > > > > Even though unsigned integer overflow is well defined, it might be
> > > > > > > > unintentional, shall we warn user about this?
> > > > > > >
> > > > > > > *Everything* could be unintentional and should be warned then.  GCC is a
> > > > > > > compiler, not an advanced AI educating the programmers.
> > > > > >
> > > > > > Well, you are right in some sense. -:)
> > > > > >
> > > > > > However, overflow is one important source for security flaws, it’s important  for compilers to detect
> > > > > > overflows in the programs in general.
> > > > >
> > > > > Except it is NOT an overflow. Rather it is wrapping. That is a big
> > > > > point here. unsigned wraps and does NOT overflow. Yes there is a major
> > > > > difference.
> > > >
> > > > Right, yes. I will try to pick my language very carefully. :)
> > > >
> > > > The practical problem I am trying to solve in the 30 million lines of
> > > > Linux kernel code is that of catching arithmetic wrap-around. The
> > > > problem is one of evolving the code -- I can't just drop -fwrapv and
> > > > -fwrapv-pointer because it's not possible to fix all the cases at once.
> > > > (And we really don't want to reintroduce undefined behavior.)
> > > >
> > > > So, for signed, pointer, and unsigned types, we need:
> > > >
> > > > a) No arithmetic UB -- everything needs to have deterministic behavior.
> > > >    The current solution here is "-fno-strict-overflow", which eliminates
> > > >    the UB and makes sure everything wraps.
> > > >
> > > > b) A way to run-time warn/trap on overflow/underflow/wrap-around. This
> > > >    would work with -fsanitize=[signed-integer|pointer]-overflow except
> > > >    due to "a)" we always wrap. And there isn't currently coverage like
> > > >    this for unsigned (in GCC).
> > > >
> > > > Our problem is that the kernel is filled with a mix of places where there
> > > > is intended wrap-around and unintended wrap-around. We can chip away at
> > > > fixing the intended wrap-around that we can find with static analyzers,
> > > > etc, but at the end of the day there is a long tail of finding the places
> > > > where intended wrap-around is hiding. But when the refactoring is
> > > > sufficiently completely, we can move the wrap-around warning to a trap,
> > > > and the kernel will not longer have this class of security flaw.
> > > >
> > > > As a real-world example, here is a bug where a u8 wraps around causing
> > > > an under-allocation that allowed for a heap overwrite:
> > > >
> > > > https://git.kernel.org/linus/6311071a0562
> > > > https://elixir.bootlin.com/linux/v6.5/source/net/wireless/nl80211.c#L5422
> > > >
> > > > If there were more than 255 elements in a linked list, the allocation
> > > > would be too small, and the second loop would write past the end of the
> > > > allocation. This is a pretty classic allocation underflow and linear
> > > > heap write overflow security flaw. (And it would be trivially stopped by
> > > > trapping on the u8 wrap around.)
> > > >
> > > > So, I want to be able to catch that at run-time. But we also have code
> > > > doing things like "if (ulong + offset < ulong) { ... }":
> > > >
> > > > https://elixir.bootlin.com/linux/v6.5/source/drivers/crypto/axis/artpec6_crypto.c#L1187
> > > >
> > > > This is easy for a static analyzer to find and we can replace it with a
> > > > non-wrapping test (e.g. __builtin_add_overflow()), but we'll not find
> > > > them all immediately, especially for the signed and pointer cases.
> > > >
> > > > So, I need to retain the "everything wraps" behavior while still being
> > > > able to detect when it happens.
> > >
> > >
> > > Hi Kees,
> > >
> > > I have a couple of questions:
> > >
> > > Currently, my thinking was that you would use signed integers
> > > if you want the usual integer arithmetic rules we know from
> > > elementary school and if you overflow this is clearly a bug
> > > you can diagnose with UBsan.
> > >
> > > There are people who think that signed overflow should be
> > > defined to wrap, but I think this would be a severe
> > > mistake because then code would start to rely on it, which
> > > makes it then difficult to differentiate between bugs and
> > > intended uses (e.g. the unfortunate situation you have
> > > with the kernel).
> > >
> > > I assume you want to combine UBSan plus wrapping for
> > > production use?  Or only for testing?   Or in other words:
> > > why would testing UBSan and production with wrapping
> > > not be sufficient to find and fix all bugs?
> > >
> > > Wrapping would not be correct because it may lead to
> > > logic errors or use-after-free etc.  I assume it is still
> > > preferred because it more deterministic than whatever comes
> > > out of the optimizer assuming that overflow has UB. Is this
> > > the reasoning applied here?
> > >
> > >
> > > For unsigned the intended use case is modulo arithmetic
> > > where wrapping is the correct behavior. At least, this
> > > is what I thought so far.. This seems also to be the
> > > position of the overall GCC community rejecting
> > > -fsanitize=unsigned-integer-overflow.
> > >
> > > But then there are also people like Richard Seacord that
> > > have the position that one should use  "unsigned" for
> > > every quantity which can not be negative, which implies
> > > that then the modulo use case becomes the exception.
> > >
> > > Related to this I have the following question: In the
> > > bug you refer to above, an unsigned variable was used
> > > for something that is not meant for modulo arithmetic.
> > > Is this done in the kernel to save space?  I.e. because
> > > 127 would not be enough as maximum but going to i16 takes
> > > to much space? or is this for compatibility with some
> > > on-the-wire protocol?
> > >
> > > Would an attribute for variables help that tells the
> > > compiler that if stores to the variable wrap around
> > > then this is not intended and this is an error?
> > >
> > > u8 x [[nowrap]];
> > >
> > > x = 256; // can be diagnosed ?
> >
> > There's a problem with representing this, it's a long-standing thing standing
> > in the way of handling -fwrapv in the IL.  Maybe C wants to have
> > 'nonnegative int' in addition to 'unsigned int', or 'bit int' for the
> > "just some two-complement bits".
> >
> > For GCC we think of -fwrapv to perform signed integer arithmetic as if
> > promoted to a large enough type first and then truncated to the original
> > type in the implementation defined manner (modulo reducing).  But we
> > end up doing what the actual hardware does (I'm for example unsure
> > how that handles INT_MIN / -1 - x86 documents it as raising #DE
> > which it indeed does, even with -fwrapv).
> >
> > volatile int a, b;
> > int main ()
> > {
> >   a = -__INT_MAX__ - 1;
> >   b = -1;
> >   int z = a / b;
> >   __builtin_printf ("%d\n", z);
> > }
> >
> > raises SIGFPE with and without -fwrapv.  I'll note that we document
> > -fwrapv to only affect addition, subtraction and multiplication
> > (it also affects negation), matching the libgcc *v functions we have
> > for -ftrapv.  -fsanitize=undefined instruments division but
> > -fsanitize=recover doesn't help.  Maybe there's a bug about this
> > recorded.  The documentation of -fwrapv should likely be clarified.
> >
> > For recent work I failed to spot the C standards language on
> > singed overflow behavior for negation, addition, subtraction
> > and multiplication.  I can find it for division though.  Can someone
> > point me to where that is (moved?)?
>
> I don't think it moved. It should follow from the general rule 6.5p5:
>
> "If an exceptional condition occurs during the evaluation of an
> expression (that is, if the result is not mathematically defined
> or not in the range of representable values for its type), the
> behavior is undefined."

Ah, and the unsigned integer type exception is then 6.2.5/9
but using "can never overflow" wording rather than "in the range
of representable values for its type".  Integer "overflow" isn't
defined anywhere and I suppose the unsigned exception also
applies to underflow for expressions like 0 - 1.

It would have been useful to repeat some of this in the sections
documenting the relevant operators.

Richard.

>
> Martin
>
>
>

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Question on -fwrapv and -fwrapv-pointer
  2023-09-18  8:47       ` Richard Biener
@ 2023-09-18 15:22         ` Martin Uecker
  0 siblings, 0 replies; 9+ messages in thread
From: Martin Uecker @ 2023-09-18 15:22 UTC (permalink / raw)
  To: Richard Biener; +Cc: keescook, gcc

Am Montag, dem 18.09.2023 um 10:47 +0200 schrieb Richard Biener via Gcc:
> On Mon, Sep 18, 2023 at 10:17 AM Martin Uecker <ma.uecker@gmail.com> wrote:
> > 
> > Am Montag, dem 18.09.2023 um 09:31 +0200 schrieb Richard Biener via Gcc:
> > > On Sat, Sep 16, 2023 at 10:38 AM Martin Uecker via Gcc <gcc@gcc.gnu.org> wrote:
> > > > 
> > > > 
> > > > 
> > > > (moved to gcc@)
> > > > 
> > > > > On Fri, Sep 15, 2023 at 08:18:28AM -0700, Andrew Pinski wrote:
> > > > > > On Fri, Sep 15, 2023 at 8:12 AM Qing Zhao <qing.zhao@oracle.com> wrote:
> > > > > > > 
> > > > > > > 
> > > > > > > 
> > > > > > > > On Sep 15, 2023, at 3:43 AM, Xi Ruoyao <xry111@xry111.site> wrote:
> > > > > > > > 
> > > > > > > > On Thu, 2023-09-14 at 21:41 +0000, Qing Zhao wrote:
> > > > > > > > > > > CLANG already provided -fsanitize=unsigned-integer-overflow. GCC
> > > > > > > > > > > might need to do the same.
> > > > > > > > > > 
> > > > > > > > > > NO. There is no such thing as unsigned integer overflow. That option
> > > > > > > > > > is badly designed and the GCC community has rejected a few times now
> > > > > > > > > > having that sanitizer before. It is bad form to have a sanitizer for
> > > > > > > > > > well defined code.
> > > > > > > > > 
> > > > > > > > > Even though unsigned integer overflow is well defined, it might be
> > > > > > > > > unintentional, shall we warn user about this?
> > > > > > > > 
> > > > > > > > *Everything* could be unintentional and should be warned then.  GCC is a
> > > > > > > > compiler, not an advanced AI educating the programmers.
> > > > > > > 
> > > > > > > Well, you are right in some sense. -:)
> > > > > > > 
> > > > > > > However, overflow is one important source for security flaws, it’s important  for compilers to detect
> > > > > > > overflows in the programs in general.
> > > > > > 
> > > > > > Except it is NOT an overflow. Rather it is wrapping. That is a big
> > > > > > point here. unsigned wraps and does NOT overflow. Yes there is a major
> > > > > > difference.
> > > > > 
> > > > > Right, yes. I will try to pick my language very carefully. :)
> > > > > 
> > > > > The practical problem I am trying to solve in the 30 million lines of
> > > > > Linux kernel code is that of catching arithmetic wrap-around. The
> > > > > problem is one of evolving the code -- I can't just drop -fwrapv and
> > > > > -fwrapv-pointer because it's not possible to fix all the cases at once.
> > > > > (And we really don't want to reintroduce undefined behavior.)
> > > > > 
> > > > > So, for signed, pointer, and unsigned types, we need:
> > > > > 
> > > > > a) No arithmetic UB -- everything needs to have deterministic behavior.
> > > > >    The current solution here is "-fno-strict-overflow", which eliminates
> > > > >    the UB and makes sure everything wraps.
> > > > > 
> > > > > b) A way to run-time warn/trap on overflow/underflow/wrap-around. This
> > > > >    would work with -fsanitize=[signed-integer|pointer]-overflow except
> > > > >    due to "a)" we always wrap. And there isn't currently coverage like
> > > > >    this for unsigned (in GCC).
> > > > > 
> > > > > Our problem is that the kernel is filled with a mix of places where there
> > > > > is intended wrap-around and unintended wrap-around. We can chip away at
> > > > > fixing the intended wrap-around that we can find with static analyzers,
> > > > > etc, but at the end of the day there is a long tail of finding the places
> > > > > where intended wrap-around is hiding. But when the refactoring is
> > > > > sufficiently completely, we can move the wrap-around warning to a trap,
> > > > > and the kernel will not longer have this class of security flaw.
> > > > > 
> > > > > As a real-world example, here is a bug where a u8 wraps around causing
> > > > > an under-allocation that allowed for a heap overwrite:
> > > > > 
> > > > > https://git.kernel.org/linus/6311071a0562
> > > > > https://elixir.bootlin.com/linux/v6.5/source/net/wireless/nl80211.c#L5422
> > > > > 
> > > > > If there were more than 255 elements in a linked list, the allocation
> > > > > would be too small, and the second loop would write past the end of the
> > > > > allocation. This is a pretty classic allocation underflow and linear
> > > > > heap write overflow security flaw. (And it would be trivially stopped by
> > > > > trapping on the u8 wrap around.)
> > > > > 
> > > > > So, I want to be able to catch that at run-time. But we also have code
> > > > > doing things like "if (ulong + offset < ulong) { ... }":
> > > > > 
> > > > > https://elixir.bootlin.com/linux/v6.5/source/drivers/crypto/axis/artpec6_crypto.c#L1187
> > > > > 
> > > > > This is easy for a static analyzer to find and we can replace it with a
> > > > > non-wrapping test (e.g. __builtin_add_overflow()), but we'll not find
> > > > > them all immediately, especially for the signed and pointer cases.
> > > > > 
> > > > > So, I need to retain the "everything wraps" behavior while still being
> > > > > able to detect when it happens.
> > > > 
> > > > 
> > > > Hi Kees,
> > > > 
> > > > I have a couple of questions:
> > > > 
> > > > Currently, my thinking was that you would use signed integers
> > > > if you want the usual integer arithmetic rules we know from
> > > > elementary school and if you overflow this is clearly a bug
> > > > you can diagnose with UBsan.
> > > > 
> > > > There are people who think that signed overflow should be
> > > > defined to wrap, but I think this would be a severe
> > > > mistake because then code would start to rely on it, which
> > > > makes it then difficult to differentiate between bugs and
> > > > intended uses (e.g. the unfortunate situation you have
> > > > with the kernel).
> > > > 
> > > > I assume you want to combine UBSan plus wrapping for
> > > > production use?  Or only for testing?   Or in other words:
> > > > why would testing UBSan and production with wrapping
> > > > not be sufficient to find and fix all bugs?
> > > > 
> > > > Wrapping would not be correct because it may lead to
> > > > logic errors or use-after-free etc.  I assume it is still
> > > > preferred because it more deterministic than whatever comes
> > > > out of the optimizer assuming that overflow has UB. Is this
> > > > the reasoning applied here?
> > > > 
> > > > 
> > > > For unsigned the intended use case is modulo arithmetic
> > > > where wrapping is the correct behavior. At least, this
> > > > is what I thought so far.. This seems also to be the
> > > > position of the overall GCC community rejecting
> > > > -fsanitize=unsigned-integer-overflow.
> > > > 
> > > > But then there are also people like Richard Seacord that
> > > > have the position that one should use  "unsigned" for
> > > > every quantity which can not be negative, which implies
> > > > that then the modulo use case becomes the exception.
> > > > 
> > > > Related to this I have the following question: In the
> > > > bug you refer to above, an unsigned variable was used
> > > > for something that is not meant for modulo arithmetic.
> > > > Is this done in the kernel to save space?  I.e. because
> > > > 127 would not be enough as maximum but going to i16 takes
> > > > to much space? or is this for compatibility with some
> > > > on-the-wire protocol?
> > > > 
> > > > Would an attribute for variables help that tells the
> > > > compiler that if stores to the variable wrap around
> > > > then this is not intended and this is an error?
> > > > 
> > > > u8 x [[nowrap]];
> > > > 
> > > > x = 256; // can be diagnosed ?
> > > 
> > > There's a problem with representing this, it's a long-standing thing standing
> > > in the way of handling -fwrapv in the IL.  Maybe C wants to have
> > > 'nonnegative int' in addition to 'unsigned int', or 'bit int' for the
> > > "just some two-complement bits".
> > > 
> > > For GCC we think of -fwrapv to perform signed integer arithmetic as if
> > > promoted to a large enough type first and then truncated to the original
> > > type in the implementation defined manner (modulo reducing).  But we
> > > end up doing what the actual hardware does (I'm for example unsure
> > > how that handles INT_MIN / -1 - x86 documents it as raising #DE
> > > which it indeed does, even with -fwrapv).
> > > 
> > > volatile int a, b;
> > > int main ()
> > > {
> > >   a = -__INT_MAX__ - 1;
> > >   b = -1;
> > >   int z = a / b;
> > >   __builtin_printf ("%d\n", z);
> > > }
> > > 
> > > raises SIGFPE with and without -fwrapv.  I'll note that we document
> > > -fwrapv to only affect addition, subtraction and multiplication
> > > (it also affects negation), matching the libgcc *v functions we have
> > > for -ftrapv.  -fsanitize=undefined instruments division but
> > > -fsanitize=recover doesn't help.  Maybe there's a bug about this
> > > recorded.  The documentation of -fwrapv should likely be clarified.

Thank you for the explanation. But note that I suggested the 
opposite: "nowrap" . A compiler would not need to be changed
at all and can simply ignore it this attribute. But if it sees 
that a too large value is stored in an unsigned type that
is too small, it could warn in the frontend. And we could add 
a sanitizer mode to diagnose wraparound at run-time. (which 
would then not have the problem with false  positives as 
-funsigned-integer-overflow would have)
> > > 

> > > For recent work I failed to spot the C standards language on
> > > singed overflow behavior for negation, addition, subtraction
> > > and multiplication.  I can find it for division though.  Can someone
> > > point me to where that is (moved?)?
> > 
> > I don't think it moved. It should follow from the general rule 6.5p5:
> > 
> > "If an exceptional condition occurs during the evaluation of an
> > expression (that is, if the result is not mathematically defined
> > or not in the range of representable values for its type), the
> > behavior is undefined."
> 
> Ah, and the unsigned integer type exception is then 6.2.5/9
> but using "can never overflow" wording rather than "in the range
> of representable values for its type".
> 
The "can never overflow" is consequence of modulo
arithmetic, because then the result is in the
representable range.

>   Integer "overflow" isn't defined anywhere and 

It might be defined in ISO/IEC 2382.

> I suppose the unsigned exception also
> applies to underflow for expressions like 0 - 1.

Yes, because the operation is performed modulo 2^N.

> 
> It would have been useful to repeat some of this in the sections
> documenting the relevant operators.

I agree.

Martin




^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Question on -fwrapv and -fwrapv-pointer
  2023-09-18  7:31   ` Richard Biener
  2023-09-18  8:17     ` Martin Uecker
@ 2023-09-18 15:27     ` Andrew Pinski
  1 sibling, 0 replies; 9+ messages in thread
From: Andrew Pinski @ 2023-09-18 15:27 UTC (permalink / raw)
  To: Richard Biener; +Cc: Martin Uecker, keescook, gcc

On Mon, Sep 18, 2023 at 12:33 AM Richard Biener via Gcc <gcc@gcc.gnu.org> wrote:
>
> On Sat, Sep 16, 2023 at 10:38 AM Martin Uecker via Gcc <gcc@gcc.gnu.org> wrote:
> >
> >
> >
> > (moved to gcc@)
> >
> > > On Fri, Sep 15, 2023 at 08:18:28AM -0700, Andrew Pinski wrote:
> > > > On Fri, Sep 15, 2023 at 8:12 AM Qing Zhao <qing.zhao@oracle.com> wrote:
> > > > >
> > > > >
> > > > >
> > > > > > On Sep 15, 2023, at 3:43 AM, Xi Ruoyao <xry111@xry111.site> wrote:
> > > > > >
> > > > > > On Thu, 2023-09-14 at 21:41 +0000, Qing Zhao wrote:
> > > > > >>>> CLANG already provided -fsanitize=unsigned-integer-overflow. GCC
> > > > > >>>> might need to do the same.
> > > > > >>>
> > > > > >>> NO. There is no such thing as unsigned integer overflow. That option
> > > > > >>> is badly designed and the GCC community has rejected a few times now
> > > > > >>> having that sanitizer before. It is bad form to have a sanitizer for
> > > > > >>> well defined code.
> > > > > >>
> > > > > >> Even though unsigned integer overflow is well defined, it might be
> > > > > >> unintentional, shall we warn user about this?
> > > > > >
> > > > > > *Everything* could be unintentional and should be warned then.  GCC is a
> > > > > > compiler, not an advanced AI educating the programmers.
> > > > >
> > > > > Well, you are right in some sense. -:)
> > > > >
> > > > > However, overflow is one important source for security flaws, it’s important  for compilers to detect
> > > > > overflows in the programs in general.
> > > >
> > > > Except it is NOT an overflow. Rather it is wrapping. That is a big
> > > > point here. unsigned wraps and does NOT overflow. Yes there is a major
> > > > difference.
> > >
> > > Right, yes. I will try to pick my language very carefully. :)
> > >
> > > The practical problem I am trying to solve in the 30 million lines of
> > > Linux kernel code is that of catching arithmetic wrap-around. The
> > > problem is one of evolving the code -- I can't just drop -fwrapv and
> > > -fwrapv-pointer because it's not possible to fix all the cases at once.
> > > (And we really don't want to reintroduce undefined behavior.)
> > >
> > > So, for signed, pointer, and unsigned types, we need:
> > >
> > > a) No arithmetic UB -- everything needs to have deterministic behavior.
> > >    The current solution here is "-fno-strict-overflow", which eliminates
> > >    the UB and makes sure everything wraps.
> > >
> > > b) A way to run-time warn/trap on overflow/underflow/wrap-around. This
> > >    would work with -fsanitize=[signed-integer|pointer]-overflow except
> > >    due to "a)" we always wrap. And there isn't currently coverage like
> > >    this for unsigned (in GCC).
> > >
> > > Our problem is that the kernel is filled with a mix of places where there
> > > is intended wrap-around and unintended wrap-around. We can chip away at
> > > fixing the intended wrap-around that we can find with static analyzers,
> > > etc, but at the end of the day there is a long tail of finding the places
> > > where intended wrap-around is hiding. But when the refactoring is
> > > sufficiently completely, we can move the wrap-around warning to a trap,
> > > and the kernel will not longer have this class of security flaw.
> > >
> > > As a real-world example, here is a bug where a u8 wraps around causing
> > > an under-allocation that allowed for a heap overwrite:
> > >
> > > https://git.kernel.org/linus/6311071a0562
> > > https://elixir.bootlin.com/linux/v6.5/source/net/wireless/nl80211.c#L5422
> > >
> > > If there were more than 255 elements in a linked list, the allocation
> > > would be too small, and the second loop would write past the end of the
> > > allocation. This is a pretty classic allocation underflow and linear
> > > heap write overflow security flaw. (And it would be trivially stopped by
> > > trapping on the u8 wrap around.)
> > >
> > > So, I want to be able to catch that at run-time. But we also have code
> > > doing things like "if (ulong + offset < ulong) { ... }":
> > >
> > > https://elixir.bootlin.com/linux/v6.5/source/drivers/crypto/axis/artpec6_crypto.c#L1187
> > >
> > > This is easy for a static analyzer to find and we can replace it with a
> > > non-wrapping test (e.g. __builtin_add_overflow()), but we'll not find
> > > them all immediately, especially for the signed and pointer cases.
> > >
> > > So, I need to retain the "everything wraps" behavior while still being
> > > able to detect when it happens.
> >
> >
> > Hi Kees,
> >
> > I have a couple of questions:
> >
> > Currently, my thinking was that you would use signed integers
> > if you want the usual integer arithmetic rules we know from
> > elementary school and if you overflow this is clearly a bug
> > you can diagnose with UBsan.
> >
> > There are people who think that signed overflow should be
> > defined to wrap, but I think this would be a severe
> > mistake because then code would start to rely on it, which
> > makes it then difficult to differentiate between bugs and
> > intended uses (e.g. the unfortunate situation you have
> > with the kernel).
> >
> > I assume you want to combine UBSan plus wrapping for
> > production use?  Or only for testing?   Or in other words:
> > why would testing UBSan and production with wrapping
> > not be sufficient to find and fix all bugs?
> >
> > Wrapping would not be correct because it may lead to
> > logic errors or use-after-free etc.  I assume it is still
> > preferred because it more deterministic than whatever comes
> > out of the optimizer assuming that overflow has UB. Is this
> > the reasoning applied here?
> >
> >
> > For unsigned the intended use case is modulo arithmetic
> > where wrapping is the correct behavior. At least, this
> > is what I thought so far.. This seems also to be the
> > position of the overall GCC community rejecting
> > -fsanitize=unsigned-integer-overflow.
> >
> > But then there are also people like Richard Seacord that
> > have the position that one should use  "unsigned" for
> > every quantity which can not be negative, which implies
> > that then the modulo use case becomes the exception.
> >
> > Related to this I have the following question: In the
> > bug you refer to above, an unsigned variable was used
> > for something that is not meant for modulo arithmetic.
> > Is this done in the kernel to save space?  I.e. because
> > 127 would not be enough as maximum but going to i16 takes
> > to much space? or is this for compatibility with some
> > on-the-wire protocol?
> >
> > Would an attribute for variables help that tells the
> > compiler that if stores to the variable wrap around
> > then this is not intended and this is an error?
> >
> > u8 x [[nowrap]];
> >
> > x = 256; // can be diagnosed ?
>
> There's a problem with representing this, it's a long-standing thing standing
> in the way of handling -fwrapv in the IL.  Maybe C wants to have
> 'nonnegative int' in addition to 'unsigned int', or 'bit int' for the
> "just some two-complement bits".
>
> For GCC we think of -fwrapv to perform signed integer arithmetic as if
> promoted to a large enough type first and then truncated to the original
> type in the implementation defined manner (modulo reducing).  But we
> end up doing what the actual hardware does (I'm for example unsure
> how that handles INT_MIN / -1 - x86 documents it as raising #DE
> which it indeed does, even with -fwrapv).
>
> volatile int a, b;
> int main ()
> {
>   a = -__INT_MAX__ - 1;
>   b = -1;
>   int z = a / b;
>   __builtin_printf ("%d\n", z);
> }
>
> raises SIGFPE with and without -fwrapv.  I'll note that we document
> -fwrapv to only affect addition, subtraction and multiplication
> (it also affects negation),
One small addition, it also affects the function `abs` (and the
corresponding builtins); the same way as negation.

Thanks,
Andrew

> matching the libgcc *v functions we have
> for -ftrapv.  -fsanitize=undefined instruments division but
> -fsanitize=recover doesn't help.  Maybe there's a bug about this
> recorded.  The documentation of -fwrapv should likely be clarified.
>
> For recent work I failed to spot the C standards language on
> singed overflow behavior for negation, addition, subtraction
> and multiplication.  I can find it for division though.  Can someone
> point me to where that is (moved?)?
>
> Richard.
>
> >
> >
> >
> > Martin
> >
> >
> >
> >
> >
> >

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Question on -fwrapv and -fwrapv-pointer
  2023-09-16  8:36 ` Question on -fwrapv and -fwrapv-pointer Martin Uecker
  2023-09-18  7:31   ` Richard Biener
@ 2023-09-20 20:40   ` Kees Cook
  2023-09-20 21:16     ` Martin Uecker
  1 sibling, 1 reply; 9+ messages in thread
From: Kees Cook @ 2023-09-20 20:40 UTC (permalink / raw)
  To: Martin Uecker; +Cc: gcc

On Sat, Sep 16, 2023 at 10:36:52AM +0200, Martin Uecker wrote:
> > On Fri, Sep 15, 2023 at 08:18:28AM -0700, Andrew Pinski wrote:
> > > On Fri, Sep 15, 2023 at 8:12 AM Qing Zhao <qing.zhao@oracle.com> wrote:
> > > >
> > > >
> > > >
> > > > > On Sep 15, 2023, at 3:43 AM, Xi Ruoyao <xry111@xry111.site> wrote:
> > > > >
> > > > > On Thu, 2023-09-14 at 21:41 +0000, Qing Zhao wrote:
> > > > >>>> CLANG already provided -fsanitize=unsigned-integer-overflow. GCC
> > > > >>>> might need to do the same.
> > > > >>>
> > > > >>> NO. There is no such thing as unsigned integer overflow. That option
> > > > >>> is badly designed and the GCC community has rejected a few times now
> > > > >>> having that sanitizer before. It is bad form to have a sanitizer for
> > > > >>> well defined code.
> > > > >>
> > > > >> Even though unsigned integer overflow is well defined, it might be
> > > > >> unintentional, shall we warn user about this?
> > > > >
> > > > > *Everything* could be unintentional and should be warned then.  GCC is a
> > > > > compiler, not an advanced AI educating the programmers.
> > > >
> > > > Well, you are right in some sense. -:)
> > > >
> > > > However, overflow is one important source for security flaws, it’s important  for compilers to detect
> > > > overflows in the programs in general.
> > > 
> > > Except it is NOT an overflow. Rather it is wrapping. That is a big
> > > point here. unsigned wraps and does NOT overflow. Yes there is a major
> > > difference.
> > 
> > Right, yes. I will try to pick my language very carefully. :)
> > 
> > The practical problem I am trying to solve in the 30 million lines of
> > Linux kernel code is that of catching arithmetic wrap-around. The
> > problem is one of evolving the code -- I can't just drop -fwrapv and
> > -fwrapv-pointer because it's not possible to fix all the cases at once.
> > (And we really don't want to reintroduce undefined behavior.)
> > 
> > So, for signed, pointer, and unsigned types, we need:
> > 
> > a) No arithmetic UB -- everything needs to have deterministic behavior.
> >    The current solution here is "-fno-strict-overflow", which eliminates
> >    the UB and makes sure everything wraps.
> > 
> > b) A way to run-time warn/trap on overflow/underflow/wrap-around. This
> >    would work with -fsanitize=[signed-integer|pointer]-overflow except
> >    due to "a)" we always wrap. And there isn't currently coverage like
> >    this for unsigned (in GCC).
> > 
> > Our problem is that the kernel is filled with a mix of places where there
> > is intended wrap-around and unintended wrap-around. We can chip away at
> > fixing the intended wrap-around that we can find with static analyzers,
> > etc, but at the end of the day there is a long tail of finding the places
> > where intended wrap-around is hiding. But when the refactoring is
> > sufficiently completely, we can move the wrap-around warning to a trap,
> > and the kernel will not longer have this class of security flaw.
> > 
> > As a real-world example, here is a bug where a u8 wraps around causing
> > an under-allocation that allowed for a heap overwrite:
> > 
> > https://git.kernel.org/linus/6311071a0562
> > https://elixir.bootlin.com/linux/v6.5/source/net/wireless/nl80211.c#L5422
> > 
> > If there were more than 255 elements in a linked list, the allocation
> > would be too small, and the second loop would write past the end of the
> > allocation. This is a pretty classic allocation underflow and linear
> > heap write overflow security flaw. (And it would be trivially stopped by
> > trapping on the u8 wrap around.)
> > 
> > So, I want to be able to catch that at run-time. But we also have code
> > doing things like "if (ulong + offset < ulong) { ... }":
> > 
> > https://elixir.bootlin.com/linux/v6.5/source/drivers/crypto/axis/artpec6_crypto.c#L1187
> > 
> > This is easy for a static analyzer to find and we can replace it with a
> > non-wrapping test (e.g. __builtin_add_overflow()), but we'll not find
> > them all immediately, especially for the signed and pointer cases.
> > 
> > So, I need to retain the "everything wraps" behavior while still being
> > able to detect when it happens.
> 
> 
> Hi Kees,
> 
> I have a couple of questions:
> 
> Currently, my thinking was that you would use signed integers
> if you want the usual integer arithmetic rules we know from
> elementary school and if you overflow this is clearly a bug 
> you can diagnose with UBsan.
> 
> There are people who think that signed overflow should be
> defined to wrap, but I think this would be a severe
> mistake because then code would start to rely on it, which
> makes it then difficult to differentiate between bugs and
> intended uses (e.g. the unfortunate situation you have 
> with the kernel).

Right -- my goal is to migrate the kernel codebase into using unambiguous
arithmetic. Doing that evolution, though, is the hard part.  :)

At present, the kernel treats all signed and pointer arithmetic as
wrapping, as that makes sure that there is no UB, which causes must more
unexpected problems than universally wrapping does.

> I assume you want to combine UBSan plus wrapping for
> production use?  Or only for testing?   Or in other words:
> why would testing UBSan and production with wrapping
> not be sufficient to find and fix all bugs? 

I want UBSan to catch wrapping in production -- there is going to be
a long tail of code that may be vulnerable to having it be manipulated
into wrapping a calculation.

The stuff that _intentionally_ wraps will stand out very quickly and we
can fix those rapidly. (Many we can find today with static analyzers,
but not all from what I've seen.)

> Wrapping would not be correct because it may lead to
> logic errors or use-after-free etc.  I assume it is still
> preferred because it more deterministic than whatever comes
> out of the optimizer assuming that overflow has UB. Is this
> the reasoning applied here?

Exactly correct.

> For unsigned the intended use case is modulo arithmetic
> where wrapping is the correct behavior. At least, this
> is what I thought so far.. This seems also to be the 
> position of the overall GCC community rejecting 
> -fsanitize=unsigned-integer-overflow.

That is my understanding as well: unsigned wrap is "intended behavior".
But for the Linux kernel this behavior is still "unexpected" in the
majority of places where it may be reachable.

> But then there are also people like Richard Seacord that
> have the position that one should use  "unsigned" for 
> every quantity which can not be negative, which implies
> that then the modulo use case becomes the exception.

Sure -- this is the preferred coding style, but we both have tons of
old code that uses "int", and we still almost always would interpret
wrapping as an unwanted state.

> Related to this I have the following question: In the 
> bug you refer to above, an unsigned variable was used
> for something that is not meant for modulo arithmetic.
> Is this done in the kernel to save space?  I.e. because
> 127 would not be enough as maximum but going to i16 takes
> to much space? or is this for compatibility with some
> on-the-wire protocol? 

I don't know the original rationale -- I expect the author wanted an
unsigned variable but didn't expect to ever encounter > 255 entries.
These kinds of expectations aren't uncommon in the kernel, and sometimes
past expectations get violated when refactoring, etc. Basically I can't
trust the sanity of the codebase, so I have to depend on the compiler to
provide the runtime coverage to avoid these kinds of security flaws.

> Would an attribute for variables help that tells the
> compiler that if stores to the variable wrap around
> then this is not intended and this is an error? 
> 
> u8 x [[nowrap]];
> 
> x = 256; // can be diagnosed ?

I would need the reverse: I want to assume all math to not wrap,
excepting a handful that are designed to.

-fsanitize=nowrap

u8 modulo [[wrapping]];
u8 counter;

modulo = 256; // no diagnose
counter = 256; // diagnose

-- 
Kees Cook

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Question on -fwrapv and -fwrapv-pointer
  2023-09-20 20:40   ` Kees Cook
@ 2023-09-20 21:16     ` Martin Uecker
  2023-09-24  4:00       ` Kees Cook
  0 siblings, 1 reply; 9+ messages in thread
From: Martin Uecker @ 2023-09-20 21:16 UTC (permalink / raw)
  To: Kees Cook; +Cc: gcc

Am Mittwoch, dem 20.09.2023 um 13:40 -0700 schrieb Kees Cook via Gcc:
> On Sat, Sep 16, 2023 at 10:36:52AM +0200, Martin Uecker wrote:
> > > On Fri, Sep 15, 2023 at 08:18:28AM -0700, Andrew Pinski wrote:
> > > > On Fri, Sep 15, 2023 at 8:12 AM Qing Zhao <qing.zhao@oracle.com> wrote:
> > > > > 
> > > > > 
> > > > > 
> > > > > > On Sep 15, 2023, at 3:43 AM, Xi Ruoyao <xry111@xry111.site> wrote:
> > > > > > 
> > > > > > On Thu, 2023-09-14 at 21:41 +0000, Qing Zhao wrote:
> > > > > > > > > CLANG already provided -fsanitize=unsigned-integer-overflow. GCC
> > > > > > > > > might need to do the same.
> > > > > > > > 
> > > > > > > > NO. There is no such thing as unsigned integer overflow. That option
> > > > > > > > is badly designed and the GCC community has rejected a few times now
> > > > > > > > having that sanitizer before. It is bad form to have a sanitizer for
> > > > > > > > well defined code.
> > > > > > > 
> > > > > > > Even though unsigned integer overflow is well defined, it might be
> > > > > > > unintentional, shall we warn user about this?
> > > > > > 
> > > > > > *Everything* could be unintentional and should be warned then.  GCC is a
> > > > > > compiler, not an advanced AI educating the programmers.
> > > > > 
> > > > > Well, you are right in some sense. -:)
> > > > > 
> > > > > However, overflow is one important source for security flaws, it’s important  for compilers to detect
> > > > > overflows in the programs in general.
> > > > 
> > > > Except it is NOT an overflow. Rather it is wrapping. That is a big
> > > > point here. unsigned wraps and does NOT overflow. Yes there is a major
> > > > difference.
> > > 
> > > Right, yes. I will try to pick my language very carefully. :)
> > > 
> > > The practical problem I am trying to solve in the 30 million lines of
> > > Linux kernel code is that of catching arithmetic wrap-around. The
> > > problem is one of evolving the code -- I can't just drop -fwrapv and
> > > -fwrapv-pointer because it's not possible to fix all the cases at once.
> > > (And we really don't want to reintroduce undefined behavior.)
> > > 
> > > So, for signed, pointer, and unsigned types, we need:
> > > 
> > > a) No arithmetic UB -- everything needs to have deterministic behavior.
> > >    The current solution here is "-fno-strict-overflow", which eliminates
> > >    the UB and makes sure everything wraps.
> > > 
> > > b) A way to run-time warn/trap on overflow/underflow/wrap-around. This
> > >    would work with -fsanitize=[signed-integer|pointer]-overflow except
> > >    due to "a)" we always wrap. And there isn't currently coverage like
> > >    this for unsigned (in GCC).
> > > 
> > > Our problem is that the kernel is filled with a mix of places where there
> > > is intended wrap-around and unintended wrap-around. We can chip away at
> > > fixing the intended wrap-around that we can find with static analyzers,
> > > etc, but at the end of the day there is a long tail of finding the places
> > > where intended wrap-around is hiding. But when the refactoring is
> > > sufficiently completely, we can move the wrap-around warning to a trap,
> > > and the kernel will not longer have this class of security flaw.
> > > 
> > > As a real-world example, here is a bug where a u8 wraps around causing
> > > an under-allocation that allowed for a heap overwrite:
> > > 
> > > https://git.kernel.org/linus/6311071a0562
> > > https://elixir.bootlin.com/linux/v6.5/source/net/wireless/nl80211.c#L5422
> > > 
> > > If there were more than 255 elements in a linked list, the allocation
> > > would be too small, and the second loop would write past the end of the
> > > allocation. This is a pretty classic allocation underflow and linear
> > > heap write overflow security flaw. (And it would be trivially stopped by
> > > trapping on the u8 wrap around.)
> > > 
> > > So, I want to be able to catch that at run-time. But we also have code
> > > doing things like "if (ulong + offset < ulong) { ... }":
> > > 
> > > https://elixir.bootlin.com/linux/v6.5/source/drivers/crypto/axis/artpec6_crypto.c#L1187
> > > 
> > > This is easy for a static analyzer to find and we can replace it with a
> > > non-wrapping test (e.g. __builtin_add_overflow()), but we'll not find
> > > them all immediately, especially for the signed and pointer cases.
> > > 
> > > So, I need to retain the "everything wraps" behavior while still being
> > > able to detect when it happens.
> > 
> > 
> > Hi Kees,
> > 
> > I have a couple of questions:
> > 
> > Currently, my thinking was that you would use signed integers
> > if you want the usual integer arithmetic rules we know from
> > elementary school and if you overflow this is clearly a bug 
> > you can diagnose with UBsan.
> > 
> > There are people who think that signed overflow should be
> > defined to wrap, but I think this would be a severe
> > mistake because then code would start to rely on it, which
> > makes it then difficult to differentiate between bugs and
> > intended uses (e.g. the unfortunate situation you have 
> > with the kernel).
> 
> Right -- my goal is to migrate the kernel codebase into using unambiguous
> arithmetic. Doing that evolution, though, is the hard part.  :)
> 
> At present, the kernel treats all signed and pointer arithmetic as
> wrapping, as that makes sure that there is no UB, which causes must more
> unexpected problems than universally wrapping does.
> 
> > I assume you want to combine UBSan plus wrapping for
> > production use?  Or only for testing?   Or in other words:
> > why would testing UBSan and production with wrapping
> > not be sufficient to find and fix all bugs? 
> 
> I want UBSan to catch wrapping in production -- there is going to be
> a long tail of code that may be vulnerable to having it be manipulated
> into wrapping a calculation.
> 
> The stuff that _intentionally_ wraps will stand out very quickly and we
> can fix those rapidly. (Many we can find today with static analyzers,
> but not all from what I've seen.)
> 
> > Wrapping would not be correct because it may lead to
> > logic errors or use-after-free etc.  I assume it is still
> > preferred because it more deterministic than whatever comes
> > out of the optimizer assuming that overflow has UB. Is this
> > the reasoning applied here?
> 
> Exactly correct.
> 
> > For unsigned the intended use case is modulo arithmetic
> > where wrapping is the correct behavior. At least, this
> > is what I thought so far.. This seems also to be the 
> > position of the overall GCC community rejecting 
> > -fsanitize=unsigned-integer-overflow.
> 
> That is my understanding as well: unsigned wrap is "intended behavior".
> But for the Linux kernel this behavior is still "unexpected" in the
> majority of places where it may be reachable.
> 
> > But then there are also people like Richard Seacord that
> > have the position that one should use  "unsigned" for 
> > every quantity which can not be negative, which implies
> > that then the modulo use case becomes the exception.
> 
> Sure -- this is the preferred coding style, but we both have tons of
> old code that uses "int", and we still almost always would interpret
> wrapping as an unwanted state.
> 
> > Related to this I have the following question: In the 
> > bug you refer to above, an unsigned variable was used
> > for something that is not meant for modulo arithmetic.
> > Is this done in the kernel to save space?  I.e. because
> > 127 would not be enough as maximum but going to i16 takes
> > to much space? or is this for compatibility with some
> > on-the-wire protocol? 
> 
> I don't know the original rationale -- I expect the author wanted an
> unsigned variable but didn't expect to ever encounter > 255 entries.

I meant something else: Why wasn't an unsigned type
used in the first place?   If all "counter" variables were
signed and all "modulo" variables unsigned, one could already 
diagnose overflow reliably. 

I was trying to understand if there are generally
valid reasons for using unsigned integers for "counters".
or whether this is just a historical mistake in the
kernel.

(and thank you for your explanations)

Martin

> These kinds of expectations aren't uncommon in the kernel, and sometimes
> past expectations get violated when refactoring, etc. Basically I can't
> trust the sanity of the codebase, so I have to depend on the compiler to
> provide the runtime coverage to avoid these kinds of security flaws.
> 
> > Would an attribute for variables help that tells the
> > compiler that if stores to the variable wrap around
> > then this is not intended and this is an error? 
> > 
> > u8 x [[nowrap]];
> > 
> > x = 256; // can be diagnosed ?
> 
> I would need the reverse: I want to assume all math to not wrap,
> excepting a handful that are designed to.
> 
> -fsanitize=nowrap
> 
> u8 modulo [[wrapping]];
> u8 counter;
> 
> modulo = 256; // no diagnose
> counter = 256; // diagnose
> 


^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Question on -fwrapv and -fwrapv-pointer
  2023-09-20 21:16     ` Martin Uecker
@ 2023-09-24  4:00       ` Kees Cook
  0 siblings, 0 replies; 9+ messages in thread
From: Kees Cook @ 2023-09-24  4:00 UTC (permalink / raw)
  To: Martin Uecker; +Cc: gcc

On Wed, Sep 20, 2023 at 11:16:21PM +0200, Martin Uecker wrote:
> I meant something else: Why wasn't an unsigned type
> used in the first place?   If all "counter" variables were
> signed and all "modulo" variables unsigned, one could already 
> diagnose overflow reliably. 
> 
> I was trying to understand if there are generally
> valid reasons for using unsigned integers for "counters".
> or whether this is just a historical mistake in the
> kernel.

There isn't a common guideline in the kernel coding style for this, so
it's mostly arbitrary. :(

> (and thank you for your explanations)

Sure thing! Thanks for your (and everyone's) help clarifying this whole
area. :)

-Kees

-- 
Kees Cook

^ permalink raw reply	[flat|nested] 9+ messages in thread

end of thread, other threads:[~2023-09-24  4:00 UTC | newest]

Thread overview: 9+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
     [not found] <202309151107.C2F7D9A01@keescook>
2023-09-16  8:36 ` Question on -fwrapv and -fwrapv-pointer Martin Uecker
2023-09-18  7:31   ` Richard Biener
2023-09-18  8:17     ` Martin Uecker
2023-09-18  8:47       ` Richard Biener
2023-09-18 15:22         ` Martin Uecker
2023-09-18 15:27     ` Andrew Pinski
2023-09-20 20:40   ` Kees Cook
2023-09-20 21:16     ` Martin Uecker
2023-09-24  4:00       ` Kees Cook

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).