public inbox for gcc@gcc.gnu.org
 help / color / mirror / Atom feed
From: David Malcolm <dmalcolm@redhat.com>
To: Pierrick Philippe <pierrick.philippe@irisa.fr>, gcc@gcc.gnu.org
Subject: Re: [Static Analyzer] Loop handling - False positive for malloc-sm
Date: Mon, 20 Mar 2023 19:30:20 -0400	[thread overview]
Message-ID: <3b77234afb96947c9694d375b43b3096cbd45467.camel@redhat.com> (raw)
In-Reply-To: <34efc6e0-5bd8-879c-0288-154ba28f5f05@irisa.fr>

On Mon, 2023-03-20 at 13:28 +0100, Pierrick Philippe wrote:
> Hi everyone,
> 
> I'm still playing around with the analyzer, and wanted to have a look
> at 
> loop handling.
> I'm using a build from /trunk/ branch (/20230309/).
> 
> Here is my analyzed code:
> 
> '''
> 1| #include <stdlib.h>
> 2| int main(void) {
> 3|    void * ptr = malloc(sizeof(int));
> 4|    for (int i = 0; i < 10; i++) {
> 5|        if (i == 5) free(ptr);
> 6|    }
> 7|}
> '''
> 
> And here, the malloc-sm is reporting a double-free on line 5 with a 
> quite confusing output:
> 
> '''
> ./test.c: In function ‘main’:
> ./test.c:5:21: warning: double-‘free’ of ‘ptr’ [CWE-415] 
> [-Wanalyzer-double-free]
>      5 |         if (i == 5) free(ptr);
>         |                         ^~~~~~~~~
>    ‘main’: events 1-13
>      |
>      |   3 |     void * ptr = malloc(sizeof(int));
>      |      |                        ^~~~~~~~~~~~~~~~~~~
>      |      |                        |
>      |      |                        (1) allocated here
>      |   4 |     for (int i = 0; i < 10; i++) {
>      |      |                         ~~~~  ~~~
>      |      |                         |            |
>      |      |                         |            (5) ...to here
>      |      |                         (2) following ‘true’ branch
> (when 
> ‘i <= 9’)...
>      |      |                         (6) following ‘true’ branch
> (when 
> ‘i <= 9’)...
>      |      |                         (9) following ‘true’ branch
> (when 
> ‘i <= 9’)...
>      |   5 |         if (i == 5) free(ptr);
>      |      |            ~           ~~~~~
>      |      |            |             |
>      |      |            |             (8) first ‘free’ here
>      |      |            |             (12) ...to here
>      |      |            |             (13) second ‘free’ here; first
> ‘free’ was at (8)
>      |      |            (3) ...to here
>      |      |            (4) following ‘false’ branch (when ‘i !=
> 5’)...
>      |      |            (7) ...to here
>      |      |            (10) ...to here
>      |      |            (11) following ‘true’ branch (when ‘i ==
> 5’)...
>      |
> '''
> 
> So, I'm guessing that this false positive is due to how the analyzer
> is 
> handling loops.
> Which lead to my question: how are loops handled by the analyzer?

Sadly, the answer is currently "not very well" :/

I implemented my own approach, with a "widening_svalue" subclass of
symbolic value.  This is widening in the Abstract Interpretation sense,
(as opposed to the bitwise operations sense): if I see multiple values
on successive iterations, the widening_svalue tries to simulate that we
know the start value and the direction the variable is moving in.

This doesn't work well; arguably I should rewrite it, perhaps with an
iterator_svalue, though I'm not sure how it ought to work.  Some ideas:

* reuse gcc's existing SSA-based loop analysis, which I believe can
identify SSA names that are iterator variables, figure out their
bounds, and their per-iteration increments, etc.

* rework the program_point or supergraph code to have a notion of "1st
iteration of loop", "2nd iteration of loop", "subsequent iterations",
or similar, so that the analyzer can explore those cases differently
(on the assumption that such iterations hopefully catch the most
interesting bugs)

Dave

  reply	other threads:[~2023-03-20 23:30 UTC|newest]

Thread overview: 8+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2023-03-20 12:28 Pierrick Philippe
2023-03-20 23:30 ` David Malcolm [this message]
2023-03-21  8:21   ` Pierrick Philippe
2023-03-22 18:19     ` David Malcolm
2023-03-23  8:06       ` Pierrick Philippe
2023-03-21 10:01   ` Shengyu Huang
2023-03-22 18:34     ` David Malcolm
2023-03-21 10:12   ` Shengyu Huang

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=3b77234afb96947c9694d375b43b3096cbd45467.camel@redhat.com \
    --to=dmalcolm@redhat.com \
    --cc=gcc@gcc.gnu.org \
    --cc=pierrick.philippe@irisa.fr \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).