[GSoC] Interest and initial proposal for project on reimplementing cpychecker as -fanalyzer plugin

public inbox for gcc@gcc.gnu.org
 help / color / mirror / Atom feed

* [GSoC] Interest and initial proposal for project on reimplementing cpychecker as -fanalyzer plugin
@ 2023-03-25 19:38 Eric Feng
  2023-03-26 15:58 ` David Malcolm
  0 siblings, 1 reply; 9+ messages in thread
From: Eric Feng @ 2023-03-25 19:38 UTC (permalink / raw)
  To: gcc

Hi GCC community,

For GSoC, I am extremely interested in working on the selected project
idea with respect to extending the static analysis pass. In
particular, porting gcc-python-plugin's cpychecker to a plugin for GCC
-fanalyzer as described in
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107646. Please find an
initial draft of my proposal below and let me know if it is a
reasonable starting point. Please also correct me if I am
misunderstanding any particular tasks and let me know what areas I
should add more information for or what else I may do in preparation.

_______

Describe the project and clearly define its goals:
One pertinent use case of the gcc-python plugin is as a static
analysis tool for CPython extension modules. The main goal is to help
programmers writing extensions identify common coding errors. Broadly,
the goal of this project is to port the functionalities of cpychecker
to a -fanalyzer plugin.

Below is a brief description of the functionalities of the static
analysis tool for which I will work on porting over to a -fanalyzer
plugin. The structure of the objectives is taken from the
gcc-python-plugin documentation:

Reference count checking: Manipulation of PyObjects is done via the
CPython API and in particular with respect to the objects' reference
count. When the reference count belonging to an object drops to zero,
we should free all resources associated with it. This check helps
ensure programmers identify problems with the reference count
associated with an object. For example, memory leaks with respect to
forgetting to decrement the reference count of an object (analogous to
malloc() without corresponding free()) or perhaps object access after
the object's reference count is zero (analogous to access after
free()).

Error-handling checking: Various checks for common errors such as
dereferencing a NULL value.

Errors in exception-handling: Checks for situations in which functions
returning PyObject* that is NULL are not gracefully handled.

Format string checking: Verify that arguments to various CPython APIs
which take format strings are correct.

Associating PyTypeObject instances with compile-time-types: Verify
that the run-time type of a PyTypeObject matches its corresponding
compile-time type for inputs where both are provided.

Verification of PyMethodDef tables: Verify that the function in
PyMethodDef tables matches the calling convention of the ml_flags set.

I suspect a good starting point would be existing proof-of-concept
-fanalyzer plugins such as the CPython GIL example
(analyzer_gil_plugin). Please let me know of any additional pointers.
If there is time to spare, I think it is reasonable to extend the
capabilities of the original checker as well (more details in the
expected timeline below).

Provide an expected timeline:
I suspect the first task to take the longest since it is relatively
involved and it also includes getting the initial infrastructure of
the plugin up. From the experience of the first task, I hope the rest
of the tasks would be implemented faster. Additionally, I understand
that the current timeline outline below is too vague. I wished to
check in with the community for some feedback on whether I am in the
right ballpark before committing to more details.

Week 1 - 7: Reference counting checking
Week 8: Error-handling checking
Week 9: Errors in exception handling
Week 10: Format string checking
Week 11: Verification of PyMethodDef tables
Week 12: I am planning the last week to be safety in case any of the
above tasks take longer than initially expected. In case everything
goes smoothly and there is time to spare, I think it is reasonable to
spend the time extending the capabilities of the original pass. Some
ideas include extending the subset of CPython API that cpychecker
currently support, matching up similar traces to solve the issue of
duplicate error reports, and/or addressing any of the other caveats
currently mentioned in the cpychecker documentation. Additional ideas
are welcome of course.

Briefly introduce yourself and your skills and/or accomplishments:
I am a current Masters in Computer Science student at Columbia
University. I did my undergraduates at Bates College (B.A Math) and
Columbia University (B.S Computer Science) respectively. My interests
are primarily in systems, programming languages, and compilers.

At Columbia, I work in the group led by Professor Stephen Edwards on
the SSLANG programming language: a language built atop the Sparse
Synchronous Model. For more formal information on the Sparse
Synchronous Model, please take a look at "The Sparse Synchronous Model
on Real Hardware" (2022). Please find our repo, along with my
contributions, here: https://github.com/ssm-lang/sslang (my GitHub
handle is @efric). My main contribution to the compiler so far
involved adding a static inlining optimization pass with another
SSLANG team member. Our implementation is mostly based on the work
shown in "Secrets of the Glasgow Haskell Compiler Inliner" (2002),
with modifications as necessary to better fit our goals. The current
implementation is a work in progress and we are still working on
adding (many) more features to it. My work in this project is written
in Haskell.

I also conduct research in the Columbia Systems Lab. Specifically, my
group and I, advised by Professor Jason Nieh, work on secure
containerization with respect to untrusted software systems. Armv9-A
introduced Realms, secure execution environments that are opaque to
untrusted operating systems, as part of the Arm Confidential Compute
Architecture (CCA). Please find more information on CCA in "Design and
Verification of the Arm Confidential Compute Architecture" (2022).
Introduced together was the Realm Management Monitor (RMM), an
interface for hypervisors to allow secure virtualization utilizing
Realms and the new hardware support. Currently, the Realm isolation
boundary is at the level of entire VMs. We are working on applying
Realms to secure containers. Work in this project is mostly at the
kernel and firmware level and is written in C and ARM assembly.

Pertaining experience with compilers in addition to SSLANG, my
undergraduate education included a class on compilers that involved
writing passes for Clang/LLVM. More currently, I am taking a
graduate-level class on Types, Languages, and Compilers where my
partner and I are working on a plugin for our own small toy language
compiler which would be able to perform type inference. The plugin
would generate relevant constraints and solve them on behalf of the
compiler. This project is still in its early stages, but the idea is
to delegate type inference functionalities to a generic library given
some information instead of having to write your own constraint
solver.

_______

Thank you for reviewing my proposal!

Best,
Eric Feng

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [GSoC] Interest and initial proposal for project on reimplementing cpychecker as -fanalyzer plugin
  2023-03-25 19:38 [GSoC] Interest and initial proposal for project on reimplementing cpychecker as -fanalyzer plugin Eric Feng
@ 2023-03-26 15:58 ` David Malcolm
  2023-03-28 12:08   ` Eric Feng
  0 siblings, 1 reply; 9+ messages in thread
From: David Malcolm @ 2023-03-26 15:58 UTC (permalink / raw)
  To: Eric Feng, gcc

On Sat, 2023-03-25 at 15:38 -0400, Eric Feng via Gcc wrote:
> Hi GCC community,
> 
> For GSoC, I am extremely interested in working on the selected
> project
> idea with respect to extending the static analysis pass. In
> particular, porting gcc-python-plugin's cpychecker to a plugin for
> GCC
> -fanalyzer as described in
> https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107646.

Hi Eric, welcome to the GCC commmunity.

I'm the author/maintainer of GCC's static analysis pass.  I'm also the
author of gcc-python-plugin and its erstwhile "cpychecker" code, so I'm
pleased that you're interested in the project.

I wrote gcc-python-plugin and cpychecker over a decade ago when I was
focused on CPython development (before I switched to GCC development),
but it's heavily bitrotted over the years, as I didn't have enough
cycles to keep it compatible with changes in both GCC and CPython
whilst working on GCC itself.  In particular, the cpychecker code
stopped working a number of GCC releases ago.  However, the cpychecker
code inspired much of my work on GCC's static analysis pass and on its
diagnostics subsystem, so much of it now lives on in C++ form as core
GCC functionality.  Also, the Python community would continue to find
static analysis of CPython extension modules useful, so it would be
good to have the idea live on as a GCC plugin on top of -fanalyzer.

>  Please find an
> initial draft of my proposal below and let me know if it is a
> reasonable starting point. Please also correct me if I am
> misunderstanding any particular tasks and let me know what areas I
> should add more information for or what else I may do in preparation.

Some ideas for familiarizing yourself with the problem space:

You should try building GCC from source, and hack in a trivial warning
that emits "hello world, I'm compiling function 'foo'".  I wrote a
guide to GCC for new contributors here that should get you started:
  https://gcc-newbies-guide.readthedocs.io/en/latest/
This will help you get familiar with GCC's internals, and although the
plan is to write a plugin, I expect that you'll run into places where a
patch to GCC itself is more appropriate (bugs and missing functionality
), so having your own debug build of GCC is a good idea.

You should become familiar with CPython's extension and embedding API.
See the excellent documentation here:
  https://docs.python.org/3/extending/extending.html
It's probably a good exercise to write your own trivial CPython
extension module.

You can read the old cpychecker code inside the gcc-python-plugin
repository, and I gave a couple of talks on it as PyCon a decade ago:

PyCon2012: "Static analysis of Python extension modules using GCC"
https://pyvideo.org/pycon-us-2012/static-analysis-of-python-extension-modules-using.html

PyCon2013: "Death by a thousand leaks: what statically-analysing 370
Python extensions looks like"
https://pyvideo.org/pycon-us-2013/death-by-a-thousand-leaks-what-statically-analys.html
https://www.youtube.com/watch?v=bblvGKzZfFI

(sorry about all the "ums" and "errs"; it's fascinating and
embarrassing to watch myself from 11 years ago on this, and see how
much I've both forgotten and learned in the meantime.  Revisiting this
work, I'm ashamed to see that I was referring to the implementation as
based on "abstract interpretation" (and e.g. absinterp.py), when I now
realize it's actually based on symbolic execution (as is GCC's-
fanalyzer)

Also, this was during the transition era between Python 2 and Python 3,
whereas now we only have to care about Python 3.

There may be other caveats; I haven't fully rewatched those talks yet
:-/

Various comments inline below, throughout...

> 
> _______
> 
> Describe the project and clearly define its goals:
> One pertinent use case of the gcc-python plugin is as a static
> analysis tool for CPython extension modules.

It might be more accurate to use the past tense when referring to the
gcc-python plugin, alas.

>  The main goal is to help
> programmers writing extensions identify common coding errors.
> Broadly,
> the goal of this project is to port the functionalities of cpychecker
> to a -fanalyzer plugin.

(nods)

> 
> Below is a brief description of the functionalities of the static
> analysis tool for which I will work on porting over to a -fanalyzer
> plugin. The structure of the objectives is taken from the
> gcc-python-plugin documentation:
> 
> Reference count checking: Manipulation of PyObjects is done via the
> CPython API and in particular with respect to the objects' reference
> count. When the reference count belonging to an object drops to zero,
> we should free all resources associated with it. This check helps
> ensure programmers identify problems with the reference count
> associated with an object. For example, memory leaks with respect to
> forgetting to decrement the reference count of an object (analogous
> to
> malloc() without corresponding free()) or perhaps object access after
> the object's reference count is zero (analogous to access after
> free()).

(nods)
> 
> Error-handling checking: Various checks for common errors such as
> dereferencing a NULL value.

Yes.  This is already done by -fanalyzer, but we need some way for it
to know the outcomes of specific functions: e.g. one common pattern is
that API function "PyFoo_Bar" could either:
(a) succeed, returning a PyObject * that the caller "owns" a reference
to, or
(b) fail, returning NULL, and setting an exception on the thread-local
interpreter state object


> 
> Errors in exception-handling: Checks for situations in which
> functions
> returning PyObject* that is NULL are not gracefully handled.

Yes; detection of this would automatically happen if we implemented
known_function subclasses e.g. for the pattern above.
> 
> Format string checking: Verify that arguments to various CPython APIs
> which take format strings are correct.

Have a look at:
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107017
("RFE: support printf-style formatted functions in -fanalyzer") and:
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100121
("RFE: plugin support for -Wformat via __attribute__((format()))")



> 
> Associating PyTypeObject instances with compile-time-types: Verify
> that the run-time type of a PyTypeObject matches its corresponding
> compile-time type for inputs where both are provided.

(nods)

> 
> Verification of PyMethodDef tables: Verify that the function in
> PyMethodDef tables matches the calling convention of the ml_flags
> set.

(nods)

> 
> I suspect a good starting point would be existing proof-of-concept
> -fanalyzer plugins such as the CPython GIL example
> (analyzer_gil_plugin). Please let me know of any additional pointers.

Yes.

There are also two example of "teaching" the analyzer about the
behavior of specific functions via subclassing known_function in:
  analyzer_known_fns_plugin.c
and:
 analyzer_kernel_plugin.c


> If there is time to spare, I think it is reasonable to extend the
> capabilities of the original checker as well (more details in the
> expected timeline below).
> 
> Provide an expected timeline:
> I suspect the first task to take the longest since it is relatively
> involved and it also includes getting the initial infrastructure of
> the plugin up. From the experience of the first task, I hope the rest
> of the tasks would be implemented faster. Additionally, I understand
> that the current timeline outline below is too vague. I wished to
> check in with the community for some feedback on whether I am in the
> right ballpark before committing to more details.
> 
> Week 1 - 7: Reference counting checking
> Week 8: Error-handling checking
> Week 9: Errors in exception handling
> Week 10: Format string checking
> Week 11: Verification of PyMethodDef tables
> Week 12: I am planning the last week to be safety in case any of the
> above tasks take longer than initially expected. In case everything
> goes smoothly and there is time to spare, I think it is reasonable to
> spend the time extending the capabilities of the original pass. Some
> ideas include extending the subset of CPython API that cpychecker
> currently support, matching up similar traces to solve the issue of
> duplicate error reports, and/or addressing any of the other caveats
> currently mentioned in the cpychecker documentation. Additional ideas
> are welcome of course.

FWIW I think it's a very ambitious project, but you seem capable.

You don't mention testing.  I'd expect part of the project to be the
creation of a regression test suite, with each step adding test
coverage for the features it adds.  There are lots of test cases in the
existing cpychecker test suite that you could reuse  - though beware,
the test harness there is very bad - I made multiple mistakes:
- expecting "gold" outputs from test cases - specific stderr strings,
which make the tests very brittle
- external scripts associated with .c files, to tell it how to invoke
the compiler, which make the tests a pain to maintain and extend.

GCC's own test suite handles this much better using DejaGnu where:
- we test for specific properties of interest in the behavior of each
test (rather than rigidly specifying everything about the behavior of
each test)
- the tests are expressed as .c files with "magic" comments containing
directives for the test harness

That said DejaGnu is implemented in Tcl, which is a pain to deal with;
you could reuse DejaGnu, or maybe Python might be a better choice; I'm
not sure.


It might be good to add new attributes to CPython's headers so that the
function declarations become self-descriptive about e.g. refererence-
counting semantics (in a way readable both to humans and to static
analysis tools).  If so, this part of the project would involve working
with the CPython development community, perhaps writing a PEP:
  https://peps.python.org/pep-0001/
Again, this would be an ambitious goal, probably better done after
there's a working prototype.


> 
> Briefly introduce yourself and your skills and/or accomplishments:
> I am a current Masters in Computer Science student at Columbia
> University. I did my undergraduates at Bates College (B.A Math) and
> Columbia University (B.S Computer Science) respectively. My interests
> are primarily in systems, programming languages, and compilers.
> 
> At Columbia, I work in the group led by Professor Stephen Edwards on
> the SSLANG programming language: a language built atop the Sparse
> Synchronous Model. For more formal information on the Sparse
> Synchronous Model, please take a look at "The Sparse Synchronous
> Model
> on Real Hardware" (2022). Please find our repo, along with my
> contributions, here: https://github.com/ssm-lang/sslang (my GitHub
> handle is @efric). My main contribution to the compiler so far
> involved adding a static inlining optimization pass with another
> SSLANG team member. Our implementation is mostly based on the work
> shown in "Secrets of the Glasgow Haskell Compiler Inliner" (2002),
> with modifications as necessary to better fit our goals. The current
> implementation is a work in progress and we are still working on
> adding (many) more features to it. My work in this project is written
> in Haskell.
> 
> I also conduct research in the Columbia Systems Lab. Specifically, my
> group and I, advised by Professor Jason Nieh, work on secure
> containerization with respect to untrusted software systems. Armv9-A
> introduced Realms, secure execution environments that are opaque to
> untrusted operating systems, as part of the Arm Confidential Compute
> Architecture (CCA). Please find more information on CCA in "Design
> and
> Verification of the Arm Confidential Compute Architecture" (2022).
> Introduced together was the Realm Management Monitor (RMM), an
> interface for hypervisors to allow secure virtualization utilizing
> Realms and the new hardware support. Currently, the Realm isolation
> boundary is at the level of entire VMs. We are working on applying
> Realms to secure containers. Work in this project is mostly at the
> kernel and firmware level and is written in C and ARM assembly.
> 
> Pertaining experience with compilers in addition to SSLANG, my
> undergraduate education included a class on compilers that involved
> writing passes for Clang/LLVM. More currently, I am taking a
> graduate-level class on Types, Languages, and Compilers where my
> partner and I are working on a plugin for our own small toy language
> compiler which would be able to perform type inference. The plugin
> would generate relevant constraints and solve them on behalf of the
> compiler. This project is still in its early stages, but the idea is
> to delegate type inference functionalities to a generic library given
> some information instead of having to write your own constraint
> solver.

It sounds like you may know more about the theory than I do!

> 
> Thank you for reviewing my proposal!

Thanks for sending it; hope the above is helpful (and not too
intimidating!)

Dave


^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [GSoC] Interest and initial proposal for project on reimplementing cpychecker as -fanalyzer plugin
  2023-03-26 15:58 ` David Malcolm
@ 2023-03-28 12:08   ` Eric Feng
  2023-03-28 19:14     ` David Malcolm
  0 siblings, 1 reply; 9+ messages in thread
From: Eric Feng @ 2023-03-28 12:08 UTC (permalink / raw)
  To: David Malcolm; +Cc: gcc

My apologies. Forgot to CC the mailing list in my previous e-mail.
Original reply below:

_______


Hi David,

Thank you for your feedback!

> Also, the Python community would continue to find
> static analysis of CPython extension modules useful, so it would be
> good to have the idea live on as a GCC plugin on top of -fanalyzer.


I hope so!

> You should try building GCC from source, and hack in a trivial warning
> that emits "hello world, I'm compiling function 'foo'".  I wrote a
> guide to GCC for new contributors here that should get you started:
>   https://gcc-newbies-guide.readthedocs.io/en/latest/
> This will help you get familiar with GCC's internals, and although the
> plan is to write a plugin, I expect that you'll run into places where a
> patch to GCC itself is more appropriate (bugs and missing functionality
> ), so having your own debug build of GCC is a good idea.
>
> You should become familiar with CPython's extension and embedding API.
> See the excellent documentation here:
>   https://docs.python.org/3/extending/extending.html
> It's probably a good exercise to write your own trivial CPython
> extension module.
>
> You can read the old cpychecker code inside the gcc-python-plugin
> repository, and I gave a couple of talks on it as PyCon a decade ago:


Sounds good.

> > Error-handling checking: Various checks for common errors such as
> > dereferencing a NULL value.
>
> Yes.  This is already done by -fanalyzer, but we need some way for it
> to know the outcomes of specific functions: e.g. one common pattern is
> that API function "PyFoo_Bar" could either:
> (a) succeed, returning a PyObject * that the caller "owns" a reference
> to, or
> (b) fail, returning NULL, and setting an exception on the thread-local
> interpreter state object


Sounds good. In other words, the infrastructure of this check is
already there and our job is to retrofit CPython API-specific
knowledge for this task. Please correct me if my understanding here is
wrong.

> > Errors in exception-handling: Checks for situations in which
> > functions
> > returning PyObject* that is NULL are not gracefully handled.
>
> Yes; detection of this would automatically happen if we implemented
> known_function subclasses e.g. for the pattern above.


Sounds good. I will merge this task with the previous one in the next
iteration of this proposal since it will be handled as a side effect
of implementing the previous task.


> You don't mention testing.  I'd expect part of the project to be the
> creation of a regression test suite, with each step adding test
> coverage for the features it adds.  There are lots of test cases in the
> existing cpychecker test suite that you could reuse  - though beware,
> the test harness there is very bad - I made multiple mistakes:
> - expecting "gold" outputs from test cases - specific stderr strings,
> which make the tests very brittle
> - external scripts associated with .c files, to tell it how to invoke
> the compiler, which make the tests a pain to maintain and extend.
>
> GCC's own test suite handles this much better using DejaGnu where:
> - we test for specific properties of interest in the behavior of each
> test (rather than rigidly specifying everything about the behavior of
> each test)
> - the tests are expressed as .c files with "magic" comments containing
> directives for the test harness
>
> That said DejaGnu is implemented in Tcl, which is a pain to deal with;
> you could reuse DejaGnu, or maybe Python might be a better choice; I'm
> not sure.


You're right, I forgot to mention that in the initial draft. Thank you
for pointing that out. I agree with the bottom-up approach with
respect to building a comprehensive regression test suite. In terms of
specifically what to implement the suite in, I'll explore DejaGnu/Tcl
in more detail before making a more informed decision.

> It might be good to add new attributes to CPython's headers so that the
> function declarations become self-descriptive about e.g. refererence-
> counting semantics (in a way readable both to humans and to static
> analysis tools).  If so, this part of the project would involve working
> with the CPython development community, perhaps writing a PEP:
>   https://peps.python.org/pep-0001/
> Again, this would be an ambitious goal, probably better done after
> there's a working prototype.


That would be very exciting. However, I'm not sure if I fully
understand what you mean. Can you clarify by giving an example of what
the new attributes you had in mind might look like and how they would
help (for example with respect to reference counting semantics)?

Incidentally, I forgot to mention in my previous email but I believe
the 350-hour option is the one that is more appropriate for this
project. Please let me know otherwise.

Best,
Eric

On Sun, Mar 26, 2023 at 11:58 AM David Malcolm <dmalcolm@redhat.com> wrote:
>
> On Sat, 2023-03-25 at 15:38 -0400, Eric Feng via Gcc wrote:
> > Hi GCC community,
> >
> > For GSoC, I am extremely interested in working on the selected
> > project
> > idea with respect to extending the static analysis pass. In
> > particular, porting gcc-python-plugin's cpychecker to a plugin for
> > GCC
> > -fanalyzer as described in
> > https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107646.
>
> Hi Eric, welcome to the GCC commmunity.
>
> I'm the author/maintainer of GCC's static analysis pass.  I'm also the
> author of gcc-python-plugin and its erstwhile "cpychecker" code, so I'm
> pleased that you're interested in the project.
>
> I wrote gcc-python-plugin and cpychecker over a decade ago when I was
> focused on CPython development (before I switched to GCC development),
> but it's heavily bitrotted over the years, as I didn't have enough
> cycles to keep it compatible with changes in both GCC and CPython
> whilst working on GCC itself.  In particular, the cpychecker code
> stopped working a number of GCC releases ago.  However, the cpychecker
> code inspired much of my work on GCC's static analysis pass and on its
> diagnostics subsystem, so much of it now lives on in C++ form as core
> GCC functionality.  Also, the Python community would continue to find
> static analysis of CPython extension modules useful, so it would be
> good to have the idea live on as a GCC plugin on top of -fanalyzer.
>
> >  Please find an
> > initial draft of my proposal below and let me know if it is a
> > reasonable starting point. Please also correct me if I am
> > misunderstanding any particular tasks and let me know what areas I
> > should add more information for or what else I may do in preparation.
>
> Some ideas for familiarizing yourself with the problem space:
>
> You should try building GCC from source, and hack in a trivial warning
> that emits "hello world, I'm compiling function 'foo'".  I wrote a
> guide to GCC for new contributors here that should get you started:
>   https://gcc-newbies-guide.readthedocs.io/en/latest/
> This will help you get familiar with GCC's internals, and although the
> plan is to write a plugin, I expect that you'll run into places where a
> patch to GCC itself is more appropriate (bugs and missing functionality
> ), so having your own debug build of GCC is a good idea.
>
> You should become familiar with CPython's extension and embedding API.
> See the excellent documentation here:
>   https://docs.python.org/3/extending/extending.html
> It's probably a good exercise to write your own trivial CPython
> extension module.
>
> You can read the old cpychecker code inside the gcc-python-plugin
> repository, and I gave a couple of talks on it as PyCon a decade ago:
>
> PyCon2012: "Static analysis of Python extension modules using GCC"
> https://pyvideo.org/pycon-us-2012/static-analysis-of-python-extension-modules-using.html
>
> PyCon2013: "Death by a thousand leaks: what statically-analysing 370
> Python extensions looks like"
> https://pyvideo.org/pycon-us-2013/death-by-a-thousand-leaks-what-statically-analys.html
> https://www.youtube.com/watch?v=bblvGKzZfFI
>
> (sorry about all the "ums" and "errs"; it's fascinating and
> embarrassing to watch myself from 11 years ago on this, and see how
> much I've both forgotten and learned in the meantime.  Revisiting this
> work, I'm ashamed to see that I was referring to the implementation as
> based on "abstract interpretation" (and e.g. absinterp.py), when I now
> realize it's actually based on symbolic execution (as is GCC's-
> fanalyzer)
>
> Also, this was during the transition era between Python 2 and Python 3,
> whereas now we only have to care about Python 3.
>
> There may be other caveats; I haven't fully rewatched those talks yet
> :-/
>
> Various comments inline below, throughout...
>
> >
> > _______
> >
> > Describe the project and clearly define its goals:
> > One pertinent use case of the gcc-python plugin is as a static
> > analysis tool for CPython extension modules.
>
> It might be more accurate to use the past tense when referring to the
> gcc-python plugin, alas.
>
> >  The main goal is to help
> > programmers writing extensions identify common coding errors.
> > Broadly,
> > the goal of this project is to port the functionalities of cpychecker
> > to a -fanalyzer plugin.
>
> (nods)
>
> >
> > Below is a brief description of the functionalities of the static
> > analysis tool for which I will work on porting over to a -fanalyzer
> > plugin. The structure of the objectives is taken from the
> > gcc-python-plugin documentation:
> >
> > Reference count checking: Manipulation of PyObjects is done via the
> > CPython API and in particular with respect to the objects' reference
> > count. When the reference count belonging to an object drops to zero,
> > we should free all resources associated with it. This check helps
> > ensure programmers identify problems with the reference count
> > associated with an object. For example, memory leaks with respect to
> > forgetting to decrement the reference count of an object (analogous
> > to
> > malloc() without corresponding free()) or perhaps object access after
> > the object's reference count is zero (analogous to access after
> > free()).
>
> (nods)
> >
> > Error-handling checking: Various checks for common errors such as
> > dereferencing a NULL value.
>
> Yes.  This is already done by -fanalyzer, but we need some way for it
> to know the outcomes of specific functions: e.g. one common pattern is
> that API function "PyFoo_Bar" could either:
> (a) succeed, returning a PyObject * that the caller "owns" a reference
> to, or
> (b) fail, returning NULL, and setting an exception on the thread-local
> interpreter state object
>
>
> >
> > Errors in exception-handling: Checks for situations in which
> > functions
> > returning PyObject* that is NULL are not gracefully handled.
>
> Yes; detection of this would automatically happen if we implemented
> known_function subclasses e.g. for the pattern above.
> >
> > Format string checking: Verify that arguments to various CPython APIs
> > which take format strings are correct.
>
> Have a look at:
> https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107017
> ("RFE: support printf-style formatted functions in -fanalyzer") and:
> https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100121
> ("RFE: plugin support for -Wformat via __attribute__((format()))")
>
>
>
> >
> > Associating PyTypeObject instances with compile-time-types: Verify
> > that the run-time type of a PyTypeObject matches its corresponding
> > compile-time type for inputs where both are provided.
>
> (nods)
>
> >
> > Verification of PyMethodDef tables: Verify that the function in
> > PyMethodDef tables matches the calling convention of the ml_flags
> > set.
>
> (nods)
>
> >
> > I suspect a good starting point would be existing proof-of-concept
> > -fanalyzer plugins such as the CPython GIL example
> > (analyzer_gil_plugin). Please let me know of any additional pointers.
>
> Yes.
>
> There are also two example of "teaching" the analyzer about the
> behavior of specific functions via subclassing known_function in:
>   analyzer_known_fns_plugin.c
> and:
>  analyzer_kernel_plugin.c
>
>
> > If there is time to spare, I think it is reasonable to extend the
> > capabilities of the original checker as well (more details in the
> > expected timeline below).
> >
> > Provide an expected timeline:
> > I suspect the first task to take the longest since it is relatively
> > involved and it also includes getting the initial infrastructure of
> > the plugin up. From the experience of the first task, I hope the rest
> > of the tasks would be implemented faster. Additionally, I understand
> > that the current timeline outline below is too vague. I wished to
> > check in with the community for some feedback on whether I am in the
> > right ballpark before committing to more details.
> >
> > Week 1 - 7: Reference counting checking
> > Week 8: Error-handling checking
> > Week 9: Errors in exception handling
> > Week 10: Format string checking
> > Week 11: Verification of PyMethodDef tables
> > Week 12: I am planning the last week to be safety in case any of the
> > above tasks take longer than initially expected. In case everything
> > goes smoothly and there is time to spare, I think it is reasonable to
> > spend the time extending the capabilities of the original pass. Some
> > ideas include extending the subset of CPython API that cpychecker
> > currently support, matching up similar traces to solve the issue of
> > duplicate error reports, and/or addressing any of the other caveats
> > currently mentioned in the cpychecker documentation. Additional ideas
> > are welcome of course.
>
> FWIW I think it's a very ambitious project, but you seem capable.
>
> You don't mention testing.  I'd expect part of the project to be the
> creation of a regression test suite, with each step adding test
> coverage for the features it adds.  There are lots of test cases in the
> existing cpychecker test suite that you could reuse  - though beware,
> the test harness there is very bad - I made multiple mistakes:
> - expecting "gold" outputs from test cases - specific stderr strings,
> which make the tests very brittle
> - external scripts associated with .c files, to tell it how to invoke
> the compiler, which make the tests a pain to maintain and extend.
>
> GCC's own test suite handles this much better using DejaGnu where:
> - we test for specific properties of interest in the behavior of each
> test (rather than rigidly specifying everything about the behavior of
> each test)
> - the tests are expressed as .c files with "magic" comments containing
> directives for the test harness
>
> That said DejaGnu is implemented in Tcl, which is a pain to deal with;
> you could reuse DejaGnu, or maybe Python might be a better choice; I'm
> not sure.
>
>
> It might be good to add new attributes to CPython's headers so that the
> function declarations become self-descriptive about e.g. refererence-
> counting semantics (in a way readable both to humans and to static
> analysis tools).  If so, this part of the project would involve working
> with the CPython development community, perhaps writing a PEP:
>   https://peps.python.org/pep-0001/
> Again, this would be an ambitious goal, probably better done after
> there's a working prototype.
>
>
> >
> > Briefly introduce yourself and your skills and/or accomplishments:
> > I am a current Masters in Computer Science student at Columbia
> > University. I did my undergraduates at Bates College (B.A Math) and
> > Columbia University (B.S Computer Science) respectively. My interests
> > are primarily in systems, programming languages, and compilers.
> >
> > At Columbia, I work in the group led by Professor Stephen Edwards on
> > the SSLANG programming language: a language built atop the Sparse
> > Synchronous Model. For more formal information on the Sparse
> > Synchronous Model, please take a look at "The Sparse Synchronous
> > Model
> > on Real Hardware" (2022). Please find our repo, along with my
> > contributions, here: https://github.com/ssm-lang/sslang (my GitHub
> > handle is @efric). My main contribution to the compiler so far
> > involved adding a static inlining optimization pass with another
> > SSLANG team member. Our implementation is mostly based on the work
> > shown in "Secrets of the Glasgow Haskell Compiler Inliner" (2002),
> > with modifications as necessary to better fit our goals. The current
> > implementation is a work in progress and we are still working on
> > adding (many) more features to it. My work in this project is written
> > in Haskell.
> >
> > I also conduct research in the Columbia Systems Lab. Specifically, my
> > group and I, advised by Professor Jason Nieh, work on secure
> > containerization with respect to untrusted software systems. Armv9-A
> > introduced Realms, secure execution environments that are opaque to
> > untrusted operating systems, as part of the Arm Confidential Compute
> > Architecture (CCA). Please find more information on CCA in "Design
> > and
> > Verification of the Arm Confidential Compute Architecture" (2022).
> > Introduced together was the Realm Management Monitor (RMM), an
> > interface for hypervisors to allow secure virtualization utilizing
> > Realms and the new hardware support. Currently, the Realm isolation
> > boundary is at the level of entire VMs. We are working on applying
> > Realms to secure containers. Work in this project is mostly at the
> > kernel and firmware level and is written in C and ARM assembly.
> >
> > Pertaining experience with compilers in addition to SSLANG, my
> > undergraduate education included a class on compilers that involved
> > writing passes for Clang/LLVM. More currently, I am taking a
> > graduate-level class on Types, Languages, and Compilers where my
> > partner and I are working on a plugin for our own small toy language
> > compiler which would be able to perform type inference. The plugin
> > would generate relevant constraints and solve them on behalf of the
> > compiler. This project is still in its early stages, but the idea is
> > to delegate type inference functionalities to a generic library given
> > some information instead of having to write your own constraint
> > solver.
>
> It sounds like you may know more about the theory than I do!
>
> >
> > Thank you for reviewing my proposal!
>
> Thanks for sending it; hope the above is helpful (and not too
> intimidating!)
>
> Dave
>

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [GSoC] Interest and initial proposal for project on reimplementing cpychecker as -fanalyzer plugin
  2023-03-28 12:08   ` Eric Feng
@ 2023-03-28 19:14     ` David Malcolm
  2023-04-01 23:49       ` Eric Feng
  0 siblings, 1 reply; 9+ messages in thread
From: David Malcolm @ 2023-03-28 19:14 UTC (permalink / raw)
  To: Eric Feng; +Cc: gcc

On Tue, 2023-03-28 at 08:08 -0400, Eric Feng wrote:
> My apologies. Forgot to CC the mailing list in my previous e-mail.
> Original reply below:
> 
> _______
> 
> 
> Hi David,
> 
> Thank you for your feedback!
> 
> > 
[...snip...]

> > 
> 
> > > Error-handling checking: Various checks for common errors such as
> > > dereferencing a NULL value.
> > 
> > Yes.  This is already done by -fanalyzer, but we need some way for
> > it
> > to know the outcomes of specific functions: e.g. one common pattern
> > is
> > that API function "PyFoo_Bar" could either:
> > (a) succeed, returning a PyObject * that the caller "owns" a
> > reference
> > to, or
> > (b) fail, returning NULL, and setting an exception on the thread-
> > local
> > interpreter state object
> 
> 
> Sounds good. In other words, the infrastructure of this check is
> already there and our job is to retrofit CPython API-specific
> knowledge for this task. Please correct me if my understanding here
> is
> wrong.

For the task above, I think it's almost all there, it's "just" a case
of implementing the special-case knowledge about the CPython API,
mostly via known_function subclasses.

You might need to implement a new plugin hook for reference-count
checking: IIRC cpychecker had some kind of custom check when returning
from the top-level function in the analysis path that issued a warning
about any PyObjects that had the wrong ob_refcnt.  So you might need to
add a little infrastructure for that, but hopefully that's a relative
simple task.

Some of the other tasks on the list might need other extensions of the
analyzer, some more involved than others; e.g.
- the format-string checking would need integrating the analyzer with -
Wformat (see the two bugzilla entries I posted; this could take a fair
bit of work, but of great benefit beyond just this project)
- checking of PyMethodDef tables would need an extra hook for walking
global initializers (hopefully easy),
etc.

> 
> > > Errors in exception-handling: Checks for situations in which
> > > functions
> > > returning PyObject* that is NULL are not gracefully handled.
> > 
> > Yes; detection of this would automatically happen if we implemented
> > known_function subclasses e.g. for the pattern above.
> 
> 
> Sounds good. I will merge this task with the previous one in the next
> iteration of this proposal since it will be handled as a side effect
> of implementing the previous task.

(nods)

> 
> 
> > You don't mention testing.  I'd expect part of the project to be
> > the
> > creation of a regression test suite, with each step adding test
> > coverage for the features it adds.  There are lots of test cases in
> > the
> > existing cpychecker test suite that you could reuse  - though
> > beware,
> > the test harness there is very bad - I made multiple mistakes:
> > - expecting "gold" outputs from test cases - specific stderr
> > strings,
> > which make the tests very brittle
> > - external scripts associated with .c files, to tell it how to
> > invoke
> > the compiler, which make the tests a pain to maintain and extend.
> > 
> > GCC's own test suite handles this much better using DejaGnu where:
> > - we test for specific properties of interest in the behavior of
> > each
> > test (rather than rigidly specifying everything about the behavior
> > of
> > each test)
> > - the tests are expressed as .c files with "magic" comments
> > containing
> > directives for the test harness
> > 
> > That said DejaGnu is implemented in Tcl, which is a pain to deal
> > with;
> > you could reuse DejaGnu, or maybe Python might be a better choice;
> > I'm
> > not sure.
> 
> 
> You're right, I forgot to mention that in the initial draft. Thank
> you
> for pointing that out. I agree with the bottom-up approach with
> respect to building a comprehensive regression test suite. In terms
> of
> specifically what to implement the suite in, I'll explore DejaGnu/Tcl
> in more detail before making a more informed decision.

(nods)

> 
> > It might be good to add new attributes to CPython's headers so that
> > the
> > function declarations become self-descriptive about e.g.
> > refererence-
> > counting semantics (in a way readable both to humans and to static
> > analysis tools).  If so, this part of the project would involve
> > working
> > with the CPython development community, perhaps writing a PEP:
> >   https://peps.python.org/pep-0001/
> > Again, this would be an ambitious goal, probably better done after
> > there's a working prototype.
> 
> 
> That would be very exciting. However, I'm not sure if I fully
> understand what you mean. Can you clarify by giving an example of
> what
> the new attributes you had in mind might look like and how they would
> help (for example with respect to reference counting semantics)?

In cpychecker I added some custom function attributes:
  https://gcc-python-plugin.readthedocs.io/en/latest/cpychecker.html
which were:
  __attribute__((cpychecker_returns_borrowed_ref))
  __attribute__((cpychecker_steals_reference_to_arg(n)))

I imagine that the plugin could provide similar attributes, and that
CPython could be patched so that the autoconf checks detect the
attributes, and the headers have some macros that optionally use them.
For example, consider e.g. list objects, where the API declarations
here:
https://github.com/python/cpython/blob/main/Include/listobject.h

are simply:

  PyAPI_FUNC(Py_ssize_t) PyList_Size(PyObject *);
  PyAPI_FUNC(PyObject *) PyList_GetItem(PyObject *, Py_ssize_t);

etc, as documented here:
https://docs.python.org/3/c-api/list.html

and implemented in here:
https://github.com/python/cpython/blob/main/Objects/listobject.c

Given that the implementation of both PyList_Size and PyList_GetItem
begin with:

    if (!PyList_Check(op)) {

and PyList_Check is a macro that unconditionally dereferences its
pointer operand:

#define PyList_Check(op) \
    PyType_FastSubclass(Py_TYPE(op), Py_TPFLAGS_LIST_SUBCLASS)

these declarations require non-NULL args, and thus perhaps we could add
some new macros making the above decls look like:

  PyAPI_FUNC(Py_ssize_t) PyList_Size(PyObject *)
    PyAPI_NonNullArg(1);

  PyAPI_FUNC(PyObject *) PyList_GetItem(PyObject *, Py_ssize_t)
    PyAPI_NonNullArg(1))
    PyAPI_ReturnsBorrowedRef();

or somesuch, documenting the required non-nullness of the args, and
that if PyList_GetItem returns successfully, the caller is borrowing a
reference to the object.

But exactly what these macros would look like would be a decision for
the CPython community (hence do it via PEP, based on a sample
implementation).

> 
> Incidentally, I forgot to mention in my previous email but I believe
> the 350-hour option is the one that is more appropriate for this
> project. Please let me know otherwise.

Yeah, this sounds like a big project.  Fortunately there are a lot of
possible subtasks in this one, and the project has benefits to GCC and
to CPython even if you only get a subset of the ideas done in the time
available (refcount checking being probably the highest-value subtask).

Hope this all makes sense
Dave

> 
> Best,
> Eric
> 
> On Sun, Mar 26, 2023 at 11:58 AM David Malcolm <dmalcolm@redhat.com>
> wrote:
> > 
> > On Sat, 2023-03-25 at 15:38 -0400, Eric Feng via Gcc wrote:
> > > Hi GCC community,
> > > 
> > > For GSoC, I am extremely interested in working on the selected
> > > project
> > > idea with respect to extending the static analysis pass. In
> > > particular, porting gcc-python-plugin's cpychecker to a plugin
> > > for
> > > GCC
> > > -fanalyzer as described in
> > > https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107646.
> > 
> > Hi Eric, welcome to the GCC commmunity.
> > 
> > I'm the author/maintainer of GCC's static analysis pass.  I'm also
> > the
> > author of gcc-python-plugin and its erstwhile "cpychecker" code, so
> > I'm
> > pleased that you're interested in the project.
> > 
> > I wrote gcc-python-plugin and cpychecker over a decade ago when I
> > was
> > focused on CPython development (before I switched to GCC
> > development),
> > but it's heavily bitrotted over the years, as I didn't have enough
> > cycles to keep it compatible with changes in both GCC and CPython
> > whilst working on GCC itself.  In particular, the cpychecker code
> > stopped working a number of GCC releases ago.  However, the
> > cpychecker
> > code inspired much of my work on GCC's static analysis pass and on
> > its
> > diagnostics subsystem, so much of it now lives on in C++ form as
> > core
> > GCC functionality.  Also, the Python community would continue to
> > find
> > static analysis of CPython extension modules useful, so it would be
> > good to have the idea live on as a GCC plugin on top of -fanalyzer.
> > 
> > >  Please find an
> > > initial draft of my proposal below and let me know if it is a
> > > reasonable starting point. Please also correct me if I am
> > > misunderstanding any particular tasks and let me know what areas
> > > I
> > > should add more information for or what else I may do in
> > > preparation.
> > 
> > Some ideas for familiarizing yourself with the problem space:
> > 
> > You should try building GCC from source, and hack in a trivial
> > warning
> > that emits "hello world, I'm compiling function 'foo'".  I wrote a
> > guide to GCC for new contributors here that should get you started:
> >   https://gcc-newbies-guide.readthedocs.io/en/latest/
> > This will help you get familiar with GCC's internals, and although
> > the
> > plan is to write a plugin, I expect that you'll run into places
> > where a
> > patch to GCC itself is more appropriate (bugs and missing
> > functionality
> > ), so having your own debug build of GCC is a good idea.
> > 
> > You should become familiar with CPython's extension and embedding
> > API.
> > See the excellent documentation here:
> >   https://docs.python.org/3/extending/extending.html
> > It's probably a good exercise to write your own trivial CPython
> > extension module.
> > 
> > You can read the old cpychecker code inside the gcc-python-plugin
> > repository, and I gave a couple of talks on it as PyCon a decade
> > ago:
> > 
> > PyCon2012: "Static analysis of Python extension modules using GCC"
> > https://pyvideo.org/pycon-us-2012/static-analysis-of-python-extension-modules-using.html
> > 
> > PyCon2013: "Death by a thousand leaks: what statically-analysing
> > 370
> > Python extensions looks like"
> > https://pyvideo.org/pycon-us-2013/death-by-a-thousand-leaks-what-statically-analys.html
> > https://www.youtube.com/watch?v=bblvGKzZfFI
> > 
> > (sorry about all the "ums" and "errs"; it's fascinating and
> > embarrassing to watch myself from 11 years ago on this, and see how
> > much I've both forgotten and learned in the meantime.  Revisiting
> > this
> > work, I'm ashamed to see that I was referring to the implementation
> > as
> > based on "abstract interpretation" (and e.g. absinterp.py), when I
> > now
> > realize it's actually based on symbolic execution (as is GCC's-
> > fanalyzer)
> > 
> > Also, this was during the transition era between Python 2 and
> > Python 3,
> > whereas now we only have to care about Python 3.
> > 
> > There may be other caveats; I haven't fully rewatched those talks
> > yet
> > :-/
> > 
> > Various comments inline below, throughout...
> > 
> > > 
> > > _______
> > > 
> > > Describe the project and clearly define its goals:
> > > One pertinent use case of the gcc-python plugin is as a static
> > > analysis tool for CPython extension modules.
> > 
> > It might be more accurate to use the past tense when referring to
> > the
> > gcc-python plugin, alas.
> > 
> > >  The main goal is to help
> > > programmers writing extensions identify common coding errors.
> > > Broadly,
> > > the goal of this project is to port the functionalities of
> > > cpychecker
> > > to a -fanalyzer plugin.
> > 
> > (nods)
> > 
> > > 
> > > Below is a brief description of the functionalities of the static
> > > analysis tool for which I will work on porting over to a -
> > > fanalyzer
> > > plugin. The structure of the objectives is taken from the
> > > gcc-python-plugin documentation:
> > > 
> > > Reference count checking: Manipulation of PyObjects is done via
> > > the
> > > CPython API and in particular with respect to the objects'
> > > reference
> > > count. When the reference count belonging to an object drops to
> > > zero,
> > > we should free all resources associated with it. This check helps
> > > ensure programmers identify problems with the reference count
> > > associated with an object. For example, memory leaks with respect
> > > to
> > > forgetting to decrement the reference count of an object
> > > (analogous
> > > to
> > > malloc() without corresponding free()) or perhaps object access
> > > after
> > > the object's reference count is zero (analogous to access after
> > > free()).
> > 
> > (nods)
> > > 
> > > Error-handling checking: Various checks for common errors such as
> > > dereferencing a NULL value.
> > 
> > Yes.  This is already done by -fanalyzer, but we need some way for
> > it
> > to know the outcomes of specific functions: e.g. one common pattern
> > is
> > that API function "PyFoo_Bar" could either:
> > (a) succeed, returning a PyObject * that the caller "owns" a
> > reference
> > to, or
> > (b) fail, returning NULL, and setting an exception on the thread-
> > local
> > interpreter state object
> > 
> > 
> > > 
> > > Errors in exception-handling: Checks for situations in which
> > > functions
> > > returning PyObject* that is NULL are not gracefully handled.
> > 
> > Yes; detection of this would automatically happen if we implemented
> > known_function subclasses e.g. for the pattern above.
> > > 
> > > Format string checking: Verify that arguments to various CPython
> > > APIs
> > > which take format strings are correct.
> > 
> > Have a look at:
> > https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107017
> > ("RFE: support printf-style formatted functions in -fanalyzer")
> > and:
> > https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100121
> > ("RFE: plugin support for -Wformat via __attribute__((format()))")
> > 
> > 
> > 
> > > 
> > > Associating PyTypeObject instances with compile-time-types:
> > > Verify
> > > that the run-time type of a PyTypeObject matches its
> > > corresponding
> > > compile-time type for inputs where both are provided.
> > 
> > (nods)
> > 
> > > 
> > > Verification of PyMethodDef tables: Verify that the function in
> > > PyMethodDef tables matches the calling convention of the ml_flags
> > > set.
> > 
> > (nods)
> > 
> > > 
> > > I suspect a good starting point would be existing proof-of-
> > > concept
> > > -fanalyzer plugins such as the CPython GIL example
> > > (analyzer_gil_plugin). Please let me know of any additional
> > > pointers.
> > 
> > Yes.
> > 
> > There are also two example of "teaching" the analyzer about the
> > behavior of specific functions via subclassing known_function in:
> >   analyzer_known_fns_plugin.c
> > and:
> >  analyzer_kernel_plugin.c
> > 
> > 
> > > If there is time to spare, I think it is reasonable to extend the
> > > capabilities of the original checker as well (more details in the
> > > expected timeline below).
> > > 
> > > Provide an expected timeline:
> > > I suspect the first task to take the longest since it is
> > > relatively
> > > involved and it also includes getting the initial infrastructure
> > > of
> > > the plugin up. From the experience of the first task, I hope the
> > > rest
> > > of the tasks would be implemented faster. Additionally, I
> > > understand
> > > that the current timeline outline below is too vague. I wished to
> > > check in with the community for some feedback on whether I am in
> > > the
> > > right ballpark before committing to more details.
> > > 
> > > Week 1 - 7: Reference counting checking
> > > Week 8: Error-handling checking
> > > Week 9: Errors in exception handling
> > > Week 10: Format string checking
> > > Week 11: Verification of PyMethodDef tables
> > > Week 12: I am planning the last week to be safety in case any of
> > > the
> > > above tasks take longer than initially expected. In case
> > > everything
> > > goes smoothly and there is time to spare, I think it is
> > > reasonable to
> > > spend the time extending the capabilities of the original pass.
> > > Some
> > > ideas include extending the subset of CPython API that cpychecker
> > > currently support, matching up similar traces to solve the issue
> > > of
> > > duplicate error reports, and/or addressing any of the other
> > > caveats
> > > currently mentioned in the cpychecker documentation. Additional
> > > ideas
> > > are welcome of course.
> > 
> > FWIW I think it's a very ambitious project, but you seem capable.
> > 
> > You don't mention testing.  I'd expect part of the project to be
> > the
> > creation of a regression test suite, with each step adding test
> > coverage for the features it adds.  There are lots of test cases in
> > the
> > existing cpychecker test suite that you could reuse  - though
> > beware,
> > the test harness there is very bad - I made multiple mistakes:
> > - expecting "gold" outputs from test cases - specific stderr
> > strings,
> > which make the tests very brittle
> > - external scripts associated with .c files, to tell it how to
> > invoke
> > the compiler, which make the tests a pain to maintain and extend.
> > 
> > GCC's own test suite handles this much better using DejaGnu where:
> > - we test for specific properties of interest in the behavior of
> > each
> > test (rather than rigidly specifying everything about the behavior
> > of
> > each test)
> > - the tests are expressed as .c files with "magic" comments
> > containing
> > directives for the test harness
> > 
> > That said DejaGnu is implemented in Tcl, which is a pain to deal
> > with;
> > you could reuse DejaGnu, or maybe Python might be a better choice;
> > I'm
> > not sure.
> > 
> > 
> > It might be good to add new attributes to CPython's headers so that
> > the
> > function declarations become self-descriptive about e.g.
> > refererence-
> > counting semantics (in a way readable both to humans and to static
> > analysis tools).  If so, this part of the project would involve
> > working
> > with the CPython development community, perhaps writing a PEP:
> >   https://peps.python.org/pep-0001/
> > Again, this would be an ambitious goal, probably better done after
> > there's a working prototype.
> > 
> > 
> > > 
> > > Briefly introduce yourself and your skills and/or
> > > accomplishments:
> > > I am a current Masters in Computer Science student at Columbia
> > > University. I did my undergraduates at Bates College (B.A Math)
> > > and
> > > Columbia University (B.S Computer Science) respectively. My
> > > interests
> > > are primarily in systems, programming languages, and compilers.
> > > 
> > > At Columbia, I work in the group led by Professor Stephen Edwards
> > > on
> > > the SSLANG programming language: a language built atop the Sparse
> > > Synchronous Model. For more formal information on the Sparse
> > > Synchronous Model, please take a look at "The Sparse Synchronous
> > > Model
> > > on Real Hardware" (2022). Please find our repo, along with my
> > > contributions, here: https://github.com/ssm-lang/sslang (my
> > > GitHub
> > > handle is @efric). My main contribution to the compiler so far
> > > involved adding a static inlining optimization pass with another
> > > SSLANG team member. Our implementation is mostly based on the
> > > work
> > > shown in "Secrets of the Glasgow Haskell Compiler Inliner"
> > > (2002),
> > > with modifications as necessary to better fit our goals. The
> > > current
> > > implementation is a work in progress and we are still working on
> > > adding (many) more features to it. My work in this project is
> > > written
> > > in Haskell.
> > > 
> > > I also conduct research in the Columbia Systems Lab.
> > > Specifically, my
> > > group and I, advised by Professor Jason Nieh, work on secure
> > > containerization with respect to untrusted software systems.
> > > Armv9-A
> > > introduced Realms, secure execution environments that are opaque
> > > to
> > > untrusted operating systems, as part of the Arm Confidential
> > > Compute
> > > Architecture (CCA). Please find more information on CCA in
> > > "Design
> > > and
> > > Verification of the Arm Confidential Compute Architecture"
> > > (2022).
> > > Introduced together was the Realm Management Monitor (RMM), an
> > > interface for hypervisors to allow secure virtualization
> > > utilizing
> > > Realms and the new hardware support. Currently, the Realm
> > > isolation
> > > boundary is at the level of entire VMs. We are working on
> > > applying
> > > Realms to secure containers. Work in this project is mostly at
> > > the
> > > kernel and firmware level and is written in C and ARM assembly.
> > > 
> > > Pertaining experience with compilers in addition to SSLANG, my
> > > undergraduate education included a class on compilers that
> > > involved
> > > writing passes for Clang/LLVM. More currently, I am taking a
> > > graduate-level class on Types, Languages, and Compilers where my
> > > partner and I are working on a plugin for our own small toy
> > > language
> > > compiler which would be able to perform type inference. The
> > > plugin
> > > would generate relevant constraints and solve them on behalf of
> > > the
> > > compiler. This project is still in its early stages, but the idea
> > > is
> > > to delegate type inference functionalities to a generic library
> > > given
> > > some information instead of having to write your own constraint
> > > solver.
> > 
> > It sounds like you may know more about the theory than I do!
> > 
> > > 
> > > Thank you for reviewing my proposal!
> > 
> > Thanks for sending it; hope the above is helpful (and not too
> > intimidating!)
> > 
> > Dave
> > 
> 


^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [GSoC] Interest and initial proposal for project on reimplementing cpychecker as -fanalyzer plugin
  2023-03-28 19:14     ` David Malcolm
@ 2023-04-01 23:49       ` Eric Feng
  2023-04-02 23:28         ` David Malcolm
  0 siblings, 1 reply; 9+ messages in thread
From: Eric Feng @ 2023-04-01 23:49 UTC (permalink / raw)
  To: David Malcolm; +Cc: gcc

> For the task above, I think it's almost all there, it's "just" a case
> of implementing the special-case knowledge about the CPython API,
> mostly via known_function subclasses.

Sounds good.


> In cpychecker I added some custom function attributes:
>   https://gcc-python-plugin.readthedocs.io/en/latest/cpychecker.html
> which were:
>   __attribute__((cpychecker_returns_borrowed_ref))
>   __attribute__((cpychecker_steals_reference_to_arg(n)))
>
[...]
>
> But exactly what these macros would look like would be a decision for
> the CPython community (hence do it via PEP, based on a sample
> implementation).

Ok, I see what you mean now. Thanks for clarifying!


> Yeah, this sounds like a big project.  Fortunately there are a lot of
> possible subtasks in this one, and the project has benefits to GCC and
> to CPython even if you only get a subset of the ideas done in the time
> available (refcount checking being probably the highest-value subtask).

Sounds good.

I refactored the project description and timeline sections of the
proposal according to our conversation. Notably, I moved format string
checking to task #2 in the timeline since its subtasks are
particularly beneficial. I also suggest in the timeline section to
reach out to the CPython community via PEP about the specifics of new
attributes in week 9/10 since I think we should have a somewhat mature
prototype by that point. Let me know if you think it should be done
earlier/later. Please find the changed sections below (I omitted
unchanged sections for brevity)
_______

Describe the project and clearly define its goals:
One pertinent use case of the gcc-python plugin was as a static
analysis tool for CPython extension modules. The main goal of the
plugin was to help programmers writing extensions identify common
coding errors. The gcc-python-plugin has bitrotted over the years and,
in particular, cpychecker stopped working some GCC releases ago.
Broadly, the goal of this project is to port the functionalities of
cpychecker to a -fanalyzer plugin.

Below is a brief description of the functionalities of the static
analysis tool for which I will work on porting over to a -fanalyzer
plugin. The structure of the objectives is based on the
gcc-python-plugin documentation:

Reference count checking: <Unchanged from original proposal>

Format string checking: Some CPython APIs such as PyArgs_ParseTuple,
PyArg_ParseTupleAndKeywords, etc take format strings as arguments.
This check involves verifying that the format strings taken in by
these APIs are correct with respect to the number and types of
arguments passed in. In particular, I will work on integrating the
analyzer with -Wformat
(https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107017) and adding
plugin support for -Wformat
(https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100121) . We should then
be able to specify our own archetype which reflects the format string
syntax for the relevant CPython APIs and take advantage of the
integrated analyzer to check them.

Associating PyTypeObject instances with compile-time-types: <Unchanged
from original proposal>

Error-handling checking (including errors in exception handling):
Common errors such as dereferencing a NULL value are already checked
by the analyzer. I will extend this functionality by implementing
special-case knowledge about the CPython API.

Verification of PyMethodDef tables: <Unchanged from original proposal>

Provide an expected timeline:
Please find a rough estimate of the weekly progress in relation to the
features described below. Tasks that I expect to take longer than one
week are broken down in more detail. In addition to what’s described,
each task also involves adding test coverage pertaining its specific
feature to a regression test suite.

Week 1 - 7: Reference counting checking
    Week 1: Set up the overall infrastructure of the plugin and begin
building core functionality
    Week 1 - 6: Core reference counting functionality
    Week 7: Refine prototype
Week 8 - 10.5: Format string checking (including associating
PyTypeObject instances with compile-time-types)
    Week 8 - ~9: RFE: support printf-style formatted functions in -fanalyzer
    Week ~9 - 10.5: RFE: plugin support for -Wformat via
__attribute__((format()))
    Additionally, begin conversing with CPython community via PEP
about the exact form of new attributes on CPython headers which may be
helpful for both humans and the static analyzer. Present ideas based
on work done so far.
Week 10.5 - 12: Error-handling checking, errors in exception handling,
and verification of PyMethodDef tables
_______

Thank you!

Best,
Eric



On Tue, Mar 28, 2023 at 3:14 PM David Malcolm <dmalcolm@redhat.com> wrote:
>
> On Tue, 2023-03-28 at 08:08 -0400, Eric Feng wrote:
> > My apologies. Forgot to CC the mailing list in my previous e-mail.
> > Original reply below:
> >
> > _______
> >
> >
> > Hi David,
> >
> > Thank you for your feedback!
> >
> > >
> [...snip...]
>
> > >
> >
> > > > Error-handling checking: Various checks for common errors such as
> > > > dereferencing a NULL value.
> > >
> > > Yes.  This is already done by -fanalyzer, but we need some way for
> > > it
> > > to know the outcomes of specific functions: e.g. one common pattern
> > > is
> > > that API function "PyFoo_Bar" could either:
> > > (a) succeed, returning a PyObject * that the caller "owns" a
> > > reference
> > > to, or
> > > (b) fail, returning NULL, and setting an exception on the thread-
> > > local
> > > interpreter state object
> >
> >
> > Sounds good. In other words, the infrastructure of this check is
> > already there and our job is to retrofit CPython API-specific
> > knowledge for this task. Please correct me if my understanding here
> > is
> > wrong.
>
> For the task above, I think it's almost all there, it's "just" a case
> of implementing the special-case knowledge about the CPython API,
> mostly via known_function subclasses.
>
> You might need to implement a new plugin hook for reference-count
> checking: IIRC cpychecker had some kind of custom check when returning
> from the top-level function in the analysis path that issued a warning
> about any PyObjects that had the wrong ob_refcnt.  So you might need to
> add a little infrastructure for that, but hopefully that's a relative
> simple task.
>
> Some of the other tasks on the list might need other extensions of the
> analyzer, some more involved than others; e.g.
> - the format-string checking would need integrating the analyzer with -
> Wformat (see the two bugzilla entries I posted; this could take a fair
> bit of work, but of great benefit beyond just this project)
> - checking of PyMethodDef tables would need an extra hook for walking
> global initializers (hopefully easy),
> etc.
>
> >
> > > > Errors in exception-handling: Checks for situations in which
> > > > functions
> > > > returning PyObject* that is NULL are not gracefully handled.
> > >
> > > Yes; detection of this would automatically happen if we implemented
> > > known_function subclasses e.g. for the pattern above.
> >
> >
> > Sounds good. I will merge this task with the previous one in the next
> > iteration of this proposal since it will be handled as a side effect
> > of implementing the previous task.
>
> (nods)
>
> >
> >
> > > You don't mention testing.  I'd expect part of the project to be
> > > the
> > > creation of a regression test suite, with each step adding test
> > > coverage for the features it adds.  There are lots of test cases in
> > > the
> > > existing cpychecker test suite that you could reuse  - though
> > > beware,
> > > the test harness there is very bad - I made multiple mistakes:
> > > - expecting "gold" outputs from test cases - specific stderr
> > > strings,
> > > which make the tests very brittle
> > > - external scripts associated with .c files, to tell it how to
> > > invoke
> > > the compiler, which make the tests a pain to maintain and extend.
> > >
> > > GCC's own test suite handles this much better using DejaGnu where:
> > > - we test for specific properties of interest in the behavior of
> > > each
> > > test (rather than rigidly specifying everything about the behavior
> > > of
> > > each test)
> > > - the tests are expressed as .c files with "magic" comments
> > > containing
> > > directives for the test harness
> > >
> > > That said DejaGnu is implemented in Tcl, which is a pain to deal
> > > with;
> > > you could reuse DejaGnu, or maybe Python might be a better choice;
> > > I'm
> > > not sure.
> >
> >
> > You're right, I forgot to mention that in the initial draft. Thank
> > you
> > for pointing that out. I agree with the bottom-up approach with
> > respect to building a comprehensive regression test suite. In terms
> > of
> > specifically what to implement the suite in, I'll explore DejaGnu/Tcl
> > in more detail before making a more informed decision.
>
> (nods)
>
> >
> > > It might be good to add new attributes to CPython's headers so that
> > > the
> > > function declarations become self-descriptive about e.g.
> > > refererence-
> > > counting semantics (in a way readable both to humans and to static
> > > analysis tools).  If so, this part of the project would involve
> > > working
> > > with the CPython development community, perhaps writing a PEP:
> > >   https://peps.python.org/pep-0001/
> > > Again, this would be an ambitious goal, probably better done after
> > > there's a working prototype.
> >
> >
> > That would be very exciting. However, I'm not sure if I fully
> > understand what you mean. Can you clarify by giving an example of
> > what
> > the new attributes you had in mind might look like and how they would
> > help (for example with respect to reference counting semantics)?
>
> In cpychecker I added some custom function attributes:
>   https://gcc-python-plugin.readthedocs.io/en/latest/cpychecker.html
> which were:
>   __attribute__((cpychecker_returns_borrowed_ref))
>   __attribute__((cpychecker_steals_reference_to_arg(n)))
>
> I imagine that the plugin could provide similar attributes, and that
> CPython could be patched so that the autoconf checks detect the
> attributes, and the headers have some macros that optionally use them.
> For example, consider e.g. list objects, where the API declarations
> here:
> https://github.com/python/cpython/blob/main/Include/listobject.h
>
> are simply:
>
>   PyAPI_FUNC(Py_ssize_t) PyList_Size(PyObject *);
>   PyAPI_FUNC(PyObject *) PyList_GetItem(PyObject *, Py_ssize_t);
>
> etc, as documented here:
> https://docs.python.org/3/c-api/list.html
>
> and implemented in here:
> https://github.com/python/cpython/blob/main/Objects/listobject.c
>
> Given that the implementation of both PyList_Size and PyList_GetItem
> begin with:
>
>     if (!PyList_Check(op)) {
>
> and PyList_Check is a macro that unconditionally dereferences its
> pointer operand:
>
> #define PyList_Check(op) \
>     PyType_FastSubclass(Py_TYPE(op), Py_TPFLAGS_LIST_SUBCLASS)
>
> these declarations require non-NULL args, and thus perhaps we could add
> some new macros making the above decls look like:
>
>   PyAPI_FUNC(Py_ssize_t) PyList_Size(PyObject *)
>     PyAPI_NonNullArg(1);
>
>   PyAPI_FUNC(PyObject *) PyList_GetItem(PyObject *, Py_ssize_t)
>     PyAPI_NonNullArg(1))
>     PyAPI_ReturnsBorrowedRef();
>
> or somesuch, documenting the required non-nullness of the args, and
> that if PyList_GetItem returns successfully, the caller is borrowing a
> reference to the object.
>
> But exactly what these macros would look like would be a decision for
> the CPython community (hence do it via PEP, based on a sample
> implementation).
>
> >
> > Incidentally, I forgot to mention in my previous email but I believe
> > the 350-hour option is the one that is more appropriate for this
> > project. Please let me know otherwise.
>
> Yeah, this sounds like a big project.  Fortunately there are a lot of
> possible subtasks in this one, and the project has benefits to GCC and
> to CPython even if you only get a subset of the ideas done in the time
> available (refcount checking being probably the highest-value subtask).
>
> Hope this all makes sense
> Dave
>
> >
> > Best,
> > Eric
> >
> > On Sun, Mar 26, 2023 at 11:58 AM David Malcolm <dmalcolm@redhat.com>
> > wrote:
> > >
> > > On Sat, 2023-03-25 at 15:38 -0400, Eric Feng via Gcc wrote:
> > > > Hi GCC community,
> > > >
> > > > For GSoC, I am extremely interested in working on the selected
> > > > project
> > > > idea with respect to extending the static analysis pass. In
> > > > particular, porting gcc-python-plugin's cpychecker to a plugin
> > > > for
> > > > GCC
> > > > -fanalyzer as described in
> > > > https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107646.
> > >
> > > Hi Eric, welcome to the GCC commmunity.
> > >
> > > I'm the author/maintainer of GCC's static analysis pass.  I'm also
> > > the
> > > author of gcc-python-plugin and its erstwhile "cpychecker" code, so
> > > I'm
> > > pleased that you're interested in the project.
> > >
> > > I wrote gcc-python-plugin and cpychecker over a decade ago when I
> > > was
> > > focused on CPython development (before I switched to GCC
> > > development),
> > > but it's heavily bitrotted over the years, as I didn't have enough
> > > cycles to keep it compatible with changes in both GCC and CPython
> > > whilst working on GCC itself.  In particular, the cpychecker code
> > > stopped working a number of GCC releases ago.  However, the
> > > cpychecker
> > > code inspired much of my work on GCC's static analysis pass and on
> > > its
> > > diagnostics subsystem, so much of it now lives on in C++ form as
> > > core
> > > GCC functionality.  Also, the Python community would continue to
> > > find
> > > static analysis of CPython extension modules useful, so it would be
> > > good to have the idea live on as a GCC plugin on top of -fanalyzer.
> > >
> > > >  Please find an
> > > > initial draft of my proposal below and let me know if it is a
> > > > reasonable starting point. Please also correct me if I am
> > > > misunderstanding any particular tasks and let me know what areas
> > > > I
> > > > should add more information for or what else I may do in
> > > > preparation.
> > >
> > > Some ideas for familiarizing yourself with the problem space:
> > >
> > > You should try building GCC from source, and hack in a trivial
> > > warning
> > > that emits "hello world, I'm compiling function 'foo'".  I wrote a
> > > guide to GCC for new contributors here that should get you started:
> > >   https://gcc-newbies-guide.readthedocs.io/en/latest/
> > > This will help you get familiar with GCC's internals, and although
> > > the
> > > plan is to write a plugin, I expect that you'll run into places
> > > where a
> > > patch to GCC itself is more appropriate (bugs and missing
> > > functionality
> > > ), so having your own debug build of GCC is a good idea.
> > >
> > > You should become familiar with CPython's extension and embedding
> > > API.
> > > See the excellent documentation here:
> > >   https://docs.python.org/3/extending/extending.html
> > > It's probably a good exercise to write your own trivial CPython
> > > extension module.
> > >
> > > You can read the old cpychecker code inside the gcc-python-plugin
> > > repository, and I gave a couple of talks on it as PyCon a decade
> > > ago:
> > >
> > > PyCon2012: "Static analysis of Python extension modules using GCC"
> > > https://pyvideo.org/pycon-us-2012/static-analysis-of-python-extension-modules-using.html
> > >
> > > PyCon2013: "Death by a thousand leaks: what statically-analysing
> > > 370
> > > Python extensions looks like"
> > > https://pyvideo.org/pycon-us-2013/death-by-a-thousand-leaks-what-statically-analys.html
> > > https://www.youtube.com/watch?v=bblvGKzZfFI
> > >
> > > (sorry about all the "ums" and "errs"; it's fascinating and
> > > embarrassing to watch myself from 11 years ago on this, and see how
> > > much I've both forgotten and learned in the meantime.  Revisiting
> > > this
> > > work, I'm ashamed to see that I was referring to the implementation
> > > as
> > > based on "abstract interpretation" (and e.g. absinterp.py), when I
> > > now
> > > realize it's actually based on symbolic execution (as is GCC's-
> > > fanalyzer)
> > >
> > > Also, this was during the transition era between Python 2 and
> > > Python 3,
> > > whereas now we only have to care about Python 3.
> > >
> > > There may be other caveats; I haven't fully rewatched those talks
> > > yet
> > > :-/
> > >
> > > Various comments inline below, throughout...
> > >
> > > >
> > > > _______
> > > >
> > > > Describe the project and clearly define its goals:
> > > > One pertinent use case of the gcc-python plugin is as a static
> > > > analysis tool for CPython extension modules.
> > >
> > > It might be more accurate to use the past tense when referring to
> > > the
> > > gcc-python plugin, alas.
> > >
> > > >  The main goal is to help
> > > > programmers writing extensions identify common coding errors.
> > > > Broadly,
> > > > the goal of this project is to port the functionalities of
> > > > cpychecker
> > > > to a -fanalyzer plugin.
> > >
> > > (nods)
> > >
> > > >
> > > > Below is a brief description of the functionalities of the static
> > > > analysis tool for which I will work on porting over to a -
> > > > fanalyzer
> > > > plugin. The structure of the objectives is taken from the
> > > > gcc-python-plugin documentation:
> > > >
> > > > Reference count checking: Manipulation of PyObjects is done via
> > > > the
> > > > CPython API and in particular with respect to the objects'
> > > > reference
> > > > count. When the reference count belonging to an object drops to
> > > > zero,
> > > > we should free all resources associated with it. This check helps
> > > > ensure programmers identify problems with the reference count
> > > > associated with an object. For example, memory leaks with respect
> > > > to
> > > > forgetting to decrement the reference count of an object
> > > > (analogous
> > > > to
> > > > malloc() without corresponding free()) or perhaps object access
> > > > after
> > > > the object's reference count is zero (analogous to access after
> > > > free()).
> > >
> > > (nods)
> > > >
> > > > Error-handling checking: Various checks for common errors such as
> > > > dereferencing a NULL value.
> > >
> > > Yes.  This is already done by -fanalyzer, but we need some way for
> > > it
> > > to know the outcomes of specific functions: e.g. one common pattern
> > > is
> > > that API function "PyFoo_Bar" could either:
> > > (a) succeed, returning a PyObject * that the caller "owns" a
> > > reference
> > > to, or
> > > (b) fail, returning NULL, and setting an exception on the thread-
> > > local
> > > interpreter state object
> > >
> > >
> > > >
> > > > Errors in exception-handling: Checks for situations in which
> > > > functions
> > > > returning PyObject* that is NULL are not gracefully handled.
> > >
> > > Yes; detection of this would automatically happen if we implemented
> > > known_function subclasses e.g. for the pattern above.
> > > >
> > > > Format string checking: Verify that arguments to various CPython
> > > > APIs
> > > > which take format strings are correct.
> > >
> > > Have a look at:
> > > https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107017
> > > ("RFE: support printf-style formatted functions in -fanalyzer")
> > > and:
> > > https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100121
> > > ("RFE: plugin support for -Wformat via __attribute__((format()))")
> > >
> > >
> > >
> > > >
> > > > Associating PyTypeObject instances with compile-time-types:
> > > > Verify
> > > > that the run-time type of a PyTypeObject matches its
> > > > corresponding
> > > > compile-time type for inputs where both are provided.
> > >
> > > (nods)
> > >
> > > >
> > > > Verification of PyMethodDef tables: Verify that the function in
> > > > PyMethodDef tables matches the calling convention of the ml_flags
> > > > set.
> > >
> > > (nods)
> > >
> > > >
> > > > I suspect a good starting point would be existing proof-of-
> > > > concept
> > > > -fanalyzer plugins such as the CPython GIL example
> > > > (analyzer_gil_plugin). Please let me know of any additional
> > > > pointers.
> > >
> > > Yes.
> > >
> > > There are also two example of "teaching" the analyzer about the
> > > behavior of specific functions via subclassing known_function in:
> > >   analyzer_known_fns_plugin.c
> > > and:
> > >  analyzer_kernel_plugin.c
> > >
> > >
> > > > If there is time to spare, I think it is reasonable to extend the
> > > > capabilities of the original checker as well (more details in the
> > > > expected timeline below).
> > > >
> > > > Provide an expected timeline:
> > > > I suspect the first task to take the longest since it is
> > > > relatively
> > > > involved and it also includes getting the initial infrastructure
> > > > of
> > > > the plugin up. From the experience of the first task, I hope the
> > > > rest
> > > > of the tasks would be implemented faster. Additionally, I
> > > > understand
> > > > that the current timeline outline below is too vague. I wished to
> > > > check in with the community for some feedback on whether I am in
> > > > the
> > > > right ballpark before committing to more details.
> > > >
> > > > Week 1 - 7: Reference counting checking
> > > > Week 8: Error-handling checking
> > > > Week 9: Errors in exception handling
> > > > Week 10: Format string checking
> > > > Week 11: Verification of PyMethodDef tables
> > > > Week 12: I am planning the last week to be safety in case any of
> > > > the
> > > > above tasks take longer than initially expected. In case
> > > > everything
> > > > goes smoothly and there is time to spare, I think it is
> > > > reasonable to
> > > > spend the time extending the capabilities of the original pass.
> > > > Some
> > > > ideas include extending the subset of CPython API that cpychecker
> > > > currently support, matching up similar traces to solve the issue
> > > > of
> > > > duplicate error reports, and/or addressing any of the other
> > > > caveats
> > > > currently mentioned in the cpychecker documentation. Additional
> > > > ideas
> > > > are welcome of course.
> > >
> > > FWIW I think it's a very ambitious project, but you seem capable.
> > >
> > > You don't mention testing.  I'd expect part of the project to be
> > > the
> > > creation of a regression test suite, with each step adding test
> > > coverage for the features it adds.  There are lots of test cases in
> > > the
> > > existing cpychecker test suite that you could reuse  - though
> > > beware,
> > > the test harness there is very bad - I made multiple mistakes:
> > > - expecting "gold" outputs from test cases - specific stderr
> > > strings,
> > > which make the tests very brittle
> > > - external scripts associated with .c files, to tell it how to
> > > invoke
> > > the compiler, which make the tests a pain to maintain and extend.
> > >
> > > GCC's own test suite handles this much better using DejaGnu where:
> > > - we test for specific properties of interest in the behavior of
> > > each
> > > test (rather than rigidly specifying everything about the behavior
> > > of
> > > each test)
> > > - the tests are expressed as .c files with "magic" comments
> > > containing
> > > directives for the test harness
> > >
> > > That said DejaGnu is implemented in Tcl, which is a pain to deal
> > > with;
> > > you could reuse DejaGnu, or maybe Python might be a better choice;
> > > I'm
> > > not sure.
> > >
> > >
> > > It might be good to add new attributes to CPython's headers so that
> > > the
> > > function declarations become self-descriptive about e.g.
> > > refererence-
> > > counting semantics (in a way readable both to humans and to static
> > > analysis tools).  If so, this part of the project would involve
> > > working
> > > with the CPython development community, perhaps writing a PEP:
> > >   https://peps.python.org/pep-0001/
> > > Again, this would be an ambitious goal, probably better done after
> > > there's a working prototype.
> > >
> > >
> > > >
> > > > Briefly introduce yourself and your skills and/or
> > > > accomplishments:
> > > > I am a current Masters in Computer Science student at Columbia
> > > > University. I did my undergraduates at Bates College (B.A Math)
> > > > and
> > > > Columbia University (B.S Computer Science) respectively. My
> > > > interests
> > > > are primarily in systems, programming languages, and compilers.
> > > >
> > > > At Columbia, I work in the group led by Professor Stephen Edwards
> > > > on
> > > > the SSLANG programming language: a language built atop the Sparse
> > > > Synchronous Model. For more formal information on the Sparse
> > > > Synchronous Model, please take a look at "The Sparse Synchronous
> > > > Model
> > > > on Real Hardware" (2022). Please find our repo, along with my
> > > > contributions, here: https://github.com/ssm-lang/sslang (my
> > > > GitHub
> > > > handle is @efric). My main contribution to the compiler so far
> > > > involved adding a static inlining optimization pass with another
> > > > SSLANG team member. Our implementation is mostly based on the
> > > > work
> > > > shown in "Secrets of the Glasgow Haskell Compiler Inliner"
> > > > (2002),
> > > > with modifications as necessary to better fit our goals. The
> > > > current
> > > > implementation is a work in progress and we are still working on
> > > > adding (many) more features to it. My work in this project is
> > > > written
> > > > in Haskell.
> > > >
> > > > I also conduct research in the Columbia Systems Lab.
> > > > Specifically, my
> > > > group and I, advised by Professor Jason Nieh, work on secure
> > > > containerization with respect to untrusted software systems.
> > > > Armv9-A
> > > > introduced Realms, secure execution environments that are opaque
> > > > to
> > > > untrusted operating systems, as part of the Arm Confidential
> > > > Compute
> > > > Architecture (CCA). Please find more information on CCA in
> > > > "Design
> > > > and
> > > > Verification of the Arm Confidential Compute Architecture"
> > > > (2022).
> > > > Introduced together was the Realm Management Monitor (RMM), an
> > > > interface for hypervisors to allow secure virtualization
> > > > utilizing
> > > > Realms and the new hardware support. Currently, the Realm
> > > > isolation
> > > > boundary is at the level of entire VMs. We are working on
> > > > applying
> > > > Realms to secure containers. Work in this project is mostly at
> > > > the
> > > > kernel and firmware level and is written in C and ARM assembly.
> > > >
> > > > Pertaining experience with compilers in addition to SSLANG, my
> > > > undergraduate education included a class on compilers that
> > > > involved
> > > > writing passes for Clang/LLVM. More currently, I am taking a
> > > > graduate-level class on Types, Languages, and Compilers where my
> > > > partner and I are working on a plugin for our own small toy
> > > > language
> > > > compiler which would be able to perform type inference. The
> > > > plugin
> > > > would generate relevant constraints and solve them on behalf of
> > > > the
> > > > compiler. This project is still in its early stages, but the idea
> > > > is
> > > > to delegate type inference functionalities to a generic library
> > > > given
> > > > some information instead of having to write your own constraint
> > > > solver.
> > >
> > > It sounds like you may know more about the theory than I do!
> > >
> > > >
> > > > Thank you for reviewing my proposal!
> > >
> > > Thanks for sending it; hope the above is helpful (and not too
> > > intimidating!)
> > >
> > > Dave
> > >
> >
>

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [GSoC] Interest and initial proposal for project on reimplementing cpychecker as -fanalyzer plugin
  2023-04-01 23:49       ` Eric Feng
@ 2023-04-02 23:28         ` David Malcolm
  2023-04-03 14:29           ` Eric Feng
  0 siblings, 1 reply; 9+ messages in thread
From: David Malcolm @ 2023-04-02 23:28 UTC (permalink / raw)
  To: Eric Feng; +Cc: gcc

On Sat, 2023-04-01 at 19:49 -0400, Eric Feng wrote:
> > For the task above, I think it's almost all there, it's "just" a
> > case
> > of implementing the special-case knowledge about the CPython API,
> > mostly via known_function subclasses.
> 
> Sounds good.
> 
> 
> > In cpychecker I added some custom function attributes:
> >  
> > https://gcc-python-plugin.readthedocs.io/en/latest/cpychecker.html
> > which were:
> >   __attribute__((cpychecker_returns_borrowed_ref))
> >   __attribute__((cpychecker_steals_reference_to_arg(n)))
> > 
> [...]
> > 
> > But exactly what these macros would look like would be a decision
> > for
> > the CPython community (hence do it via PEP, based on a sample
> > implementation).
> 
> Ok, I see what you mean now. Thanks for clarifying!
> 
> 
> > Yeah, this sounds like a big project.  Fortunately there are a lot
> > of
> > possible subtasks in this one, and the project has benefits to GCC
> > and
> > to CPython even if you only get a subset of the ideas done in the
> > time
> > available (refcount checking being probably the highest-value
> > subtask).
> 
> Sounds good.
> 
> I refactored the project description and timeline sections of the
> proposal according to our conversation. Notably, I moved format
> string
> checking to task #2 in the timeline since its subtasks are
> particularly beneficial. I also suggest in the timeline section to
> reach out to the CPython community via PEP about the specifics of new
> attributes in week 9/10 since I think we should have a somewhat
> mature
> prototype by that point. Let me know if you think it should be done
> earlier/later. Please find the changed sections below (I omitted
> unchanged sections for brevity)
> _______
> 
> Describe the project and clearly define its goals:
> One pertinent use case of the gcc-python plugin was as a static
> analysis tool for CPython extension modules. The main goal of the
> plugin was to help programmers writing extensions identify common
> coding errors. The gcc-python-plugin has bitrotted over the years
> and,
> in particular, cpychecker stopped working some GCC releases ago.
> Broadly, the goal of this project is to port the functionalities of
> cpychecker to a -fanalyzer plugin.
> 
> Below is a brief description of the functionalities of the static
> analysis tool for which I will work on porting over to a -fanalyzer
> plugin. The structure of the objectives is based on the
> gcc-python-plugin documentation:
> 
> Reference count checking: <Unchanged from original proposal>
> 
> Format string checking: Some CPython APIs such as PyArgs_ParseTuple,
> PyArg_ParseTupleAndKeywords, etc take format strings as arguments.
> This check involves verifying that the format strings taken in by
> these APIs are correct with respect to the number and types of
> arguments passed in. In particular, I will work on integrating the
> analyzer with -Wformat
> (https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107017) and adding
> plugin support for -Wformat
> (https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100121) . We should
> then
> be able to specify our own archetype which reflects the format string
> syntax for the relevant CPython APIs and take advantage of the
> integrated analyzer to check them.
> 
> Associating PyTypeObject instances with compile-time-types:
> <Unchanged
> from original proposal>
> 
> Error-handling checking (including errors in exception handling):
> Common errors such as dereferencing a NULL value are already checked
> by the analyzer. I will extend this functionality by implementing
> special-case knowledge about the CPython API.
> 
> Verification of PyMethodDef tables: <Unchanged from original
> proposal>
> 
> Provide an expected timeline:
> Please find a rough estimate of the weekly progress in relation to
> the
> features described below. Tasks that I expect to take longer than one
> week are broken down in more detail. In addition to what’s described,
> each task also involves adding test coverage pertaining its specific
> feature to a regression test suite.
> 
> Week 1 - 7: Reference counting checking
>     Week 1: Set up the overall infrastructure of the plugin and begin
> building core functionality
>     Week 1 - 6: Core reference counting functionality
>     Week 7: Refine prototype
> Week 8 - 10.5: Format string checking (including associating
> PyTypeObject instances with compile-time-types)
>     Week 8 - ~9: RFE: support printf-style formatted functions in -
> fanalyzer
>     Week ~9 - 10.5: RFE: plugin support for -Wformat via
> __attribute__((format()))
>     Additionally, begin conversing with CPython community via PEP
> about the exact form of new attributes on CPython headers which may
> be
> helpful for both humans and the static analyzer. Present ideas based
> on work done so far.
> Week 10.5 - 12: Error-handling checking, errors in exception
> handling,
> and verification of PyMethodDef tables
> 

Sounds great.

Note that the deadline for submitting proposals to the official GSoC
website is April 4 - 18:00 UTC (i.e. this coming Tuesday) and that
Google are very strict about that deadline; see:
https://developers.google.com/open-source/gsoc/timeline

Please include the biographical detail on yourself in the proposal that
you posted on the list, and if you can, link to C++ code you've
written.


I don't know if you saw the emails from Sun Steven, but they're also
interested in this project, perhaps as a collaboration with you.  Given
that the project is large and could be chopped up into several
components that might be a possibility - but don't feel like you need
to do that yourself in your proposal; as noted in the email I just
sent, we don't know how many slots we'll get from the GSoC program.

Good luck
Dave


^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [GSoC] Interest and initial proposal for project on reimplementing cpychecker as -fanalyzer plugin
  2023-04-02 23:28         ` David Malcolm
@ 2023-04-03 14:29           ` Eric Feng
  0 siblings, 0 replies; 9+ messages in thread
From: Eric Feng @ 2023-04-03 14:29 UTC (permalink / raw)
  To: David Malcolm; +Cc: gcc

Thanks for bringing this to my attention Dave! I’m happy to
collaborate on this project with Steven. I will reply in more detail
in the other thread.

Best,
Eric


On Sun, Apr 2, 2023 at 7:28 PM David Malcolm <dmalcolm@redhat.com> wrote:
>
> On Sat, 2023-04-01 at 19:49 -0400, Eric Feng wrote:
> > > For the task above, I think it's almost all there, it's "just" a
> > > case
> > > of implementing the special-case knowledge about the CPython API,
> > > mostly via known_function subclasses.
> >
> > Sounds good.
> >
> >
> > > In cpychecker I added some custom function attributes:
> > >
> > > https://gcc-python-plugin.readthedocs.io/en/latest/cpychecker.html
> > > which were:
> > >   __attribute__((cpychecker_returns_borrowed_ref))
> > >   __attribute__((cpychecker_steals_reference_to_arg(n)))
> > >
> > [...]
> > >
> > > But exactly what these macros would look like would be a decision
> > > for
> > > the CPython community (hence do it via PEP, based on a sample
> > > implementation).
> >
> > Ok, I see what you mean now. Thanks for clarifying!
> >
> >
> > > Yeah, this sounds like a big project.  Fortunately there are a lot
> > > of
> > > possible subtasks in this one, and the project has benefits to GCC
> > > and
> > > to CPython even if you only get a subset of the ideas done in the
> > > time
> > > available (refcount checking being probably the highest-value
> > > subtask).
> >
> > Sounds good.
> >
> > I refactored the project description and timeline sections of the
> > proposal according to our conversation. Notably, I moved format
> > string
> > checking to task #2 in the timeline since its subtasks are
> > particularly beneficial. I also suggest in the timeline section to
> > reach out to the CPython community via PEP about the specifics of new
> > attributes in week 9/10 since I think we should have a somewhat
> > mature
> > prototype by that point. Let me know if you think it should be done
> > earlier/later. Please find the changed sections below (I omitted
> > unchanged sections for brevity)
> > _______
> >
> > Describe the project and clearly define its goals:
> > One pertinent use case of the gcc-python plugin was as a static
> > analysis tool for CPython extension modules. The main goal of the
> > plugin was to help programmers writing extensions identify common
> > coding errors. The gcc-python-plugin has bitrotted over the years
> > and,
> > in particular, cpychecker stopped working some GCC releases ago.
> > Broadly, the goal of this project is to port the functionalities of
> > cpychecker to a -fanalyzer plugin.
> >
> > Below is a brief description of the functionalities of the static
> > analysis tool for which I will work on porting over to a -fanalyzer
> > plugin. The structure of the objectives is based on the
> > gcc-python-plugin documentation:
> >
> > Reference count checking: <Unchanged from original proposal>
> >
> > Format string checking: Some CPython APIs such as PyArgs_ParseTuple,
> > PyArg_ParseTupleAndKeywords, etc take format strings as arguments.
> > This check involves verifying that the format strings taken in by
> > these APIs are correct with respect to the number and types of
> > arguments passed in. In particular, I will work on integrating the
> > analyzer with -Wformat
> > (https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107017) and adding
> > plugin support for -Wformat
> > (https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100121) . We should
> > then
> > be able to specify our own archetype which reflects the format string
> > syntax for the relevant CPython APIs and take advantage of the
> > integrated analyzer to check them.
> >
> > Associating PyTypeObject instances with compile-time-types:
> > <Unchanged
> > from original proposal>
> >
> > Error-handling checking (including errors in exception handling):
> > Common errors such as dereferencing a NULL value are already checked
> > by the analyzer. I will extend this functionality by implementing
> > special-case knowledge about the CPython API.
> >
> > Verification of PyMethodDef tables: <Unchanged from original
> > proposal>
> >
> > Provide an expected timeline:
> > Please find a rough estimate of the weekly progress in relation to
> > the
> > features described below. Tasks that I expect to take longer than one
> > week are broken down in more detail. In addition to what’s described,
> > each task also involves adding test coverage pertaining its specific
> > feature to a regression test suite.
> >
> > Week 1 - 7: Reference counting checking
> >     Week 1: Set up the overall infrastructure of the plugin and begin
> > building core functionality
> >     Week 1 - 6: Core reference counting functionality
> >     Week 7: Refine prototype
> > Week 8 - 10.5: Format string checking (including associating
> > PyTypeObject instances with compile-time-types)
> >     Week 8 - ~9: RFE: support printf-style formatted functions in -
> > fanalyzer
> >     Week ~9 - 10.5: RFE: plugin support for -Wformat via
> > __attribute__((format()))
> >     Additionally, begin conversing with CPython community via PEP
> > about the exact form of new attributes on CPython headers which may
> > be
> > helpful for both humans and the static analyzer. Present ideas based
> > on work done so far.
> > Week 10.5 - 12: Error-handling checking, errors in exception
> > handling,
> > and verification of PyMethodDef tables
> >
>
> Sounds great.
>
> Note that the deadline for submitting proposals to the official GSoC
> website is April 4 - 18:00 UTC (i.e. this coming Tuesday) and that
> Google are very strict about that deadline; see:
> https://developers.google.com/open-source/gsoc/timeline
>
> Please include the biographical detail on yourself in the proposal that
> you posted on the list, and if you can, link to C++ code you've
> written.
>
>
> I don't know if you saw the emails from Sun Steven, but they're also
> interested in this project, perhaps as a collaboration with you.  Given
> that the project is large and could be chopped up into several
> components that might be a possibility - but don't feel like you need
> to do that yourself in your proposal; as noted in the email I just
> sent, we don't know how many slots we'll get from the GSoC program.
>
> Good luck
> Dave
>

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [GSoC] Interest and initial proposal for project on reimplementing cpychecker as -fanalyzer plugin
  2023-04-02 17:24 Sun Steven
@ 2023-04-02 23:14 ` David Malcolm
  0 siblings, 0 replies; 9+ messages in thread
From: David Malcolm @ 2023-04-02 23:14 UTC (permalink / raw)
  To: Sun Steven, gcc

On Sun, 2023-04-02 at 17:24 +0000, Sun Steven via Gcc wrote:
> Hi, Eric, Malcom,

Hi - and welcome to the GCC community.

> 
> Sorry that I didn't check this thread before.
> 
> It sounds like there are a lot of things to do. I want to offer some
> help.
> 
> Let me add some backgrounds of memory management in python here.
> 
> 
> ## Intro (for people unfamiliar with CPython)
> 
> Unlike programs written in C++, where the compiler automatically adds
> destructors on all exit paths, CPython requires manual memory
> management
> on PyObject*.
> 
> The current CPython has 2 major memory management mechanisms,
> including reference counting and a mark-and-sweep gc for cyclic
> references.
> The former acts as the major mechanism. PyObject gets destructed when
> the refcount drops to zero.

(nods)

FWIW I wrote the original cpychecker code because I was sick of buggy
extension modules crashing /usr/bin/python... and all the bug reports
about it that were landing in my inbox (I was the maintainer of Python
within Fedora/RHEL back then).

> 
> ## CPython has made great efforts to reduce memory errors.
> 
> With specific compile flags on, the CPython interpreter records the
> total
> refcount, also it aborts when refcount drops below zero (being double
> freed).
> This helps to discover memory leaks. PEP 683 (implemented in 3.12)
> also
> introduced "immortal objects" with initial refcount 999999999,
> prevent it from
> being accidentally freed (such as small integers).

That feature is new to me (I last worked on CPython internals back in
the 2 to 3 transition era); thanks!

[...snip...]

> 
> ## Why I want to participate in?
> 
> I am currently working on the initial implementations of PEP 701 (a
> new
> f-string parser). During the testing, I discovered (and fixed) 3
> memory leaks.
> As you can see, even the most experienced CPython developers
> sometimes
> forget to properly decrease refs. I think it will be inspiring if a
> new analysis
> tool was made available as a compiler builtin. It will lead to a
> better CPython.
> 
> 
> I do not know if GSoC allows collaborations. Maybe the headcount is
> limited,
> or maybe I am too senior for GSoC. But I think I am still a rookie in
> front of
> GCC.

I'd be up for a collaboration between you and Eric (assuming Eric is,
of course), as the project is very large and there are several logical
components to it that you could carve up between the two of you.  That
said it may be out of our hands, it depends on:

(a) how many slots we get allocated to us from Google

(b) GSoC is meant for newcomers to open source development; it sounds
like you might already have significant experience.  I don't know what
GSoC's threshold is.

> I want to contribute, no matter the forms.

That's great.  It sounds like you have considerable knowledge of
CPython internals (a decade more up-to-date than mine!), and hopefully
you have recent contacts within the CPython community in terms of
getting traction there.

Some notes about a GSoC application:

Do you have a link to your github so we can see your CPython
contributions?

How's your C++?  Have you tried building GCC from source yet?  FWIW I
felt intimidated back when I first started working on GCC itself; I
wrote this guide to help people get started:
  https://gcc-newbies-guide.readthedocs.io/en/latest/

Note that the deadline for submitting proposals to the official GSoC
website is April 4 - 18:00 UTC (i.e. this coming Tuesday) and that
Google are very strict about that deadline; see:
https://developers.google.com/open-source/gsoc/timeline

Hope this is helpful
Dave

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [GSoC] Interest and initial proposal for project on reimplementing cpychecker as -fanalyzer plugin
@ 2023-04-02 17:24 Sun Steven
  2023-04-02 23:14 ` David Malcolm
  0 siblings, 1 reply; 9+ messages in thread
From: Sun Steven @ 2023-04-02 17:24 UTC (permalink / raw)
  To: gcc

[-- Attachment #1: Type: text/plain, Size: 2008 bytes --]

Hi, Eric, Malcom,

Sorry that I didn't check this thread before.

It sounds like there are a lot of things to do. I want to offer some help.

Let me add some backgrounds of memory management in python here.

## Intro (for people unfamiliar with CPython)

Unlike programs written in C++, where the compiler automatically adds
destructors on all exit paths, CPython requires manual memory management
on PyObject*.

The current CPython has 2 major memory management mechanisms,
including reference counting and a mark-and-sweep gc for cyclic references.
The former acts as the major mechanism. PyObject gets destructed when
the refcount drops to zero.

## CPython has made great efforts to reduce memory errors.

With specific compile flags on, the CPython interpreter records the total
refcount, also it aborts when refcount drops below zero (being double freed).
This helps to discover memory leaks. PEP 683 (implemented in 3.12) also
introduced "immortal objects" with initial refcount 999999999, prevent it from
being accidentally freed (such as small integers).

Even with these features, CPython extension management is still a problem,
since most errors occur on "error-handling path", which is less likely to be
covered. And most users will not use a debug-build cpython, making the error
more under the surface.

## Why I want to participate in?

I am currently working on the initial implementations of PEP 701 (a new
f-string parser). During the testing, I discovered (and fixed) 3 memory leaks.
As you can see, even the most experienced CPython developers sometimes
forget to properly decrease refs. I think it will be inspiring if a new analysis
tool was made available as a compiler builtin. It will lead to a better CPython.

I do not know if GSoC allows collaborations. Maybe the headcount is limited,
or maybe I am too senior for GSoC. But I think I am still a rookie in front of
GCC.

I want to contribute, no matter the forms.

Yours

^ permalink raw reply	[flat|nested] 9+ messages in thread

end of thread, other threads:[~2023-04-03 14:30 UTC | newest]

Thread overview: 9+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2023-03-25 19:38 [GSoC] Interest and initial proposal for project on reimplementing cpychecker as -fanalyzer plugin Eric Feng
2023-03-26 15:58 ` David Malcolm
2023-03-28 12:08   ` Eric Feng
2023-03-28 19:14     ` David Malcolm
2023-04-01 23:49       ` Eric Feng
2023-04-02 23:28         ` David Malcolm
2023-04-03 14:29           ` Eric Feng
2023-04-02 17:24 Sun Steven
2023-04-02 23:14 ` David Malcolm

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).