RE: [PATCH v5 09/10] btrace, python: Enable ptwrite filter registration.

public inbox for gdb-patches@sourceware.org
 help / color / mirror / Atom feed

From: "Willgerodt, Felix" <felix.willgerodt@intel.com>
To: "Metzger, Markus T" <markus.t.metzger@intel.com>,
	"gdb-patches@sourceware.org" <gdb-patches@sourceware.org>
Subject: RE: [PATCH v5 09/10] btrace, python: Enable ptwrite filter registration.
Date: Mon, 11 Jul 2022 12:48:07 +0000	[thread overview]
Message-ID: <MN2PR11MB4566FC635ABD832B02A4DEB58E879@MN2PR11MB4566.namprd11.prod.outlook.com> (raw)
In-Reply-To: <DM8PR11MB574965EBD7D522B4FCF3AB23DEB89@DM8PR11MB5749.namprd11.prod.outlook.com>

Hi Markus,

Thanks for your feedback. I added some responses below,
that we probably should agree on before the next revision.

Thanks,
Felix

> -----Original Message-----
> From: Metzger, Markus T <markus.t.metzger@intel.com>
> Sent: Dienstag, 28. Juni 2022 15:59
> To: Willgerodt, Felix <felix.willgerodt@intel.com>; gdb-
> patches@sourceware.org
> Subject: RE: [PATCH v5 09/10] btrace, python: Enable ptwrite filter
> registration.
> 
> Hello Felix,
> 
> >With this patch a default ptwrite filter is registered upon start of GDB.
> >It prints the plain ptwrite payload as hex.  The default filter can be
> >overwritten by registering a custom filter in python or by registering
> >"None", for no output at all.  Registering a filter function creates per
> >thread copies to allow unique internal states per thread.
> >---
> > gdb/btrace.c                   |   3 +
> > gdb/btrace.h                   |   9 +++
> > gdb/data-directory/Makefile.in |   1 +
> > gdb/extension-priv.h           |   5 ++
> > gdb/extension.c                |  13 ++++
> > gdb/extension.h                |   3 +
> > gdb/guile/guile.c              |   1 +
> > gdb/python/lib/gdb/ptwrite.py  |  86 +++++++++++++++++++++++++
> > gdb/python/py-record-btrace.c  | 111
> +++++++++++++++++++++++++++++++++
> > gdb/python/py-record-btrace.h  |   8 +++
> > gdb/python/python-internal.h   |   3 +
> > gdb/python/python.c            |   2 +
> > 12 files changed, 245 insertions(+)
> > create mode 100644 gdb/python/lib/gdb/ptwrite.py
> 
> 
> > /* For maintenance commands.  */
> >@@ -1317,6 +1318,8 @@ ftrace_add_pt (struct btrace_thread_info *btinfo,
> >   uint64_t offset;
> >   int status;
> >
> >+  apply_ext_lang_ptwrite_filter (btinfo);
> 
> A comment would be nice, here, to explain what this does.

I added one in the next version.

> 
> >+  /* Function pointer to the ptwrite callback.  Returns the string returned
> >+     by the ptwrite filter function or nullptr if no string is supposed to
> >+     be printed.  */
> >+  std::string (*ptw_callback_fun) (const uint64_t payload, const uint64_t ip,
> >+				   const void *ptw_filter);
> 
> The comment doesn't match the code.  It cannot return nullptr.

Will be fixed in the next revision.

> Why to we call the void * parameter ptw_filter instead of the usual context?
> We probably want to call the callback itself ptw_filter and the void *
> argument
> context.
> 
> We also seem to mix the terms ptwrite callback and ptwrite filter.

I think the problem here is that both are callbacks. The first one (ptw_callback_fun)
is used in btrace.c to call python/py-record-btrace.c:recpy_call_filter, or another
extension language that would provide this functionality,
see extension.c:apply_ext_lang_ptwrite_filter.
The second one (ptw_filter) is what recpy_call_filter will use to do the python call via
PyObject_CallFunctionObjArgs().

So we call a callback with another callback as the argument (which is the actual ptw_filter).
Therefore the current naming seems correct to me. As the ptw_filter to me clearly is
the void *. I am open to suggestions, but calling the actual filter function context and the
gdb internal callback the filter seems wrong to me.

I could add to the comment, that the callback_fun can be provided by any extension
language. Or rename it somehow.

> >+
> >+  /* Function pointer to the ptwrite filter function.  */
> >+  void *ptw_filter = nullptr;
> 
> Is this the function or some context for the function?
> 

See above. This is the actual python filter object GDB calls in the end.

> 
> >diff --git a/gdb/guile/guile.c b/gdb/guile/guile.c
> >index 14b191ded62..86f92a476af 100644
> >--- a/gdb/guile/guile.c
> >+++ b/gdb/guile/guile.c
> >@@ -124,6 +124,7 @@ static const struct extension_language_ops
> >guile_extension_ops =
> >   gdbscm_apply_val_pretty_printer,
> >
> >   NULL, /* gdbscm_apply_frame_filter, */
> >+  NULL, /* gdbscm_load_ptwrite_listener, */
> 
> This should probably be called ptwrite_filter.

Fixed locally.

> 
> >+def default_filter(payload, ip):
> >+    """Default filter that is active upon starting GDB."""
> >+    return "{:x}".format(payload)
> >+
> >+# This dict contains the per thread copies of the filter function and the
> >+# global template filter, from which the copies are created.
> >+_ptwrite_filter = {"global" : default_filter}
> 
> Why those leading underscores?

GDB follows the PEP8 coding standards:
https://sourceware.org/gdb/wiki/Internals%20GDB-Python-Coding-Standards
These say that the leading underscore is a "weak internal user indicator":
https://pep8.org/#descriptive-naming-styles
See also:
https://docs.python.org/3/tutorial/classes.html#private-variables

Other files in GDB use the style as well. It is supposed to mean that
the variable shouldn't be used directly.

> 
> >+def _update_filter_dict(thread_list):
> >+    """Helper function to update the filter dict.
> >+
> >+    Discards filter copies of threads that already exited and registers
> >+    copies of the filter for new threads."""
> >+    # thread_list[x].ptid returns the tuple (pid, lwp, tid)
> >+    lwp_list = [i.ptid[1] for i in thread_list]
> >+
> >+    # clean-up old filters
> >+    for key in _ptwrite_filter.keys():
> >+      if key not in lwp_list and key != "global":
> >+        _ptwrite_filter.pop(key)
> >+
> >+    # Register filter for new threads
> >+    for key in lwp_list:
> >+        if key not in _ptwrite_filter.keys():
> >+            _ptwrite_filter[key] = deepcopy(_ptwrite_filter["global"])
> 
> This function is called two times: once after we cleared all filters, and
> once when looking up the filter for a given thread.  The first time, we
> know that there are no existing filters; the second time, we are really
> only interested in a single filter.
>
> Wouldn't it suffice to lookup the filter in get_filter() and, if it doesn't
> exist, create a new one?

Yes, we could get rid of the call to _update_filter_dict() in register_filter().
The main reason I added it was to clean the obsolete filters whenever
possible. I don't see a clear performance advantage if we would remove
the call (without having a thread exit notification).
We need to clean up the same amount of filters at some point.

> That leaves removing obsolete filters.  Could this be done with some
> thread notification?

IIRC, you suggested this previously. I replied that there is no python API
that I am aware of that can do this. The python events API doesn't expose
thread exited events.

The notification approach would have the disadvantage that GDB might
make a lot of calls to python to clean up single filters. Of course, the
current approach has the disadvantage that it manually keeps the thread
list and filter dict in sync.

> 
> >+def _clear_traces(thread_list):
> >+    """Helper function to clear the trace of all threads in THREAD_LIST."""
> >+    current_thread = gdb.selected_thread()
> >+
> >+    recording = gdb.current_recording()
> >+
> >+    if (recording is not None):
> 
> Let's remove the empty line to group the statements.

Will be done.

> 
> >+/* Helper function that calls the ptwrite filter PTW_FILTER with
> >+   PAYLOAD and IP as arguments.  Returns a pointer to the string that will
> >+   be printed or nullptr if nothing should be printed.  IP can be nullptr,
> >+   PAYLOAD must point to a valid integer.  */
> >+std::string
> >+recpy_call_filter (const uint64_t payload, const uint64_t ip,
> >+		   const void *ptw_filter)
> 
> The comment doesn't match the code.  It cannot return nullptr.  Also,
> IP and PAYLOAD are integers, not pointers.
> 
> I think we can shorten "calls the ptwrite filter PTW_FILTER" to just
> "calls PTW_FILTER".

Right, I missed that. Will be fixed, thanks.

> 
> >+{
> >+  std::string result;
> >+
> >+  if ((PyObject *) ptw_filter == Py_None)
> >+    return result;
> >+  else if ((PyObject *) ptw_filter == nullptr)
> >+    error (_("No valid ptwrite filter."));
> 
> No need for else.  Should we check nullptr first?

Will be fixed.

> 
> >+/* Helper function returning the current ptwrite filter.  Returns nullptr
> >+   in case of errors.  */
> >+
> >+PyObject *
> >+get_ptwrite_filter ()
> >+{
> >+  PyObject *module = PyImport_ImportModule ("gdb.ptwrite");
> >+
> >+  if (PyErr_Occurred ())
> >+  {
> >+    gdbpy_print_stack ();
> >+    return nullptr;
> >+  }
> 
> Do we want to print the stack without throwing an error?
> 

I added an error (and unified the errors of the two functions a bit).

> >+/* Used for registering the default ptwrite filter to the current thread.  A
> 
> What does 'default' mean, here?  We're registering the callback that btrace
> calls on PTW.  From btrace's perspective, this is the ptwrite filter.  From
> python's perspective, this is maybe a proxy for the python filter.
> 
> >+   pointer to this function is stored in the python extension interface.  */
> >+
> >+void
> >+gdbpy_load_ptwrite_filter (const struct extension_language_defn
> *extlang,
> >+			   struct btrace_thread_info *btinfo)
> >+{
> >+  if (!gdb_python_initialized || btinfo == nullptr)
> >+    return;
> 
> Isn't this an error or even an internal error case?

gdb_python_initialized is actually checked by gdbpy_enter, so I removed it.
I added an assert for btinfo.

> 
> >+/* Callback function for the ptwrite filter.  */
> >+extern std::string recpy_call_filter (const uint64_t payload,
> >+				      const uint64_t ip,
> >+				      const void *ptw_filter);
> 
> Should the comment say something like 'proxy for the python
> ptwrite filter'?

See discussion above. In the extension language interface in GDB this is
a callback. How about:

-/* Callback function for the ptwrite filter.  */
+/* Helper function to call the ptwrite filter.  */

That makes it more in line with other python/*.h files.
Intel Deutschland GmbH
Registered Address: Am Campeon 10, 85579 Neubiberg, Germany
Tel: +49 89 99 8853-0, www.intel.de <http://www.intel.de>
Managing Directors: Christin Eisenschmid, Sharon Heck, Tiffany Doon Silva  
Chairperson of the Supervisory Board: Nicole Lau
Registered Office: Munich
Commercial Register: Amtsgericht Muenchen HRB 186928

next prev parent reply	other threads:[~2022-07-11 12:48 UTC|newest]

Thread overview: 32+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2022-06-22 11:43 [PATCH v5 00/10] Extensions for PTWRITE Felix Willgerodt
2022-06-22 11:43 ` [PATCH v5 01/10] btrace: Introduce auxiliary instructions Felix Willgerodt
2022-06-28  9:10   ` Metzger, Markus T
2022-06-22 11:43 ` [PATCH v5 02/10] btrace: Enable auxiliary instructions in record instruction-history Felix Willgerodt
2022-06-28  9:10   ` Metzger, Markus T
2022-06-28 11:28     ` Willgerodt, Felix
2022-06-29 10:43       ` Metzger, Markus T
2022-06-22 11:43 ` [PATCH v5 03/10] btrace: Enable auxiliary instructions in record function-call-history Felix Willgerodt
2022-06-28  9:10   ` Metzger, Markus T
2022-09-19  8:59     ` Willgerodt, Felix
2022-06-22 11:43 ` [PATCH v5 04/10] btrace: Handle stepping and goto for auxiliary instructions Felix Willgerodt
2022-06-28  9:11   ` Metzger, Markus T
2022-06-22 11:43 ` [PATCH v5 05/10] python: Introduce gdb.RecordAuxiliary class Felix Willgerodt
2022-06-28  9:11   ` Metzger, Markus T
2022-07-11 12:48     ` Willgerodt, Felix
2022-06-22 11:43 ` [PATCH v5 06/10] python: Add clear() to gdb.Record Felix Willgerodt
2022-06-28  9:11   ` Metzger, Markus T
2022-06-22 11:43 ` [PATCH v5 07/10] btrace, gdbserver: Add ptwrite to btrace_config_pt Felix Willgerodt
2022-06-28  9:11   ` Metzger, Markus T
2022-06-22 11:43 ` [PATCH v5 08/10] btrace, linux: Enable ptwrite packets Felix Willgerodt
2022-06-28  9:12   ` Metzger, Markus T
2022-06-22 11:43 ` [PATCH v5 09/10] btrace, python: Enable ptwrite filter registration Felix Willgerodt
2022-06-28 13:59   ` Metzger, Markus T
2022-07-11 12:48     ` Willgerodt, Felix [this message]
2022-07-12 12:23       ` Metzger, Markus T
2022-07-13  8:49         ` Willgerodt, Felix
2022-07-13 15:20           ` Metzger, Markus T
2022-07-26 14:08             ` Willgerodt, Felix
2022-09-14  8:37               ` Metzger, Markus T
2022-06-22 11:43 ` [PATCH v5 10/10] btrace: Extend ptwrite event decoding Felix Willgerodt
2022-06-29 13:35   ` Metzger, Markus T
2022-09-19  8:59     ` Willgerodt, Felix

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=MN2PR11MB4566FC635ABD832B02A4DEB58E879@MN2PR11MB4566.namprd11.prod.outlook.com \
    --to=felix.willgerodt@intel.com \
    --cc=gdb-patches@sourceware.org \
    --cc=markus.t.metzger@intel.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).