public inbox for gdb-patches@sourceware.org
 help / color / mirror / Atom feed
* [PATCH] [python] Optimize python_string_to_host_string() for Python 2.
@ 2015-01-14  6:38 Yichun Zhang (agentzh)
  2015-01-19 22:08 ` Yichun Zhang (agentzh)
                   ` (3 more replies)
  0 siblings, 4 replies; 8+ messages in thread
From: Yichun Zhang (agentzh) @ 2015-01-14  6:38 UTC (permalink / raw)
  To: gdb-patches; +Cc: Yichun Zhang (agentzh)

Hi,

I have been writing quite some advanced GDB Python tools for LuaJIT
and NGINX in the nginx-gdb-utils project [1] and have been suffering
from serious performance issues even for relatively small working sets.

I have noticed from a typical on-CPU C-land Flame Graph [2]
for my tools that python_string_to_host_string() is painfully slow
when using Python 2 (2.7 to be more specific) due to the
expensive and meaningless unicode conversions there (involved with
hot function calls like utf_8_decode).

With the following patch, my most complicated Python tools
finally have comparable performance between Python 2 and
Python 3. In the case of Python 2, some real-world Python scripts'
overall speedup can be as big as 34% (for my "lgcpath" command [3])
or even 57% (for my "lgcstat" command [4]). And from the new Flame
Graph [5], we can see that the corresponding function frames are
indeed gone.

Comments are welcome!

[1] https://github.com/openresty/nginx-gdb-utils#readme
[2] http://agentzh.org/misc/flamegraph/gdb-py-lgcstat-2015-0113.svg
[3] https://github.com/openresty/nginx-gdb-utils#lgcpath
[4] https://github.com/openresty/nginx-gdb-utils#lgcstat
[5] http://agentzh.org/misc/flamegraph/patched-gdb-py-lgcstat-2015-0113.svg

gdb/ChangeLog:

    * python/py-utils.c (python_string_to_host_string): use
    xstrdup directly for Python 2.
    * python/py-utils.c (gdbpy_obj_to_string): use
    python_string_to_host_string for both Python 2 and 3.
---
 gdb/python/py-utils.c | 9 ++++-----
 1 file changed, 4 insertions(+), 5 deletions(-)

diff --git a/gdb/python/py-utils.c b/gdb/python/py-utils.c
index 58a5934..ed7be3b 100644
--- a/gdb/python/py-utils.c
+++ b/gdb/python/py-utils.c
@@ -209,6 +209,7 @@ python_string_to_target_python_string (PyObject *obj)
 char *
 python_string_to_host_string (PyObject *obj)
 {
+#ifdef IS_PY3K
   PyObject *str;
   char *result;
 
@@ -219,6 +220,9 @@ python_string_to_host_string (PyObject *obj)
   result = unicode_to_encoded_string (str, host_charset ());
   Py_DECREF (str);
   return result;
+#else
+  return xstrdup (PyString_AsString (obj));
+#endif
 }
 
 /* Return true if OBJ is a Python string or unicode object, false
@@ -245,12 +249,7 @@ gdbpy_obj_to_string (PyObject *obj)
 
   if (str_obj != NULL)
     {
-#ifdef IS_PY3K
       char *msg = python_string_to_host_string (str_obj);
-#else
-      char *msg = xstrdup (PyString_AsString (str_obj));
-#endif
-
       Py_DECREF (str_obj);
       return msg;
     }
-- 
1.8.5.6

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [PATCH] [python] Optimize python_string_to_host_string() for Python 2.
  2015-01-14  6:38 [PATCH] [python] Optimize python_string_to_host_string() for Python 2 Yichun Zhang (agentzh)
@ 2015-01-19 22:08 ` Yichun Zhang (agentzh)
  2015-01-20 11:13   ` Phil Muldoon
  2015-02-01 23:33 ` Yichun Zhang (agentzh)
                   ` (2 subsequent siblings)
  3 siblings, 1 reply; 8+ messages in thread
From: Yichun Zhang (agentzh) @ 2015-01-19 22:08 UTC (permalink / raw)
  To: gdb-patches

Hello!

On Tue, Jan 13, 2015 at 10:38 PM, Yichun Zhang (agentzh) wrote:
> With the following patch, my most complicated Python tools
> finally have comparable performance between Python 2 and
> Python 3. In the case of Python 2, some real-world Python scripts'
> overall speedup can be as big as 34% (for my "lgcpath" command [3])
> or even 57% (for my "lgcstat" command [4]). And from the new Flame
> Graph [5], we can see that the corresponding function frames are
> indeed gone.
>
> Comments are welcome!
>

Does the silence in the last 6 days mean that this patch is completely wrong?

Actually I have a few more patches to submit to optimize this
gdb/python part even further. The silence here makes me hesitate ;)

Please let me know if I'm doing something wrong here. Very much appreciated!

Thanks!
-agentzh

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [PATCH] [python] Optimize python_string_to_host_string() for Python 2.
  2015-01-19 22:08 ` Yichun Zhang (agentzh)
@ 2015-01-20 11:13   ` Phil Muldoon
  2015-01-20 13:13     ` Joel Brobecker
  0 siblings, 1 reply; 8+ messages in thread
From: Phil Muldoon @ 2015-01-20 11:13 UTC (permalink / raw)
  To: Yichun Zhang (agentzh), gdb-patches

On 19/01/15 22:08, Yichun Zhang (agentzh) wrote:
> Hello!
>
> On Tue, Jan 13, 2015 at 10:38 PM, Yichun Zhang (agentzh) wrote:
>> With the following patch, my most complicated Python tools
>> finally have comparable performance between Python 2 and
>> Python 3. In the case of Python 2, some real-world Python scripts'
>> overall speedup can be as big as 34% (for my "lgcpath" command [3])
>> or even 57% (for my "lgcstat" command [4]). And from the new Flame
>> Graph [5], we can see that the corresponding function frames are
>> indeed gone.
>>
>> Comments are welcome!
>>
>
> Does the silence in the last 6 days mean that this patch is completely wrong?

Not at all.  I think the post Christmas catch-up has affected patch
review and there is some lag.  I am sure the maintainers will review
your patch soon.

> Actually I have a few more patches to submit to optimize this
> gdb/python part even further. The silence here makes me hesitate ;)

Please submit them. There is no hesitation on commenting on patches
constructively. For my part, I find no issue with people indicating
what might be wrong (or right) with my patches!

> Please let me know if I'm doing something wrong here. Very much appreciated!

You are doing nothing wrong.  Keep pinging your patches. The usual
wait time for pinging a patch is seven days.

Cheers

Phil


^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [PATCH] [python] Optimize python_string_to_host_string() for Python 2.
  2015-01-20 11:13   ` Phil Muldoon
@ 2015-01-20 13:13     ` Joel Brobecker
  2015-01-20 22:19       ` Yichun Zhang (agentzh)
  0 siblings, 1 reply; 8+ messages in thread
From: Joel Brobecker @ 2015-01-20 13:13 UTC (permalink / raw)
  To: Phil Muldoon; +Cc: Yichun Zhang (agentzh), gdb-patches

In addition to Phil's answer (Thank You, Phil!), feel free to ping us
about pending patches like you just did on a weekly basis. We're all
volunteers, so we do not always have the time to review right away.

-- 
Joel

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [PATCH] [python] Optimize python_string_to_host_string() for Python 2.
  2015-01-20 13:13     ` Joel Brobecker
@ 2015-01-20 22:19       ` Yichun Zhang (agentzh)
  0 siblings, 0 replies; 8+ messages in thread
From: Yichun Zhang (agentzh) @ 2015-01-20 22:19 UTC (permalink / raw)
  To: Joel Brobecker; +Cc: Phil Muldoon, gdb-patches

Hello!

I see. Thank you for the replies, Phil and Joel!

Oh yeah, as the maintainer of many opensource projects myself, I
totally understand the situation especially given the relatively high
traffic volume on this list :)

That's cool and I'll keep publishing more patches here :)

Best regards,
-agentzh

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [PATCH] [python] Optimize python_string_to_host_string() for Python 2.
  2015-01-14  6:38 [PATCH] [python] Optimize python_string_to_host_string() for Python 2 Yichun Zhang (agentzh)
  2015-01-19 22:08 ` Yichun Zhang (agentzh)
@ 2015-02-01 23:33 ` Yichun Zhang (agentzh)
  2015-02-22 18:01 ` Yichun Zhang (agentzh)
  2015-03-03 16:12 ` Joel Brobecker
  3 siblings, 0 replies; 8+ messages in thread
From: Yichun Zhang (agentzh) @ 2015-02-01 23:33 UTC (permalink / raw)
  To: gdb-patches

Hi

On Tue, Jan 13, 2015 at 10:38 PM, Yichun Zhang (agentzh) wrote:
> With the following patch, my most complicated Python tools
> finally have comparable performance between Python 2 and
> Python 3. In the case of Python 2, some real-world Python scripts'
> overall speedup can be as big as 34% (for my "lgcpath" command [3])
> or even 57% (for my "lgcstat" command [4]). And from the new Flame
> Graph [5], we can see that the corresponding function frames are
> indeed gone.

ping :)

-agentzh

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [PATCH] [python] Optimize python_string_to_host_string() for Python 2.
  2015-01-14  6:38 [PATCH] [python] Optimize python_string_to_host_string() for Python 2 Yichun Zhang (agentzh)
  2015-01-19 22:08 ` Yichun Zhang (agentzh)
  2015-02-01 23:33 ` Yichun Zhang (agentzh)
@ 2015-02-22 18:01 ` Yichun Zhang (agentzh)
  2015-03-03 16:12 ` Joel Brobecker
  3 siblings, 0 replies; 8+ messages in thread
From: Yichun Zhang (agentzh) @ 2015-02-22 18:01 UTC (permalink / raw)
  To: gdb-patches

Hello!

On Tue, Jan 13, 2015 at 10:38 PM, Yichun Zhang (agentzh) wrote:
> With the following patch, my most complicated Python tools
> finally have comparable performance between Python 2 and
> Python 3. In the case of Python 2, some real-world Python scripts'
> overall speedup can be as big as 34% (for my "lgcpath" command [3])
> or even 57% (for my "lgcstat" command [4]). And from the new Flame
> Graph [5], we can see that the corresponding function frames are
> indeed gone.
>

ping

Regards,
-agentzh

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [PATCH] [python] Optimize python_string_to_host_string() for Python 2.
  2015-01-14  6:38 [PATCH] [python] Optimize python_string_to_host_string() for Python 2 Yichun Zhang (agentzh)
                   ` (2 preceding siblings ...)
  2015-02-22 18:01 ` Yichun Zhang (agentzh)
@ 2015-03-03 16:12 ` Joel Brobecker
  3 siblings, 0 replies; 8+ messages in thread
From: Joel Brobecker @ 2015-03-03 16:12 UTC (permalink / raw)
  To: Yichun Zhang (agentzh); +Cc: gdb-patches

Hello,

First of all, sorry about the delay in review this. I was hoping
that someone more knowledgeable than I would be able to review it.
Thanks for ping us, and for your patience!

> I have been writing quite some advanced GDB Python tools for LuaJIT
> and NGINX in the nginx-gdb-utils project [1] and have been suffering
> from serious performance issues even for relatively small working sets.
> 
> I have noticed from a typical on-CPU C-land Flame Graph [2]
> for my tools that python_string_to_host_string() is painfully slow
> when using Python 2 (2.7 to be more specific) due to the
> expensive and meaningless unicode conversions there (involved with
> hot function calls like utf_8_decode).
[...]
> gdb/ChangeLog:
> 
>     * python/py-utils.c (python_string_to_host_string): use
>     xstrdup directly for Python 2.
>     * python/py-utils.c (gdbpy_obj_to_string): use
>     python_string_to_host_string for both Python 2 and 3.

I believe you, but being a fairly complete dummy in this area,
can you explain why the unicode conversion is meaningless with
Python 2? In fact, if you have that explanation, I think it would
be useful to have that as a comment inside the Python-2 part of
python_string_to_host_string.

Thanks,
-- 
Joel

^ permalink raw reply	[flat|nested] 8+ messages in thread

end of thread, other threads:[~2015-03-03 16:12 UTC | newest]

Thread overview: 8+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2015-01-14  6:38 [PATCH] [python] Optimize python_string_to_host_string() for Python 2 Yichun Zhang (agentzh)
2015-01-19 22:08 ` Yichun Zhang (agentzh)
2015-01-20 11:13   ` Phil Muldoon
2015-01-20 13:13     ` Joel Brobecker
2015-01-20 22:19       ` Yichun Zhang (agentzh)
2015-02-01 23:33 ` Yichun Zhang (agentzh)
2015-02-22 18:01 ` Yichun Zhang (agentzh)
2015-03-03 16:12 ` Joel Brobecker

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).