public inbox for systemtap@sourceware.org
 help / color / mirror / Atom feed
* [Bug runtime/17461] New: probing process.end crashes on busy systems
@ 2014-10-06 20:49 jlebon at redhat dot com
  2014-10-06 20:50 ` [Bug runtime/17461] " jlebon at redhat dot com
                   ` (4 more replies)
  0 siblings, 5 replies; 6+ messages in thread
From: jlebon at redhat dot com @ 2014-10-06 20:49 UTC (permalink / raw)
  To: systemtap

https://sourceware.org/bugzilla/show_bug.cgi?id=17461

            Bug ID: 17461
           Summary: probing process.end crashes on busy systems
           Product: systemtap
           Version: unspecified
            Status: NEW
          Severity: normal
          Priority: P2
         Component: runtime
          Assignee: systemtap at sourceware dot org
          Reporter: jlebon at redhat dot com

Created attachment 7816
  --> https://sourceware.org/bugzilla/attachment.cgi?id=7816&action=edit
crash_testcase.exp

Running the following simple script on a busy system (where many processes are
created/destroyed quickly) eventually causes the system to lock up. It takes a
while sometimes to occur (e.g. 1-2 hours), but it always does. I haven't been
able so far to determine the cause of the issue, although the backtraces might
implicate utrace.

-- 
You are receiving this mail because:
You are the assignee for the bug.

^ permalink raw reply	[flat|nested] 6+ messages in thread

* [Bug runtime/17461] probing process.end crashes on busy systems
  2014-10-06 20:49 [Bug runtime/17461] New: probing process.end crashes on busy systems jlebon at redhat dot com
@ 2014-10-06 20:50 ` jlebon at redhat dot com
  2014-10-06 20:52 ` jlebon at redhat dot com
                   ` (3 subsequent siblings)
  4 siblings, 0 replies; 6+ messages in thread
From: jlebon at redhat dot com @ 2014-10-06 20:50 UTC (permalink / raw)
  To: systemtap

https://sourceware.org/bugzilla/show_bug.cgi?id=17461

--- Comment #1 from Jonathan Lebon <jlebon at redhat dot com> ---
Created attachment 7817
  --> https://sourceware.org/bugzilla/attachment.cgi?id=7817&action=edit
dmesg.log

-- 
You are receiving this mail because:
You are the assignee for the bug.

^ permalink raw reply	[flat|nested] 6+ messages in thread

* [Bug runtime/17461] probing process.end crashes on busy systems
  2014-10-06 20:49 [Bug runtime/17461] New: probing process.end crashes on busy systems jlebon at redhat dot com
  2014-10-06 20:50 ` [Bug runtime/17461] " jlebon at redhat dot com
@ 2014-10-06 20:52 ` jlebon at redhat dot com
  2014-10-06 21:12 ` dsmith at redhat dot com
                   ` (2 subsequent siblings)
  4 siblings, 0 replies; 6+ messages in thread
From: jlebon at redhat dot com @ 2014-10-06 20:52 UTC (permalink / raw)
  To: systemtap

https://sourceware.org/bugzilla/show_bug.cgi?id=17461

--- Comment #2 from Jonathan Lebon <jlebon at redhat dot com> ---
Forgot to add, this happened on f20 3.16.2-200 on git stap at least as of
commit 3525152, but also earlier (including prior to the rt patches). Will try
to do a bisect.

-- 
You are receiving this mail because:
You are the assignee for the bug.

^ permalink raw reply	[flat|nested] 6+ messages in thread

* [Bug runtime/17461] probing process.end crashes on busy systems
  2014-10-06 20:49 [Bug runtime/17461] New: probing process.end crashes on busy systems jlebon at redhat dot com
  2014-10-06 20:50 ` [Bug runtime/17461] " jlebon at redhat dot com
  2014-10-06 20:52 ` jlebon at redhat dot com
@ 2014-10-06 21:12 ` dsmith at redhat dot com
  2015-05-13 20:31 ` jlebon at redhat dot com
  2021-05-04  1:06 ` fche at redhat dot com
  4 siblings, 0 replies; 6+ messages in thread
From: dsmith at redhat dot com @ 2014-10-06 21:12 UTC (permalink / raw)
  To: systemtap

https://sourceware.org/bugzilla/show_bug.cgi?id=17461

David Smith <dsmith at redhat dot com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |dsmith at redhat dot com

--- Comment #3 from David Smith <dsmith at redhat dot com> ---
I'd certainly suspect utrace, especially since I see utrace_free() in your
dmesg output. However, I also see _raw_spin_lock, and that's got me confused.
We added some patches recently to add support for realtime kernels, but we
shouldn't be using raw spinlocks anywhere but realtime kernels.

The only real utrace change lately was the following:

====
commit d9d07e99777c6e7aaaa8db0049c5fd5e5a2f01b0                                 
Author: David Smith <dsmith@redhat.com>                                         
Date:   Fri Jul 18 15:49:39 2014 -0500                                          

    Fixed PR17181 by making utrace handle interrupting processes better.        
====

-- 
You are receiving this mail because:
You are the assignee for the bug.

^ permalink raw reply	[flat|nested] 6+ messages in thread

* [Bug runtime/17461] probing process.end crashes on busy systems
  2014-10-06 20:49 [Bug runtime/17461] New: probing process.end crashes on busy systems jlebon at redhat dot com
                   ` (2 preceding siblings ...)
  2014-10-06 21:12 ` dsmith at redhat dot com
@ 2015-05-13 20:31 ` jlebon at redhat dot com
  2021-05-04  1:06 ` fche at redhat dot com
  4 siblings, 0 replies; 6+ messages in thread
From: jlebon at redhat dot com @ 2015-05-13 20:31 UTC (permalink / raw)
  To: systemtap

https://sourceware.org/bugzilla/show_bug.cgi?id=17461

--- Comment #4 from Jonathan Lebon <jlebon at redhat dot com> ---
Created attachment 8312
  --> https://sourceware.org/bugzilla/attachment.cgi?id=8312&action=edit
dmesg.log

This is still an issue on the latest f20 3.19.5 with the latest git stap.
Interestingly, adding debug statements in utrace_free() confirms that the crash
does not happen there, but the rest of the stack is still very similar (showing
a backtrace coming from exit() related calls).

-- 
You are receiving this mail because:
You are the assignee for the bug.

^ permalink raw reply	[flat|nested] 6+ messages in thread

* [Bug runtime/17461] probing process.end crashes on busy systems
  2014-10-06 20:49 [Bug runtime/17461] New: probing process.end crashes on busy systems jlebon at redhat dot com
                   ` (3 preceding siblings ...)
  2015-05-13 20:31 ` jlebon at redhat dot com
@ 2021-05-04  1:06 ` fche at redhat dot com
  4 siblings, 0 replies; 6+ messages in thread
From: fche at redhat dot com @ 2021-05-04  1:06 UTC (permalink / raw)
  To: systemtap

https://sourceware.org/bugzilla/show_bug.cgi?id=17461

Frank Ch. Eigler <fche at redhat dot com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEW                         |RESOLVED
                 CC|                            |fche at redhat dot com
         Resolution|---                         |WORKSFORME

--- Comment #5 from Frank Ch. Eigler <fche at redhat dot com> ---
running this test on a rawhide (5.13-rc0 kernel, 4.5-rc stap), it's solid.

-- 
You are receiving this mail because:
You are the assignee for the bug.

^ permalink raw reply	[flat|nested] 6+ messages in thread

end of thread, other threads:[~2021-05-04  1:06 UTC | newest]

Thread overview: 6+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2014-10-06 20:49 [Bug runtime/17461] New: probing process.end crashes on busy systems jlebon at redhat dot com
2014-10-06 20:50 ` [Bug runtime/17461] " jlebon at redhat dot com
2014-10-06 20:52 ` jlebon at redhat dot com
2014-10-06 21:12 ` dsmith at redhat dot com
2015-05-13 20:31 ` jlebon at redhat dot com
2021-05-04  1:06 ` fche at redhat dot com

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).