* [Bug runtime/6964] process probes cause kernel crash on f9
2008-10-16 17:10 [Bug runtime/6964] New: process probes cause kernel crash on f9 wcohen at redhat dot com
@ 2008-10-16 17:43 ` eteo at redhat dot com
2008-10-16 19:21 ` wcohen at redhat dot com
` (15 subsequent siblings)
16 siblings, 0 replies; 18+ messages in thread
From: eteo at redhat dot com @ 2008-10-16 17:43 UTC (permalink / raw)
To: systemtap
------- Additional Comments From eteo at redhat dot com 2008-10-16 17:42 -------
(In reply to comment #0)
> The current snapshot of systemtap on f9 x86_64 machine running 2.6.26.5-45.fc9
> x86_64 kernel can be crashed with the following short script:
I'm able to reproduce this on a x86 host as well.
Eugene
--
http://sourceware.org/bugzilla/show_bug.cgi?id=6964
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.
^ permalink raw reply [flat|nested] 18+ messages in thread
* [Bug runtime/6964] process probes cause kernel crash on f9
2008-10-16 17:10 [Bug runtime/6964] New: process probes cause kernel crash on f9 wcohen at redhat dot com
2008-10-16 17:43 ` [Bug runtime/6964] " eteo at redhat dot com
@ 2008-10-16 19:21 ` wcohen at redhat dot com
2008-10-16 21:53 ` wcohen at redhat dot com
` (14 subsequent siblings)
16 siblings, 0 replies; 18+ messages in thread
From: wcohen at redhat dot com @ 2008-10-16 19:21 UTC (permalink / raw)
To: systemtap
------- Additional Comments From wcohen at redhat dot com 2008-10-16 19:20 -------
Removing the "-c ls" will allow this to work:
stap -e 'probe process.syscall, process.end \
{printf("%s %d %s\n", execname(), pid(), pp())}'
The problem is still present on the 2.6.26.6-75.fc9 kernel.
--
http://sourceware.org/bugzilla/show_bug.cgi?id=6964
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.
^ permalink raw reply [flat|nested] 18+ messages in thread
* [Bug runtime/6964] process probes cause kernel crash on f9
2008-10-16 17:10 [Bug runtime/6964] New: process probes cause kernel crash on f9 wcohen at redhat dot com
2008-10-16 17:43 ` [Bug runtime/6964] " eteo at redhat dot com
2008-10-16 19:21 ` wcohen at redhat dot com
@ 2008-10-16 21:53 ` wcohen at redhat dot com
2008-10-17 20:06 ` roland at gnu dot org
` (13 subsequent siblings)
16 siblings, 0 replies; 18+ messages in thread
From: wcohen at redhat dot com @ 2008-10-16 21:53 UTC (permalink / raw)
To: systemtap
------- Additional Comments From wcohen at redhat dot com 2008-10-16 21:51 -------
Did some additional experiments to gather more information about what is going on.
Suggested that things run with "strace -f". The machine did not crash when the
command was prepended with this command.
Found that crashes still occurred with "-c ls" for:
probe process.syscall {}
and:
probe process.end {}
--
http://sourceware.org/bugzilla/show_bug.cgi?id=6964
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.
^ permalink raw reply [flat|nested] 18+ messages in thread
* [Bug runtime/6964] process probes cause kernel crash on f9
2008-10-16 17:10 [Bug runtime/6964] New: process probes cause kernel crash on f9 wcohen at redhat dot com
` (2 preceding siblings ...)
2008-10-16 21:53 ` wcohen at redhat dot com
@ 2008-10-17 20:06 ` roland at gnu dot org
2008-10-17 21:00 ` wcohen at redhat dot com
` (12 subsequent siblings)
16 siblings, 0 replies; 18+ messages in thread
From: roland at gnu dot org @ 2008-10-17 20:06 UTC (permalink / raw)
To: systemtap
------- Additional Comments From roland at gnu dot org 2008-10-17 20:05 -------
Please show the crash details for a current kernel (f9 or f10).
Ideal would be to reduce this to an example utrace module that demonstrates the
bug without using stap. If it's triggered by ptrace, then a simple test program
using ptrace alone (for the ptrace-tests suite).
--
What |Removed |Added
----------------------------------------------------------------------------
CC| |roland at redhat dot com
Status|NEW |WAITING
http://sourceware.org/bugzilla/show_bug.cgi?id=6964
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.
^ permalink raw reply [flat|nested] 18+ messages in thread
* [Bug runtime/6964] process probes cause kernel crash on f9
2008-10-16 17:10 [Bug runtime/6964] New: process probes cause kernel crash on f9 wcohen at redhat dot com
` (3 preceding siblings ...)
2008-10-17 20:06 ` roland at gnu dot org
@ 2008-10-17 21:00 ` wcohen at redhat dot com
2008-10-17 21:34 ` wcohen at redhat dot com
` (11 subsequent siblings)
16 siblings, 0 replies; 18+ messages in thread
From: wcohen at redhat dot com @ 2008-10-17 21:00 UTC (permalink / raw)
To: systemtap
------- Additional Comments From wcohen at redhat dot com 2008-10-17 20:58 -------
Created an attachment (id=3004)
--> (http://sourceware.org/bugzilla/attachment.cgi?id=3004&action=view)
Kernel oops message for 2.6.26.6-75.fc9.x86_64 kernel
--
http://sourceware.org/bugzilla/show_bug.cgi?id=6964
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.
^ permalink raw reply [flat|nested] 18+ messages in thread
* [Bug runtime/6964] process probes cause kernel crash on f9
2008-10-16 17:10 [Bug runtime/6964] New: process probes cause kernel crash on f9 wcohen at redhat dot com
` (4 preceding siblings ...)
2008-10-17 21:00 ` wcohen at redhat dot com
@ 2008-10-17 21:34 ` wcohen at redhat dot com
2008-10-18 5:58 ` fche at redhat dot com
` (10 subsequent siblings)
16 siblings, 0 replies; 18+ messages in thread
From: wcohen at redhat dot com @ 2008-10-17 21:34 UTC (permalink / raw)
To: systemtap
------- Additional Comments From wcohen at redhat dot com 2008-10-17 21:33 -------
Compiled the instrumentation one line 'probe process.end {}' separately into
module then ran with:
staprun -vv 6964f.ko -c ls
This caused a crash right after the following line:
stapio:stp_main_loop:372 detaching pid 30260
Looks like things are going wrong after when the following line from
systemtap/runtime/staprun/mainloop.c is attempted:
int rc = ptrace (PTRACE_DETACH, target_pid, 0, 0);
Is there a ptrace tests that register utrace engines similar to what systemtap
generates in the instrumentation module?
--
http://sourceware.org/bugzilla/show_bug.cgi?id=6964
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.
^ permalink raw reply [flat|nested] 18+ messages in thread
* [Bug runtime/6964] process probes cause kernel crash on f9
2008-10-16 17:10 [Bug runtime/6964] New: process probes cause kernel crash on f9 wcohen at redhat dot com
` (5 preceding siblings ...)
2008-10-17 21:34 ` wcohen at redhat dot com
@ 2008-10-18 5:58 ` fche at redhat dot com
2008-10-18 18:30 ` fche at redhat dot com
` (9 subsequent siblings)
16 siblings, 0 replies; 18+ messages in thread
From: fche at redhat dot com @ 2008-10-18 5:58 UTC (permalink / raw)
To: systemtap
--
What |Removed |Added
----------------------------------------------------------------------------
Status|WAITING |NEW
Last reconfirmed|0000-00-00 00:00:00 |2008-10-18 05:57:44
date| |
http://sourceware.org/bugzilla/show_bug.cgi?id=6964
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.
^ permalink raw reply [flat|nested] 18+ messages in thread
* [Bug runtime/6964] process probes cause kernel crash on f9
2008-10-16 17:10 [Bug runtime/6964] New: process probes cause kernel crash on f9 wcohen at redhat dot com
` (6 preceding siblings ...)
2008-10-18 5:58 ` fche at redhat dot com
@ 2008-10-18 18:30 ` fche at redhat dot com
2008-10-20 2:06 ` wenji dot huang at oracle dot com
` (8 subsequent siblings)
16 siblings, 0 replies; 18+ messages in thread
From: fche at redhat dot com @ 2008-10-18 18:30 UTC (permalink / raw)
To: systemtap
------- Additional Comments From fche at redhat dot com 2008-10-18 18:28 -------
Reported "upstream": https://bugzilla.redhat.com/show_bug.cgi?id=467568
--
What |Removed |Added
----------------------------------------------------------------------------
Status|NEW |SUSPENDED
http://sourceware.org/bugzilla/show_bug.cgi?id=6964
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.
^ permalink raw reply [flat|nested] 18+ messages in thread
* [Bug runtime/6964] process probes cause kernel crash on f9
2008-10-16 17:10 [Bug runtime/6964] New: process probes cause kernel crash on f9 wcohen at redhat dot com
` (7 preceding siblings ...)
2008-10-18 18:30 ` fche at redhat dot com
@ 2008-10-20 2:06 ` wenji dot huang at oracle dot com
2008-10-20 6:53 ` wenji dot huang at oracle dot com
` (7 subsequent siblings)
16 siblings, 0 replies; 18+ messages in thread
From: wenji dot huang at oracle dot com @ 2008-10-20 2:06 UTC (permalink / raw)
To: systemtap
------- Additional Comments From wenji dot huang at oracle dot com 2008-10-20 02:05 -------
(In reply to comment #1)
Reproduced it on 2.6.27+latest utrace patch, both x86 and x86_64.
--
http://sourceware.org/bugzilla/show_bug.cgi?id=6964
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.
^ permalink raw reply [flat|nested] 18+ messages in thread
* [Bug runtime/6964] process probes cause kernel crash on f9
2008-10-16 17:10 [Bug runtime/6964] New: process probes cause kernel crash on f9 wcohen at redhat dot com
` (8 preceding siblings ...)
2008-10-20 2:06 ` wenji dot huang at oracle dot com
@ 2008-10-20 6:53 ` wenji dot huang at oracle dot com
2008-10-21 3:10 ` wenji dot huang at oracle dot com
` (6 subsequent siblings)
16 siblings, 0 replies; 18+ messages in thread
From: wenji dot huang at oracle dot com @ 2008-10-20 6:53 UTC (permalink / raw)
To: systemtap
------- Additional Comments From wenji dot huang at oracle dot com 2008-10-20 06:51 -------
Related code in utrace_get_signal:
if (unlikely(report.killed)) {
/* COMMENT */
sigset_t sigkill_only;
siginitsetinv(&sigkill_only, sigmask(SIGKILL));
spin_lock_irq(&task->sighand->siglock);
signr = dequeue_signal(task, &sigkill_only, info);
BUG_ON(signr != SIGKILL);
*return_ka = task->sighand->action[SIGKILL - 1];
return signr;
}
The BUG_ON is triggered and I debug the signr value which is equal to 0, not
SIGKILL.
--
http://sourceware.org/bugzilla/show_bug.cgi?id=6964
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.
^ permalink raw reply [flat|nested] 18+ messages in thread
* [Bug runtime/6964] process probes cause kernel crash on f9
2008-10-16 17:10 [Bug runtime/6964] New: process probes cause kernel crash on f9 wcohen at redhat dot com
` (9 preceding siblings ...)
2008-10-20 6:53 ` wenji dot huang at oracle dot com
@ 2008-10-21 3:10 ` wenji dot huang at oracle dot com
2008-10-21 18:34 ` jan dot kratochvil at redhat dot com
` (5 subsequent siblings)
16 siblings, 0 replies; 18+ messages in thread
From: wenji dot huang at oracle dot com @ 2008-10-21 3:10 UTC (permalink / raw)
To: systemtap
------- Additional Comments From wenji dot huang at oracle dot com 2008-10-21 03:09 -------
Did some tests to narrow the scope.
It works fine for:
stap -vve 'probe kernel.function("sys_open"){}' -c ls
stap -vve 'probe process("./test").syscall {}' -c ls
only got crashed when
stap -vve 'probe process.syscall {}' -c ls
I guess the reason is the spawn process by stap is being hitted. Some signals
are delayed/messed so that BUG_ON in utrace_get_signal is triggered.
--
http://sourceware.org/bugzilla/show_bug.cgi?id=6964
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.
^ permalink raw reply [flat|nested] 18+ messages in thread
* [Bug runtime/6964] process probes cause kernel crash on f9
2008-10-16 17:10 [Bug runtime/6964] New: process probes cause kernel crash on f9 wcohen at redhat dot com
` (10 preceding siblings ...)
2008-10-21 3:10 ` wenji dot huang at oracle dot com
@ 2008-10-21 18:34 ` jan dot kratochvil at redhat dot com
2008-10-22 9:09 ` wenji dot huang at oracle dot com
` (4 subsequent siblings)
16 siblings, 0 replies; 18+ messages in thread
From: jan dot kratochvil at redhat dot com @ 2008-10-21 18:34 UTC (permalink / raw)
To: systemtap
--
What |Removed |Added
----------------------------------------------------------------------------
CC| |jan dot kratochvil at redhat
| |dot com
http://sourceware.org/bugzilla/show_bug.cgi?id=6964
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.
^ permalink raw reply [flat|nested] 18+ messages in thread
* [Bug runtime/6964] process probes cause kernel crash on f9
2008-10-16 17:10 [Bug runtime/6964] New: process probes cause kernel crash on f9 wcohen at redhat dot com
` (11 preceding siblings ...)
2008-10-21 18:34 ` jan dot kratochvil at redhat dot com
@ 2008-10-22 9:09 ` wenji dot huang at oracle dot com
2008-10-29 2:53 ` fche at redhat dot com
` (3 subsequent siblings)
16 siblings, 0 replies; 18+ messages in thread
From: wenji dot huang at oracle dot com @ 2008-10-22 9:09 UTC (permalink / raw)
To: systemtap
------- Additional Comments From wenji dot huang at oracle dot com 2008-10-22 09:08 -------
Created an attachment (id=3015)
--> (http://sourceware.org/bugzilla/attachment.cgi?id=3015&action=view)
workaround for the bug
This patch temporarily disables the PTRACE_TRACEME/DETACH when executing
command. The command will be running once child forked. No kernel panic any
more.
--
http://sourceware.org/bugzilla/show_bug.cgi?id=6964
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.
^ permalink raw reply [flat|nested] 18+ messages in thread
* [Bug runtime/6964] process probes cause kernel crash on f9
2008-10-16 17:10 [Bug runtime/6964] New: process probes cause kernel crash on f9 wcohen at redhat dot com
` (12 preceding siblings ...)
2008-10-22 9:09 ` wenji dot huang at oracle dot com
@ 2008-10-29 2:53 ` fche at redhat dot com
2008-11-03 8:53 ` mjw at redhat dot com
` (2 subsequent siblings)
16 siblings, 0 replies; 18+ messages in thread
From: fche at redhat dot com @ 2008-10-29 2:53 UTC (permalink / raw)
To: systemtap
------- Additional Comments From fche at redhat dot com 2008-10-29 02:52 -------
> This patch temporarily disables the PTRACE_TRACEME/DETACH when executing
> command.
Thanks, committed, for I hope not too long :-(
--
http://sourceware.org/bugzilla/show_bug.cgi?id=6964
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.
^ permalink raw reply [flat|nested] 18+ messages in thread
* [Bug runtime/6964] process probes cause kernel crash on f9
2008-10-16 17:10 [Bug runtime/6964] New: process probes cause kernel crash on f9 wcohen at redhat dot com
` (13 preceding siblings ...)
2008-10-29 2:53 ` fche at redhat dot com
@ 2008-11-03 8:53 ` mjw at redhat dot com
2008-11-11 21:36 ` mjw at redhat dot com
2008-11-11 22:06 ` fche at redhat dot com
16 siblings, 0 replies; 18+ messages in thread
From: mjw at redhat dot com @ 2008-11-03 8:53 UTC (permalink / raw)
To: systemtap
------- Additional Comments From mjw at redhat dot com 2008-11-03 08:52 -------
This workaround is causing some confusion for people (me included) who use
systemtap git trunk since -c doesn't work reliably now. Is there a timeline to
get this fixed for real?
--
http://sourceware.org/bugzilla/show_bug.cgi?id=6964
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.
^ permalink raw reply [flat|nested] 18+ messages in thread
* [Bug runtime/6964] process probes cause kernel crash on f9
2008-10-16 17:10 [Bug runtime/6964] New: process probes cause kernel crash on f9 wcohen at redhat dot com
` (14 preceding siblings ...)
2008-11-03 8:53 ` mjw at redhat dot com
@ 2008-11-11 21:36 ` mjw at redhat dot com
2008-11-11 22:06 ` fche at redhat dot com
16 siblings, 0 replies; 18+ messages in thread
From: mjw at redhat dot com @ 2008-11-11 21:36 UTC (permalink / raw)
To: systemtap
------- Additional Comments From mjw at redhat dot com 2008-11-11 21:35 -------
Currently make installcheck will not really work because the systemtap.syscall
tests all use -c. It often just hangs with this workaround applied.
--
http://sourceware.org/bugzilla/show_bug.cgi?id=6964
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.
^ permalink raw reply [flat|nested] 18+ messages in thread
* [Bug runtime/6964] process probes cause kernel crash on f9
2008-10-16 17:10 [Bug runtime/6964] New: process probes cause kernel crash on f9 wcohen at redhat dot com
` (15 preceding siblings ...)
2008-11-11 21:36 ` mjw at redhat dot com
@ 2008-11-11 22:06 ` fche at redhat dot com
16 siblings, 0 replies; 18+ messages in thread
From: fche at redhat dot com @ 2008-11-11 22:06 UTC (permalink / raw)
To: systemtap
------- Additional Comments From fche at redhat dot com 2008-11-11 22:05 -------
I'll fix the -c synchronization logic.
--
What |Removed |Added
----------------------------------------------------------------------------
AssignedTo|systemtap at sources dot |fche at redhat dot com
|redhat dot com |
Status|SUSPENDED |ASSIGNED
http://sourceware.org/bugzilla/show_bug.cgi?id=6964
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.
^ permalink raw reply [flat|nested] 18+ messages in thread