From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 24925 invoked by alias); 24 Apr 2007 19:07:23 -0000 Received: (qmail 23808 invoked by uid 48); 24 Apr 2007 19:07:08 -0000 Date: Tue, 24 Apr 2007 19:07:00 -0000 From: "wcohen at redhat dot com" To: systemtap@sources.redhat.com Message-ID: <20070424200707.4420.wcohen@redhat.com> Reply-To: sourceware-bugzilla@sourceware.org Subject: [Bug kprobes/4420] New: systemtap.samples/lket.exp test crashing RHEL4U4 machine X-Bugzilla-Reason: AssignedTo Mailing-List: contact systemtap-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Post: List-Help: , Sender: systemtap-owner@sourceware.org X-SW-Source: 2007-q2/txt/msg00122.txt.bz2 Looking at why RHEL4U4 i686 machine (2.6.9-42.0.10.EL i686 kernel) is dying during the snapshot testing. Some of the kernel error message looks similar to an earlier closed bug, 2726. However, the tests from 2726 works. Need to narrow down the problem more. Looking through the systemtap.log for the testsuite the test that is crashing the machine is: Running /home/wcohen/stap_testing_200704240830/src/testsuite/systemtap.samples/lket.exp ... from systemtap.log Running /home/wcohen/stap_testing_200704240830/src/testsuite/systemtap.samples/lket.exp ... Pass 1: parsed user script and 54 library script(s) in 740usr/30sys/805real ms. Pass 2: analyzed script: 857 probe(s), 310 function(s), 24 embed(s), 132 global(s) in 70000usr/170sys/70548real ms. Pass 3: translated to C into "/tmp/stapfJv2vF/stap_997dc0e24dfee70b8a1d0811d8a016a9_641435.c" in 810usr/20sys/832real ms. Taking a look at the directory: $ ls -l /tmp/stapfJv2vF/ total 9656 -rw-r--r-- 1 wcohen wcohen 795 Apr 24 04:58 Makefile -rw-r--r-- 1 wcohen wcohen 4378845 Apr 24 04:58 stap_997dc0e24dfee70b8a1d0811d8a016a9_641435.c -rw-r--r-- 1 wcohen wcohen 2722463 Apr 24 04:59 stap_997dc0e24dfee70b8a1d0811d8a016a9_641435.ko -rw-r--r-- 1 wcohen wcohen 3067 Apr 24 04:59 stap_997dc0e24dfee70b8a1d0811d8a016a9_641435.mod.c -rw-r--r-- 1 wcohen wcohen 34604 Apr 24 04:59 stap_997dc0e24dfee70b8a1d0811d8a016a9_641435.mod.o -rw-r--r-- 1 wcohen wcohen 2688948 Apr 24 04:59 stap_997dc0e24dfee70b8a1d0811d8a016a9_641435.o Pretty big module. The following caused the kernel to crash: sudo /home/wcohen/stap_testing_200704240830/install/bin/staprun /tmp/stapfJv2vF/stap_997dc0e24dfee70b8a1d0811d8a016a9_641435.ko Oops output on console. slingshot.devel.redhat.com login: Kernel panic - not syncing: kernel/module.c:24 <0>Kernel panic - not syncing: kernel/sched.c:2430: spin_lock(kernel/sched.c:c5 Badness in panic at kernel/panic.c:118 [] panic+0x135/0x142 [] scheduler_tick+0x21d/0x4aa [] do_timer+0x29/0xb5 [] timer_interrupt+0x165/0x25a [] handle_IRQ_event+0x25/0x4f [] do_IRQ+0x18a/0x2bf ======================= [] search_extable+0x1f/0x36 [] common_interrupt+0x18/0x20 [] search_extable+0x1f/0x36 [] panic+0xef/0x142 [] search_extable+0x1f/0x36 [] search_module_extables+0x6d/0x13b [] search_extable+0x1f/0x36 [] search_exception_tables+0x1f/0x21 [] fixup_exception+0xb/0x20 [] kprobe_exceptions_notify+0x187/0x19b [] notifier_call_chain+0x17/0x2e [] do_page_fault+0x0/0x4dc [] do_page_fault+0x52/0x4dc [] __cond_resched+0x14/0x3b [] __getblk+0x2b/0x49 [] ext3_get_inode_loc+0x4f/0x223 [ext3] [] do_IRQ+0x2b3/0x2bf [] common_interrupt+0x18/0x20 [] do_page_fault+0x0/0x4dc [] error_code+0x2f/0x38 []<0>Kernel panic - not syncing: kernel/module.c:2114: spin_lock(ker4 -- Summary: systemtap.samples/lket.exp test crashing RHEL4U4 machine Product: systemtap Version: unspecified Status: NEW Severity: critical Priority: P2 Component: kprobes AssignedTo: systemtap at sources dot redhat dot com ReportedBy: wcohen at redhat dot com GCC target triplet: i386-linux http://sourceware.org/bugzilla/show_bug.cgi?id=4420 ------- You are receiving this mail because: ------- You are the assignee for the bug, or are watching the assignee.