public inbox for systemtap@sourceware.org
 help / color / mirror / Atom feed
* [Bug runtime/20820] New: another "soft lockup" BUG on RHEL7 ppc64
@ 2016-11-14 18:57 dsmith at redhat dot com
  2016-11-16 21:26 ` [Bug runtime/20820] " dsmith at redhat dot com
                   ` (19 more replies)
  0 siblings, 20 replies; 21+ messages in thread
From: dsmith at redhat dot com @ 2016-11-14 18:57 UTC (permalink / raw)
  To: systemtap

https://sourceware.org/bugzilla/show_bug.cgi?id=20820

            Bug ID: 20820
           Summary: another "soft lockup" BUG on RHEL7 ppc64
           Product: systemtap
           Version: unspecified
            Status: NEW
          Severity: normal
          Priority: P2
         Component: runtime
          Assignee: systemtap at sourceware dot org
          Reporter: dsmith at redhat dot com
  Target Milestone: ---

When running the testsuite in parallel mode, I'm fairly consistently seeing the
following kernel BUG on 3.10.0-327.el7.ppc64:

========
Nov 11 14:48:47 kernel: BUG: soft lockup - CPU#15 stuck for 22s! [stapio:12395]
Nov 11 14:48:47 kernel: Modules linked in:
stap_26acea073d2390b6b4b93e91ac0f9692_13862(OE)
stap_0c9ef08fe0dae436a4433f8136bb310_13790(OE)
stap_bc2a857b167e703bc87d6cae0b9ba9d_13731(OE)
stap_b1cc2deb9fb8eb377283fff95e42020_12899(OE)
stap_0bc821bb685c8518da835f4f413c3a9e_12842(OE) PROCFS_BUFFER8(OE)
stap_6764e915e63f55da6e9c6bb630e1f83_12395(OE)
stap_4d4f80217a831e70ed5d551c209bfdf_12296(OE)
stap_1fee09fddaf3e217bc1be5bca0b707ca_21658(OE) binfmt_misc tun sg pseries_rng
nfsd auth_rpcgss nfs_acl lockd grace sunrpc ip_tables xfs libcrc32c sd_mod
crc_t10dif crct10dif_generic crct10dif_common ibmvscsi scsi_transport_srp
ibmveth scsi_tgt dm_mirror dm_region_hash dm_log dm_mod [last unloaded:
stap_f9cfa97efba72cc8292ebcc4094674e_11936]
Nov 11 14:48:47 kernel: CPU: 15 PID: 12395 Comm: stapio Tainted: G           OE
 ------------   3.10.0-327.el7.ppc64 #1
Nov 11 14:48:47 kernel: task: c0000005f04be910 ti: c0000005eaf6c000 task.ti:
c0000005eaf6c000
Nov 11 14:48:47 kernel: NIP: d000000005dc3b80 LR: d000000005dc6e6c CTR:
c0000000004b9440
Nov 11 14:48:47 kernel: REGS: c0000005eaf6f710 TRAP: 0901   Tainted: G         
 OE  ------------    (3.10.0-327.el7.ppc64)
Nov 11 14:48:47 kernel: MSR: 8000000000009032 <SF,EE,ME,IR,DR,RI>  CR: 88004428
 XER: 00000000
Nov 11 14:48:47 kernel: CFAR: d000000005dc3ba0 SOFTE: 1
                                                             GPR00:
d000000005dc6e6c c0000005eaf6f990 d000000005dd6a58 d000000007230000
                                                             GPR04:
000000000000bcd7 00000000000001e2 f9b5fab84824369d ffffffffc2b2ae35
                                                             GPR08:
000000000001ffff d000000007844910 000000000003029b d000000005dca4b0
                                                             GPR12:
c0000000004b9440 c000000007b38700
Nov 11 14:48:47 kernel: NIP [d000000005dc3b80] ._stp_map_set_ii+0x100/0x250
[stap_6764e915e63f55da6e9c6bb630e1f83_12395]
Nov 11 14:48:47 kernel: LR [d000000005dc6e6c] .probe_2893+0x11c/0x1b00
[stap_6764e915e63f55da6e9c6bb630e1f83_12395]
Nov 11 14:48:47 kernel: Call Trace:
Nov 11 14:48:47 kernel: [c0000005eaf6f990] [d000000005dc3bdc]
._stp_map_set_ii+0x15c/0x250 [stap_6764e915e63f55da6e9c6bb630e1f83_12395]
(unreliable)
Nov 11 14:48:47 kernel: [c0000005eaf6fa30] [d000000005dc6e6c]
.probe_2893+0x11c/0x1b00 [stap_6764e915e63f55da6e9c6bb630e1f83_12395]
Nov 11 14:48:47 kernel: [c0000005eaf6fb30] [d000000005dc674c]
.enter_be_probe+0x14c/0x240 [stap_6764e915e63f55da6e9c6bb630e1f83_12395]
Nov 11 14:48:47 kernel: [c0000005eaf6fbd0] [d000000005dc9a8c]
.systemtap_module_init+0x7fc/0x850 [stap_6764e915e63f55da6e9c6bb630e1f83_12395]
Nov 11 14:48:47 kernel: [c0000005eaf6fcd0] [d000000005dc9ccc]
._stp_ctl_write_cmd+0x1ec/0x644 [stap_6764e915e63f55da6e9c6bb630e1f83_12395]
Nov 11 14:48:47 kernel: [c0000005eaf6fd80] [c0000000002f089c]
.SyS_write+0x14c/0x400
Nov 11 14:48:47 kernel: [c0000005eaf6fe30] [c00000000000a17c]
system_call+0x38/0xb4
Nov 11 14:48:47 kernel: Instruction dump:
Nov 11 14:48:47 kernel: 7f9c4038 7b9c0020 3b9c000c 7b9c1f24 7d3ee02a 7f5ee214
2fa90000 419e0104
Nov 11 14:48:47 kernel: 3929fff0 60000000 60000000 60420000 <2fa90000> 419e0020
e9490020 7fbf5000
========

This typically happens once per testsuite run. I don't know yet which test (or
set of tests) trigger this problem.

-- 
You are receiving this mail because:
You are the assignee for the bug.

^ permalink raw reply	[flat|nested] 21+ messages in thread

end of thread, other threads:[~2017-02-08 17:05 UTC | newest]

Thread overview: 21+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2016-11-14 18:57 [Bug runtime/20820] New: another "soft lockup" BUG on RHEL7 ppc64 dsmith at redhat dot com
2016-11-16 21:26 ` [Bug runtime/20820] " dsmith at redhat dot com
2016-11-18 17:22 ` dsmith at redhat dot com
2016-11-18 19:01 ` dsmith at redhat dot com
2016-11-18 20:15 ` mcermak at redhat dot com
2016-11-24 16:06 ` mcermak at redhat dot com
2016-11-24 17:30 ` fche at redhat dot com
2016-11-28 16:55 ` dsmith at redhat dot com
2016-11-28 17:48 ` mcermak at redhat dot com
2016-11-29 13:47 ` mcermak at redhat dot com
2016-11-29 13:48 ` mcermak at redhat dot com
2016-11-30 19:10 ` dsmith at redhat dot com
2016-12-01 13:04 ` mcermak at redhat dot com
2016-12-01 16:11 ` dsmith at redhat dot com
2016-12-01 16:44 ` dsmith at redhat dot com
2016-12-01 19:29 ` jistone at redhat dot com
2016-12-01 20:38 ` dsmith at redhat dot com
2016-12-01 20:52 ` jistone at redhat dot com
2016-12-02 15:01 ` mcermak at redhat dot com
2016-12-02 17:30 ` dsmith at redhat dot com
2017-02-08 17:05 ` dsmith at redhat dot com

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).