From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 12653 invoked by alias); 8 Dec 2015 18:57:55 -0000 Mailing-List: contact systemtap-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Post: List-Help: , Sender: systemtap-owner@sourceware.org Received: (qmail 12584 invoked by uid 48); 8 Dec 2015 18:57:51 -0000 From: "dsmith at redhat dot com" To: systemtap@sourceware.org Subject: [Bug runtime/19345] New: RHEL 7.0 s390x crash in check.exp Date: Tue, 08 Dec 2015 18:57:00 -0000 X-Bugzilla-Reason: AssignedTo X-Bugzilla-Type: new X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: systemtap X-Bugzilla-Component: runtime X-Bugzilla-Version: unspecified X-Bugzilla-Keywords: X-Bugzilla-Severity: normal X-Bugzilla-Who: dsmith at redhat dot com X-Bugzilla-Status: NEW X-Bugzilla-Resolution: X-Bugzilla-Priority: P2 X-Bugzilla-Assigned-To: systemtap at sourceware dot org X-Bugzilla-Target-Milestone: --- X-Bugzilla-Flags: X-Bugzilla-Changed-Fields: bug_id short_desc product version bug_status bug_severity priority component assigned_to reporter target_milestone Message-ID: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://sourceware.org/bugzilla/ Auto-Submitted: auto-generated MIME-Version: 1.0 X-SW-Source: 2015-q4/txt/msg00244.txt.bz2 https://sourceware.org/bugzilla/show_bug.cgi?id=3D19345 Bug ID: 19345 Summary: RHEL 7.0 s390x crash in check.exp Product: systemtap Version: unspecified Status: NEW Severity: normal Priority: P2 Component: runtime Assignee: systemtap at sourceware dot org Reporter: dsmith at redhat dot com Target Milestone: --- During some testing on RHEL 7.0, 7.1, and 7.2, I found that the check.exp t= est case (which tests all the systemtap examples) causes a crash on RHEL 7.0 (3.10.0-123.el7.s390x). The same test case passes on 7.1 (3.10.0-229.el7.s3= 90x) and newer kernels. The crash looks like: =3D=3D=3D=3D [ 5232.627933] Unable to handle kernel pointer dereference at virtual kernel address 000000f440a2e000 [ 5232.627984] Oops: 003b [#1] SMP=20 [ 5232.627987] Modules linked in: stap_e4e6b8a268df981fe9186b8096f3569c__19634(OF) binfmt_misc sg qeth_l2 vmur nfsd auth_rpcgss nfs_acl lockd sunrpc xfs libcrc32c dasd_fba_mod dasd_eckd_= mod dasd_mod qeth lcs ctcm qdio fsm ccwgroup dm_mirror dm_region_hash dm_log dm= _mod [last unloaded: stap_9903d024d134f28f45f6801901192f4d__19622] [ 5232.628012] CPU: 1 PID: 748 Comm: systemd-journal Tainted: GF=20=20=20= =20=20=20=20=20=20 O-------------- 3.10.0-123.el7.s390x #1 [ 5232.628016] task: 00000000359ca440 ti: 0000000036914000 task.ti: 0000000036914000 [ 5232.628019] Krnl PSW : 0704c00180000000 000000000026fc8c (mem_cgroup_update_page_stat+0x3c/0xa0) [ 5232.628030] R:0 T:1 IO:1 EX:1 Key:0 M:1 W:0 P:0 AS:3 CC:0 PM:0 EA:3\x0aKrnl GPRS: 0000000000936dd2 0000000000878640 000000f440a2e328 0000000000000000 [ 5232.628039] 0000000000000000 0000000001e68000 0000000034e5e160 0000000000000000 [ 5232.628050] 000003fffcd2c000 0000000000000200 000003d10004e7c0 0000000035338270 [ 5232.628051] 0000000000000000 0000000000000001 000000000026fc7c 0000000036917c18 [ 5232.628059] Krnl Code: 000000000026fc7c: c010003044e2\x09larl\x09%r1,878640\x0a 000000000026fc82: e33010540012\x09lt\x09%r3,84(%r1)\x0a #000000000026fc88: a774002a\x09\x09brc\x097,26fcdc\x0a >000000000026fc8c: e33020080002\x09ltg\x09%r3,8(%r2)\x0a 000000000026fc92: a7840025\x09\x09brc\x098,26fcdc\x0a 000000000026fc96: e31020070090\x09llgc\x09%r1,7(%r2)\x0a 000000000026fc9c: a7110002\x09\x09tmll\x09%r1,2\x0a 000000000026fca0: a784001e\x09\x09brc\x098,26fcdc [ 5232.628071] Call Trace: [ 5232.628072] ([<0000000000000200>] 0x200) [ 5232.628074] [<0000000000241ec0>] page_add_file_rmap+0xa0/0xd0 [ 5232.628078] [<000000000023274a>] __do_fault+0x182/0x5f8 [ 5232.628081] [<00000000002377ba>] handle_mm_fault+0x462/0xe98 [ 5232.628082] [<00000000005b3998>] do_dat_exception+0x1d8/0x358 [ 5232.628087] [<00000000005b1de6>] pgm_check_handler+0x17a/0x17e [ 5232.628088] [<000000008001db0c>] 0x8001db0c [ 5232.628090] Last Breaking-Event-Address: [ 5232.628090] [<0000000000271f6a>] lookup_page_cgroup+0x42/0x48 [ 5232.628092]=20=20 [ 5232.628093] Kernel panic - not syncing: Fatal exception: panic_on_oops =3D=3D=3D=3D Here's another: =3D=3D=3D=3D [ 4501.794419] Unable to handle kernel pointer dereference at virtual kernel address 000000f440b1a000 [ 4501.794465] Oops: 003b [#1] SMP=20 [ 4501.794467] Modules linked in: stap_e4e6b8a268df981fe9186b8096f3569c__64377(OF) binfmt_misc sg qeth_l2 vmur nfsd auth_rpcgss nfs_acl lockd sunrpc xfs libcrc32c dasd_fba_mod dasd_eckd_= mod dasd_mod lcs qeth ctcm fsm qdio ccwgroup dm_mirror dm_region_hash dm_log dm= _mod [last unloaded: stap_9903d024d134f28f45f6801901192f4d__64365] [ 4501.794493] CPU: 1 PID: 745 Comm: systemd-journal Tainted: GF=20=20=20= =20=20=20=20=20=20 O-------------- 3.10.0-123.el7.s390x #1 [ 4501.794496] task: 000000000144f5d0 ti: 0000000034e14000 task.ti: 0000000034e14000 [ 4501.794499] Krnl PSW : 0404e00180000000 000000000026cea6 (mem_cgroup_page_lruvec+0x5e/0xc0) [ 4501.794510] R:0 T:1 IO:0 EX:0 Key:0 M:1 W:0 P:0 AS:3 CC:2 PM:0 EA:3\x0aKrnl GPRS: 0000000000936dd3 0000000000000048 000000f440b1a0a8 000000000000f440 [ 4501.794518] 0000000000000000 0000000001e68000 0000000000000000 00000000002156a0 [ 4501.794522] 0000000000000000 000000000279a0e0 0700000035be0000 000003d100c763c0 [ 4501.794531] 000003d100c763c0 00000000008d2c00 000000000026cea0 0000000034e179b0 [ 4501.794539] Krnl Code: 000000000026ce94: f0a0000407f4\x09srp\x094(11,%r0),2036,0\x0a 000000000026ce9a: c0e500002847\x09brasl\x09%r14,271f28\x0a #000000000026cea0: e310c0070090\x09llgc\x09%r1,7(%r12)\x0a >000000000026cea6: e33020080004\x09lg\x09%r3,8(%r2)\x0a 000000000026ceac: a7110020\x09\x09tmll\x09%r1,32\x0a 000000000026ceb0: a7740014\x09\x09brc\x097,26ced8\x0a 000000000026ceb4: e31020070090\x09llgc\x09%r1,7(%r2)\x0a 000000000026ceba: a7110002\x09\x09tmll\x09%r1,2 [ 4501.794551] Call Trace: [ 4501.794552] ([<0000000034e17a98>] 0x34e17a98) [ 4501.794554] [<0000000000216af0>] pagevec_lru_move_fn+0xf8/0x1a8 [ 4501.794559] [<0000000000216cae>] __lru_cache_add+0x9e/0xb8 [ 4501.794560] [<000000000024b3c0>] read_swap_cache_async+0x108/0x1b8 [ 4501.794562] [<000000000024b4fe>] swapin_readahead+0x8e/0xd8 [ 4501.794563] [<0000000000222006>] shmem_getpage_gfp+0x5de/0x848 [ 4501.794565] [<00000000002222da>] shmem_fault+0x6a/0x118 [ 4501.794567] [<000000000023264a>] __do_fault+0x82/0x5f8 [ 4501.794570] [<00000000002377ba>] handle_mm_fault+0x462/0xe98 [ 4501.794572] [<00000000005b3998>] do_dat_exception+0x1d8/0x358 [ 4501.794576] [<00000000005b1de6>] pgm_check_handler+0x17a/0x17e [ 4501.794578] [<000003fffd10c776>] 0x3fffd10c776 [ 4501.794579] Last Breaking-Event-Address: [ 4501.794580] [<0000000000271f6a>] lookup_page_cgroup+0x42/0x48 [ 4501.794582]=20=20 [ 4501.794583] Kernel panic - not syncing: Fatal exception: panic_on_oops =3D=3D=3D=3D When this crash happens, the last thing in systemtap.log is: =3D=3D=3D=3D attempting command stap -w functioncallcount.stp "*@mm/*.c" -c "sleep 1" =3D=3D=3D=3D Executing the same command by hand seems to cause the crash. This crash hap= pens every time the functioncallcount.stp example is run (it is not intermittent= ). --=20 You are receiving this mail because: You are the assignee for the bug.