From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 28100 invoked by alias); 3 Apr 2006 19:21:12 -0000 Received: (qmail 28090 invoked by uid 22791); 3 Apr 2006 19:21:12 -0000 X-Spam-Status: No, hits=-2.6 required=5.0 tests=AWL,BAYES_00 X-Spam-Check-By: sourceware.org Received: from mga02.intel.com (HELO orsmga101-1.jf.intel.com) (134.134.136.20) by sourceware.org (qpsmtpd/0.31) with ESMTP; Mon, 03 Apr 2006 19:21:10 +0000 Received: from orsmga001.jf.intel.com ([10.7.209.18]) by orsmga101-1.jf.intel.com with ESMTP; 03 Apr 2006 12:21:09 -0700 Received: from scsmsx332.sc.intel.com (HELO scsmsx332.amr.corp.intel.com) ([10.3.90.6]) by orsmga001.jf.intel.com with ESMTP; 03 Apr 2006 12:21:09 -0700 X-IronPort-AV: i="4.03,159,1141632000"; d="scan'208"; a="18737247:sNHT38262357" Received: from scsmsx403.amr.corp.intel.com ([10.3.90.18]) by scsmsx332.amr.corp.intel.com with Microsoft SMTPSVC(6.0.3790.211); Mon, 3 Apr 2006 12:21:08 -0700 X-MimeOLE: Produced By Microsoft Exchange V6.5 Content-class: urn:content-classes:message MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable Subject: RE: Stress testing - all functions Date: Mon, 03 Apr 2006 19:21:00 -0000 Message-ID: X-MS-Has-Attach: X-MS-TNEF-Correlator: Thread-Topic: Stress testing - all functions Thread-Index: AcZVNkFLUH9d7lB8S4W82nXRtTx1HgCFKnKA From: "Stone, Joshua I" To: "Frank Ch. Eigler" Cc: X-OriginalArrivalTime: 03 Apr 2006 19:21:08.0873 (UTC) FILETIME=[C2CC9F90:01C65753] X-IsSubscribed: yes Mailing-List: contact systemtap-help@sourceware.org; run by ezmlm Precedence: bulk List-Subscribe: List-Post: List-Help: , Sender: systemtap-owner@sourceware.org X-SW-Source: 2006-q2/txt/msg00020.txt.bz2 Frank Ch. Eigler wrote: >> d*: system reboots - there was no OOPS or BUG or anything printed to >> the console. >=20 > Given time, one can try d[a-m]* and d[n-z]* to recursively subdivide > the namespace. It would be great if this were done automatically. (A > full search would of course take lots of time due to reboots.) Don't forget [A-Z0-9_] in legal identifiers as well. I had forgotten the capitals at first, so I will update the test accordingly. What you suggest is somewhat painful to implement since stap uses glob matching, not regex. However, given that testing success is cheap and failure is expensive (reboot), a linear sweep of da*, db*, etc. suffices. In this way you isolate one more letter for each reboot you endure. I narrowed this crash down to 'do_*', and then I was fortunate to get a stack trace on 'do_d*'. The trace shows a cycle of do_debug->do_page_fault->error_code->do_debug->(loop). I've been unable to reproduce this with probing either do_debug or 'do_d*'. Probing 'do_*' always reboots immediately though, completely reproducible. >> The two that failed in a* and c* are both functions that are >> decorated with __exit. Perhaps the translator needs to disallow >> __exit functions, just as it disallows __init? >=20 > That's right, for routines that are linked into vmlinux (not in a > module). In all, fourteen letters of the alphabet fail on inserting probes on __exit functions, so this is a blocking issue for doing more complete tests. I will file a bugzilla on this, and if it's not difficult I'll go ahead and try to fix it. Josh (BTW, these results are all from i386 RHEL4 U3, w/ Anil's backported patches)