From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 19255 invoked by alias); 14 Nov 2006 02:17:30 -0000 Received: (qmail 19242 invoked by uid 22791); 14 Nov 2006 02:17:29 -0000 X-Spam-Status: No, hits=-1.1 required=5.0 tests=AWL,BAYES_40,SUBJ_HAS_UNIQ_ID X-Spam-Check-By: sourceware.org Received: from mga03.intel.com (HELO mga03.intel.com) (143.182.124.21) by sourceware.org (qpsmtpd/0.31) with ESMTP; Tue, 14 Nov 2006 02:17:24 +0000 Received: from azsmga001.ch.intel.com ([10.2.17.19]) by mga03.intel.com with ESMTP; 13 Nov 2006 18:17:23 -0800 Received: from scsmsx332.sc.intel.com (HELO scsmsx332.amr.corp.intel.com) ([10.3.90.6]) by azsmga001.ch.intel.com with ESMTP; 13 Nov 2006 18:17:22 -0800 X-ExtLoop1: 1 X-IronPort-AV: i="4.09,419,1157353200"; d="scan'208"; a="145723943:sNHT19926830" Received: from scsmsx413.amr.corp.intel.com ([10.3.90.32]) by scsmsx332.amr.corp.intel.com with Microsoft SMTPSVC(6.0.3790.211); Mon, 13 Nov 2006 18:17:22 -0800 X-MimeOLE: Produced By Microsoft Exchange V6.5 Content-class: urn:content-classes:message MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable Subject: RE: Summary of nightly tests 20061109 Date: Tue, 14 Nov 2006 12:49:00 -0000 Message-ID: X-MS-Has-Attach: X-MS-TNEF-Correlator: Thread-Topic: Summary of nightly tests 20061109 Thread-Index: AccEM/cNzhyNwWH9RX6xvmHiPWCJowDWsfVA From: "Stone, Joshua I" To: "William Cohen" , "Martin Hunt" Cc: "SystemTAP" X-OriginalArrivalTime: 14 Nov 2006 02:17:22.0544 (UTC) FILETIME=[04CC6700:01C70793] X-IsSubscribed: yes Mailing-List: contact systemtap-help@sourceware.org; run by ezmlm Precedence: bulk List-Subscribe: List-Post: List-Help: , Sender: systemtap-owner@sourceware.org X-SW-Source: 2006-q4/txt/msg00418.txt.bz2 On Thursday, November 09, 2006 11:19 AM, William Cohen wrote: > RHLE4 U4 ia64 > Kernel: Linux 2.6.9-42.0.3.EL #1 SMP Mon Sep 25 17:14:34 EDT 2006 > ia64 ia64 ia64 GNU/Linux > [...] > The earliest record of staprun causing unaligned accesses on the ia64 > machine is Nov 5. See things like the following in the systemtap.log > output of dejagnu. > [...] > staprun(9115): unaligned access to 0x2000000000338014, ip=3D0x4000000000007c01 > staprun(9115): unaligned access to 0x200000000033801c, ip=3D0x4000000000007c10 > staprun(9115): unaligned access to 0x2000000000338024, ip=3D0x4000000000007c01 > staprun(9115): unaligned access to 0x200000000033802c, ip=3D0x4000000000007c10 In do_kernel_symbols (runtime/stpd/symbols.c), there are these lines: 144: sym_base =3D malloc(MAX_SYMBOLS*sizeof(struct _stp_symbol)+sizeof(int)); [...] 154: *(int *)sym_base =3D STP_SYMBOLS; 155: syms =3D (struct _stp_symbol *)(sym_base + sizeof(int)); [...] 178: syms[i].addr =3D addr; 179: syms[i].symbol =3D (char *)(dataptr - data); The ips 7c01 and 7c10 correspond to lines 178 and 179. Line 144 will return an 8-byte aligned pointer, and then 155 adds 4 to it. From then on, all access to 8-byte fields through 'syms' will be misaligned. There's a logging rate limiter that's saving us from seeing more than four of the same message, thankfully. One potential solution is to make the transport command a long instead of int. Such a fix would require changes to the whole transport layer, though, so I'll leave it to Martin to decide what to do... Josh