From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from esa5.fujitsucc.c3s2.iphmx.com (esa5.fujitsucc.c3s2.iphmx.com [68.232.159.76]) by sourceware.org (Postfix) with ESMTPS id 3B0BD3858D3C for ; Tue, 21 Sep 2021 01:27:19 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org 3B0BD3858D3C X-IronPort-AV: E=McAfee;i="6200,9189,10113"; a="39585202" X-IronPort-AV: E=Sophos;i="5.85,309,1624287600"; d="scan'208";a="39585202" Received: from mail-ty1jpn01lp2050.outbound.protection.outlook.com (HELO JPN01-TY1-obe.outbound.protection.outlook.com) ([104.47.93.50]) by ob1.fujitsucc.c3s2.iphmx.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 21 Sep 2021 10:27:17 +0900 ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=O0KdVioXnhblfZTcSsLux4j+DUm03lMsjMpGL6RQSzBQ7jr2UsqcxGhdJ+S0dOh6Hfp2d52ux3eyMWFzoUmY7EMsai/IiHPzst+DYwEGqKGowgwxIVU+jbeq8fvY+VVWoNc/1dmGTYR/NiA27f2rBVCG5Moog9G3+R006vbLFnGVzH6/+qmZlIG/se2tlZMpmxAxYZ1zOKLd714FvnNdRo39pyWj6RBxhKmiq/s4o+rDX3W47dIOBi7p3Kq/CdpFp1Z+6NIud0Iud7ZgA7JjA9wiiwLs5kkzlw99BUbO1yEX9D+DSVUpt4egfe8EHX5UWhLYYsEjCAsZ3yB/52TkwA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version; bh=wp/d65bdpP8Zq3j+1VvHLN2IZg5lHQgN1CASFs31pgc=; b=Tr2WroZLp845D4IdlK9CW+NNx0gUvXDq2HQrjHBqfUCc7F18rUBRCHhDvZ+KSceWt1G3RnbqnuKOoYPTTvl53hVQUXyiRldVLia0Q/17cVYew1s4Uqe6VXx3+by+Tp7R1esUPHxp2zChzHph/mdMCOFWeJ36lJWJsO6IV/xdtOVk+cTb7hqjqa9nOPPK9kMufzdyxPILdwgiZ0x/3kzOE+jrxB4mYMKvNbyI4IG7VFTHC+Zm32VEJ6vw85i7axFPh1Yarz8/f1YgbDRx411GPRnIX7ZtEBSm3Dxi6CrBc54ZMgBq4z4Li7r9I5qk0tuK2lcrre8DQnoJT8NvixmUQw== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=fujitsu.com; dmarc=pass action=none header.from=fujitsu.com; dkim=pass header.d=fujitsu.com; arc=none Received: from TYAPR01MB6025.jpnprd01.prod.outlook.com (2603:1096:402:36::13) by TYAPR01MB6444.jpnprd01.prod.outlook.com (2603:1096:400:a1::5) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4523.18; Tue, 21 Sep 2021 01:27:14 +0000 Received: from TYAPR01MB6025.jpnprd01.prod.outlook.com ([fe80::253b:6b27:fc5c:204e]) by TYAPR01MB6025.jpnprd01.prod.outlook.com ([fe80::253b:6b27:fc5c:204e%5]) with mapi id 15.20.4523.018; Tue, 21 Sep 2021 01:27:14 +0000 From: "naohirot@fujitsu.com" To: Wilco Dijkstra , "'Lucas A. M. Magalhaes'" , Noah Goldstein , "libc-alpha@sourceware.org" Subject: RE: [PATCH v3 2/5] benchtests: Add memset zero fill benchtest Thread-Topic: [PATCH v3 2/5] benchtests: Add memset zero fill benchtest Thread-Index: AQHXic6jy6USnJyx5kO37MI1gLA1squd9IkAgANfZaCAAOkzAIAArxWggADifACAAMeSUIAJZF/A Date: Tue, 21 Sep 2021 01:27:13 +0000 Message-ID: References: <20210805074733.433430-1-naohirot@fujitsu.com> <20210805075053.433538-1-naohirot@fujitsu.com> <163130642274.404689.6991051609396665932@localhost.localdomain> <163154191414.705584.12050866556951422556@localhost.localdomain> In-Reply-To: Accept-Language: en-001, ja-JP, en-US Content-Language: ja-JP X-MS-Has-Attach: X-MS-TNEF-Correlator: x-securitypolicycheck: OK by SHieldMailChecker v2.6.3 x-shieldmailcheckermailid: 5ecf2b97a09b49ce9b10e01c131a617f msip_labels: MSIP_Label_a7295cc1-d279-42ac-ab4d-3b0f4fece050_Enabled=True; MSIP_Label_a7295cc1-d279-42ac-ab4d-3b0f4fece050_SiteId=a19f121d-81e1-4858-a9d8-736e267fd4c7; MSIP_Label_a7295cc1-d279-42ac-ab4d-3b0f4fece050_SetDate=2021-09-15T08:24:55.298Z; MSIP_Label_a7295cc1-d279-42ac-ab4d-3b0f4fece050_Name=FUJITSU-RESTRICTED; MSIP_Label_a7295cc1-d279-42ac-ab4d-3b0f4fece050_ContentBits=0; MSIP_Label_a7295cc1-d279-42ac-ab4d-3b0f4fece050_Method=Standard; x-ms-publictraffictype: Email x-ms-office365-filtering-correlation-id: fd6aec04-8e3d-4c88-d3db-08d97c9ef115 x-ms-traffictypediagnostic: TYAPR01MB6444: x-ms-exchange-transport-forked: True x-microsoft-antispam-prvs: x-ms-oob-tlc-oobclassifiers: OLM:10000; x-ms-exchange-senderadcheck: 1 x-ms-exchange-antispam-relay: 0 x-microsoft-antispam: BCL:0; x-microsoft-antispam-message-info: qbLAdbW4KLxLNhkgBhKhi6KOOpD6WONehx9wtR82lxPPOWJZGbVVuGmyrrIYkwSmfonGyXpF7U9L7nFRMCJq2pNIaopjuDRdcP9H9A/Cn6nidQMJLEuxvcpycRLjaYm+e1m8zhLlc/96a9KHKtNcqjB87zsRpelFQxTgo6MXAaNxSh/9FaCDeVTDImyPvMkeIN3te052LzSlHWP7g4CocrlSMdu/8LBOxa3RJ70oTKPDRPxqisfJ3nKq+RmkuSxSoVoLYn1a5BAL0txPe1+8DmP24+/6EbghYYvfnjx8r0n3TsrcJdrRimfH9RzXT675Yb1vFgb16ZzdaV9fI/XE4AzVAqkkifMgMy2Szdn2MfLk3O2rCN/ThrBLs9lxwrG9Lvr6ka6TL/JQCa4hjPnpZiSnCtNPQTipY3o1y6NsXpjZ4NPk5H1FA/A7sKAQSOdhImqEvlR2RQfUp4eULJyWDCkuxXqogCxK18l/wBJmxOZa3zf6j33f9EKIOgXCUxdzshmGe9d/8EdWTqNHLhMF7gB90FVIDBOzllM77nzSan4S/6gEHi6Zd9J+7i/nmlVJdi7QZbKIRWYj8hbCDQ0E8vEJZJxuRls7/BVPMQgnByh2tGSR5UkZicQvEeLEGLhGV8drncW8AAejBCLoYDJqWEf4LD2RVY7XL1hKVsgQgYqpkCCguEafcf9+Mm+w26/hAo/q+OKnOCHtiWOCQ/hiOOooj/mxnBdqwtLkKlD7oUvLQXcwJKiXRZz4eUGWlKuJZ/zKlASaGKWSyK2ioaAdxo4nH7wDcFdYMMenU3TRRypuPmAMtoDDStEh3zttdrwCBGoIAKKnFp+LBVPYA/lOeA== x-forefront-antispam-report: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:TYAPR01MB6025.jpnprd01.prod.outlook.com; PTR:; CAT:NONE; SFS:(4636009)(366004)(508600001)(83380400001)(8676002)(8936002)(26005)(66556008)(66446008)(5660300002)(66946007)(7696005)(6506007)(9686003)(33656002)(55016002)(53546011)(52536014)(64756008)(66476007)(71200400001)(316002)(38100700002)(110136005)(76116006)(86362001)(966005)(186003)(85182001)(38070700005)(122000001)(2906002); DIR:OUT; SFP:1101; x-ms-exchange-antispam-messagedata-chunkcount: 1 x-ms-exchange-antispam-messagedata-0: =?iso-2022-jp?B?cjNocEs0VzhTaFdhK2tuSGRHY2Q4OTZlZzMzUGgxMmxjTkZuYk43NkJX?= =?iso-2022-jp?B?OWczckUrUmNtQWRnQVZSMTMxSGlZRWlvMmluaUNKUmQ4TEV2bTRpOTEx?= =?iso-2022-jp?B?MUxEY3BXVjdVcFRuRVFJNm1TY2ZxN2FrYmdYS2dFQW0yTHFoR28xdDB0?= =?iso-2022-jp?B?QXlhbUNoZERneVdwR0xneUJBK3RBN3g2NnIySWx2SG5wSzhFMDAxM1d2?= =?iso-2022-jp?B?YS9ITThrS0o1aXpqYitFQlhMK25rZzl2S0hlc2c0REcxWERsM1NDSTlo?= =?iso-2022-jp?B?Y0hubXJkWnZ2MnZXR3BBaFEvQ05jVWtIbFIxcjlHa05Lamp4VjFQcmMv?= =?iso-2022-jp?B?SzEwdmVPUzRtUENnVlU3OXVYTHRrL0FCVHdWTEJtQ1F2MndoUDRsUWwv?= =?iso-2022-jp?B?M3pLOGRCdm9tbXAzdFBKc3ZQM2V3RVRIRktZMFFIeVJuYU9qUEsySEMy?= =?iso-2022-jp?B?Wjc1anVNQkRMeFAycHkzSkpkVC9pWURjWnpGdUh0clJyMHZCV2cxY1Bk?= =?iso-2022-jp?B?N29aalNHZ0dlQzdYYkk5dEVPRUJmeFAvSytqZGdBL09SN1l4NVRDaWd1?= =?iso-2022-jp?B?enpNRzBqcWpXMXlCcGZBd0Fxcnl2QTZvWkw3ZHRUYmtpMlUzTC91VVM4?= =?iso-2022-jp?B?ZW1uQ1hZUFhrL0dQNTBoNFNJVkt1Q3lsMzZ5bGFPUmd6N2VSTW9sSDBa?= =?iso-2022-jp?B?b29sMFcvK0FyMXQ3c0V4K1pZTkxNaHlEY0dLSlhEQU5SaTVHcmFWc2hu?= =?iso-2022-jp?B?MHQ5Qnl2QmcyYWVWZUc3aHVmbnQzNEptbG93QU5OR2QwQXRMSkRzQm5a?= =?iso-2022-jp?B?akVTai9ZSkRSTnZ0OU9XeFJyMzdNNVg0SVJTOG1wcWNYOFR3dDBlUkdH?= =?iso-2022-jp?B?d1M1OTdzamFGd3FDNDU2ZERGVlhoejh3bm8vZUFNUmlVaHhtQTl2cXNQ?= =?iso-2022-jp?B?ODdYUFRBbWNNL05aZTR4RnNLUjhoVUUzQ1MzUXdGdldhbHczdEtlbC82?= =?iso-2022-jp?B?N0h2eGlSLzJCd3FwUHF2UlhpVEJXa1NHMld1elI5RzRueS9HcnlIeWI0?= =?iso-2022-jp?B?THY2SVNaeUFGc0E5UmI0Q0JoSXA5anljWVhyS1VXUXF1ZFpBbnV0RVVs?= =?iso-2022-jp?B?V2Jia2RJb1ZxS3hLdFE0SU1yNmEzVTZuSmdYQ1JhSWNTQTlMRUU3SmpQ?= =?iso-2022-jp?B?NEIrS3h3Q044U1UwSGZRSUJYRTJyV0Z6NHRVd1NBek5hK0pUMGdyREpa?= =?iso-2022-jp?B?OHJuSU5GZjk0d1luQmNrNEZCRG80OTkxRVNNR2Fvak9rTW5DckRWK0t5?= =?iso-2022-jp?B?RTVhc01EcWNQUGJFQU4wWTlYVGFCdUZYL1M3NG5iV0dQYW5uUHRiVXJR?= =?iso-2022-jp?B?SUJSc1VRek1vOW9wMmVEYzdXZHZPWUtWckhpczQvTjZpdVJ0ckN2endh?= =?iso-2022-jp?B?MDJpbFU4eDlqSE5pYWxoNEtENnlDandHdHBkbE5nOFFVYmFVTERsR3Zz?= =?iso-2022-jp?B?T25yMWh2L2ovN2JXWlZPMUNGVzlvTHo0MUlWaGRKdjQ5UUpiVWxmUm10?= =?iso-2022-jp?B?QThNdVBTa1hIcWpQREZDTHlPT085eXpZMjRyeHVkc3hWYUg0Z2kzTmc3?= =?iso-2022-jp?B?ZUtXeVhGbjdWb0ZPTDVXWSt1VEVpMlpZb2RkT0lpYmRObi9lbHNUQ1lx?= =?iso-2022-jp?B?aHRXTjRPQkNZa0lXZTBJOUwrVG5KQmlnd3QvbGVSOTJsSlRzbGhMTDI2?= =?iso-2022-jp?B?VEhEZ2pIckI0SDVVa0tUditGOEF6MWVndm91azhwNTZtdU9ybGFsWWc1?= =?iso-2022-jp?B?L1ZhRVBKNVhoTzBaSURIV3MwZDJkTDJpL0tiMW1jaXVzNjBlOUNEclFh?= =?iso-2022-jp?B?SmtnK2laK2ZpMmRjM0kvaFJUY3Z2VGZqMzI2WUhOenJFUkQ3RWdEWmVD?= Content-Type: text/plain; charset="iso-2022-jp" Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-OriginatorOrg: fujitsu.com X-MS-Exchange-CrossTenant-AuthAs: Internal X-MS-Exchange-CrossTenant-AuthSource: TYAPR01MB6025.jpnprd01.prod.outlook.com X-MS-Exchange-CrossTenant-Network-Message-Id: fd6aec04-8e3d-4c88-d3db-08d97c9ef115 X-MS-Exchange-CrossTenant-originalarrivaltime: 21 Sep 2021 01:27:13.9907 (UTC) X-MS-Exchange-CrossTenant-fromentityheader: Hosted X-MS-Exchange-CrossTenant-id: a19f121d-81e1-4858-a9d8-736e267fd4c7 X-MS-Exchange-CrossTenant-mailboxtype: HOSTED X-MS-Exchange-CrossTenant-userprincipalname: EXHkY3Q+Q42g4mJmRolSn8/PJ8SGiNiNcewxD/kjOP0zg05qKs8KWk87Vlswflmn0FWwmzgTAMODXVo398zUlA== X-MS-Exchange-Transport-CrossTenantHeadersStamped: TYAPR01MB6444 X-Spam-Status: No, score=-3.2 required=5.0 tests=BAYES_00, DKIM_SIGNED, DKIM_VALID, KAM_DMARC_NONE, RCVD_IN_MSPIKE_H2, SPF_HELO_PASS, SPF_NONE, TXREP autolearn=no autolearn_force=no version=3.4.4 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on server2.sourceware.org X-BeenThere: libc-alpha@sourceware.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Libc-alpha mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 21 Sep 2021 01:27:22 -0000 Hi Wilco, Let me ping you regarding the start size. > -----Original Message----- > From: Tamura, Naohiro/=1B$BEDB<=1B(B =1B$BD>9-=1B(B > Sent: Wednesday, September 15, 2021 5:25 PM > To: Wilco Dijkstra ; 'Lucas A. M. Magalhaes' ; Noah Goldstein > ; libc-alpha@sourceware.org > Subject: RE: [PATCH v3 2/5] benchtests: Add memset zero fill benchtest >=20 > Hi Wilco, >=20 > Thank you for the comment. > I understood your concerns about the start size and the end size. >=20 > > From: Wilco Dijkstra > > Sent: Tuesday, September 14, 2021 11:03 PM > > > > I had a quick go at running the new benchmark. The main problem is that= it doesn't > > give repeatable results - there are huge variations from run to run of = about 50% for > > the smaller sizes. This is a fundamental problem due to the timing loop= , and the only > > way to reduce it is to increase the time taken by memset, ie. start at = a much larger > > size (say at 16KB). >=20 > In terms of the start size, 256B is chosen because __memset_generic > (sysdeps/aarch64/memset.S) calls DC ZVA for zero fill from 256B, which > code you committed [1]. > And I reported an interesting insight in the mail [2] that DC ZVA is > slower than store instruction from 256B to 16KB on A64FX [3]. > So it seems valuable to measure the range from 256B to 16KB to see > each CPU's behavior. > What do you think? >=20 > [1] https://sourceware.org/git/?p=3Dglibc.git&h=3Da8c5a2a9521e105da6e96ea= f4029b8e4d595e4f5 > [2] https://sourceware.org/pipermail/libc-alpha/2021-August/129805.html > [3] https://drive.google.com/file/d/1fonjDDlF4LPLfZY9-z22DGn-yaSpGN4g/vie= w >=20 > > It also takes a long time to run - generally it's best to ensure a benc= hmark takes less > > than 10 seconds on a typical modern system (remember there will be many= that are > > slower!). It should be feasible to reduce the iteration count for large= sizes, but you > > could go up to 16MB rather than 64MB. >=20 > OK, I'll change the end size to 16MB. >=20 > Thanks. > Naohiro