From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from EUR01-HE1-obe.outbound.protection.outlook.com (mail-eopbgr130042.outbound.protection.outlook.com [40.107.13.42]) by sourceware.org (Postfix) with ESMTPS id ADC413858C27 for ; Tue, 20 Apr 2021 11:39:55 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.3.2 sourceware.org ADC413858C27 Received: from DB6PR07CA0171.eurprd07.prod.outlook.com (2603:10a6:6:43::25) by AM6PR08MB5217.eurprd08.prod.outlook.com (2603:10a6:20b:cc::12) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4042.16; Tue, 20 Apr 2021 11:39:53 +0000 Received: from DB5EUR03FT042.eop-EUR03.prod.protection.outlook.com (2603:10a6:6:43:cafe::44) by DB6PR07CA0171.outlook.office365.com (2603:10a6:6:43::25) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4065.8 via Frontend Transport; Tue, 20 Apr 2021 11:39:53 +0000 X-MS-Exchange-Authentication-Results: spf=pass (sender IP is 63.35.35.123) smtp.mailfrom=arm.com; sourceware.org; dkim=pass (signature was verified) header.d=armh.onmicrosoft.com;sourceware.org; dmarc=pass action=none header.from=arm.com; Received-SPF: Pass (protection.outlook.com: domain of arm.com designates 63.35.35.123 as permitted sender) receiver=protection.outlook.com; client-ip=63.35.35.123; helo=64aa7808-outbound-1.mta.getcheckrecipient.com; Received: from 64aa7808-outbound-1.mta.getcheckrecipient.com (63.35.35.123) by DB5EUR03FT042.mail.protection.outlook.com (10.152.21.123) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4042.16 via Frontend Transport; Tue, 20 Apr 2021 11:39:53 +0000 Received: ("Tessian outbound b610e7b4d771:v90"); Tue, 20 Apr 2021 11:39:53 +0000 X-CheckRecipientChecked: true X-CR-MTA-CID: caa63aa8f5dca2aa X-CR-MTA-TID: 64aa7808 Received: from d318b4c53ebf.2 by 64aa7808-outbound-1.mta.getcheckrecipient.com id C18118DE-2244-4425-B32F-460BE66A4D31.1; Tue, 20 Apr 2021 11:39:44 +0000 Received: from EUR01-HE1-obe.outbound.protection.outlook.com by 64aa7808-outbound-1.mta.getcheckrecipient.com with ESMTPS id d318b4c53ebf.2 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384); Tue, 20 Apr 2021 11:39:44 +0000 ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=k2s06/5ZiApjTA9eLvKxd8hbnd9dHfbXRf28o7F13/eN8CLNzQsDGcDiT4Rcnai68iNSYtjSmgPcWkSNGqgMdXVY5LVlJt91ZxT0MJxZ2DDqwbknGuUgLx2VsjBfw826LKyLXCJrnaDdFfA4/FGC8xvoyywdgDrFI4K0jVPyfCzRlxjVfbhBznNG1FfgpqTNiZkVHCwem7ipasX9BPBpp1kP3zHoRmYlY9Kbm9wvZP98HlfRPQJfyD0WFb/WRPqZV/QLEITTyX3JQF7UbBsT5GZW6iEdeZ3LX8Fq1KqXl4R/hl8jr1u9otxnQxiidC+r1G8kARdFF4PppNRHTS3obA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=aVqQNf2OknJYg1XcIs5TJMUJ4dqV9kyMwZqmI6xvJgg=; b=KXeqeTxEcvPL/B+uBxFxGeqXZxwPd07VQ+JXNIUKIm37RSk99OXy+11GlUK+A2pmFeuic9Jx8EdUHX6NfR0bP6B3EYCl+zyeOLdgAL8H3E0+PSCs4K9OCf8OarlC2yhMxWwemEub0eE2JHiFPrrACOSBLz0epatJBqIvBitaqJx/sn32gM5ymZvnPvh1W7cscZFMVKD96m7b3WJX8hJmo6flDp9WGs5IrAFy0sShHqUaswDDRScf4UZiu8szbS1qVwpdfLS5zT6ZDLce3ayyb+gBQUL80SuJgvqkDnEVRNFMsZ/cPRSlNQ/KmmRUe9RKg0Up/bMZv+7B9Jd4vTHRQw== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=arm.com; dmarc=pass action=none header.from=arm.com; dkim=pass header.d=arm.com; arc=none Received: from VE1PR08MB5599.eurprd08.prod.outlook.com (2603:10a6:800:1a1::12) by VI1PR0801MB1760.eurprd08.prod.outlook.com (2603:10a6:800:51::15) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4042.19; Tue, 20 Apr 2021 11:39:43 +0000 Received: from VE1PR08MB5599.eurprd08.prod.outlook.com ([fe80::385c:f8ff:ee16:3a4d]) by VE1PR08MB5599.eurprd08.prod.outlook.com ([fe80::385c:f8ff:ee16:3a4d%6]) with mapi id 15.20.4042.024; Tue, 20 Apr 2021 11:39:41 +0000 From: Wilco Dijkstra To: "naohirot@fujitsu.com" CC: 'GNU C Library' , Szabolcs Nagy Subject: Re: [PATCH 0/5] Added optimized memcpy/memmove/memset for A64FX Thread-Topic: [PATCH 0/5] Added optimized memcpy/memmove/memset for A64FX Thread-Index: AQHXL5Jyw0P1gKwhEk6/DkVDv1IPJaqyCeTQgAIMP+uACM8uEIAAbRm9 Date: Tue, 20 Apr 2021 11:39:41 +0000 Message-ID: References: , , In-Reply-To: Accept-Language: en-GB, en-US Content-Language: en-GB X-MS-Has-Attach: X-MS-TNEF-Correlator: Authentication-Results-Original: fujitsu.com; dkim=none (message not signed) header.d=none;fujitsu.com; dmarc=none action=none header.from=arm.com; x-originating-ip: [82.24.249.100] x-ms-publictraffictype: Email X-MS-Office365-Filtering-Correlation-Id: 980d651d-bbf4-4604-0e96-08d903f103c9 x-ms-traffictypediagnostic: VI1PR0801MB1760:|AM6PR08MB5217: x-ms-exchange-transport-forked: True X-Microsoft-Antispam-PRVS: x-checkrecipientrouted: true nodisclaimer: true x-ms-oob-tlc-oobclassifiers: OLM:576;OLM:576; X-MS-Exchange-SenderADCheck: 1 X-Microsoft-Antispam-Untrusted: BCL:0; X-Microsoft-Antispam-Message-Info-Original: mJs2xmxCsIxyI8GuAaZYk40iGtJvY1RUtRAjPxu99ZWHruJOfwROJYZp69Gc/eJSPfw+n9OTBMhUIpvzOHWLfpVYP2rO/dvbo+cFN+4spr4ogACHWaXQGCDl03MS585ztyfETY7ulbUJQhb8aITu3CfrZzyQ9v8ZL+tq4QSmaRMWcdzT4N4tFvij7KIo9A4QIdNVpL4AU8NYRfO5RRhVRB5Skx7/JBKevCoci3HlDERYp1UiTmlI5DhCX48cA1stqDCR6SkrkbIYVNKUdLdLSavWlttJ+fHecBOQJfaFhc08gTqWn58BqfRg06CWD8VEHxHleSlgzKHzuDUHKU5q8xwvZvOh7JCLrXliVJP5ZC6t6A6KshDUH1R2gM69zQ0hgoG8QidSFeLxL71xc5UjSsQXyOJLRcmpQHASmn/5qLX5HlYzGSnF2tOmE3Em6MBRa8MHv2/j0+peTEoOlSPrUu90ECgvPD84AJk3LtNa9eD6U8+Ueub172mWijXfb6w/Ik5IpBYxsooOZjse0LkPEjQDCSmgwNrQ9KvGQsTkhjazXxa548dGxHFVKbr+BC/VlcWWCkCsKlM5YLOERYzaBxAOSsBVZqY+YLbayP9SjtXmC39pwxbT/XcvfbquKPnDb/NVDZLvBk8oubwwMBTA3KtUERlJACVx4jpPm6KbXQzpGmcivjYl/5j0yTPBrGc5 X-Forefront-Antispam-Report-Untrusted: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:VE1PR08MB5599.eurprd08.prod.outlook.com; PTR:; CAT:NONE; SFS:(4636009)(366004)(4744005)(5660300002)(966005)(33656002)(4326008)(6506007)(186003)(26005)(8676002)(66556008)(2906002)(7696005)(52536014)(498600001)(64756008)(66946007)(86362001)(55016002)(9686003)(66446008)(66476007)(76116006)(54906003)(38100700002)(91956017)(6916009)(8936002)(71200400001)(122000001); DIR:OUT; SFP:1101; x-ms-exchange-antispam-messagedata: =?iso-8859-1?Q?CQNwKjqkOHet7J4JFYHQKKLXFlLeMa/tVtUH+pShcnsRXaHkUR92XICgpq?= =?iso-8859-1?Q?qFECd/Wfj7TqLdxG6+jiH9mPpAMMayH5ntTCdkq73mZHpSXduXKkFHti0q?= =?iso-8859-1?Q?6Klg6hjEfnZtmrSgda6XWNHoN7tfbWVk7UxsbGO/Zah/9DzKolGgwUEBw6?= =?iso-8859-1?Q?IXl/x+Uq/ymuhemNoO7V8BvsslD+kZKho2ACJX3Nbik0+QrG/eqwA/Bv05?= =?iso-8859-1?Q?czr3/OImRZrTxr22SBM01h7PfFZGSkvRCObn9N5gYPk5jgji1X44NE2fNL?= =?iso-8859-1?Q?s9Vyy9ssqsu9dFxHpXcDQK4qKBThmXeQ9wijE+mtWCnh7CW2zua8cYPi/n?= =?iso-8859-1?Q?5Y4WDd+HcGpeQgSLJb0dMH4DzwDxS2wEqqSsWwQEpuHRJ3yFO6hQo9MhnF?= =?iso-8859-1?Q?GdJuOLtpIDLzEoLYYNH5rRkr+IXkMfCTwrvPfnCZB/PYH46MfSkRxeRemA?= =?iso-8859-1?Q?X9Rq4l2bPAoMCf7x71j1hjtEzwD0ZBO0acq6vu+nuQ/UXQsXhRH2g50tPw?= =?iso-8859-1?Q?IOfKUcOszD43Jfk9sBP91o3LTkkWiLJPYCtwOtkGon39JDW1Mj1vDwpbzS?= =?iso-8859-1?Q?sEc55tlj8a7z9mHw4JlfM5ddoUysPwsILVqpTuXhEaelBkFeCf0jm+5wXl?= =?iso-8859-1?Q?+euJAXy5I/x4uJQNJEXhYkTMFq1I8mIDeOyHJc/NtJGMlDIY02WQ3Hl2Zu?= =?iso-8859-1?Q?dyErzq2d3/Nq0ClkB3itEczJvSn07TsZnH+6DhiLGFSL5KZpgTXupfFoH9?= =?iso-8859-1?Q?1Ero7wOfa2OFHAx8ZTlpXJp1SCswL6WtU+JKKvTKGPZaM9C8n61akJlOU8?= =?iso-8859-1?Q?ofWFNvu09u3PXAzeSjhmUNQ9GrVYzTiH7bUQL6e2uGM9Me4ep1nk+LcuFu?= =?iso-8859-1?Q?rUtYGLXP9EnqmUe9ra5XyMQkVL3iGECGTrdhf+s9GgugJXH4hxgWUA0kRr?= =?iso-8859-1?Q?ZfNDiYnbajW8nvqd3GA3jbcKNEPhLPLr5z4CoQ7shbWilVqdlI1oSBPHHv?= =?iso-8859-1?Q?c48Cl8Vz73nz3SY2Av6PH00HZIxKL5J++s2ArHGtZsaUg0OpI3bIWBq8q+?= =?iso-8859-1?Q?GGaXJSrD9u3T61Pj9feLCZNGrYyc02lwywIXIAsASzgJRSbNJmDN+TY+ys?= =?iso-8859-1?Q?/fI40IssJgYNTVbtBl5qfprGQ0gKi/sRolKc22FdB1fTzRDkgqp2x9Of8N?= =?iso-8859-1?Q?kSg8SO+0NMXCbvUrDqg4xRR0UgqrJG/7aNpBYDawhIZkGITKa47IcQC4n+?= =?iso-8859-1?Q?C+Qv/XTsWzSbt+PM9gRSwfH6Ihhm6FgoVRFJGbFG6wsvdm1/kmjk+riHmX?= =?iso-8859-1?Q?0j3L+tbiIdMPmEDdEyTWtlM0d6mXRQqoMC2XTmzaiQcjc/c=3D?= Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-MS-Exchange-Transport-CrossTenantHeadersStamped: VI1PR0801MB1760 Original-Authentication-Results: fujitsu.com; dkim=none (message not signed) header.d=none;fujitsu.com; dmarc=none action=none header.from=arm.com; X-EOPAttributedMessage: 0 X-MS-Exchange-Transport-CrossTenantHeadersStripped: DB5EUR03FT042.eop-EUR03.prod.protection.outlook.com X-MS-Office365-Filtering-Correlation-Id-Prvs: ccc4dffd-0a5c-48e0-7e63-08d903f0fcdb X-Microsoft-Antispam: BCL:0; X-Microsoft-Antispam-Message-Info: zC0UaNGyrhrrfXQJtRW1qz0t/9gifwp/fMu5LGFisxVHEebRrUxaffcsDGihvxXIj4CUDYDpaNZzodP5zJTVJZCLo8mPz/tgS6HiyLF1X4X2qUcGPps3D1FynBlZVlKFvoRromzfkT6mgJLxujjB5DyKh2btipQGxAmVm9sgCv7V+dnR5nKjV6Fnuz9TlIqGfXvS18oW2taz6v37dp0p2RyjqRgb4SDKbNHHt+Aa3fEaew/PH9m+RGFDF57jGI+ksi5culvapV3dm0+dHoK6eW3L5F5fkSfu3eJzay1T8vqZC8ga348Ohacao4hA+YOIzVllm8jGDLmH8YOYDBGs5XgUSl1lNHfVj2Ib6ra7at4AHckAIjZekt6YDEjngMPm/9ITWfTb1rXi12m3wpzg93Dk9Vl8TUaVu+EsV1eNIbFqV5CZRX95uubqRzozwe9jN/6hqdoQTF9kU4F2lAYp851VAZXEC9yKZKPXVfgb3y5DirLWfp8s0oSAwsQe/Bo2Cgz4K1CJXyd03L216RelvQp/PkvJEN2VdHVXp04xKC7DFG3tUWZnVIbn3dPpHCvNqUrrs6ysHZbDIdVSHaHFvrz3neTNmOyNFiVpsSVYHfY2E6EG41wBgwBPKTi8qnQ2ML4MRHSN5npIEJrLF8Q/zys8rGDpOMrrT94/HnniLeTmTk7WmnsHsM6mSdYXJnP8GPkQb+yLHATfdu+kMGJbbzi6ASCFaiy4brBoMn3P9V4= X-Forefront-Antispam-Report: CIP:63.35.35.123; CTRY:IE; LANG:en; SCL:1; SRV:; IPV:CAL; SFV:NSPM; H:64aa7808-outbound-1.mta.getcheckrecipient.com; PTR:ec2-63-35-35-123.eu-west-1.compute.amazonaws.com; CAT:NONE; SFS:(4636009)(36840700001)(46966006)(9686003)(498600001)(86362001)(6506007)(8676002)(52536014)(36860700001)(55016002)(70206006)(186003)(6862004)(966005)(2906002)(70586007)(81166007)(356005)(4744005)(4326008)(26005)(82310400003)(336012)(33656002)(8936002)(54906003)(7696005)(47076005)(5660300002); DIR:OUT; SFP:1101; X-OriginatorOrg: arm.com X-MS-Exchange-CrossTenant-OriginalArrivalTime: 20 Apr 2021 11:39:53.5070 (UTC) X-MS-Exchange-CrossTenant-Network-Message-Id: 980d651d-bbf4-4604-0e96-08d903f103c9 X-MS-Exchange-CrossTenant-Id: f34e5979-57d9-4aaa-ad4d-b122a662184d X-MS-Exchange-CrossTenant-OriginalAttributedTenantConnectingIp: TenantId=f34e5979-57d9-4aaa-ad4d-b122a662184d; Ip=[63.35.35.123]; Helo=[64aa7808-outbound-1.mta.getcheckrecipient.com] X-MS-Exchange-CrossTenant-AuthSource: DB5EUR03FT042.eop-EUR03.prod.protection.outlook.com X-MS-Exchange-CrossTenant-AuthAs: Anonymous X-MS-Exchange-CrossTenant-FromEntityHeader: HybridOnPrem X-MS-Exchange-Transport-CrossTenantHeadersStamped: AM6PR08MB5217 X-Spam-Status: No, score=-6.5 required=5.0 tests=BAYES_00, DKIM_SIGNED, DKIM_VALID, RCVD_IN_DNSWL_NONE, RCVD_IN_MSPIKE_H2, SPF_HELO_PASS, SPF_PASS, TXREP, UNPARSEABLE_RELAY autolearn=ham autolearn_force=no version=3.4.2 X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on server2.sourceware.org X-BeenThere: libc-alpha@sourceware.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Libc-alpha mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 20 Apr 2021 11:40:06 -0000 Hi Haohiro,=0A= =0A= > I removed redundant instructions using cbz and prfm offset address [1][2]= .=0A= >=0A= > [1] https://github.com/NaohiroTamura/glibc/commit/94363b4ab2e5b4b29843a47= a6970b9645a8e4eeb=0A= > [2] https://github.com/NaohiroTamura/glibc/commit/4648eb559e46d978ded65d4= 0c6bf8c38dd2519d7=0A= =0A= For the first 2 CBZ cases in both [1] and [2] the fastest option is to use = ANDS+BEQ. ANDS only=0A= requires 1 ALU operation while AND+CBZ uses 2 ALU operations on A64FX.=0A= =0A= Wilco=