From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from esa9.fujitsucc.c3s2.iphmx.com (esa9.fujitsucc.c3s2.iphmx.com [68.232.159.90]) by sourceware.org (Postfix) with ESMTPS id DCCC93858004 for ; Tue, 20 Apr 2021 05:49:06 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.3.2 sourceware.org DCCC93858004 Authentication-Results: sourceware.org; dmarc=none (p=none dis=none) header.from=fujitsu.com Authentication-Results: sourceware.org; spf=pass smtp.mailfrom=naohirot@fujitsu.com IronPort-SDR: 6jt6tH2mrBAw+8zdj+5mqrYal0uz2vYy8cRG8ps5Y8dLm0DpuPfBy96v+qPmJr1Y9MywVsi72h spAm7aGKj6YxqSP58q+kJa5v8Wg2KJkjCKly8ttJoSklMMtsI9kJsQKlPQkmpoVQCq+ssoxvBG ZIa5T3NwxRKiRKsgKec42CoyTEnD+VAXJcukMHN4vdoLjWkrxm8615fHqXdwvtavVnbKPvTvfT NL2vHP4eTkIBGyIrfuiKMTkGpEV1MpBdt8MIsgCPBKSAGng1BYr2Mkwti+tfSFNA9pgLb5Xen1 ExA= X-IronPort-AV: E=McAfee;i="6200,9189,9959"; a="30087622" X-IronPort-AV: E=Sophos;i="5.82,236,1613401200"; d="scan'208";a="30087622" Received: from mail-ty1jpn01lp2055.outbound.protection.outlook.com (HELO JPN01-TY1-obe.outbound.protection.outlook.com) ([104.47.93.55]) by ob1.fujitsucc.c3s2.iphmx.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 20 Apr 2021 14:49:05 +0900 ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=W5zx6SSEbM+MdJoYODgWYvRfYZC/1O0pDgciFsU235b3R0qA7bTNi9G/KS28/8Ndgyq/5/62DEh1QxMomH6f9EWBl4vJ8pWsGF8uY7554KYndyVdcxBs7Dl5RZ78VA6b1XXQM7GXm4DhAh9CIrc2QU+EdR9xWpb6RxjgdRoS2iq1Kmjar3bnMZYwucEyud2NsLZ3bQE845wOkjIJR0DqKgs7yf+saz1dIUOJxVskE1ulKHmLEuHa0QsvodHhh9oPPQra0hLxB3z69v0ZkGUpPUCgPgt6Yf2Pv0xzuNDwGzS8f6Xtr6XkK3gCRSIKLJMe+k1JFWxqKPQsI1d06Q8gEw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=ADKKv5Q5ZX0w3rHxwe4JylhoHQS1H8H+hyRBsQf2P1Y=; b=QWzBWwi444wc+SmdBFAhKN/QzjGQxH7Bg+QtPhCGsq5+P1sqiTycGR6ivXDLfkDrYoSOuipaMXy6/PErCvbtKeqafNvA177WXs5H+qCzKzfqXKCfC+VaoWqnbt1/jvCbQ25pbgCm+hn5jZ7r3LXcOfTQUlR6xQdCwjzgVaEbuSx0y8N74E0uyKO1hC9lghb6yaBhEAnZ+HD4HxDMnco7Xrs0c3C6hCFTabIgnVH6cOifIMIWKVmSjFZ/79O/lvzRxeD1j0iZMOPUhQ6vzvHxMi1kg9fqgMX+Vt7b/qRAhqbThrSKkldQMKUvoRRoVKWtS57lbXlA2qVQQ6IwZXpzvg== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=fujitsu.com; dmarc=pass action=none header.from=fujitsu.com; dkim=pass header.d=fujitsu.com; arc=none DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=fujitsu.onmicrosoft.com; s=selector2-fujitsu-onmicrosoft-com; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=ADKKv5Q5ZX0w3rHxwe4JylhoHQS1H8H+hyRBsQf2P1Y=; b=Duwb95s2LVLnDwyhKC6+IVw8m5kDN/nwkwczW9FRprYnDblOYzjb7xuGZ+JTNsRYwhHBh+UQkTLKB8XhtRM82zAJCmjgfqjKzFWHeKX3JAzlb/bf+KYKqKle+fU1UIYvDQLjyX4d/Fjn8xZeZk1MHKq0PLQDOHI9cc/RvSX+bK0= Received: from TYAPR01MB6025.jpnprd01.prod.outlook.com (2603:1096:402:36::13) by TYAPR01MB4399.jpnprd01.prod.outlook.com (2603:1096:404:12f::22) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4042.16; Tue, 20 Apr 2021 05:49:02 +0000 Received: from TYAPR01MB6025.jpnprd01.prod.outlook.com ([fe80::c8de:7917:af16:588b]) by TYAPR01MB6025.jpnprd01.prod.outlook.com ([fe80::c8de:7917:af16:588b%6]) with mapi id 15.20.4042.024; Tue, 20 Apr 2021 05:49:02 +0000 From: "naohirot@fujitsu.com" To: 'Wilco Dijkstra' CC: 'GNU C Library' , Szabolcs Nagy Subject: RE: [PATCH 0/5] Added optimized memcpy/memmove/memset for A64FX Thread-Topic: [PATCH 0/5] Added optimized memcpy/memmove/memset for A64FX Thread-Index: AQHXL5Jyw0P1gKwhEk6/DkVDv1IPJaqyCeTQgAIMP+uACM8uEA== Date: Tue, 20 Apr 2021 05:49:02 +0000 Message-ID: References: , In-Reply-To: Accept-Language: en-001, ja-JP, en-US Content-Language: ja-JP X-MS-Has-Attach: X-MS-TNEF-Correlator: x-securitypolicycheck: OK by SHieldMailChecker v2.6.3 x-shieldmailcheckermailid: 01cd1f44004f417d9ca4a81cee5e373a authentication-results: arm.com; dkim=none (message not signed) header.d=none;arm.com; dmarc=none action=none header.from=fujitsu.com; x-originating-ip: [218.44.52.180] x-ms-publictraffictype: Email x-ms-office365-filtering-correlation-id: 96075cdb-933c-413b-f1dc-08d903c00040 x-ms-traffictypediagnostic: TYAPR01MB4399: x-ms-exchange-transport-forked: True x-microsoft-antispam-prvs: x-ms-oob-tlc-oobclassifiers: OLM:2201; x-ms-exchange-senderadcheck: 1 x-microsoft-antispam: BCL:0; x-microsoft-antispam-message-info: u23sHN0ed7ifctJ+nPc0C/vA2FEL1EjnYzInQveMdPnUSdsXPMLdydydigIf3fdcypwTvIDBh7KpFynfF+1VSUnMK0+qEQJUMQMaGjLm6Y8bd1mHNy08VyUmqK33W9d53BgI6s/baQ36/y35Xp5CSFjtJHLRSDYUm7+WP2MydxcrDH0pa9mUOY+arxVqU0Akq4OumnSkYvAbQRf8C+b+zDxkdyepdSnDpej4fQxhtRcRlAW0DNu7CFHWkI6zRXkzWs756RzLduZUu9BXPBs2BnGdc+nN+Td7MpDyk0WSmoCNTUZgsNuVONo2vp8pdWbHUznBXCXIU6Yj467/8/LrYysPIHiYJ5L3oyUuKdmfVExaXz6coFHDVHc1i9Sh2pyZIGm+XVezxGN7cnNDBh0cATjfTqOdFgUiaJ7SdDEkp7K7iEo1KIuYccHqjpXTmcRvlxQP0L7PoypFJOXntzeX5USjw7pRV2ALe1IgXKRLbFrZHFbKszrc5IEEPxcVEMastAaoMImR5wFaYlecxu8ffQOVt6bQGQ4XLDHLd3TtMxi6dwUkFpCg0408JU9W+lJd1Pbp4gtWzREJnqToy1IK29MxAmTXOMJXqjz9jAFeegwpGLPCkWljBdWRkapEgsoCZAqBaHnjoBI/onwcFO0EHI2L+3OmlDAPNP4vmXpg7LCnD2a/Paw16c4ANp9GZvZz x-forefront-antispam-report: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:TYAPR01MB6025.jpnprd01.prod.outlook.com; PTR:; CAT:NONE; SFS:(4636009)(346002)(396003)(39860400002)(366004)(136003)(376002)(316002)(54906003)(66476007)(33656002)(186003)(6506007)(71200400001)(4744005)(86362001)(4326008)(26005)(85182001)(8676002)(55016002)(122000001)(38100700002)(6916009)(7696005)(8936002)(2906002)(478600001)(966005)(52536014)(9686003)(64756008)(66446008)(66946007)(66556008)(76116006)(5660300002); DIR:OUT; SFP:1101; x-ms-exchange-antispam-messagedata: =?iso-2022-jp?B?NisxZTlWWG1QSUIwREhpT0RGVHBSdEtKRmd5OE56S2c1OTZwcmdKRTNP?= =?iso-2022-jp?B?UFRoNGhvUWR3NzNIdUJsNDlWaHVoZ0RLd2hRQ29YNFhFQmZCNEIzclNp?= =?iso-2022-jp?B?ZFZMZmhDNlBuR3NqZ1B0SHNnQU55R0lkbFFYRmxiWFZTUDRDYy9SZHlR?= =?iso-2022-jp?B?Z2NyN1I2Qm8xK3pMUGY0emlBV0ZaTWg3QjdlTUNEdGZHWExzczdaZGhP?= =?iso-2022-jp?B?YVhWcldNZm01UFQ3T01IZzBGZlVWUkVFcFZpWVliQlBnV3hwb2dZOFFD?= =?iso-2022-jp?B?UlRCdld4N0ZuN2VIT1RuVGdraDExN3RPMW9adjhZTWhIay82c2VRbDFM?= =?iso-2022-jp?B?OGw0anBHRnJzcmcyWWR0M3A2UHdNNG9QckFPTHFFYVczaDVvWHVUK0Fv?= =?iso-2022-jp?B?aFZheEdDNE1EbFRSdGUzUjk0a0psclYrS1Q1S0hQODVNaDc4TXArbDNy?= =?iso-2022-jp?B?cmVFazZ1MWhKaEhDMXQrZDdCbUVzRWx3WmhQNFV5aWR2dDBQTlBTZ1A1?= =?iso-2022-jp?B?UWlCVDRoK3o3dXJFUFNlYmlTOCtoTG53RnZQQk54NFJhUzdJcWtXYVZY?= =?iso-2022-jp?B?ekJ3bGJFcWNwNGo0U2hIaGpXbmlyaGFSWWRkb25MdVBPTFRSekE3c0ZV?= =?iso-2022-jp?B?Mmw0OUh2anl1bnhRUjZQY3JnRGdOL2dqbEhvVGlZaWFNbEZDcHdsaDkz?= =?iso-2022-jp?B?SE9vZmJlekhsb2M0ay9yUWI4SUJ5QkN4U1lMNktBVkhLMjRzT0FPa29V?= =?iso-2022-jp?B?VUloMnNCUDNDZHNmUkNLb1BYM2xPWmNFRVU5ZE5FazJUVmxMMWtiN0FS?= =?iso-2022-jp?B?TDZ0S0lBamwxb3llc2NQUHBTVW1QTElzcWtvNERIUzFySUxRVVlrUHVY?= =?iso-2022-jp?B?RnpkU0k1djFldTNsbHJDRnlxdUtQd2QxdXZaYmJLUVVXOFR5MWJyNUtU?= =?iso-2022-jp?B?cytOdnJiM0xWOGh2QjkzK1Fya0ZMNW1wV1lCbzRJdHNFWC9TQjNScVFx?= =?iso-2022-jp?B?Y3hOU1VmeVMzVVZXQmVIemhYOHR0YXVIZUx1Q0F1aU5qRWp5S1dGOUhC?= =?iso-2022-jp?B?bnNEN1VFOEVEN1U4RStTZVBTYTdLR0dZWTZhRWxPZXFUekhRbWNsT3NX?= =?iso-2022-jp?B?UFAwTFk0WC9ZMzBFVXBrbmZrK04vNlRvZ1UzWmFzNm9jeEgyaEtOd2NU?= =?iso-2022-jp?B?M1BGYUhnZTlZKzdrR21RUys2aDVidkNjdjJaYzJKL0ZBbmdkcGJlU1dF?= =?iso-2022-jp?B?T1NYeExOODBtV0Y5ZFpmNVl0YVlkZFZ2NFhla0pJUm5VTmhXa1RrOXIr?= =?iso-2022-jp?B?ZWdDUytSazNiSHRtb0RuUFhLMXhBelZBNjRBTk5ua0MycFZUV0FtOXht?= =?iso-2022-jp?B?S2VZRmwyWlJHcThYejhKc0hqQ0V2bVA5VUw2VFMxdUN1OWRrcmpjVVdn?= =?iso-2022-jp?B?M3Eva0Z5czUzVThIN0FFYVdObzF5MFNzSWNnd0lhNVVKdFdqZVo1Smla?= =?iso-2022-jp?B?akd2dVRmNm1QNEs5V2VYWUVwWUwzZDhIUDdUNzhIdjhaWXQ4TmViOGoy?= =?iso-2022-jp?B?ZlBvVnkrS1R3WGpHVE1yQzBTcVlnb2xYVUtqR2ttNkZWZkprVCtmOWJU?= =?iso-2022-jp?B?dm5QeTFLUU1EcDk0bC9nbEdZY0dsdkhIRkhpUk5FdjE4WXlSSDJEMERF?= =?iso-2022-jp?B?M2RDeEZuT2U2Yi9ESkt5L3lSaXdkNCtFbFh2VGg2d1lYaVE5NlRKVlRn?= =?iso-2022-jp?B?eE1SR0NiaXR1Mk9NK2N1WTdxM045dDVaTHNydjBaeE5yZXF5QjQwNkVk?= =?iso-2022-jp?B?RXhHUUJJenpMUkpBa2JaME8vMTJnZWlLOC9UKzBTSkw4cUNWZmgvZ1Jo?= =?iso-2022-jp?B?bkw5am8yVVpMNVNUY1hQR1RuaXovVW0rKzZVUzcrVSs0YWh6TkdVNzJU?= Content-Type: text/plain; charset="iso-2022-jp" Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-OriginatorOrg: fujitsu.com X-MS-Exchange-CrossTenant-AuthAs: Internal X-MS-Exchange-CrossTenant-AuthSource: TYAPR01MB6025.jpnprd01.prod.outlook.com X-MS-Exchange-CrossTenant-Network-Message-Id: 96075cdb-933c-413b-f1dc-08d903c00040 X-MS-Exchange-CrossTenant-originalarrivaltime: 20 Apr 2021 05:49:02.1291 (UTC) X-MS-Exchange-CrossTenant-fromentityheader: Hosted X-MS-Exchange-CrossTenant-id: a19f121d-81e1-4858-a9d8-736e267fd4c7 X-MS-Exchange-CrossTenant-mailboxtype: HOSTED X-MS-Exchange-CrossTenant-userprincipalname: PMj7EVIYdvvVgqd1TdGtquj2cO4eMlRfO6v28AISAzEm37b/RcLWvCiCPLUc6qELhsWJ5Ek9Iz0E17nCD7WGbw== X-MS-Exchange-Transport-CrossTenantHeadersStamped: TYAPR01MB4399 X-Spam-Status: No, score=-2.5 required=5.0 tests=BAYES_00, DKIM_SIGNED, DKIM_VALID, RCVD_IN_MSPIKE_H4, RCVD_IN_MSPIKE_WL, SPF_HELO_PASS, SPF_PASS, TXREP autolearn=ham autolearn_force=no version=3.4.2 X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on server2.sourceware.org X-BeenThere: libc-alpha@sourceware.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Libc-alpha mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 20 Apr 2021 05:49:11 -0000 Hi Wilco-san, Let me focus on removing redundant instructions in this mail. > From: Wilco Dijkstra > It is also possible to remove a lot of unnecessary code - eg. rather than= use 2 > instructions per prefetch, merge the constant offset in the prefetch inst= ruction > itself (since they allow up to 32KB offset). There are also lots of branc= hes that > skip a few instructions if a value is zero, this is often counterproducti= ve due to > adding branch mispredictions. I removed redundant instructions using cbz and prfm offset address [1][2]. [1] https://github.com/NaohiroTamura/glibc/commit/94363b4ab2e5b4b29843a47a6= 970b9645a8e4eeb [2] https://github.com/NaohiroTamura/glibc/commit/4648eb559e46d978ded65d40c= 6bf8c38dd2519d7 Thanks. Naohiro