From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from esa9.fujitsucc.c3s2.iphmx.com (esa9.fujitsucc.c3s2.iphmx.com [68.232.159.90]) by sourceware.org (Postfix) with ESMTPS id 7E6043835828 for ; Tue, 27 Apr 2021 11:03:54 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.3.2 sourceware.org 7E6043835828 Authentication-Results: sourceware.org; dmarc=none (p=none dis=none) header.from=fujitsu.com Authentication-Results: sourceware.org; spf=pass smtp.mailfrom=naohirot@fujitsu.com IronPort-SDR: wuMaHrxqCniOov5Ng2gf+gTD7C9AKtDL52DvUCoW0u/cqhQ6aiNyQpmuuVr2jfuyYKAOsU7qJn fgS8pihopF5jyVsDZc6eQoRcKTperpM+V6uf6wgXBQsQCWnQmr95T1hs+29uAdwNmhBG/piHG7 GbR4RFLCbFr0hMIO6VeJS7V7JC+sMkT2snuW2AA/8I1F9CEKPWIyD5xQUO9sFkVy/wx/CXYhEc WfohKFOXgm0xq90c9afNZ9RyTRTiqXn54SbWaIBnJvJTuF9JQAD/syHxw1Iy4m3rCL/O3EdI4U pqc= X-IronPort-AV: E=McAfee;i="6200,9189,9966"; a="30519396" X-IronPort-AV: E=Sophos;i="5.82,254,1613401200"; d="scan'208";a="30519396" Received: from mail-hk2apc01lp2056.outbound.protection.outlook.com (HELO APC01-HK2-obe.outbound.protection.outlook.com) ([104.47.124.56]) by ob1.fujitsucc.c3s2.iphmx.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 27 Apr 2021 20:03:52 +0900 ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=HuT3iJeMEsRzsEdc5cnxXUxtPzSNwZjmKF1a6jRYMQQvx5cDSX+OhyP0kHVLgdh5NnOIVhI4It0oJAzVJsArxckFcrrLLPJ6MN0zktID0S/reCF+TwuhtCJYda8FMOMHYtOpAjMsi70kfHXYRBcwrxsmtVdjIFTyk0aZjr2OfOz0DMSXddwESPjCLAut7w2vzLKVG5iDHX9YoHYMYPZBikpDC8E+WfeqwdLXJxGwSDjyo3NzBYoS6ZGtFFx2GZszvGzW7KB4hOCXkWBV0TUN/w60btYD24tb7wESbDww6ICUq1+I4SdecKsZHah5MUSmWh9lR+/S6bJCDifoqVpjmw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=Di5OON64TcjWTjV+n0cUok82/vUCYl2QWKU7FfV6YcQ=; b=nP4cXe/WOKpl9HUNJrVcnxqx0gVv6GbLOHlNioE4uc2HDi3K53DCHTeHPFfOjSS7agFyFzWU2oSbxUdYZusoa76vAEby7YeEy/4XMJAbkV4KjT9BOdiuw6U7L3lo2NGHdp7hphEkP1sSXF41imbw68peUyc5IkVxAFGHhC+TORAm1C53lHkTS9sh/7IL8oDj5ckQaKmLIc0Q/gSeGRmHjp3K+HaQ8MUSal2/4EAXNAD9ErtL329SUE54EtmSOzgr1LDBo2hvZvQhtKDCGIZ1p3IgyIEBVk8Nhj1jV6V4rdIhARSoNRc/g936G/1GEHNc9dI/TtaEreSU0us/d9yVuw== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=fujitsu.com; dmarc=pass action=none header.from=fujitsu.com; dkim=pass header.d=fujitsu.com; arc=none DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=fujitsu.onmicrosoft.com; s=selector2-fujitsu-onmicrosoft-com; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=Di5OON64TcjWTjV+n0cUok82/vUCYl2QWKU7FfV6YcQ=; b=MghEvEj5/xeci+UsrTwgpuadPiPQFj0teilAfTlnK5eNDM0vHdn4x+mKODf0bQseqJBJrLCMJ1QAg8yTyvexbvYebHafupS8iUFYlRIq0H7m7oTrNBI3fzKIT+OwiOEKSb1z68Cbswzl7l5VMyKJ5B2TfTTfDE/086Lh1pPgZXI= Received: from TYAPR01MB6025.jpnprd01.prod.outlook.com (2603:1096:402:36::13) by TY2PR01MB2972.jpnprd01.prod.outlook.com (2603:1096:404:75::16) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4065.25; Tue, 27 Apr 2021 11:03:48 +0000 Received: from TYAPR01MB6025.jpnprd01.prod.outlook.com ([fe80::2422:2c7:39a3:5283]) by TYAPR01MB6025.jpnprd01.prod.outlook.com ([fe80::2422:2c7:39a3:5283%6]) with mapi id 15.20.4065.027; Tue, 27 Apr 2021 11:03:48 +0000 From: "naohirot@fujitsu.com" To: 'Wilco Dijkstra' CC: 'GNU C Library' , Szabolcs Nagy Subject: RE: [PATCH 0/5] Added optimized memcpy/memmove/memset for A64FX Thread-Topic: [PATCH 0/5] Added optimized memcpy/memmove/memset for A64FX Thread-Index: AQHXL5Jyw0P1gKwhEk6/DkVDv1IPJaqyCeTQgAIMP+uACM8uEIAAbRm9gArx0OA= Date: Tue, 27 Apr 2021 11:03:48 +0000 Message-ID: References: , , In-Reply-To: Accept-Language: en-001, ja-JP, en-US Content-Language: ja-JP X-MS-Has-Attach: X-MS-TNEF-Correlator: x-securitypolicycheck: OK by SHieldMailChecker v2.6.3 x-shieldmailcheckermailid: 0cc8f4d0090144069af0bde7eebeea51 authentication-results: arm.com; dkim=none (message not signed) header.d=none;arm.com; dmarc=none action=none header.from=fujitsu.com; x-originating-ip: [218.44.52.175] x-ms-publictraffictype: Email x-ms-office365-filtering-correlation-id: 06ed4617-6600-4b59-336e-08d9096c2282 x-ms-traffictypediagnostic: TY2PR01MB2972: x-ms-exchange-transport-forked: True x-microsoft-antispam-prvs: x-ms-oob-tlc-oobclassifiers: OLM:972; x-ms-exchange-senderadcheck: 1 x-microsoft-antispam: BCL:0; x-microsoft-antispam-message-info: 9n801IMFsOpc4/LoIm7iN0xAekC0fOd+UIlo5aJ2jzrQVkA2nfmLVO+WOTfHJvFGdRjS3t+Khy1wp9kaXC88sXt92cvkPjWgqWoBPhbQI9/sQVelbHtPa3ydswfmWLPv4MwbH5lKf/4rimbwRM6wn3jXko9YVCaWumviC2jmjcbwxOml4i+DvSUKlMZUHIg1F8/0eu2EU3hZjfBTEY+OWxB0jJWKpiq3sP/nzn4r3TCfxJdJloLNeC70KGJ5zHTUf/uURxNpnChkDkfisZKfWdwc92Rl/zsUx1v6aXqEbEXpJtOr4HMIHcazZqBOLV5n8YUkcdHHo9hLnBR8jORwLnVYdn7GbvtzrchiKLAhMaA6VqScjzTPcbTf89rLolY6VP5y7hljMeMQfYWt2ZRFAxGLlOc7hXZWafwM7ILEbd2lmQPrVYCis42JlGnDpvY36O9eG4j+mRmteUmVTRL5/A7q/q4hxcjZioLC1f4LxJ83lIxdOF9KCIANQXsYseFQwX3Ug6iocQI2qCv8HJlobaTv4Pra5OpNvFmIuYwoIqeK64CKDKmvPpEj79wKuHiTpAqFjEpJLth3JFHpHLaWcWAN573V1E7KSxXfTMZHdJ6VRw3U0jXeChbK7ObXVl9QN0MbUsTIVTrbvk+AScRX2mcYumNAFz1yVYrawKUkrfzW2RFOI5AHJS4XJdK5Jhu/ x-forefront-antispam-report: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:TYAPR01MB6025.jpnprd01.prod.outlook.com; PTR:; CAT:NONE; SFS:(4636009)(396003)(376002)(39860400002)(346002)(366004)(136003)(8676002)(54906003)(5660300002)(6506007)(52536014)(122000001)(4744005)(85182001)(9686003)(38100700002)(66556008)(66476007)(4326008)(76116006)(66446008)(55016002)(66946007)(83380400001)(478600001)(33656002)(7696005)(6916009)(316002)(186003)(8936002)(64756008)(2906002)(966005)(71200400001)(26005)(86362001); DIR:OUT; SFP:1101; x-ms-exchange-antispam-messagedata: =?iso-2022-jp?B?ZzE0WTlNT1R3WXRudzZkNExkVmFLcHkxNVJWNy9yQ0JNc2ZwNmJGZnpw?= =?iso-2022-jp?B?dVVjRUxsVEF0Wko2RW81VWJMd1V6VElURm9IMGdWV3BodlI5VUNVYnhV?= =?iso-2022-jp?B?aGYvS29OTmhnTWFTdFQycXpRY0lvVjNwelR4RUZDNFRCWW9JcUZzMlhv?= =?iso-2022-jp?B?a1BVQzZqdHNGZVpjSkg4MENKVFZPM1ZZWCtGUEdzeGg2VGtiQmwyNTBS?= =?iso-2022-jp?B?aWJoS2cxcnpib2NEbVoyYU41N2t0VVNXTWZOOG1iTHNkZ1N5cTJzZExk?= =?iso-2022-jp?B?TzhKM0F6VWp3NmkrNWFCZUhERUk0MHdXSnJDZ1hMMVRqc2VQRWxMS3BP?= =?iso-2022-jp?B?ZlVFV3BuYjNGelF1UUNrbFphcEdXdjUzU1pzSGExMnRKdmlBKytNTjhl?= =?iso-2022-jp?B?K3Y2c0dUZzEyZ2ZyZFNzNWRkM3llQ01tbkFPRStDR0lsL3drMHlEVWc1?= =?iso-2022-jp?B?Ym9aY3RrYzl6Z3kzczlIcGRmQk56bEhjeFBKcFdHT2ZRU0RZODBJSC92?= =?iso-2022-jp?B?di95cVpBNGVmK0tybWJ0cnN3TGRmYlJ6QXg1aGw0NGgvVUhicDlza3ZT?= =?iso-2022-jp?B?a2FjWDNjallCT3dZYWZ3N3l5UHAwd2VrcFpyT0w3RGhmQTRJQ1hEMUxT?= =?iso-2022-jp?B?aFBpWUY5THRMekZleWNFYjV4VXJQam1GVWliMHh2aEI5ZVk0Y3pPNmh4?= =?iso-2022-jp?B?ZG1SMmx6aENEUm8wa09aR3AvTzlEdnNtY2VnMExKaFFzRWw1RDVHekt0?= =?iso-2022-jp?B?a2FWQ1I5aXFNRnIvdzR5VjFyeUlJS3Nnb3Yydm9UN2h5S0Z4bjFWREt6?= =?iso-2022-jp?B?dkUzbUtiNm52V1JlcjM4R2NPcGV3eDd6cU1BMDIvb1NOa0ExYWJNM1By?= =?iso-2022-jp?B?OHJxZ2FobGpoNHhIcEgzU1UwMmdHVzgraG9mYnM3K0paM3hDd0xyQzcx?= =?iso-2022-jp?B?NFkvOUtRRDRMeGE0MkpuQ3NsTTUvWXIza29Qa20xeUUrMGRMRVp3Z2Jn?= =?iso-2022-jp?B?YnBld2V2S1lESWNqaFpQczg3bHk4VW54cjRlQVJRbDZVVWt0bndHaGl4?= =?iso-2022-jp?B?by9JKytqRzlvbHQwcTZkZVh5cXBocXBhUkZWclY2RmtZcDVPTENZYVNs?= =?iso-2022-jp?B?b1F1ZTVNVjN0UWI4bjVDUC9HK0VuNndaUnA0NHA0bDVBL1FSYkNsWEFw?= =?iso-2022-jp?B?SHA0clVtT1E4VlFiVXBBc1ljUCt6NWdHUGR5R2JjZ2Qyc1drc01RUy9O?= =?iso-2022-jp?B?STNXZ25sM3dzdWZhV3QyaHdiT01mQjlBV2M2cG5uRlpVWGVPRHk3TmRJ?= =?iso-2022-jp?B?L1pzT0YwRmROM1hydXRVMUNFekhYSEVTeXNiNldPN0IzdjFEUlptcXky?= =?iso-2022-jp?B?emtTamthcGorNCt6eGtGTGpUS1VSQUcvc2JPNXkzQWJtMmxzK1l2SVVh?= =?iso-2022-jp?B?aEdUQ09JMHFyYXVuSnBVQ2gxRUdrT1FDRkl5bDVRSGs0dWRtc1d5ZlJS?= =?iso-2022-jp?B?QnQvYjVtZGU5Kzc0WHNabTlQOURldWZTa2d2RmpzVnhhZDlKaGxlSXpK?= =?iso-2022-jp?B?M0MrL2tndDRsNGJxZFgyWExFNU1TUkRzVUlvZUVwekVPMTJySlRwRDFG?= =?iso-2022-jp?B?ekRHSnJpR2FzdURpajRONjhSNWNzZGpBRW1IWWNkcVFqamFjNEtWSWRo?= =?iso-2022-jp?B?dzEyVUl2TVM3QW9aN2kxVVVkcTQ1RU9hY244Ykk4bURDVmY5aHVtZ0FX?= =?iso-2022-jp?B?Ukk2a2xhVkFQd2hCaEhxRldRNHpUWGRGZGErOHI4YVpDcHJZYnV4NVhy?= =?iso-2022-jp?B?WlVmODYyZXpVZ2trS2VtelBDSzlaMks2RVlibkw3eENjWFVlK1JkQUo4?= =?iso-2022-jp?B?TXQxcEtObEtWeVZ5NlpBYVRSdkNKMHhLZmora3VOVEM5T005Nk1VZ0Vz?= Content-Type: text/plain; charset="iso-2022-jp" Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-OriginatorOrg: fujitsu.com X-MS-Exchange-CrossTenant-AuthAs: Internal X-MS-Exchange-CrossTenant-AuthSource: TYAPR01MB6025.jpnprd01.prod.outlook.com X-MS-Exchange-CrossTenant-Network-Message-Id: 06ed4617-6600-4b59-336e-08d9096c2282 X-MS-Exchange-CrossTenant-originalarrivaltime: 27 Apr 2021 11:03:48.9024 (UTC) X-MS-Exchange-CrossTenant-fromentityheader: Hosted X-MS-Exchange-CrossTenant-id: a19f121d-81e1-4858-a9d8-736e267fd4c7 X-MS-Exchange-CrossTenant-mailboxtype: HOSTED X-MS-Exchange-CrossTenant-userprincipalname: JMLCS1741O6JhxjFxejk/+qrWBSw5V3eGDbWDU/195Leqx4bc7FAie7ZmULP6T8ElcHWuSqosldA7MOi8i+brA== X-MS-Exchange-Transport-CrossTenantHeadersStamped: TY2PR01MB2972 X-Spam-Status: No, score=-2.7 required=5.0 tests=BAYES_00, DKIM_SIGNED, DKIM_VALID, JMQ_SPF_NEUTRAL, RCVD_IN_MSPIKE_H4, RCVD_IN_MSPIKE_WL, SPF_HELO_PASS, SPF_PASS, TXREP autolearn=no autolearn_force=no version=3.4.2 X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on server2.sourceware.org X-BeenThere: libc-alpha@sourceware.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Libc-alpha mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 27 Apr 2021 11:03:57 -0000 Hi Wilco-san, This mail is a continuation of removing redundant instructions. > From: Wilco Dijkstra =20 > For the first 2 CBZ cases in both [1] and [2] the fastest option is to us= e > ANDS+BEQ. ANDS only requires 1 ALU operation while AND+CBZ uses 2 ALU > operations on A64FX. I see, I haven't used ANDS before. Thanks for the advice. I updated memcpy[1] and memset[2]. [1] https://github.com/NaohiroTamura/glibc/commit/fca2c1cf1fd80ec7ecb93f7cd= 08be9aab9ca9412 [2] https://github.com/NaohiroTamura/glibc/commit/5004e34c35a20faf3e12e6ce9= 15845a75b778cbf Thanks. Naohiro