From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: <> Received: from fx302.security-mail.net (mxout.security-mail.net [85.31.212.42]) by sourceware.org (Postfix) with ESMTPS id 3035E3953075 for ; Tue, 10 Aug 2021 09:45:56 +0000 (GMT) Authentication-Results: sourceware.org; dkim=permerror (bad message/signature format) Received: by fx302.security-mail.net (Postfix) id 656263D3B13A; Tue, 10 Aug 2021 11:45:55 +0200 (CEST) Date: Tue, 10 Aug 2021 11:45:55 +0200 (CEST) From: MAILER-DAEMON (Mail Delivery System) Subject: Undelivered Mail Returned to Sender To: libc-alpha@sourceware.org Auto-Submitted: auto-replied MIME-Version: 1.0 Content-Type: multipart/report; report-type=delivery-status; boundary="24FDE3D3B138.1628588755/fx302.security-mail.net" Message-Id: <20210810094555.656263D3B13A@fx302.security-mail.net> X-Spam-Status: No, score=-11.2 required=5.0 tests=BAYES_00, GIT_PATCH_0, HTML_MESSAGE, KAM_DMARC_STATUS, KAM_LOTSOFHASH, RCVD_IN_DNSWL_LOW, SPF_HELO_NONE, TXREP autolearn=ham autolearn_force=no version=3.4.4 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on server2.sourceware.org X-Content-Filtered-By: Mailman/MimeDel 2.1.29 X-BeenThere: libc-alpha@sourceware.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Libc-alpha mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 10 Aug 2021 09:46:06 -0000 This is a MIME-encapsulated message. --24FDE3D3B138.1628588755/fx302.security-mail.net Content-Description: Notification Content-Type: text/plain; charset=us-ascii This is the mail system at host fx302.security-mail.net. I'm sorry to have to inform you that your message could not be delivered to one or more recipients. It's attached below. For further assistance, please send mail to postmaster. If you do so, please include this problem report. You can delete your own text from the attached returned message. The mail system : host zimbra2.kalray.eu[195.135.97.26] said: 550 5.1.1 : Recipient address rejected: User unknown in virtual mailbox table (in reply to RCPT TO command) --24FDE3D3B138.1628588755/fx302.security-mail.net Content-Description: Delivery report Content-Type: message/delivery-status Reporting-MTA: dns; fx302.security-mail.net X-Postfix-Queue-ID: 24FDE3D3B138 X-Postfix-Sender: rfc822; libc-alpha@sourceware.org Arrival-Date: Tue, 10 Aug 2021 11:45:55 +0200 (CEST) Final-Recipient: rfc822; mpoulhies@kalray.eu Original-Recipient: rfc822;mpoulhies@kalray.eu Action: failed Status: 5.1.1 Remote-MTA: dns; zimbra2.kalray.eu Diagnostic-Code: smtp; 550 5.1.1 : Recipient address rejected: User unknown in virtual mailbox table --24FDE3D3B138.1628588755/fx302.security-mail.net Content-Description: Undelivered Message Content-Type: message/rfc822 Return-Path: Received: from sourceware.org (ip-8-43-85-97.sourceware.org [8.43.85.97]) by fx302.security-mail.net (Postfix) with ESMTPS id 5EF873D3B0FF for ; Tue, 10 Aug 2021 11:45:52 +0200 (CEST) Received: from server2.sourceware.org (localhost [IPv6:::1]) by sourceware.org (Postfix) with ESMTP id 570363951C63 for ; Tue, 10 Aug 2021 09:45:51 +0000 (GMT) Received: from EUR01-VE1-obe.outbound.protection.outlook.com (mail-eopbgr140043.outbound.protection.outlook.com [40.107.14.43]) by sourceware.org (Postfix) with ESMTPS id 804A23953C37 for ; Tue, 10 Aug 2021 09:44:40 +0000 (GMT) Received: from DB7PR03CA0077.eurprd03.prod.outlook.com (2603:10a6:10:72::18) by AM7PR08MB5301.eurprd08.prod.outlook.com (2603:10a6:20b:dd::14) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4394.21; Tue, 10 Aug 2021 09:44:38 +0000 Received: from DB5EUR03FT045.eop-EUR03.prod.protection.outlook.com (2603:10a6:10:72:cafe::63) by DB7PR03CA0077.outlook.office365.com (2603:10a6:10:72::18) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4415.13 via Frontend Transport; Tue, 10 Aug 2021 09:44:38 +0000 Received: from 64aa7808-outbound-1.mta.getcheckrecipient.com (63.35.35.123) by DB5EUR03FT045.mail.protection.outlook.com (10.152.21.164) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4394.16 via Frontend Transport; Tue, 10 Aug 2021 09:44:38 +0000 Received: ("Tessian outbound efa8a7456a86:v101"); Tue, 10 Aug 2021 09:44:38 +0000 Received: from 3a96d47ecb6c.2 by 64aa7808-outbound-1.mta.getcheckrecipient.com id 7F81C20A-07BA-425F-86C8-C5784ADA2065.1; Tue, 10 Aug 2021 09:44:32 +0000 Received: from EUR02-VE1-obe.outbound.protection.outlook.com by 64aa7808-outbound-1.mta.getcheckrecipient.com with ESMTPS id 3a96d47ecb6c.2 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384); Tue, 10 Aug 2021 09:44:32 +0000 Received: from PA4PR08MB6320.eurprd08.prod.outlook.com (2603:10a6:102:e5::9) by PAXPR08MB6445.eurprd08.prod.outlook.com (2603:10a6:102:159::13) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4394.15; Tue, 10 Aug 2021 09:44:31 +0000 Received: from PA4PR08MB6320.eurprd08.prod.outlook.com ([fe80::cd22:a583:c97c:72a6]) by PA4PR08MB6320.eurprd08.prod.outlook.com ([fe80::cd22:a583:c97c:72a6%7]) with mapi id 15.20.4415.013; Tue, 10 Aug 2021 09:44:31 +0000 Received: from arm.com (217.140.106.49) by LO2P265CA0254.GBRP265.PROD.OUTLOOK.COM (2603:10a6:600:8a::26) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4394.16 via Frontend Transport; Tue, 10 Aug 2021 09:44:30 +0000 X-Quarantine-ID: X-Virus-Scanned: E-securemail, by Secumail X-Spam-Status: No, score=2.705 tagged_above=-1000 required=7.5 tests=[AB_ENVFROM_LONG_40=0.5, AB_IN_REPLY_TO_EXISTS=-1, AB_LONG_SUBJ_30=0.001, AB_UNPARSEABLE_RELAY_ENHANCE=1.5, DKIM_SIGNED=0.1, DKIM_VALID=-1, DKIM_VALID_AU=-0.1, FAKE_REPLY_SURE_A=1, FAKE_REPLY_SURE_B=1, FSL_RCVD_EX_GT_5=1, FSL_RCVD_UT_GT_5=0.01, HEAD_NEWS=-0.5, MISSING_MID=0.14, MM_ENVFROM_BOUNCE=1, RCVD_IN_DNSWL_MED=-1.3, RDNS_DYNAMIC=0.363, SINGLE_HEADER_2K=1, S_FROM_GREY_MINUS_2=-2, TEST_REPLY_C=1, T_RP_MATCHES_RCVD=-0.01, UNPARSEABLE_RELAY=0.001] autolearn=disabled Authentication-Results: fx302.security-mail.net (amavisd-new); dkim=pass (1024-bit key) header.d=sourceware.org Secumail-id: <115e1.61124ad0.1ab59.0> DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org 570363951C63 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=sourceware.org; s=default; t=1628588751; bh=x5XkVGd+EAFBDn5q2tDjhZuF+SUSDLGnflhAVhyXOVw=; h=Date:To:Subject:References:In-Reply-To:List-Id:List-Unsubscribe: List-Archive:List-Post:List-Help:List-Subscribe:From:Reply-To:Cc: From; b=u03JKo4NdOQPh8lXoN/BlD1q37TQJ7edIDELjrjlH6SYTZ/pnpqnKFM4p7oh471SB 09jfdk8yyId8JxI/1c1RDN3cGAByRqflOg4fFNeWhtOqydXphKR2RfQQqHasY7KScI N2VVNyU1swF6mTKi/38RTvv1mHSGObtjORpSHkBI= X-Original-To: libc-alpha@sourceware.org Delivered-To: libc-alpha@sourceware.org DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org 804A23953C37 X-MS-Exchange-Authentication-Results: spf=pass (sender IP is 63.35.35.123) smtp.mailfrom=arm.com; sourceware.org; dkim=pass (signature was verified) header.d=armh.onmicrosoft.com;sourceware.org; dmarc=pass action=none header.from=arm.com; Received-SPF: Pass (protection.outlook.com: domain of arm.com designates 63.35.35.123 as permitted sender) receiver=protection.outlook.com; client-ip=63.35.35.123; helo=64aa7808-outbound-1.mta.getcheckrecipient.com; X-CheckRecipientChecked: true X-CR-MTA-CID: 42b271e0dbe3f9e4 X-CR-MTA-TID: 64aa7808 ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=Dcgv08iBX+bEdHdsp9yZYIKZgX9dT/sdnl0g4j80Mijn0GE7VsAHerOP7uL55WbsUkFu3lG26qcsYdiTbCzHF78lwrgqE24ExBWzy0FeStu/J9lplcO6begV/Lb8PRzSKGvJCCgARwagtc6qWcGzD9K44T7Q37QM/B8XkcWcfGdwWEKDfyMxnVEDxpuFjNvh2i5KKDIpcUqBBcjSXD7Takz0lwaElr9u+vP6jUXuwjZ7OVwf7j7+qygHucHdxRpxOIc8hZUJTRIwhhgAxc0yl9pioPEncYm95HjGlKCML8xj9jpDMbLHMZMoGvjupUUfw+QzCMyLPHtuTTIBznefbg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=x5XkVGd+EAFBDn5q2tDjhZuF+SUSDLGnflhAVhyXOVw=; b=W3afO34engL18TsC8HZ8V6kjh+9srVhSgizaHW3XhRD6RT+VrqGydL1anmnOTdKSZTsO3VV+slAcyzOdny0LribthBB60axsspw+lZCI/rYPGxwUDw3ZSTh2Ual2QlOc8GK1v+oyBzOP8ZgXLXfjSH5+u8fXefsubb+S3dOveAy0zow56BMUKpTHuZj9mR4bAR8bksZBqYcX1e1CXhJeSPLSzUQULbYfIKMEZ95NKC+k/Hag6uGt51zwvwMAb3z3OQ9m/jIVw8mEMcFlwVxYxS9hX9o6x+Vo1wSLNlApDSLI+vPt5ZPUVS13RXtrE0F7CPXLJOLhIpdP7uQuizxLmg== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=arm.com; dmarc=pass action=none header.from=arm.com; dkim=pass header.d=arm.com; arc=none Authentication-Results-Original: arm.com; dkim=none (message not signed) header.d=none;arm.com; dmarc=none action=none header.from=arm.com; Date: Tue, 10 Aug 2021 10:44:29 +0100 To: Wilco Dijkstra Subject: Re: [PATCH v4 5/5] AArch64: Improve A64FX memset medium loops Message-ID: <20210810094428.GG20410@arm.com> References: In-Reply-To: User-Agent: Mutt/1.9.4 (2018-02-28) X-ClientProxiedBy: LO2P265CA0254.GBRP265.PROD.OUTLOOK.COM (2603:10a6:600:8a::26) To PA4PR08MB6320.eurprd08.prod.outlook.com (2603:10a6:102:e5::9) MIME-Version: 1.0 X-MS-Exchange-MessageSentRepresentingType: 1 X-MS-PublicTrafficType: Email X-MS-Office365-Filtering-Correlation-Id: 40dc973e-dd7c-48e0-0e50-08d95be3787a X-MS-TrafficTypeDiagnostic: PAXPR08MB6445:|AM7PR08MB5301: X-MS-Exchange-Transport-Forked: True X-Microsoft-Antispam-PRVS: x-checkrecipientrouted: true NoDisclaimer: true X-MS-Oob-TLC-OOBClassifiers: OLM:962;OLM:962; X-MS-Exchange-SenderADCheck: 1 X-MS-Exchange-AntiSpam-Relay: 0 X-Microsoft-Antispam-Untrusted: BCL:0; X-Microsoft-Antispam-Message-Info-Original: M/qHEST+rNG4cClw/ALpiq1GxG8cuM9tctw+QxMdx4FAg6GhBHaf7Q/sD1spNXl1Y9hw4WKMlQJdFv+pse8WJEV7ptnlESJ5bRwWVX4VzAyPkTwA8/2hik3vRtm4syVfS0rzfyX16i5SzqY15XzcjXVV4ewVqhx0WNE9rtaLwueVc3/GIzHkmytNkHVYWG6Qyg29jhJtNuya2qFofe06Egt3grvQ5K10+HLHXys/5rO7s4eOM9DqdKvuIRF7HrVrhcAcq6hq3zHrMGoStU0iXNXziuBgdvEeFaAQecGiCHLly/2Z7aTnqYz58CoVeLp6CU6kk5gEmaiLlDi7NuYEwqdnYXSuRv0j+s0XnlzeyRSXkE5dIIJla8+nqzvdrkBB/AcQC6Phf9SDtxUQftDWnXK90C04OH57VF5Ln8vNJGKpu78RXg6EgDQ9z5j/ldhsdeJhjjsjbheLR3L5dSFSUbyo9TLgXdl65aWByjBv8o5uP7SCyxiP4kcMqMSiOVP0eMzenTSYOfCip/mmQ2p7ivQHav0hxCpSfwDptbgfCNVVT7k1fEDKdSjlCehrBCfav7JuiGGIAV4c88Wa/1amblXNFQ0bBm1o3x9MvxBkBqiQGrfrtfSCYfrp3s9DAch3XPqE9ELUGz9IIicEy7aa5HWH7TRV+xxSp9RCKEjXlrlEF8++UUJtJswB145iEEBSbVZC2zWg7a+mEbceXBlAHYo/V+x08UIwmTsk83u4yAP6DRt9vFD626YmuC/MXKII X-Forefront-Antispam-Report-Untrusted: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:PA4PR08MB6320.eurprd08.prod.outlook.com; PTR:; CAT:NONE; SFS:(4636009)(346002)(39850400004)(396003)(136003)(376002)(366004)(4326008)(478600001)(66476007)(36756003)(6862004)(66946007)(6636002)(66556008)(26005)(2906002)(54906003)(7696005)(52116002)(44832011)(37006003)(86362001)(316002)(2616005)(956004)(1076003)(33656002)(186003)(8676002)(55016002)(5660300002)(38100700002)(8886007)(38350700002)(8936002)(473944003)(357404004); DIR:OUT; SFP:1101; X-MS-Exchange-AntiSpam-MessageData-ChunkCount: 1 X-MS-Exchange-AntiSpam-MessageData-0: cK6DwyL0bvv0pWzUBnfoRG4qthHIdQJwXqRvF3bcijxJZ5msZ4mTVEHw8F4jMaUxv6S18jsre0lWKsc+hWcl4yTIU6yganJ3ChWgkcl1g94gRUq8x5jbXk/SAx+GwSqw9BdrtROVFiA2Tb5Rnff8SRc6UNbpNRqUdtLEFwNe2FaiaoHwNoT7h4CpRHOAlT7ht1/ho3EAAdOaNWafbfv2o0oDchv1KOGQSiimS3eL2Ce7G0fxcaOOM4SMDv92ozTUDayKiK4NrUramL0Fg5q6yue06c6DLWiRe3HngHECOLNSBe5ylHfrkXdLu0eWcfk33ZO8iiGg9yZrQ5xiEhDpWMmEu98COktQk1tqA1PJuoDanGD7aSnxHGFUQ7VQQFFA9NiJrTpZiQm353PScXKeuYvJqithghxHwXqF9CWr8YyUA1mwP0tddV9QmNo9h3KwRJCDxrSJ8HhrCJD+1V8nmb2umHMnQFexuDQ3jd0K6clflOClsCToWlP7mmLcaF/Kyw19T9DJJc/yoy67Jbwc+c68dlUwYm4ypMk3Q7TR+e2+5W90rLH1MHWvYwjzoR9DE3wHDaYvOShDfjtF0BAkZ3M9i0QT4QfK8ToSEwlgHyEsNp4w/mvVzpK9GGYzOJdz4uuCpU/d5277xY9y4nwlvWRcUvCKlTIPqZE4oqbkQNnuGDmfTiDL4OZRymowIYfgY5O63qKWFdldPsgQjyltI+gB3qBUuk717w6iZt7X0Q6ZXz8GQjcyOCctfNC5j3xKVRT5t9k7a6Izn386e0JGhvTRJ+rufl9HQ0xMqouGlJ5Z2RrM8jpUZkcMzzHhU1NMx5adCQE4nB5MQCqbN/GjhjRzcvy0/U0f/BHa+dO1SuDrmF++uRdRBw+DD2VIrUNrRWAhw9TxQ2BDT2fas2aAp33cOwqKn1IN4xaOa7j/EHy4YkbxU9vfvTCFVEBJsCN6 D4gefJhvB7w14lsGvVj34Stw0qwXTwFWkvJfemkzAXztheNBOmHQBJ6NydN7puMYkn5PVb99ahkxNCGbWF+cpdyji9l0LF6xhanHaWvT80Af1oYztUk6Yz/vtYmaQiGQo/V7/X2IC/316IGpQ3rxapXBwSsMR0mhwXAbE0hR8jIjADYh48bTHDViBEto1oqxRuZx/CF0uhmOsUu+J0j23R50KCmEHWgqxd5vucgOvOojKezgmeqyUcX8uUVtykMxIQWHCJjohgs2D/RuE6JfVr1v5SOHxf93G1ciF/YepUP9P9H/Eo5pBr0L2cPcarENztmdNBXriVSIWygoTpk+yIgP8E2aWvwmStP/ewFC+AMQz8ikE2lyu7uSc+54O0Cv X-MS-Exchange-Transport-CrossTenantHeadersStamped: PAXPR08MB6445, AM7PR08MB5301 Original-Authentication-Results: arm.com; dkim=none (message not signed) header.d=none;arm.com; dmarc=none action=none header.from=arm.com; X-EOPAttributedMessage: 0 X-MS-Exchange-Transport-CrossTenantHeadersStripped: DB5EUR03FT045.eop-EUR03.prod.protection.outlook.com X-MS-Office365-Filtering-Correlation-Id-Prvs: 954e81a2-427c-41e4-8bfd-08d95be373d1 X-Microsoft-Antispam: BCL:0; X-Microsoft-Antispam-Message-Info: ZEPxdcREqgSvWR1PGjPOq28WscBcYepVwY94DF1zAgCNbFxoR7IIm6mToXu/LkhRnfTchCBw9RhXD7ub+bwQXV0hlzI2dvAoKlgVimHKWHBIS+7BhWnZ0sWnX8Awq/iPxeKasRD6xIERufO0N1NElsOAu6UKTngK7aY9uAgfsJdg8fuM1CWzYvC8hxVomBXrbp1IDA2BqCto/XNVOgJj4+ScrtVkwTVrxec6oVlOGSJJ7TVC6LSnaCF6RzliIlQzwC7B6JJfnLS10QyElzzcCOOs+8wxhaxeKL0EYaH8+yGGaKDTZfvXfvlEKtSIH7vjrdPkfA0Vobl8MjXBVr5dmwcVFZTFDIKcoMYlEzFCaNId3ioEeJ3YT91JAigC60JQQOZFoCOEJAragvQfbJdBTRI2zicolo7bXVMeCFvIseiZ8jyuZyGMXBTMx74cQYOfZR50xEw7auSzMwJiSFMbJDGcNncBcx5BiX8dPVBgnchK0hXXQZi6A0C1Www/Lo0txrk/WygwLeKr+qK5gZSzj2FwuL8ZV2CV2JJngNZ+37cQQuepN+r5t0y+t0w5VVmwuF+yGw3P7UrHIzxSz7WjKLcdKZvldWat3DXBMe0PnUA1+fRnJ9vewnGymmnQRfxNMdbgYPHRlgHfMAc981FLS+qiaD4al39eGO26sETUqNhrdeP/uvHDI2eNLyXTUkhz/LF8aSoe5EkqqxorQhlD+euUt3PJL0L1t0oeKgNfCzfxVFsylmD1iWx2W+wxKzxG X-Forefront-Antispam-Report: CIP:63.35.35.123; CTRY:IE; LANG:en; SCL:1; SRV:; IPV:CAL; SFV:NSPM; H:64aa7808-outbound-1.mta.getcheckrecipient.com; PTR:ec2-63-35-35-123.eu-west-1.compute.amazonaws.com; CAT:NONE; SFS:(4636009)(346002)(136003)(376002)(39850400004)(396003)(36840700001)(46966006)(8676002)(26005)(36860700001)(36756003)(44832011)(186003)(316002)(2906002)(7696005)(55016002)(356005)(82310400003)(81166007)(70586007)(70206006)(478600001)(6862004)(37006003)(1076003)(82740400003)(336012)(47076005)(8936002)(33656002)(956004)(2616005)(6636002)(4326008)(54906003)(86362001)(5660300002)(8886007)(473944003)(357404004); DIR:OUT; SFP:1101; X-OriginatorOrg: arm.com X-MS-Exchange-CrossTenant-OriginalArrivalTime: 10 Aug 2021 09:44:38.6504 (UTC) X-MS-Exchange-CrossTenant-Network-Message-Id: 40dc973e-dd7c-48e0-0e50-08d95be3787a X-MS-Exchange-CrossTenant-Id: f34e5979-57d9-4aaa-ad4d-b122a662184d X-MS-Exchange-CrossTenant-OriginalAttributedTenantConnectingIp: TenantId=f34e5979-57d9-4aaa-ad4d-b122a662184d; Ip=[63.35.35.123]; Helo=[64aa7808-outbound-1.mta.getcheckrecipient.com] X-MS-Exchange-CrossTenant-AuthSource: DB5EUR03FT045.eop-EUR03.prod.protection.outlook.com X-MS-Exchange-CrossTenant-AuthAs: Anonymous X-MS-Exchange-CrossTenant-FromEntityHeader: HybridOnPrem X-BeenThere: libc-alpha@sourceware.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Libc-alpha mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , From: Szabolcs Nagy via Libc-alpha Reply-To: Szabolcs Nagy Cc: 'GNU C Library' Errors-To: libc-alpha-bounces+mpoulhies=kalray.eu@sourceware.org Sender: Libc-alpha X-ALTERMIMEV2_in: done Content-Type: text/plain; charset=utf-8 The 08/09/2021 13:15, Wilco Dijkstra via Libc-alpha wrote: > v4: minor loop change > > Simplify the code for memsets smaller than L1. Improve the unroll8 and L1_prefetch loops. OK to commit, but keep Reviewed-by: Naohiro Tamura (further tweaks can go into follwup commits.) > > --- > > diff --git a/sysdeps/aarch64/multiarch/memset_a64fx.S b/sysdeps/aarch64/multiarch/memset_a64fx.S > index 89dba912588c243e67a9527a56b4d3a44659d542..318c6350a31e0fad788b5f2139de645ddc51493f 100644 > --- a/sysdeps/aarch64/multiarch/memset_a64fx.S > +++ b/sysdeps/aarch64/multiarch/memset_a64fx.S > @@ -30,7 +30,6 @@ > #define L2_SIZE (8*1024*1024) // L2 8MB > #define CACHE_LINE_SIZE 256 > #define PF_DIST_L1 (CACHE_LINE_SIZE * 16) // Prefetch distance L1 > -#define rest x2 > #define vector_length x9 > > #if HAVE_AARCH64_SVE_ASM > @@ -89,29 +88,19 @@ ENTRY (MEMSET) > > .p2align 4 > L(vl_agnostic): // VL Agnostic > - mov rest, count > mov dst, dstin > - add dstend, dstin, count > - // if rest >= L2_SIZE && vector_length == 64 then L(L2) > - mov tmp1, 64 > - cmp rest, L2_SIZE > - ccmp vector_length, tmp1, 0, cs > - b.eq L(L2) > - // if rest >= L1_SIZE && vector_length == 64 then L(L1_prefetch) > - cmp rest, L1_SIZE > - ccmp vector_length, tmp1, 0, cs > - b.eq L(L1_prefetch) > - > + cmp count, L1_SIZE > + b.hi L(L1_prefetch) > > + // count >= 8 * vector_length > L(unroll8): > - lsl tmp1, vector_length, 3 > - .p2align 3 > -1: cmp rest, tmp1 > - b.cc L(last) > - st1b_unroll > + sub count, count, tmp1 > + .p2align 4 > +1: st1b_unroll 0, 7 > add dst, dst, tmp1 > - sub rest, rest, tmp1 > - b 1b > + subs count, count, tmp1 > + b.hi 1b > + add count, count, tmp1 > > L(last): > cmp count, vector_length, lsl 1 > @@ -129,18 +118,22 @@ L(last): > st1b z0.b, p0, [dstend, -1, mul vl] > ret > > -L(L1_prefetch): // if rest >= L1_SIZE > + // count >= L1_SIZE > .p2align 3 > +L(L1_prefetch): > + cmp count, L2_SIZE > + b.hs L(L2) > + cmp vector_length, 64 > + b.ne L(unroll8) > 1: st1b_unroll 0, 3 > prfm pstl1keep, [dst, PF_DIST_L1] > st1b_unroll 4, 7 > prfm pstl1keep, [dst, PF_DIST_L1 + CACHE_LINE_SIZE] > add dst, dst, CACHE_LINE_SIZE * 2 > - sub rest, rest, CACHE_LINE_SIZE * 2 > - cmp rest, L1_SIZE > - b.ge 1b > - cbnz rest, L(unroll8) > - ret > + sub count, count, CACHE_LINE_SIZE * 2 > + cmp count, PF_DIST_L1 > + b.hs 1b > + b L(unroll8) > > // count >= L2_SIZE > .p2align 3 > -- To declare a filtering error, please use the following link : https://www.security-mail.net/reporter.php?mid=115e1.61124ad0.1ab59.0&r=mpoulhies%40kalray.eu&s=libc-alpha-bounces%2Bmpoulhies%3Dkalray.eu%40sourceware.org&o=Re%3A+%5BPATCH+v4+5%2F5%5D+AArch64%3A+Improve+A64FX+memset+medium+loops&verdict=C&c=f39d9cf9bc2bf4b6b9ed72b252e2bb740f50c51b --24FDE3D3B138.1628588755/fx302.security-mail.net--