From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from EUR05-VI1-obe.outbound.protection.outlook.com (mail-vi1eur05on2072.outbound.protection.outlook.com [40.107.21.72]) by sourceware.org (Postfix) with ESMTPS id A78963858D28 for ; Tue, 14 Nov 2023 16:56:22 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org A78963858D28 Authentication-Results: sourceware.org; dmarc=pass (p=none dis=none) header.from=arm.com Authentication-Results: sourceware.org; spf=pass smtp.mailfrom=arm.com ARC-Filter: OpenARC Filter v1.0.0 sourceware.org A78963858D28 Authentication-Results: server2.sourceware.org; arc=pass smtp.remote-ip=40.107.21.72 ARC-Seal: i=3; a=rsa-sha256; d=sourceware.org; s=key; t=1699980984; cv=pass; b=DLq0635Iz6PnHx9+8yPTxodh79tPqwYITGKUPv+bS8A3LgxHGTjGzDif9jHuCseMiAM+kaSrBdW6AHpFpECGnglLMXGzICdtFWfMst04BSv/nbfQOduP3lgeFgu6FXImrQX1TYJS4F/NWOqPFxGwLSZe8mx4GK5h+Liu2ud4Jy4= ARC-Message-Signature: i=3; a=rsa-sha256; d=sourceware.org; s=key; t=1699980984; c=relaxed/simple; bh=zmsu37QnJQPqyWjngyBXAjK/uXo69xRLMq9SEfwbbX0=; h=DKIM-Signature:DKIM-Signature:From:To:Subject:Date:Message-ID: MIME-Version; b=EoaS4cQBywK278FGfwxuIvlEwc8iHuPEsDP5qJfdLlzvHDOPoUCLAtH014nUp97QBsorVM2Nd0OIXI0Ge04EvdzUKzXclfhLHILxidSqfMMvdXuHkSyNmr0Dp6d6QaKIgST5yuILtVtgf2siV0U88Or/5LvrqKlxkqyQOjqUifg= ARC-Authentication-Results: i=3; server2.sourceware.org ARC-Seal: i=2; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=pass; b=XWYZI3PlaFRw2ydbvH3VguM3vRJZoqnoyNvFfUyT4hMr0F18lBQdKQz7vh6f7ggad3FF2j72PBrmMWqWroc/1fC/MUnX0k9ZUIR/GfExIc6nsYRI06xEILhFqjKlB3IUFKdi/szggjq7ppxqVxx76xf/QunSKQSpni8QmnzRr3Rv6+ni3TQ1oakxWrKZJtybXJb6uejSVwT39URjJDLFH93/dBjEHkUTKLLH2RvqRSjTG8iWPfZMnfIP3M5mVq0Vv45ahINAouE8/Nqu9nLYPsMqZwNtAU4GLEjBjwkYxsqNceVFF7OovH0npJisgIHF2ngUyRhP1bbV3I465CnaqQ== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=b0r+6weyiYza5UhcOy+Q+jWPPsvgLfYpMUffwIQy9yk=; b=aUeWUb43jSFs0JbW3rLb2ioSmQc9lZ7pgH5z5zuc2DJX+suQSFZTNIjFqS+upX1OHbEm+XtmB8+wwT6MeChEOg07fRynTIAXg+j7NhqM9Bl6F88wdlnx8+uI8hesCqYQKN80BUWdlOfn1dSGtr4lSQGiBcVrGB8xEaCWDKSLhwphjyxUMTg3lyMtc4X+EVVVZEidFPEptx4ly1IOLq/uyAVmjXT2sDlOJD84AgaBFcxTAlzZ1guUeNlXoXuChqYciqBVj4EhkuIUwua0CWr5OLVQntHmRBselxAUk7B//OhcLIBcmiwdd12DTxf4qF2JCXW/qdJo0mmBHY481apKLQ== ARC-Authentication-Results: i=2; mx.microsoft.com 1; spf=pass (sender ip is 63.35.35.123) smtp.rcpttodomain=gcc.gnu.org smtp.mailfrom=arm.com; dmarc=pass (p=none sp=none pct=100) action=none header.from=arm.com; dkim=pass (signature was verified) header.d=armh.onmicrosoft.com; arc=pass (0 oda=1 ltdi=1 spf=[1,1,smtp.mailfrom=arm.com] dkim=[1,1,header.d=arm.com] dmarc=[1,1,header.from=arm.com]) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=armh.onmicrosoft.com; s=selector2-armh-onmicrosoft-com; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=b0r+6weyiYza5UhcOy+Q+jWPPsvgLfYpMUffwIQy9yk=; b=ttWDYNQpiNevMSBzJxHhFotCFwhj9wf8evJh5OWN0EPEgJEYdhSsn/vbCH/nPNaKhGnWG90JvyOF7X7Ja566R/D94Cv660px3pBTmEC7gzKsD/YFJb3imD/ksGc0JMF5Kh+a3o4bcH17ztudY3r8Wydq6BWDxixuHY5cdti+luA= Received: from AM0PR01CA0171.eurprd01.prod.exchangelabs.com (2603:10a6:208:aa::40) by DB9PR08MB7651.eurprd08.prod.outlook.com (2603:10a6:10:30e::5) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.6977.29; Tue, 14 Nov 2023 16:56:18 +0000 Received: from AM3PEPF0000A799.eurprd04.prod.outlook.com (2603:10a6:208:aa:cafe::79) by AM0PR01CA0171.outlook.office365.com (2603:10a6:208:aa::40) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.6977.29 via Frontend Transport; Tue, 14 Nov 2023 16:56:17 +0000 X-MS-Exchange-Authentication-Results: spf=pass (sender IP is 63.35.35.123) smtp.mailfrom=arm.com; dkim=pass (signature was verified) header.d=armh.onmicrosoft.com;dmarc=pass action=none header.from=arm.com; Received-SPF: Pass (protection.outlook.com: domain of arm.com designates 63.35.35.123 as permitted sender) receiver=protection.outlook.com; client-ip=63.35.35.123; helo=64aa7808-outbound-1.mta.getcheckrecipient.com; pr=C Received: from 64aa7808-outbound-1.mta.getcheckrecipient.com (63.35.35.123) by AM3PEPF0000A799.mail.protection.outlook.com (10.167.16.104) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.7002.13 via Frontend Transport; Tue, 14 Nov 2023 16:56:17 +0000 Received: ("Tessian outbound e243565b0037:v228"); Tue, 14 Nov 2023 16:56:17 +0000 X-CheckRecipientChecked: true X-CR-MTA-CID: 085b1af4c2fcd091 X-CR-MTA-TID: 64aa7808 Received: from ce645928c845.3 by 64aa7808-outbound-1.mta.getcheckrecipient.com id E5D6F331-34C9-4F10-885E-9F8692AED7A9.1; Tue, 14 Nov 2023 16:56:10 +0000 Received: from EUR05-DB8-obe.outbound.protection.outlook.com by 64aa7808-outbound-1.mta.getcheckrecipient.com with ESMTPS id ce645928c845.3 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384); Tue, 14 Nov 2023 16:56:10 +0000 ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=CiMzdln5S3MAfMgv2TUnV5mkTKZbmcVCZJHf2RFC1IkYqQPYefjWqD7dnlCLBKSVVWdPcmvuuaWRocVhh8rluBHB3Q80o7J7hC6ykmKRbE1pE9kStOFJuVbkSTm/UEvPBslseL4BdZ6dvPbdS8IT2QHJzmtoowLMJvlUtmXcQDkLVEeBG4NF/fzpiPr+DCK/RPWXMomlrwv91YTOpnbntn6Zhc0tGjZUYoOqGV9f6rGwHg3BHeBxDirrrfMzZ00d0asB4xzq8K881bs7IVvDVhF94sLTroDdzSAwTP9P1tEuJskqlsZkPxvDk94Scu/s/TUoQgZTbFZrLB6db6CStg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=b0r+6weyiYza5UhcOy+Q+jWPPsvgLfYpMUffwIQy9yk=; b=hUkpehzgBzip6dKRotCNTkQ8g5+qNmArCfectEeX+cZhSMdhYsmfWFPzqHPVvO3HT4REnwjUjV3Kh//Un4jxG0tJZGvkyF6cUqHn4K0U+q6OtN8DFcsv5br2kjUyABcv5uZNIOvQ3Sy39J6tT+ydEQUTOOq6UuqYcmgC2idUsT46pAp4Z6gwvSQveglKeAsVqiwkdTUxpftdvWg6LriLN7Dcfs9Pu02phiOa6wMpRrI9t1XsVCohNg+uEbgPPdGTlVME2DFazCEmz7+ViOQBjdjgJdxM92F8DTx4cHHDH5O2tVG67PqIGVPjLFYsAzWemNC3itfyPOQ7Av71AJEWZQ== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=arm.com; dmarc=pass action=none header.from=arm.com; dkim=pass header.d=arm.com; arc=none DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=armh.onmicrosoft.com; s=selector2-armh-onmicrosoft-com; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=b0r+6weyiYza5UhcOy+Q+jWPPsvgLfYpMUffwIQy9yk=; b=ttWDYNQpiNevMSBzJxHhFotCFwhj9wf8evJh5OWN0EPEgJEYdhSsn/vbCH/nPNaKhGnWG90JvyOF7X7Ja566R/D94Cv660px3pBTmEC7gzKsD/YFJb3imD/ksGc0JMF5Kh+a3o4bcH17ztudY3r8Wydq6BWDxixuHY5cdti+luA= Received: from PAWPR08MB8982.eurprd08.prod.outlook.com (2603:10a6:102:33f::20) by DU0PR08MB8785.eurprd08.prod.outlook.com (2603:10a6:10:475::13) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.6977.22; Tue, 14 Nov 2023 16:56:08 +0000 Received: from PAWPR08MB8982.eurprd08.prod.outlook.com ([fe80::cfc5:acc1:cfc1:9704]) by PAWPR08MB8982.eurprd08.prod.outlook.com ([fe80::cfc5:acc1:cfc1:9704%6]) with mapi id 15.20.6977.029; Tue, 14 Nov 2023 16:56:08 +0000 From: Wilco Dijkstra To: Richard Earnshaw , Kyrylo Tkachov , GCC Patches CC: Richard Sandiford , Richard Earnshaw Subject: Re: [PATCH v2] AArch64: Cleanup memset expansion Thread-Topic: [PATCH v2] AArch64: Cleanup memset expansion Thread-Index: AQHaFxbh3FMDwtGqRkuO/QbEOcUwZbB6A0cAgAABHW0= Date: Tue, 14 Nov 2023 16:56:08 +0000 Message-ID: References: <372b9689-24b5-41f4-a990-5aee0226e15f@foss.arm.com> <61c6e268-188c-4b35-956d-bd8927d705f2@foss.arm.com> In-Reply-To: Accept-Language: en-GB, en-US Content-Language: en-GB X-MS-Has-Attach: X-MS-TNEF-Correlator: msip_labels: Authentication-Results-Original: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=arm.com; x-ms-traffictypediagnostic: PAWPR08MB8982:EE_|DU0PR08MB8785:EE_|AM3PEPF0000A799:EE_|DB9PR08MB7651:EE_ X-MS-Office365-Filtering-Correlation-Id: 21a4152a-d30a-45a6-27ea-08dbe5329edd x-checkrecipientrouted: true nodisclaimer: true X-MS-Exchange-SenderADCheck: 1 X-MS-Exchange-AntiSpam-Relay: 0 X-Microsoft-Antispam-Untrusted: BCL:0; X-Microsoft-Antispam-Message-Info-Original: mIxxH61igQgCuleKb51ALNaGcYuy0vl6Ynl6+eKe8iCG3R2qijByKsIoe0lOxk7nysgtKB1GIQA8Owk5ls5b8HpNkEHrla7X5ThKFGWw7UtIREyoHGJTJSS5EYv6GcaAtkPT0xOIKu4N6DBxf/6TPZXmyQsylj/lhL1oMAlBts0M3BwHJiovpyULS6DarXM74AgS2KqMHdbFXUYUbWYOBCeeyTTXuHOZws274WjvY8rnyMz+/WENuWZ4/jdKyuTV00x6ZbQhyReqnH9I7lsEoy7AolI/Dw+qciJkRCiqhE1Z7XLTmYox0y7qjP2McGdqEUrZAPqW54HXIDTb2w9YptEAVoI0drcrFsqFwmj69honqwDsRgr6L/y5aflZV+sPNe6fwwrx1jNBwGw+Ci7HZhVvMv4ArWh5HjFRZk2dFSmOU6ThV01NkbWbwABXxk3ONcp3n0j8ldgBOkVlXzL4VuJGclYqvUCF3k48CqUN59oAL/fWoqt2rMR2M8PbxCGOs5sIxNqpgZKCLAGBUGUMQGA1je8MpYjm7ZMYIQodv0F7K6DKsx7yvA8ZiNti+inrsH7ioy2jStni1AgWMOCuv0XqZEFgfl60JqAQXD1OkQg= X-Forefront-Antispam-Report-Untrusted: CIP:255.255.255.255;CTRY:;LANG:en;SCL:1;SRV:;IPV:NLI;SFV:NSPM;H:PAWPR08MB8982.eurprd08.prod.outlook.com;PTR:;CAT:NONE;SFS:(13230031)(39860400002)(376002)(366004)(396003)(346002)(136003)(230922051799003)(451199024)(64100799003)(1800799009)(186009)(41300700001)(26005)(38070700009)(6506007)(122000001)(2906002)(7696005)(33656002)(478600001)(71200400001)(86362001)(4744005)(9686003)(38100700002)(55016003)(52536014)(8936002)(8676002)(4326008)(66946007)(76116006)(316002)(91956017)(5660300002)(66476007)(54906003)(66556008)(66446008)(64756008)(110136005);DIR:OUT;SFP:1101; Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-MS-Exchange-Transport-CrossTenantHeadersStamped: DU0PR08MB8785 Original-Authentication-Results: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=arm.com; X-EOPAttributedMessage: 0 X-MS-Exchange-Transport-CrossTenantHeadersStripped: AM3PEPF0000A799.eurprd04.prod.outlook.com X-MS-PublicTrafficType: Email X-MS-Office365-Filtering-Correlation-Id-Prvs: 841f8c27-1ae0-44da-5cf6-08dbe532994b X-Microsoft-Antispam: BCL:0; X-Microsoft-Antispam-Message-Info: ZrrdFUJu4NPSCJBcAn5E7Dvx2Gfpg6rKGdRNVyETi2FbxXvc+8Zpc32vz7yh2CeZGZFfzKEVHhk5LPLX47W+vUQRRfGDPYmJOMfLAqUBOgNRvJ/szvxZ2F2D7cTdrqDd4HsmSTI5L2OnzJWByBRQMJj6eofIjA+2V736CcPHtwOhHKqofkowl9K8NmdhVUffSKkHVNoTHjZIm2mn0wA3oTxKOEqbjeQLwnZcod8tUxG42GRPfnqzShoV1Eg5GbygjdmIb14Tpmc0EWkTkCjlOyy3CU8+PEIL8Kcm2ZkT54wZxwS7OWaGN22q3Zj/KB+Mj+GCjhhR6WC0/6VSchlFMRrYXprJO4e7fRw45Yr3yo4qrSMj1KUkw2q5M9HEslYC507BDyMejW3WdQEszjYK7jOUPDuKuwRK596owvT5PDOPIejtRkxebcRfdlMbvd1eh1ytAkOElMSL7w0XAc62SwtzEn1m8Ei45YnZATgdxUbyI73srNbKJy8hA80Z/ECPojIQfPmK8kjC9yOdP+FuVZC+QgTK57/lZhCGkWto5V7ARBdngdVzl9xt7FUbpGAVgVFenOFI8O2pmEPA5w5mL8JBBiTfaKewkYANDBLM7Y9JfloPofMoQbWL2KrhaUblfWQ0xrp0LmEOvM2Hk3e7mmVYV4Mp9npEFR1Nk9gHvm8FCZPRRK0VaL3NMMUSfmRB2h+8dAPR3HTSf2BtkBpXzONvgGsRhBRXIjT/NmI81dM= X-Forefront-Antispam-Report: CIP:63.35.35.123;CTRY:IE;LANG:en;SCL:1;SRV:;IPV:CAL;SFV:NSPM;H:64aa7808-outbound-1.mta.getcheckrecipient.com;PTR:ec2-63-35-35-123.eu-west-1.compute.amazonaws.com;CAT:NONE;SFS:(13230031)(4636009)(346002)(396003)(136003)(39860400002)(376002)(230922051799003)(64100799003)(82310400011)(1800799009)(451199024)(186009)(36840700001)(40470700004)(46966006)(55016003)(40480700001)(41300700001)(2906002)(4744005)(40460700003)(5660300002)(8936002)(8676002)(52536014)(4326008)(316002)(70586007)(70206006)(110136005)(54906003)(356005)(81166007)(6506007)(47076005)(33656002)(36860700001)(7696005)(478600001)(336012)(86362001)(26005)(9686003)(82740400003);DIR:OUT;SFP:1101; X-OriginatorOrg: arm.com X-MS-Exchange-CrossTenant-OriginalArrivalTime: 14 Nov 2023 16:56:17.9007 (UTC) X-MS-Exchange-CrossTenant-Network-Message-Id: 21a4152a-d30a-45a6-27ea-08dbe5329edd X-MS-Exchange-CrossTenant-Id: f34e5979-57d9-4aaa-ad4d-b122a662184d X-MS-Exchange-CrossTenant-OriginalAttributedTenantConnectingIp: TenantId=f34e5979-57d9-4aaa-ad4d-b122a662184d;Ip=[63.35.35.123];Helo=[64aa7808-outbound-1.mta.getcheckrecipient.com] X-MS-Exchange-CrossTenant-AuthSource: AM3PEPF0000A799.eurprd04.prod.outlook.com X-MS-Exchange-CrossTenant-AuthAs: Anonymous X-MS-Exchange-CrossTenant-FromEntityHeader: HybridOnPrem X-MS-Exchange-Transport-CrossTenantHeadersStamped: DB9PR08MB7651 X-Spam-Status: No, score=-4.8 required=5.0 tests=BAYES_00,DKIM_SIGNED,DKIM_VALID,FORGED_SPF_HELO,KAM_DMARC_NONE,KAM_SHORT,RCVD_IN_DNSWL_NONE,RCVD_IN_MSPIKE_H2,SPF_HELO_PASS,SPF_NONE,TXREP,T_SCC_BODY_TEXT_LINE,UNPARSEABLE_RELAY autolearn=no autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server2.sourceware.org List-Id: Hi Richard,=0A= =0A= > +/* Maximum bytes set for an inline memset expansion.=A0 With -Os use 3 S= TP=0A= > +=A0=A0 and 1 MOVI/DUP (same size as a call).=A0 */=0A= > +#define MAX_SET_SIZE(speed) (speed ? 256 : 96)=0A= =0A= > So it looks like this assumes we have AdvSIMD.=A0 What about =0A= > -mgeneral-regs-only?=0A= =0A= After my strictalign bugfix=0A= (https://gcc.gnu.org/pipermail/gcc-patches/2023-November/635309.html)=0A= aarch64_expand_setmem starts with:=0A= =0A= /* Variable-sized or strict-align memset may use the MOPS expansion. */= =0A= if (!CONST_INT_P (operands[1]) || !TARGET_SIMD=0A= || (STRICT_ALIGNMENT && align < 16))=0A= return aarch64_expand_setmem_mops (operands);=0A= =0A= Generating perfect code for every STRICT_ALIGNMENT x TARGET_SIMD=0A= x AARCH64_EXTRA_TUNE_NO_LDP_STP_QREGS x speed/size combination=0A= would require a huge rewrite - and that's not the goal of this patch.=0A= =0A= Cheers,=0A= Wilco=