From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from EUR04-HE1-obe.outbound.protection.outlook.com (mail-he1eur04on2075.outbound.protection.outlook.com [40.107.7.75]) by sourceware.org (Postfix) with ESMTPS id C0C2F385828D for ; Mon, 19 Jun 2023 07:20:16 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org C0C2F385828D Authentication-Results: sourceware.org; dmarc=pass (p=quarantine dis=none) header.from=suse.com Authentication-Results: sourceware.org; spf=pass smtp.mailfrom=suse.com ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=eR9zhOnXmL2kdURcnWOHDixyJWj0PgZnnQWaZGlRoCTl7cwR5fbTnudPF4/61oZ8nP6qiC2H2TtMTr77r1JSi3UbnXma9pU1I5RF6qXQKBMaVWO7U2SZSW7dm7zqSF0HHqx5OQIPJba/VVZilBxKMUnyMUzl+olGUQ5KWLWxqZGK0i9PDFtjYIM2iD+r7olmpDoGwyPFyWY2k/kUbBpyRUj0MqfEvoZOOHxNZy7Jo5grpYVV4e0SbjTMryJlbuY4fKcmSnH194RYGPpwjGUwuVwIoXJBx7wyDSjsjJ1vPhdMbXnFmAh2eUEiuAugsLbOVZuaOuH3WqXihwmqRYDtHQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=UfrzrkvqJMimVhviOCxYjLpT7PuvgpWloGEobVx2QUY=; b=haxXOBUvX0Qb6T2rilIU+bwxXvc52siyAdjIRwfqSAMkXyqtOExvLHD3KiRGa2owGkTuJ2Fh6cbGf2h8mI2pcWhliYWLudm61mdRDjA4gR6DJTJTDhW1Za/9Jp+BMTCzrWX1K357qHpF9X2xpTQWJR8yHu2smqzJpRBQMpuKmLRLwSLCLTc/G9ddvuQwD+GUHbK32L+tYvHVG4W6/oL4KLPNBAePpgHDwtV5lgun0eY3Q2ZIpbT6qPbCkKkM8qgTeex9kPe6gwv+ANxe34d97UsyR83a8uwOCzUNdHVAsEDrUQtRbzX9LEd/WKG5Mg28o9WM+YeGgkYjGadfuF5wlA== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=suse.com; dmarc=pass action=none header.from=suse.com; dkim=pass header.d=suse.com; arc=none DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=suse.com; s=selector1; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=UfrzrkvqJMimVhviOCxYjLpT7PuvgpWloGEobVx2QUY=; b=ox1/Te17YoJnUB5vO9x3deQ6Uh9yJt1itEvrg8QnQPSdS+bTSae+EVVRSSjhjmNQy4aFO7OoP/p8a1enrNd+BjzGYC/N52VcZMnaRBHQa6YRrDi91+PPOw7qO9nJl1hPb3EdpFAhRBWn6wnaFr3WhplXK7iOs+gvF4pVmWPViIFR67mRQILZYUh5fCxf8PzCqVsjkGoudckr6JKumzzAJV5E0CGdX7JB15Vl/WGrmL0X22Ua+gDtEXLKnJOkYqvl8J+KcxeFX8Y16lrtVaQlczRbFGzqQwO8Ezn5c0+9qE+6KHlei+ubLmx6gutbeSz11KXZugK1iNWLlXBjgWN1Sw== Authentication-Results: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=suse.com; Received: from VE1PR04MB6560.eurprd04.prod.outlook.com (2603:10a6:803:122::25) by DBBPR04MB7772.eurprd04.prod.outlook.com (2603:10a6:10:1e3::8) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.6500.36; Mon, 19 Jun 2023 07:20:12 +0000 Received: from VE1PR04MB6560.eurprd04.prod.outlook.com ([fe80::e442:306f:7711:e24c]) by VE1PR04MB6560.eurprd04.prod.outlook.com ([fe80::e442:306f:7711:e24c%5]) with mapi id 15.20.6500.031; Mon, 19 Jun 2023 07:20:12 +0000 Message-ID: <1643ea25-70ea-b9f4-df17-84607e82f5fb@suse.com> Date: Mon, 19 Jun 2023 09:20:13 +0200 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101 Thunderbird/102.12.0 Subject: Re: [PATCH 4/4] x86: provide a 128-bit VBROADCASTSD pseudo Content-Language: en-US To: "H.J. Lu" Cc: Binutils References: <08bf9dc9-5616-7dce-a094-d2ea799c92bf@suse.com> From: Jan Beulich In-Reply-To: Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit X-ClientProxiedBy: FR2P281CA0029.DEUP281.PROD.OUTLOOK.COM (2603:10a6:d10:14::16) To VE1PR04MB6560.eurprd04.prod.outlook.com (2603:10a6:803:122::25) MIME-Version: 1.0 X-MS-PublicTrafficType: Email X-MS-TrafficTypeDiagnostic: VE1PR04MB6560:EE_|DBBPR04MB7772:EE_ X-MS-Office365-Filtering-Correlation-Id: 1e44bb26-53e9-43e7-e6f0-08db70959f08 X-MS-Exchange-SenderADCheck: 1 X-MS-Exchange-AntiSpam-Relay: 0 X-Microsoft-Antispam: BCL:0; X-Microsoft-Antispam-Message-Info: bozvWytwDneuUAgtm0Xr7P3K/jrXdy79GeB9P4ej7861OOnWwFt7yEKVfJCp/g+73Fq1jukn4sRWcDCKZsVEk6I8mp4e6l0Gi7oqSW1NzPIacINDF5n+DivzC1ip5meWfak7ow6VHtrguYFO3MM/e5x2a3GwlEKoxvMnJhvDfISSRpRxErO4X5m1dYhiLzpYQUqQu9BC+Gxdl19WR2IpLYt4trdUHwSi/bh+9OCZKTma0pWQqLcLRyzmC8cWQVu199A8+4VSSFAiNVw8QmbaK0bM1pDgJlL3cwCYDd6Xl3vXX/UiZfYRW9bHVPT2YhSnn1W0dxTyB95VeraYPTyanBG8SXrmCg21WNE7RuEsMrrgHJyF/QaPmFFH6vGi6RuZs2mbBfBfipTPh5p1p/9+afJlSfgNSS374X9EYLyJ1gys6L6e86kM6RwOJ3oiMfD3cGrY11Z3YPbmfWSxMq0gVE7bcNrq1PIPSETo4XsbK1bC7un9GXIjWfkgJqKSlAaU2LEI7vYRdNBc69SPak5p8QkrgiwxXJ0sKv2TK6WvccWKM8RBUYC9Bu/JkXRzaDSkqeVj+GQG9Q/+9/v5nH2azSpmgVOIyie6MDj1sEn58lPH0S9OgEnG5cgfAZID5RzAb6yl4xLWw56lEErB4WF6mg== X-Forefront-Antispam-Report: CIP:255.255.255.255;CTRY:;LANG:en;SCL:1;SRV:;IPV:NLI;SFV:NSPM;H:VE1PR04MB6560.eurprd04.prod.outlook.com;PTR:;CAT:NONE;SFS:(13230028)(136003)(346002)(376002)(366004)(39860400002)(396003)(451199021)(31696002)(86362001)(41300700001)(8676002)(8936002)(5660300002)(66556008)(66476007)(66946007)(6916009)(316002)(31686004)(2616005)(38100700002)(83380400001)(186003)(36756003)(26005)(6512007)(6486002)(6506007)(53546011)(4326008)(478600001)(2906002)(45980500001)(43740500002);DIR:OUT;SFP:1101; X-MS-Exchange-AntiSpam-MessageData-ChunkCount: 1 X-MS-Exchange-AntiSpam-MessageData-0: =?utf-8?B?MDIxNEt1M3oxOS9iTGNzSXBkb3h2TEFpQUdrUDZuUDhyeHJTUmQzbXUvS1VJ?= =?utf-8?B?cVJ0WDhsZ2V3ZS8rWmFKZDBrZWEvZ05NQTF5dGNIVllLSlFqMmNSKzlXM3J6?= =?utf-8?B?TXpQRDFSQVF5UTdMc3FaSlpydlYydFBPdmEzN3F0dnJXeWJNczVHTFB0eEY3?= =?utf-8?B?WERMa3VWaWZIQTZBYS9MUGlJNC9lVHUxZE1JYmJnam94TmIvMDZPcEpUY0Iz?= =?utf-8?B?WHVWSFhGNFZJYUp2aTlRNUFsTjhwSzR3VGJZa1NNSy9FejVuelhSNmdWT1Ns?= =?utf-8?B?Ni9SY3pyZHBvZ0FzR2JwRlNZSzgvTGZOUVJzRzhiSXBnM0s3ZGtoWWFJT3RV?= =?utf-8?B?TUExc0lpR3BoYnlSU3ZwNDhwUVcrU2Vzck94RjNkZjRVUndSa0Z3WWFWNFdY?= =?utf-8?B?TlYwTldDSFRPNU9tM1hjazlJNENzQ3R2aFl6M1VXU05BUis4ZjBpZjJMdHdr?= =?utf-8?B?dXdXMVJoL1IzZTRId0U4dy9hODU5cXNCWS9XRkJldmtHK3RxSVdmL3VmcW5P?= =?utf-8?B?Mld5SkM4YUVFUVVNTEF0VkxMQmFQbjZqNWhYWWlCaGkrcnFYMG93dDIwYnFM?= =?utf-8?B?VGE1aDN3U1dyTUdJVjFISGRzaTRzOWRGZDNOaEVMNXBWa3NXVWpPb1hsT0hr?= =?utf-8?B?aDRxeCtxTXVmb2dnVnVCUnlWOVdocXNNNHAzdXIyZTFiTGQ2VDZGdmhBc3R4?= =?utf-8?B?UFBaRGQwbXA0VS9hNUtOSGowUVNyNmpyV3FWWEExZk1lY042eFVkdXpWZmNX?= =?utf-8?B?YUlVRUNDRllxZVNPd1VJejh6dFZTSFJoOG1HckRtZnB1S2h4czBVWUJlWTI4?= =?utf-8?B?Vjd4UU1ualFPVWpTVXRvVDZDb0E1MWFMZHBuOGY5NkVhd215Zk96QjVEMnA5?= =?utf-8?B?ZEVwcmwvTG1RZVA1TWs5bHlUaVNLeXZjcXoyN3AxOUhuTGE2T2FaUHFoR2JX?= =?utf-8?B?aDFnOXUxOS9wMHh3ZXhhOWlPUWhmNjBjYU5WQW5RNkFaR3g3Y1VMVWJkdkVP?= =?utf-8?B?WEkxK2NaTGhtUzR4UVRlQnJyTlVOU21YRXVxZldyZ1oxbXpuMmQ2Qis1SVVK?= =?utf-8?B?QVFjZ0cyajJHUHhGMTAxZ2MxVVRsVEt1R3NtS0tNVHVLbnA2Q2lOTGZsN0I1?= =?utf-8?B?RDdzaHQ4MFpqeVk4NUxEbHc3OGUrckZwNGNZZ0l6Rnp2a1BkOVhEOFBkRlRC?= =?utf-8?B?QnhHU3VDMnFwT0p2TWhEMFhIQVRPalMzdk1LL01YeFBwdlB6cmtFckdoMmF5?= =?utf-8?B?cEluUC9oYUpSMlBkYWFCcFNuMytaQVFGQTI3MEM4TzhNb2w0RHNUQnp1a2E3?= =?utf-8?B?N1BYMlB4ZEZ3cTAxOUc3WG5LTzZXSk1Na3paL0FxZG1NU3VTTHN3ZmwwVndw?= =?utf-8?B?ZWJDdVFMYXhZa05YSnFSM3ducDdtc3dWQnh5THB1WVNzUG9tS3JacytJOGFM?= =?utf-8?B?a0ppQW4xYmI1WkwvdjhSWitCVktpWndwY0JQakpNTzJhN1dpanUvQ1B5ZHR1?= =?utf-8?B?RndtVjNBRWFrMW9RQWpkb3pJZ1NSTFBHK3A0NFE2eXIzbTNnczEwa0g0bG9D?= =?utf-8?B?TWdIYXpxRjl0TWdHYmthWWVFNldneDBTd2RRWjFEN2srYTlsTGEyOStJTlNt?= =?utf-8?B?U0w5R2tWVEdlaEZhTWJ3Y3p0TDlZVjk4RG5aa0loRVZ0Q0lzcmdnc3lQNExx?= =?utf-8?B?d0JNQWhMSkRiaXU2R0RtTGlkWkNEaE96eC9uaElrNUttd1doVTRIRnlKeiti?= =?utf-8?B?NzlKL01SbEZBNUoxZ3YzT0NYck5maUI5d0h4UXI3RW9HenZieFRIMFdhYmd6?= =?utf-8?B?Ky8rU0FJQ21XVUptd3R5RitObTNPTExLN0VCRWpSN0hjcWx6TVB4QXhhZktj?= =?utf-8?B?WjZvd2lQbTVHQ1hWN2FaV2ZBNnN0U2xPeGlWUmNuTFdOYUZGRTh2UnJDbWt4?= =?utf-8?B?dnNTTWU5QnBLRysxU2NaMkFmRmlIZm5kZ00wSGFKS1o2a3YyR0FNaWhjNUZw?= =?utf-8?B?b3hTME4rWG1VdWxBTTUxV28reGo1QXRINXBKNW95bGJIUVN4VUtLd0J6QTh4?= =?utf-8?B?SFVQZzBtYnZTeFZ0Y2Y3RjhTam5QelJSTjNvVFoxVGRSUXBvcHRPN3k1SVdE?= =?utf-8?Q?xCcj61MAOxSt3dUsZMgTJ7uIO?= X-OriginatorOrg: suse.com X-MS-Exchange-CrossTenant-Network-Message-Id: 1e44bb26-53e9-43e7-e6f0-08db70959f08 X-MS-Exchange-CrossTenant-AuthSource: VE1PR04MB6560.eurprd04.prod.outlook.com X-MS-Exchange-CrossTenant-AuthAs: Internal X-MS-Exchange-CrossTenant-OriginalArrivalTime: 19 Jun 2023 07:20:12.4869 (UTC) X-MS-Exchange-CrossTenant-FromEntityHeader: Hosted X-MS-Exchange-CrossTenant-Id: f7a17af6-1c5c-4a36-aa8b-f5be247aa4ba X-MS-Exchange-CrossTenant-MailboxType: HOSTED X-MS-Exchange-CrossTenant-UserPrincipalName: rTYgj5bTPmOONjh29DSrDLRxrOyrkOZML/49wkU38KIdfdPzMO9PpmEOCqhl4TcYQvTfxxMOvJ4kB3Aw4Ndtkw== X-MS-Exchange-Transport-CrossTenantHeadersStamped: DBBPR04MB7772 X-Spam-Status: No, score=-3027.6 required=5.0 tests=BAYES_00,DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,NICE_REPLY_A,RCVD_IN_DNSWL_NONE,RCVD_IN_MSPIKE_H2,SPF_HELO_PASS,SPF_PASS,TXREP,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server2.sourceware.org List-Id: On 16.06.2023 18:59, H.J. Lu wrote: > On Fri, Jun 16, 2023 at 12:32 AM Jan Beulich wrote: >> VBROADCASTSD not supporting 128-bit destinations in any of their AVX, >> AVX2, or AVX512F incarnations is presumably because of VMOVDDUP >> precisely supporting this very operation. (It is therefore different >> from e.g. VPBROADCASTQ, which has no exact equivalent.) Still its >> absence has led to people using VPBROADCASTQ as substitution; this could >> have been avoided if such a pseudo had been supported from the very >> beginning. >> >> Note that the pseudos try to match what the real instructions would have >> used as closely as possible, i.e. VexW0 instead of VexWIG for the AVX >> and AVX2 forms as well as AVX2 in the first place for the register >> source form. >> --- >> For being the first example of us supplying such, this is partly RFC. On >> top of that a question is also whether to indeed have split AVX/AVX2 >> templates, when in principle one (allowing for both memory and register >> source) could do. >> > > I don't think assembler should invent such instructions. May I ask about the "why" behind this? If such a pseudo had been there from the beginning, an admittedly minor mistake like that corrected by gcc commit a4df0ce78d6f likely wouldn't have been made, because no special casing of V2DFmode would have been necessary in the first place. Jan