From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from EUR01-HE1-obe.outbound.protection.outlook.com (mail-he1eur01on2052.outbound.protection.outlook.com [40.107.13.52]) by sourceware.org (Postfix) with ESMTPS id 722E13858C5E for ; Wed, 14 Jun 2023 05:54:42 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org 722E13858C5E Authentication-Results: sourceware.org; dmarc=pass (p=quarantine dis=none) header.from=suse.com Authentication-Results: sourceware.org; spf=pass smtp.mailfrom=suse.com ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=VXrsWHnYbI2EWQwYWD4ZOhu2CyT0XTey6ahgbcGmO1GlJybS5+PSRBB8MVT0VT074G/SwV7R7qkCPF/heJ8ZaZEsY4d35KMvIIHaWL2QK4t1VjSIf9TcUHM7D5ZvcNF+YwF0h/clC2liJxqpE925K5tb9jgP7FEc5WFLsc3OiqTpa+hPfneWc0Kb4o76yw0NDCjis454OXWyD8us3XVV5X+PjWWcAWlXqwJLQz8rsHJeZdy+G5ZmVuTHYymz1FHp3qrW9BVHV7mxIY5T+fESdBUZi0MO+q/RN0Z9ANrjfSSbibnormKGwna72SKy+4nrovOr7fQPnm9+HxiDIBucFw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=J7dubu0rGxxD4USD4XGsASTPRYHXhWYgY3zE1YwsIa4=; b=hoOyU8Fe8v17VNjjJs3DH2bkEbr/gFQRpI7d2xitEsT5zkRv1shdBt5/RszyMTDUoE4ovFhImzCD7Jsf5BAv2P2tZXAi4LUY/Q6xMChRYe0CiUuPXaccy4krgpM56/TON3yJFTkXReDnZ1vhHBPBcM4MNTja/1QX5H6cAlLzbEPFMDJUeO+752oAJ/tOxTyUL3OCwUioKKM6QFR2FDKG2grrzbtKxql62277nYHIjozMPxF3eg4WejxCdgcPG7nnSVFHhjE6vKFAILKW1OMWonAnewXreGQbVkTpSIx85PUqTbPANujsQOKYPjIcwQtfd2QfQPY7WhUo7K79+SZ/RQ== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=suse.com; dmarc=pass action=none header.from=suse.com; dkim=pass header.d=suse.com; arc=none DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=suse.com; s=selector1; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=J7dubu0rGxxD4USD4XGsASTPRYHXhWYgY3zE1YwsIa4=; b=yNCshgOPdcjK5ucsNbOs/IOO06xDbb6hIscqSeil9bc6YggpuV6/U1o2sRfPtaO32oQPwff94eB417RL8y3G7MyTygW2+PLLLL6vX/ZBs+QdJgTyjW9BbdQuJzSFNeGHlT75CJPWmblxKFA2hHKS83JfuFn4SlcsTGbxYVz+noW7aayP5M1h6Mg7VC7H0dbitK5kSm2VuqVpFfE4ik4BQluH8K97yLw42F/gqb3COF4Bcy1uQhi02WG8IDs/HWePeruB2N7HGtmM6QjVWpX+XVlWIvKZ0kXu5ZmQOrYM8PSp70tsG6/SYbS3SfNCX2Tz0cOA8zmuaNN90mBrUEV0OQ== Authentication-Results: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=suse.com; Received: from VE1PR04MB6560.eurprd04.prod.outlook.com (2603:10a6:803:122::25) by GVXPR04MB9902.eurprd04.prod.outlook.com (2603:10a6:150:116::19) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.6455.44; Wed, 14 Jun 2023 05:54:38 +0000 Received: from VE1PR04MB6560.eurprd04.prod.outlook.com ([fe80::e442:306f:7711:e24c]) by VE1PR04MB6560.eurprd04.prod.outlook.com ([fe80::e442:306f:7711:e24c%5]) with mapi id 15.20.6455.039; Wed, 14 Jun 2023 05:54:37 +0000 Message-ID: <1901e956-dc34-cc03-0419-8d4338174384@suse.com> Date: Wed, 14 Jun 2023 07:54:35 +0200 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101 Thunderbird/102.12.0 Content-Language: en-US To: "gcc-patches@gcc.gnu.org" Cc: Kirill Yukhin , Hongtao Liu From: Jan Beulich Subject: [PATCH] x86/AVX512: use VMOVDDUP for broadcast to V2DF Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit X-ClientProxiedBy: FR0P281CA0180.DEUP281.PROD.OUTLOOK.COM (2603:10a6:d10:b4::10) To VE1PR04MB6560.eurprd04.prod.outlook.com (2603:10a6:803:122::25) MIME-Version: 1.0 X-MS-PublicTrafficType: Email X-MS-TrafficTypeDiagnostic: VE1PR04MB6560:EE_|GVXPR04MB9902:EE_ X-MS-Office365-Filtering-Correlation-Id: 99176179-08b0-4567-17a5-08db6c9bd621 X-MS-Exchange-SenderADCheck: 1 X-MS-Exchange-AntiSpam-Relay: 0 X-Microsoft-Antispam: BCL:0; X-Microsoft-Antispam-Message-Info: zxXTz73L0bjBzpWcOzr+vgHT8zGlC5g0kdiZMrNu3oIchW1iahGAHN/pwQVNXuCfxJXK57B/xelW40Ve/jLuOvm1IPvpXjTrvTUqbgupZ+y0BqalN4zBxRGOJieOa9kN8klVXzhaPnXGHFsNqWq3v+pjWc9ScqiToaBnnJYzbqqOqb1WnZwAVqePKI1OyyU+BFL7bzHdD2ewHRW6pun0c9OJ5yd/wwRdZ2xhnLm4nvjB+Rpv+ISH6fqqPQAcVQ1yzHLt1oj4WZi9FzVvCFNBE1QI/1Vu3wgLbZKFwfS+ryOHSFf+nc8IL9bxDC/Tkx3wJoG6RZMlqJkvCsFR7Y6AwJ0H+lGrM3XINeOKG57jGcHc38Xf5VdXSZnkxlUUiTW+Et7LT2g3B+lwrEsoYk7Fi2vlSHqA0AVwuTvqeqP9XViZmtW0goNA2O1f1UoNy0g8jkgh+jfbuhRTFyijy0odNcxq8e1+cmHDOMcU/iRIl5EBu8cSLTzeK0hmlB/+1ON063qiOEeiKQW3Ix3ZuWF/e9vyxwz6vrmg+bW4r+ziBTEiw3g3KwYitB+O5r3yvF9MkYKtPmiR7Bylbf2frfVL+oNRbWHuxDLS5qrvExc3FQESl7h+oRXZy47ZasTFLGtJ5bg+yNhtrwAgoy03NvO5hg== X-Forefront-Antispam-Report: CIP:255.255.255.255;CTRY:;LANG:en;SCL:1;SRV:;IPV:NLI;SFV:NSPM;H:VE1PR04MB6560.eurprd04.prod.outlook.com;PTR:;CAT:NONE;SFS:(13230028)(366004)(136003)(396003)(346002)(39860400002)(376002)(451199021)(4326008)(6916009)(66476007)(316002)(31686004)(41300700001)(186003)(4744005)(54906003)(2906002)(5660300002)(66556008)(478600001)(8676002)(8936002)(66946007)(6486002)(6506007)(6512007)(26005)(36756003)(86362001)(2616005)(38100700002)(31696002)(45980500001)(43740500002);DIR:OUT;SFP:1101; X-MS-Exchange-AntiSpam-MessageData-ChunkCount: 1 X-MS-Exchange-AntiSpam-MessageData-0: =?utf-8?B?MVI2YUhPa1BsK1VqZWM0VC9KNFIxQ1ZuZnh6VjJ1ZjR1MWFGcWl6QVRoV2FJ?= =?utf-8?B?VGxNNGxxNDllVzB1VTIyMElkOXB5WDU0Y0kwSUhlZkpsWWhKZWdzdmdXV21B?= =?utf-8?B?Nnl4MXFKOEhRZkU2Y0I2ZUJJT0NXOG9adjJYSEp6Vk1oNDRiSXJPRGZLNFZ5?= =?utf-8?B?b25XS1FZVGQ3TlkxM1FHZmlQa3hoU1FPb0QyS1ZiV2U5bHQzSDlNMytnWWpC?= =?utf-8?B?bUIvdHk3T1h3OERJSHFUMjc0d0hkSVZOL0ZjTytRN3pmNUhQSUxvdllkd1BY?= =?utf-8?B?TXdrbjNLWXJ0d1BDL2ZSZ2Vhb05kNVRJeUM1bXFERndtS1loZm1zL09PMGZQ?= =?utf-8?B?ZkFKUkRzV3k4bkppbnQrclpXb2kwSVVYZEpNWGV0VlVBc2lrbW9MME1NSjBR?= =?utf-8?B?WVA4aDVzRm5aNktBQTZlaFpUcWdqZ1VhQmE4ZzFjb2pCN3JEcnVFOXY5SzdC?= =?utf-8?B?cXFlR2JhUFB2UGRnakp5TlBYcTlHdTZCcnhTTUNWbnkvek5VZnNSeTZFamFI?= =?utf-8?B?OGs0Y2ZSV1prQkRXVnQ4THhzSmZNTy9uL2tHeEE2eDhzbUsva2pOZVBVWWs4?= =?utf-8?B?c2FBamtWQm9CSjd4eW82RVlac0hqVGlvQWFNS0FiRm1iREZQa0xYRmRMdWZo?= =?utf-8?B?c2VMTlh2RkZnMXdpU2owOTNqVGpIZWJtV1ltdVFrSnpvZWV1UHN5dG9rN0xy?= =?utf-8?B?My9qSVJXY0lycUhJYzVzd3hqS3pKa1JzT1RnT1JlNnl4TVd4Ykdxb0VZZ0Yy?= =?utf-8?B?MjhBOTBFb3BDUzZadjBEWWVtSWhNNkdmeHprK1I2MjQwdFBuOWMwME50NkxO?= =?utf-8?B?K3FyRVNoTUU1UkFFemlHVmw4SFZxWXlFZjQxMGlockZ6ZHNHeDFvNUFjcTU1?= =?utf-8?B?aHZzeWtGZEJIK0l2bGgyU2NjZThWMkVJMHBTOHFNNzE1TnFCUGV4eTl2aG9N?= =?utf-8?B?aGR3SGVod05uaDNnYWlvaTJXak40dGpleEh6SkFpN21ZdXpWM3hyQllQTS9L?= =?utf-8?B?ZGZwUHFDSTZ1RGJkbldQZVZGeHNrWllWQUowaEtzYlhYN3o1UjVTQnpTajhk?= =?utf-8?B?TGs5dk85a3JpREFmSkV4aTVpUXJ2d2VyWENyRnFlMUZpclpiK0hNVDd1eEZy?= =?utf-8?B?aTF5dnViZ0F2OVJWZnBqanVFaWtoODAyR1NOZmZ1UXM4anpqbjJzNWFSTitE?= =?utf-8?B?bWI5dVZ1cmJicTN3NnRsSmp0eWxpTkhrbjVEaGNtUEN1YjFMdjNQdG8rSGJ1?= =?utf-8?B?dEVublRGbStYd0lVMEdjUDNodDhxeXhMaCtEc3RSNEFOd04wdFQvNSs0VFFu?= =?utf-8?B?SGVXRXRwRHpkeUhBTVBCRkZ6NXU1ZkM5RVphbzloeHI0cnVSakVqMjJwYmNx?= =?utf-8?B?UEx2d1RoTzlsdW9WbDFDYVl3cEpNRE9qUlUzTCtBMVczU1RyeWRiY0ZwTVpF?= =?utf-8?B?RC94dFdIVVRRbkVBOS9Md3BXSDhJZ00rSHYrd205c2dLQlFTczlOR0NzV2Vt?= =?utf-8?B?aGY3Z1Q0ZWV1dm94WEZ6eVdBS05IaWVReFVaNUNRSTlpYi9kNjhucU84d2ZE?= =?utf-8?B?dzN0ak1ab0E5ZWZUeU92bDIzMGc4cDA3VVFpRmdQenpsWE9Za2k2TW53YVc5?= =?utf-8?B?MzduTUxsQ0Rma0NySy9Ua0JnamdscGV0VW5ncWlZT3hHbzFxQ1lvYTZrTExY?= =?utf-8?B?ZUxWQ0REblRCNm01NGtNRXNCYk9yeUtLR245SlpkOWltUVFqWHMzU21YTEpz?= =?utf-8?B?Q0VITzRHTWZCZThsdDB4aEU2S2ZuSE5zaWZFRVlTMkxQVjdGKzQ0clZYVjN3?= =?utf-8?B?UHlPT2pIQSttZ2FPSDFVMW9MNUIvcUpTaVRLdFRFY0lpeStSMXFMeFZzRW9H?= =?utf-8?B?Qzk4eUVjZjJhazM2bWJndTJrWDZtOHpLMmd6RW5IZC81VVBFREJ6TWVqOFIy?= =?utf-8?B?N0Rvc0pYVk95SW1PKy9OMFFQTDZsclAzRjJWc2Ira3pseW1DSjFkaXlLWDVY?= =?utf-8?B?L2RjanFxck9odktUNVJWSzdBZ05YUkMvVE5YSkFIOHpBOGxqUWc5N2wzMTIx?= =?utf-8?B?Ull2cENRbG9JV09vRm9ScjVnQ0RENmtLbDc3Q1h5S1M5eVV0Z24rdHJZcEZE?= =?utf-8?Q?Xa6mqE0F7hay7W4hv2u+6u7x7?= X-OriginatorOrg: suse.com X-MS-Exchange-CrossTenant-Network-Message-Id: 99176179-08b0-4567-17a5-08db6c9bd621 X-MS-Exchange-CrossTenant-AuthSource: VE1PR04MB6560.eurprd04.prod.outlook.com X-MS-Exchange-CrossTenant-AuthAs: Internal X-MS-Exchange-CrossTenant-OriginalArrivalTime: 14 Jun 2023 05:54:37.3497 (UTC) X-MS-Exchange-CrossTenant-FromEntityHeader: Hosted X-MS-Exchange-CrossTenant-Id: f7a17af6-1c5c-4a36-aa8b-f5be247aa4ba X-MS-Exchange-CrossTenant-MailboxType: HOSTED X-MS-Exchange-CrossTenant-UserPrincipalName: 5yN3yKgmJrIMP8IApEuwZc+NsIeY4EeEW3e6pG5mO5X7BuG067IZHBHJu/uWwqbGCEfbu5ImvaaZ4qD2jYB3FQ== X-MS-Exchange-Transport-CrossTenantHeadersStamped: GVXPR04MB9902 X-Spam-Status: No, score=-3027.7 required=5.0 tests=BAYES_00,DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_NONE,RCVD_IN_MSPIKE_H2,SPF_HELO_PASS,SPF_PASS,TXREP,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server2.sourceware.org List-Id: Like is already the case for the AVX/AVX2 form, VMOVDDUP - acting on double precision floating values - is more appropriate to use here, and it can also result in shorter insn encodings when source is memory or %xmm0...%xmm7, and no masking is applied (in allowing a 2-byte VEX prefix then instead of a 3-byte one). gcc/ * config/i386/sse.md (_vec_dup): Use vmovddup. --- a/gcc/config/i386/sse.md +++ b/gcc/config/i386/sse.md @@ -25724,9 +25724,9 @@ "TARGET_AVX512F" { /* There is no DF broadcast (in AVX-512*) to 128b register. - Mimic it with integer variant. */ + Mimic it with vmovddup, just like vec_dupv2df does. */ if (mode == V2DFmode) - return "vpbroadcastq\t{%1, %0|%0, %q1}"; + return "vmovddup\t{%1, %0|%0, %q1}"; return "vbroadcast\t{%1, %0|%0, %1}"; }