From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from EUR04-DB3-obe.outbound.protection.outlook.com (mail-eopbgr60070.outbound.protection.outlook.com [40.107.6.70]) by sourceware.org (Postfix) with ESMTPS id 739B83858D39 for ; Mon, 17 Oct 2022 06:39:20 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org 739B83858D39 Authentication-Results: sourceware.org; dmarc=pass (p=quarantine dis=none) header.from=suse.com Authentication-Results: sourceware.org; spf=pass smtp.mailfrom=suse.com ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=hKD2dkmGGOoVr5Eu2nSbScp4hpb6CboQHeOadG0ccf2D3WzkHq7h3S5taZhiCBpZsV5hERfnhUfHuosLoI7vriEbnbbC+KSEF/Pu8y3HH/MJyzOQsD+Mdgk9PNKAV2XIvVNxYkqQ6gNIguNrXUsaBAIb9MmjQIrEdEk9CKuanJYXsRTsE2BwTnPhMaXSKTSbVOF+vry+Lb8W9tPEsf2x5sQxL7ACi86Bs0YV9oI/bAXwW3M1Ivp9cvbOpiIsldUO62BcXXvLjBJyCcmMSHnQrVmtQStmGbnx5oJBSx0a/KGEo2lFPce9IdzDTrGrwo4tEFIFvo0sRo6qWBVAtccVJQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=dqunp1GsZfgxqkY72ydnMQXWysNs/SZRZxseQN3XMcw=; b=l8LomZVrH6HGYaPD94+eul2m5qUQGOwTkVADNJDZTzAaXkXTSBgcJcsLHx5BM9PXLiMLNW73cSr32Fhu0G0XX79Qq/FvA0AmEosAq2ZBv8/AhvcFLVTOGrTxINL4FAGmYze0aCbBp4mE/ygzui6n+jFA9wBiVLHSfFjA7CORgCjejkWUrmOq9APD7j6JIeqZjwjAaAHDjcW4fESE8WyEmkhP17UAUFIM75J8lliRI6V0n6mnrazk45vHn9EEMR4i1xNGYrjrbQ/Q/4YChoZrxODfjMq8pg5Z2lwtL25TWpOBgX7gT33ZMFP6IrOB6xfzqxWO6zuQnK2MJeyjjhKplA== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=suse.com; dmarc=pass action=none header.from=suse.com; dkim=pass header.d=suse.com; arc=none DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=suse.com; s=selector1; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=dqunp1GsZfgxqkY72ydnMQXWysNs/SZRZxseQN3XMcw=; b=m5KnDwqVfxmGCNb0UjzljzE8Pe5D43D1PTSyJaTmEFSgJe2ARUoL3JMAL+65qw8LbgJ48cFnZVtrn+GHt/x7hegoAYaJI3vCKYZFjvTWeSlmahR4QoM+eXAPHFaah2u3ZV4/qFJRHhbGDY08TZd5cpLLKYl3+juhSteMsfeu9GF39RLjZA60c/OpRrNtBSUB2i9yJSZIkWvCsLeW3yGFuOqEjKsvHSFpfolt1OPMkoU4EY0qPTnA3xi+boYltqj/3tFeG+w0NaAsQLwx+UhHBgiFXknYoZeTngecrx1N2h4JUfyPHEGsie3FJV0MzMfAxCcDeNXvgQVoojNcmVNPwQ== Authentication-Results: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=suse.com; Received: from VE1PR04MB6560.eurprd04.prod.outlook.com (2603:10a6:803:122::25) by AM0PR04MB6898.eurprd04.prod.outlook.com (2603:10a6:208:185::11) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.5723.29; Mon, 17 Oct 2022 06:39:17 +0000 Received: from VE1PR04MB6560.eurprd04.prod.outlook.com ([fe80::2459:15ae:e6cb:218a]) by VE1PR04MB6560.eurprd04.prod.outlook.com ([fe80::2459:15ae:e6cb:218a%7]) with mapi id 15.20.5723.033; Mon, 17 Oct 2022 06:39:17 +0000 Message-ID: <90899ac8-cc4f-52f1-9498-5e7f87fe6355@suse.com> Date: Mon, 17 Oct 2022 08:39:17 +0200 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101 Thunderbird/102.3.3 Subject: Re: [PATCH] x86: fold AVX512-VNNI disassembler entries with AVX-VNNI ones Content-Language: en-US To: "H.J. Lu" Cc: Binutils , "Jiang, Haochen" , wwwhhhyyy References: <7bac66be-535e-9051-d674-f2f5ba180e17@suse.com> From: Jan Beulich In-Reply-To: Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit X-ClientProxiedBy: AM5PR1001CA0059.EURPRD10.PROD.OUTLOOK.COM (2603:10a6:206:15::36) To VE1PR04MB6560.eurprd04.prod.outlook.com (2603:10a6:803:122::25) MIME-Version: 1.0 X-MS-PublicTrafficType: Email X-MS-TrafficTypeDiagnostic: VE1PR04MB6560:EE_|AM0PR04MB6898:EE_ X-MS-Office365-Filtering-Correlation-Id: be8b5dd2-267f-4d62-195c-08dab00a50a8 X-MS-Exchange-SenderADCheck: 1 X-MS-Exchange-AntiSpam-Relay: 0 X-Microsoft-Antispam: BCL:0; X-Microsoft-Antispam-Message-Info: tkhAjPS6YQm0vZuz2/x4nfCbdf3bdsB+5vIhNnH+CTsH99W+VWiu7bsM3pHgPOwHJyA3iii18SH4l9rEsaDHw/XM6lYs/a+QnlBR+nO3TLvo4W8xVxsxg26GvO5n3Augx7El4i7JBqGbrFVRfN5Uz/Ad3XEjOJi/lha+J4Tc+Nw8EUqGJVMSrZ2f1XRXqOU/TNd3FYzyucIh3DWg+BApYM2ZVDW9KQpwGgWlBVMaeOaCDi+s48rVqYlVLGfrUihf7pItVgfkvqRlQ1YCKyckIYHp8a2aFQSQF4NOO8A3mn8U3XsjHTEm+GpHFolwk90gTZMkPeYaf1F29fpiQSmAj8E31B0o0fsNQmaG4AI0QliXm0V1ot8FnPN3nrW0jTr0MYNRo69BpnzFOZvz8IkBGj1OwjK5Pd53sZWpn+Z7pfsKuNuix0Sxxew0EGJ3K68LcieCN5MhZoaP+eX14ljGGJ5kn0SqeyWf0wS2guzCbCU7aTXnYLNMWB2Ogpec8UbAurSyscXxWHD+B9jroJQpWuvBcIbnEbFsN4U0W1S5Ng+W7O1GIbJ53QDlbIbvLJWUu+rdU0u8Xl7YftQa2wb2Ztwl+27L5o7Yc6DEdhsqna5EVE5be8u13ieNPJ9T8glRarGJD/4IsZGwwHtV3ujNcY2k2TGq7ioT4AJqF0bTkaacD0g+fQOS0q3GeqLfqXiPkjL8QScPDop/TUuPUsfGCNnnKwirVo+MjTng0Gu8nP29eDJXIIuvBogGb2igbaHaRjxJCI7JDQLqOBNYT8bEO9RoMc5D2+3zYqNPKk9MlNc= X-Forefront-Antispam-Report: CIP:255.255.255.255;CTRY:;LANG:en;SCL:1;SRV:;IPV:NLI;SFV:NSPM;H:VE1PR04MB6560.eurprd04.prod.outlook.com;PTR:;CAT:NONE;SFS:(13230022)(346002)(136003)(396003)(39860400002)(376002)(366004)(451199015)(2616005)(186003)(54906003)(31696002)(86362001)(38100700002)(5660300002)(2906002)(41300700001)(8936002)(4326008)(8676002)(478600001)(6512007)(26005)(6506007)(53546011)(6486002)(316002)(66476007)(66556008)(66946007)(6916009)(36756003)(31686004)(43740500002)(45980500001);DIR:OUT;SFP:1101; X-MS-Exchange-AntiSpam-MessageData-ChunkCount: 1 X-MS-Exchange-AntiSpam-MessageData-0: =?utf-8?B?aW1rbGttTyt2TUdqL2d4Nzc5MERITkY4eHpoeWhYUlRsMDFCMGlnbmdBWVlZ?= =?utf-8?B?Y0R5TFE1SmQ1SEZsejFTMDRoV0l1aFdJNDc2TW8rQ0JIb1RpWS9xc1FvcS90?= =?utf-8?B?R0NSTDBzSW9DM0JUQWhZSlNueENPUTZDRzNVMlBMV2xObldPcXhKT0E1MzRN?= =?utf-8?B?M2ZJdE1INnRvMjlvUW1mZXFNTElzemVLRWU5QVlOdjA4eUpwUUE2bVdYTG1z?= =?utf-8?B?c0E5ZDR6N0dtdXNObFNLdnJMUkJ3YjkyUS84emZPYUY5ZkdpWTdwdHBQUHBu?= =?utf-8?B?ckJQMHdaTVlwc1B4TnNRbWZveGx2MytZTkRQUENNZzBybjk5VVdweVMyYjVJ?= =?utf-8?B?K241QTRhWlo1a2pyK2JCZFhyT3A4U1lVTk55cit0dlRQejZjWUVSemxYWEJQ?= =?utf-8?B?Yk5uVS9BWFRZOE9PU0RZSUVYRlhBZUU3dUV1N2tLTnZnNjFCRHljSyt2N2cz?= =?utf-8?B?djR5NzBROWRJTWpja25GelE0R2NUSTJXc0N1NGRrUnExdWVjQ2hWblMzL2pt?= =?utf-8?B?SXYxeVltRlJWUmVBeE15Y1NpTlZ3UFcyWXpGZDY0NW5wZTlNekNkLzlHbklR?= =?utf-8?B?dlIvVDlmZXkrbnczTHF0a21MUVhhNmppK2k2YXZjSTFUdU1mdFE0enZiYUU1?= =?utf-8?B?OHFMaGdVNzBBRGoyeVdLL01UZjhJT3BlS0hlMVFsZ0MyYUlwK2xpQ1JOUWtE?= =?utf-8?B?WjZwZjRGMjhnTWM3dWtKSnNkUTlJV2FJdUl1VGhwcllaTlZCclNnbmMrRXNU?= =?utf-8?B?ZHN0ZmphZllaUUFodDBDNytIbjRSUExldlZrKzBTeEJjL3c5Sk1FUXlzNzYz?= =?utf-8?B?QlZuTlRBS2tBS2VucmhWT1UrUmNDVUc4YzVwSGthcWdMcjM2RmZGZitWekdK?= =?utf-8?B?b2lJZW1Oemx1RmtyQUJLSmg4YzBtWGVMWW03TjFNYW8yYmV5cmMvbEdtaks2?= =?utf-8?B?c0p1Q1VCZHNpRGw3dm9QY2hiUnN1d0hOOENCRVRlYmJNckVVc3hxTmtGZWls?= =?utf-8?B?QWRyZjg4VXpKbVIyK2x2Vi9IK2NCVTA3bDB6dWVTZFNWbGN5VThjMGJxK0lr?= =?utf-8?B?SFh3YzlkWDlBdjNTbW1Xd0FhRmlRMjhpZFdHNkxSK1l4L2tnQnpvb2JPbkR5?= =?utf-8?B?NVdGY0ltTHV3V2MrWFBtREdMZkZsMHhBTm9MOXJqOSsyWVVlQ2k5MkRiUTVZ?= =?utf-8?B?RkVrdTZZbkNzWDRHRnQydU5WRitzeWRveDRhQTh6dXJYU0tnZ0ltdEdJVW1K?= =?utf-8?B?dlBFRUZ3bWZkdFNLQm5vajErRlNtV09iMFdIY0xwSzB2MVY5dm9ONXF5UEZQ?= =?utf-8?B?cE5PYkhqSklYME9wSmsxZ2wrNWF1TnYvTzNETmlOS2owOGZlWmRSemZGV0RZ?= =?utf-8?B?MFZ2dFJWVlBhMVRNTTZTYXNLeWd6WHl6N3BVNGhHWmphUTNEYlZWNGxoV2xm?= =?utf-8?B?SFJSVk5TZklzZW5EL1JyczdRVWM3aW5TbmpYMnR6VTJGeGdZNUpkS1FwMFlX?= =?utf-8?B?cndiLzN3TVVYenFrOGd6UWZEbk1wQXVjNVViT2Z3Y0NKVkU4NGFpcGcvN2Ft?= =?utf-8?B?TTh2ampYUkNyckRmU0M0WW8vcGc2NkdyVkFiT2hwRE9SSlRYZ013TjNvNHp5?= =?utf-8?B?NWlVVGQzVkFyREhyQUNDYk5ENVlaMno0SVB6SWlZbVhZeDVmM05lZ1hvMGgr?= =?utf-8?B?a3h5L3piTE5IcGQxQlF0blZCZ1hRc2kwR0ZjMU8zWTI3ek1Tc2pJWW5CaDBR?= =?utf-8?B?dVJoQ2pPZU1nVGhINHRDNkxiNG5RMmZuWmdPNXNxblJoeWJSR29YR1djYkhP?= =?utf-8?B?Q1lyKzlLMjVyVG9hcFI5OW9rd0d4cTlxR0txTTJTRThFVTNVL0lJUmJnV1hT?= =?utf-8?B?LzlTdVBhYTkydUpSb3kySjlSWlhhY0NNWUliVEg3UjAyZy9keU9iMVJpekJk?= =?utf-8?B?Z25UTnBpMjNvWWpjN0wvamtObFRDUEMyQ0RkYXNoQ3pKMmVMRjNMREZBbmlM?= =?utf-8?B?bXNwaXRjeVJzbStlYWZzOE5naSt2WHFIUWcrRnM5bG93b3VSdVpPN0xsZlBI?= =?utf-8?B?bmo0S3BZeTcwOXpObm9zWHNDUTBFbnYyT3N1d0RqUzBaTFJIZisyRzF3bVJP?= =?utf-8?Q?DUkpvycsd+S5CL64qBfESF4nx?= X-OriginatorOrg: suse.com X-MS-Exchange-CrossTenant-Network-Message-Id: be8b5dd2-267f-4d62-195c-08dab00a50a8 X-MS-Exchange-CrossTenant-AuthSource: VE1PR04MB6560.eurprd04.prod.outlook.com X-MS-Exchange-CrossTenant-AuthAs: Internal X-MS-Exchange-CrossTenant-OriginalArrivalTime: 17 Oct 2022 06:39:17.6639 (UTC) X-MS-Exchange-CrossTenant-FromEntityHeader: Hosted X-MS-Exchange-CrossTenant-Id: f7a17af6-1c5c-4a36-aa8b-f5be247aa4ba X-MS-Exchange-CrossTenant-MailboxType: HOSTED X-MS-Exchange-CrossTenant-UserPrincipalName: fDDfK+KKC6JtBPL3vn74gfDDBzpp/uXU029JK6AeMhXmXB3JOrWvj0S5rYLtJAq1WqvBA4TNuJDX+Nf61e2lCg== X-MS-Exchange-Transport-CrossTenantHeadersStamped: AM0PR04MB6898 X-Spam-Status: No, score=-3029.8 required=5.0 tests=BAYES_00,DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,NICE_REPLY_A,RCVD_IN_DNSWL_NONE,RCVD_IN_MSPIKE_H2,SPF_HELO_PASS,SPF_PASS,TXREP autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server2.sourceware.org List-Id: On 14.10.2022 19:28, H.J. Lu wrote: > On Fri, Oct 14, 2022 at 3:22 AM Jan Beulich wrote: >> >> Make %XV also print the separating blank in the VEX case, while making >> it do nothing for EVEX-encoded insns. This way the AVX-VNNI entries >> can be re-used for AVX512-VNNI, at the same time fixing the lack of >> EVEX.W decoding. >> >> For the AVX-VNNI ones further make sure only VEX.66 forms are actually >> decoded. >> --- >> Irrespective of this change I continue to disagree with the arbitrary >> printing of "{vex}" for the AVX-VNNI insns: If that's meant for >> disambiguation purposes, then EVEX-encoded insns not using EVEX-specific >> functionality by having VEX counterparts (vaddps %xmm0, %xmm0, %xmm0) >> should also be prefixed by "{evex}". > > This is done to match the assembler. There are 3 kinds of VNNI processors: > > 1. AVX512-VNNI only. > 2. AVX-VNNI only. > 3. AVX512-VNNI and AVX-VNNI. > > Since AVX512-VNNI came out first, all VNNI instructions without a prefix > are encoded as AVX512-VNNI. The existing VNNI instructions without > a prefix, generated by compiler or hand written, are encoded with EVEX. > If one needs VNNI with VEX encoding, the {vex} prefix should be used. ... if, as said, AVX512 wasn't turned off altogether. With your model, just look at how odd code using both AVX-VNNI and AVX-VNNI-INT8 then looks: vpdpbssd %ymm0, %ymm5, %ymm6 vpdpbsud %ymm1, %ymm5, %ymm6 {vex} vpdpbusd %ymm2, %ymm5, %ymm6 vpdpbuud %ymm3, %ymm5, %ymm6 Yes, one could further clutter this and add {vex} to every line. But why would anyone want to clutter their code? Plus the same argument then applies to AVX512VL: This came out later than AVX, and the assembler (necessarily) requires {evex} to actually encode these (when there are AVX equivalents). Hence to match the assembler, the disassembler then also ought to emit {evex} for this subset of encodings. Jan > This applies to any AVX extensions which come after EVEX ones, including > AVX-IFMA.