From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from EUR03-DBA-obe.outbound.protection.outlook.com (mail-dbaeur03on2054.outbound.protection.outlook.com [40.107.104.54]) by sourceware.org (Postfix) with ESMTPS id 0F7CC3857B80 for ; Tue, 25 Oct 2022 06:34:15 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org 0F7CC3857B80 Authentication-Results: sourceware.org; dmarc=pass (p=quarantine dis=none) header.from=suse.com Authentication-Results: sourceware.org; spf=pass smtp.mailfrom=suse.com ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=h03Hx7/Tk3NUCduRjF601r0YLEZgdM6iDV8rfUXJyhCLTxTfUXTDVU55fzvrxOM0hoDQsOCg4xebN1zCWg5pHD+Zlew3kAAPWPcxEI4gODCno3HZRHCXCKz0daoog3jDwhlqfzAy1mDFSCuw7TXC4Ck4y1tgB29AeldbFO/91ORLet7s9Bzl9AbHDf5LDjmVwWJctUEoRMVjED1EXl+tTfPENTfSwQlc3Iw4xfP1m2EnXl/0xOqQM3NAe4x52gaUV+270dVeaEcDIctzkefls2CRJjXJliqSDOQag1qybpGkjkvCs/U/942XLafDiEt4mmHs25Ic4SFO2wS8REy3mA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=9MglKgXlhj7HokSZzO+iIpViVUwLcvqdcYZZFXZb9gY=; b=obkT8JuuevpUHPhmSQqaiROGuoIzhkBJ8J4pqH0xNGa1pitfLND+mvw3D9a6pRZFUxEmY5z+qpUiOV9T95JX8Gq8RMcDNSh/DYRnHePyfvyQJ0hF8UHQcxc+GbweVvjWf0kFcsfF1jb7/jASpPT8rBcISpLHNwBuqaeMaq17kSQaKP19OggHMT7fDjmkcZoNrIkFWGA5TjJvapACAn68+qGYTCHoGMtnMpl5gR9mlGsAirFx7rYbqroZj3qHHe3N/oQ57PV+IFBLoRXfWb09RSxRDER/iT30/A71uSCMLoJCxC+f9NTg1YxHNLY72TM0p+0eUdnQP1irdwAk8ChKKg== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=suse.com; dmarc=pass action=none header.from=suse.com; dkim=pass header.d=suse.com; arc=none DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=suse.com; s=selector1; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=9MglKgXlhj7HokSZzO+iIpViVUwLcvqdcYZZFXZb9gY=; b=5gS+Y3xnOQrEZtR5n8fMrfFUwZk4Ev/lAs/g6u45d2KUZ6uaknt4gXKOvyvWQ0UPFKQygJpapkbarfcU13lb9xnCXLW+WUI1cqpmiiDe/rN+AEspYhfB2JEE0IWFUIcOPmpM7+ZLA+ERqfrMCAbOrduLIpN0fgNbFl5KbKdLQlIo/Js/ky+L/k8ilckRsnS64zL1tm4FrrAys76LFDxtbcoIzLruZ1TE6tBtnsPztSNf9ZMTTaEXJVCmcXjpHL3XQhQinX6hxi3ImymmZpfD2eCUg8aCPHLmx3HLf9cF+yDMm5mzk/PnDGOAhXB4xH1+2GljHRhWoPvcZa1yBkMxfw== Authentication-Results: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=suse.com; Received: from VE1PR04MB6560.eurprd04.prod.outlook.com (2603:10a6:803:122::25) by AM9PR04MB7633.eurprd04.prod.outlook.com (2603:10a6:20b:2d9::19) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.5746.21; Tue, 25 Oct 2022 06:34:12 +0000 Received: from VE1PR04MB6560.eurprd04.prod.outlook.com ([fe80::4da2:ea8b:e71e:b8d8]) by VE1PR04MB6560.eurprd04.prod.outlook.com ([fe80::4da2:ea8b:e71e:b8d8%4]) with mapi id 15.20.5746.021; Tue, 25 Oct 2022 06:34:12 +0000 Message-ID: Date: Tue, 25 Oct 2022 08:34:10 +0200 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101 Thunderbird/102.4.0 Subject: Re: [PATCH 02/10] Support Intel AVX-VNNI-INT8 Content-Language: en-US To: Haochen Jiang Cc: hjl.tools@gmail.com, "Cui,Lili" , binutils@sourceware.org References: <20221019151534.45521-1-haochen.jiang@intel.com> <20221019151534.45521-3-haochen.jiang@intel.com> From: Jan Beulich In-Reply-To: <20221019151534.45521-3-haochen.jiang@intel.com> Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit X-ClientProxiedBy: AM6PR10CA0036.EURPRD10.PROD.OUTLOOK.COM (2603:10a6:209:89::49) To VE1PR04MB6560.eurprd04.prod.outlook.com (2603:10a6:803:122::25) MIME-Version: 1.0 X-MS-PublicTrafficType: Email X-MS-TrafficTypeDiagnostic: VE1PR04MB6560:EE_|AM9PR04MB7633:EE_ X-MS-Office365-Filtering-Correlation-Id: d7b1f6cc-0c00-454e-8aa5-08dab652edff X-MS-Exchange-SenderADCheck: 1 X-MS-Exchange-AntiSpam-Relay: 0 X-Microsoft-Antispam: BCL:0; X-Microsoft-Antispam-Message-Info: j6gI/+Ux1fkSHafKY6UIscx4uoQPp2CABp9FHDm/TGGJARtvP3y1xxaG/B/33GwTUqms0WaZ9kikyRpx78rKE7NQcCIofrKMVy6O813JX5GUOJP0xlkxSywTe7R3Xgzpn0dHsgUjEdid4uBGHrRZYSUlyaExwQy7YkyMn3VXeSPNsBHBBLMQ6swNHhQlSZiI5LsSdeeCqEPRgpYJFFFfV5NATJlXu+JmPCyN/n8hprSOWI46/RBhXm00452TTHWTqTYv3KdcLu9KkcTHLKV7BnrqmYNwzjt1feBg186t8lP6Nb5DzpMhdh9kOGMHrdff5Bd39eAYKdklCS4T4mocn7EBqaraGJxFXEn426xr+N21m6vosEOPVi6GbNi+9Y4BGXJWl2sw52W/R7u+JcAOOVpsbTmc38zO6aXO5meoPb9/DelOJ9bHx/m6nZXJws+GJphIblPLkv1zukZBg7Nb68zSChTjiD/6zG5/tgDXv4X8Tnf9IXDKtnH5llP/P0EGoQp8uPmNtXalBTW77fubfrWQalBoWNYeYfxIDoe5PapaiOaEdejN91T5CYEOFzao78mJwixH/sQn878OmD23DTLARlKOx27fUnwMf08g4OKuRI7kvCStqXY1dURIRJjZGL/Ey8b16p0AWzDlP8HRCJRF2nHWNcbs9XHpbP6s292jP05obj11xkVTRcK+g07Fi0NnA3CuxbnLkzuXiSjIamBKGeErt94nWXiW28MEXx9wmdfdyXSMMy19VekCvr9nM2QBYNYIeWal4unDzCZYYjhxlcSmrFVGxTGcnPWfSZwbCB1heIbEY/EpXgYMwt90LCEFrj6xXu6aWfFEgot3mI+xZWntv1bbhQQY1MZP5xM= X-Forefront-Antispam-Report: CIP:255.255.255.255;CTRY:;LANG:en;SCL:1;SRV:;IPV:NLI;SFV:NSPM;H:VE1PR04MB6560.eurprd04.prod.outlook.com;PTR:;CAT:NONE;SFS:(13230022)(366004)(396003)(346002)(376002)(39860400002)(136003)(451199015)(36756003)(31686004)(38100700002)(6916009)(6486002)(316002)(31696002)(86362001)(186003)(6512007)(26005)(6506007)(53546011)(2616005)(5660300002)(8936002)(41300700001)(478600001)(2906002)(66946007)(4326008)(83380400001)(8676002)(66556008)(66476007)(45980500001)(43740500002);DIR:OUT;SFP:1101; X-MS-Exchange-AntiSpam-MessageData-ChunkCount: 1 X-MS-Exchange-AntiSpam-MessageData-0: =?utf-8?B?UVhNcEoxa2FURWRVWXRCY25nMGF2UkJ2aFFzcWlvV3Z5QkpsUkYvaWd2RmU3?= =?utf-8?B?QjFQSDE2dWw5VkczTXdvZCsyZ1AxdHFwemNLdCtUSytjWUp2N21pK1hzSndO?= =?utf-8?B?UmYrVXYya1NVdXRoYStiZ1ZpdTh5ODVwcTFCeTVkVmRwT0o4TkdObVE2OXA3?= =?utf-8?B?Y0dGdys1RitxOGxUamVIdVBJV1V3TmMrRmVtQ0NPd05udHpIWlBnbEMrUGdH?= =?utf-8?B?YVc2cU5qQlVaVFN1YUNidUlobFlhR0haL0VuRkNUOVBwQzdBVE13YkFJUWMx?= =?utf-8?B?QWk3UG9XRkdMei9MOEtuQmVlQ21DYzhhV0RSTEJHcW56VWcvSEdNRGZCU0gw?= =?utf-8?B?TE9hSThSMGFvUzNkMzJnblQzelpobU8wTm5XdzVBd3lXWWpjd1ljSmtUY0xR?= =?utf-8?B?M2NoNU9oeGp3d09LTlNPblJiUTQ3SlJFWXdVOVQ1UG5lVXRhQ2NvV3BvUkRw?= =?utf-8?B?TVMrTVdwSUM4bUVOTGpyMjJvdTVGbXlHM25XZVovUTNyNzgzSFZ2T3M5KzJk?= =?utf-8?B?NEViWmhzWlhZNFU0bmtBeDhnb2l6c28vc21uYVI5OEdQYWcvQnJ5MmdCSWZ3?= =?utf-8?B?NFBBNGc0RnBpbnBVYitaK1BiU2F0UW1HakltLzkxM3RhaEw3WHl0WGZKV2tx?= =?utf-8?B?S1hIVjJ6WkszbVU0VXlvd1prM0k4WlVJQnRCbDlzQTRBcTdmRDNRNHhwZDN3?= =?utf-8?B?OG9wOTZTeFZkMG1zbXZYYjZPWUdxQ2xmdUJWQUtWUml1M2tjRVNjY01XYnVu?= =?utf-8?B?S0dzRVU1b2VSYVNMWWlzK2NJc29zN0o1bVZ0R2RDbDlBWm5sOUlCY1dOUmdk?= =?utf-8?B?cTlERGVMci9ua0tOV2g0UVczOVYvUDBtVnAxU2hyYTBJdUFPTTdJaEpSTnRQ?= =?utf-8?B?OHA5OUNGeW00RjdkUmlZcHE5OEtHZi9mejB5UGJ1dFdLZC9IWUI1SUNUT0Zs?= =?utf-8?B?UGN3MFlUM0t2a292TDNqRWdENnAwa2FHYjlzWi9RVXdQdDRDNDNZT2lsdEhv?= =?utf-8?B?THJ4dnFrM056c3RTZDhhSzNCNzRpbzJZRWMyT2pObXdQUkd1a1Zub0lJTWxK?= =?utf-8?B?M2lZRkU4cVMvWFliR2xTV2VVbGt4Ulp0bnVLbm92Ymw2anpqUVVxMG9QaFFj?= =?utf-8?B?MlZ0VWMvL3FFbkdvbDlXdUVDaTN6dWFETE1RODJkaTlCeVNkaVNmY1dZOXNW?= =?utf-8?B?c0NCTktvMlV0MkNOMGcyUTVGdGpyLzNXRFhKNUJuZ05ObnM5eWxLMWI1NE5q?= =?utf-8?B?ZXVETUhYZEk0c2lWdjlvcEZSZjI2c3RxZE9GeXhCZHIvSmgzMGpLNXR0VHlK?= =?utf-8?B?bkZlUFhwVHk1VTFMbnQvYURIMi9JT20zcTlYM2N2NmpKNEQxNWNYL1h4SUM3?= =?utf-8?B?MHpPYWZkVllXU3ZnNHhnODl5NDNFOTZ0UVcxSVpPajlLSmhMaGNQazFYeGNS?= =?utf-8?B?ZWluamc0UEdWcEVPcVcrT01mcU1MZ1NOc1RLQ0MxaGZraERVU2QrRituMEx3?= =?utf-8?B?Z3NSUU9XT1AxTkpXb0F0RjM5MHlUZjNZUU4zMkFYSHQ0QWJST2VhTjBnc0d1?= =?utf-8?B?cVI1RXc1T1ozRVFEMUROSWFkQVBYK2lBQTlOZGZPb2VCL3MvOWJXYzVJU0Zs?= =?utf-8?B?Zit2eEFKUVV4eE1DOEdJU25FaUlGdW9sbU5WSUNCRm95ZzdDUWhKTkpDanpM?= =?utf-8?B?ZERoTlBSNDRpRVVJSGMvYUZsRzZOeFdiSnUvSGJrVHpJcHF1UTdlZE1JdUUz?= =?utf-8?B?eW5ZdDduaUkra3FmM1F4T2NnZmtxdzlzNWtzTm0xVGRkeWw1VGlkcTk5QVRM?= =?utf-8?B?TFBPUlNZdEpvNGpyNTBWczNwTi8xUkdUSEdLd3l5YitKVkFWMW5EcHVhZWN1?= =?utf-8?B?eU4xOU00VWtmTVZreEFseFFrZWYzc0xzbU9heG03RXZrdlp2eVMvTXdWVU1B?= =?utf-8?B?aml4aHg4NmdZQXRIK1RGeXVFZWZHS2Y4OUVoeHlVWjlCdy8xU0Y4OEkwd1NN?= =?utf-8?B?VUw3alhINVFxdS9Jek80OE5XaHlBOWZRcUd3bXZPcjFjbWU4V1V3aGpSMjJl?= =?utf-8?B?UEpuU0c0dDM5ZHNQa21XeVIzZzNxQXdMaFhFbDlWQWxUWDJoT2JNMEpNME1x?= =?utf-8?Q?6Rk5LO9zHPTA0ryIj76g0wazj?= X-OriginatorOrg: suse.com X-MS-Exchange-CrossTenant-Network-Message-Id: d7b1f6cc-0c00-454e-8aa5-08dab652edff X-MS-Exchange-CrossTenant-AuthSource: VE1PR04MB6560.eurprd04.prod.outlook.com X-MS-Exchange-CrossTenant-AuthAs: Internal X-MS-Exchange-CrossTenant-OriginalArrivalTime: 25 Oct 2022 06:34:12.4429 (UTC) X-MS-Exchange-CrossTenant-FromEntityHeader: Hosted X-MS-Exchange-CrossTenant-Id: f7a17af6-1c5c-4a36-aa8b-f5be247aa4ba X-MS-Exchange-CrossTenant-MailboxType: HOSTED X-MS-Exchange-CrossTenant-UserPrincipalName: OUyPj6rDz0WnfCirmEi92f3UYOr3R6DP/uRKxJoFoEO6dYssyguxiHJnhjuH8XjAnSg3u+FX2TyCBP8mKCVPRQ== X-MS-Exchange-Transport-CrossTenantHeadersStamped: AM9PR04MB7633 X-Spam-Status: No, score=-3029.4 required=5.0 tests=BAYES_00,DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,KAM_NUMSUBJECT,NICE_REPLY_A,RCVD_IN_DNSWL_NONE,RCVD_IN_MSPIKE_H2,SPF_HELO_NONE,SPF_PASS,TXREP autolearn=no autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server2.sourceware.org List-Id: On 19.10.2022 17:15, Haochen Jiang wrote: > --- a/opcodes/i386-opc.tbl > +++ b/opcodes/i386-opc.tbl > @@ -3270,3 +3270,14 @@ vpmadd52huq, 0x66B5, None, CpuAVX_IFMA, Modrm|Vex|PseudoVexPrefix|Space0F38|VexV > vpmadd52luq, 0x66B4, None, CpuAVX_IFMA, Modrm|Vex|PseudoVexPrefix|Space0F38|VexVVVV|VexW1|CheckRegSize|No_bSuf|No_wSuf|No_lSuf|No_sSuf|No_qSuf|No_ldSuf, { RegXMM|RegYMM|Unspecified|BaseIndex, RegXMM|RegYMM, RegXMM|RegYMM } > > // AVX_IFMA instructions end. > + > +// AVX_VNNI_INT8 instructions. > + > +vpdpbuud, 0x50, None, CpuAVX_VNNI_INT8, Modrm|Vex|Space0F38|VexVVVV|VexW0|CheckRegSize|No_bSuf|No_wSuf|No_lSuf|No_sSuf|No_qSuf|No_ldSuf, { RegXMM|RegYMM|Unspecified|BaseIndex, RegXMM|RegYMM, RegXMM|RegYMM } > +vpdpbuuds, 0x51, None, CpuAVX_VNNI_INT8, Modrm|Vex|Space0F38|VexVVVV|VexW0|CheckRegSize|No_bSuf|No_wSuf|No_lSuf|No_sSuf|No_qSuf|No_ldSuf, { RegXMM|RegYMM|Unspecified|BaseIndex, RegXMM|RegYMM, RegXMM|RegYMM } > +vpdpbssd, 0xf250, None, CpuAVX_VNNI_INT8, Modrm|Vex|Space0F38|VexVVVV|VexW0|CheckRegSize|No_bSuf|No_wSuf|No_lSuf|No_sSuf|No_qSuf|No_ldSuf, { RegXMM|RegYMM|Unspecified|BaseIndex, RegXMM|RegYMM, RegXMM|RegYMM } > +vpdpbssds, 0xf251, None, CpuAVX_VNNI_INT8, Modrm|Vex|Space0F38|VexVVVV|VexW0|CheckRegSize|No_bSuf|No_wSuf|No_lSuf|No_sSuf|No_qSuf|No_ldSuf, { RegXMM|RegYMM|Unspecified|BaseIndex, RegXMM|RegYMM, RegXMM|RegYMM } > +vpdpbsud, 0xf350, None, CpuAVX_VNNI_INT8, Modrm|Vex|Space0F38|VexVVVV|VexW0|CheckRegSize|No_bSuf|No_wSuf|No_lSuf|No_sSuf|No_qSuf|No_ldSuf, { RegXMM|RegYMM|Unspecified|BaseIndex, RegXMM|RegYMM, RegXMM|RegYMM } > +vpdpbsuds, 0xf351, None, CpuAVX_VNNI_INT8, Modrm|Vex|Space0F38|VexVVVV|VexW0|CheckRegSize|No_bSuf|No_wSuf|No_lSuf|No_sSuf|No_qSuf|No_ldSuf, { RegXMM|RegYMM|Unspecified|BaseIndex, RegXMM|RegYMM, RegXMM|RegYMM } > + > +// AVX_VNNI_INT8 instructions end. Like for patch 1 as a minor remark: Perhaps better insert right after the AVX-VNNI templates, to have related things close together? Jan