From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from EUR05-AM6-obe.outbound.protection.outlook.com (mail-am6eur05on2083.outbound.protection.outlook.com [40.107.22.83]) by sourceware.org (Postfix) with ESMTPS id D1D193858D20 for ; Wed, 30 Aug 2023 16:16:54 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org D1D193858D20 Authentication-Results: sourceware.org; dmarc=pass (p=quarantine dis=none) header.from=suse.com Authentication-Results: sourceware.org; spf=pass smtp.mailfrom=suse.com ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=h1lPkwDBfiMj1qjW4zEVTz/9AkY8joo/0ppScXfFZZH79aM2pR7vBJXAwzHJn7/jLUOt70rbgBobQV9AbV2ky46dqSY+WRdDJfiVMX18tUqif1vkOVLDAZ3ZFSt9IbuYtS+WQ3nEAP3GMXbyrNyVqMA+OJ9X0xnyDK1S364PM1ISkoQMqMR3eiPpN8wI+3EJJM0eG9Aq9wOvy6k2/WwVUk8zl6m51vG1Py002GUfvqMo9Hpy56hBS358ioqmZJADpozDFsSD5FracK6DOtZd9FBWBZIVpUD2bJQXVt284S0lHkJVnl/6ZXjMDucCHy4ywNM1ojk7cGo3ZWjl+HZpbg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=jcoHgTFumu0ochyK6cLxOpIHvJroEZVsCWj4VWdgPwQ=; b=G9jPKsld8bOpAia9OrCds2OpikCxxT9pkUwtuM/Cbg2gyTpr5G1hMZG/TA/+tQkd6nEy5nww2rhGvTppcMkc61gC7CrQwV9yWY3KB+nE7wY+zNJtWckyMM742RIXtur2okGzieAlBW9ixhnuqLWqVJeqLB6gZ29c9OXZZGStbt0NnkLKgTVlpgNrHpWwXn6neygyRBy2FcwYucvVz0180H9+9gAW87ko/5hWS9UYuNAgoOGUK06zu8cUQXl7ksuUYkfXuegdiOGVWMLu67T10GqTF31D95jk31QeBXkqZk6zhQUinjH+3peOsDYVvr5Ud2nLb2nToZdsv+OmVXmdHA== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=suse.com; dmarc=pass action=none header.from=suse.com; dkim=pass header.d=suse.com; arc=none DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=suse.com; s=selector1; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=jcoHgTFumu0ochyK6cLxOpIHvJroEZVsCWj4VWdgPwQ=; b=BTxPM4wTDInhxZA2OI0DOiWobT4hloY3SAezXXXQMpLXf/LKKvkLiqzI0qQSkr2HWhQdh+Qdz77ZqdUrLYwTrlr2Whd+LksTqbbtZEcqHEtmQjoMH8B8N6K3ghuMEFo66//oQyC1zod/Q4c9Ur9FjEfXz6yqfI7J6SVWlE7ogA8rpPUSBW8DPO02+LsKjGxmIJbgs5FDP8uUestvEoyl9K1Ped9Z2SUt4R3ZOiwZeoOQRi5e3o8nwpJQEhkjMwAB3xjc+fSoFcvyfmCrvYNMyBxM198p9jGHA9hixizTTNpHX8wOQUxnsNQZi3X0PIfD/kXCl3PYaEVw4kQxdZ9mWQ== Authentication-Results: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=suse.com; Received: from DU2PR04MB8790.eurprd04.prod.outlook.com (2603:10a6:10:2e1::23) by DB8PR04MB7051.eurprd04.prod.outlook.com (2603:10a6:10:fd::20) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.6699.24; Wed, 30 Aug 2023 16:16:51 +0000 Received: from DU2PR04MB8790.eurprd04.prod.outlook.com ([fe80::f749:b27f:2187:6654]) by DU2PR04MB8790.eurprd04.prod.outlook.com ([fe80::f749:b27f:2187:6654%5]) with mapi id 15.20.6699.035; Wed, 30 Aug 2023 16:16:51 +0000 Message-ID: Date: Wed, 30 Aug 2023 18:16:49 +0200 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101 Thunderbird/102.14.0 Subject: Re: [PATCH 5/5] x86: support AVX10.1 vector size restrictions Content-Language: en-US To: "H.J. Lu" Cc: Binutils , "Jiang, Haochen" References: <6f819651-36c0-1c69-8224-fe21f0f96a3f@suse.com> <990c83c3-0776-efdd-e162-5c367f4ebdc2@suse.com> <1b1b6e37-9484-95d8-d63d-c586a064729f@suse.com> From: Jan Beulich In-Reply-To: Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit X-ClientProxiedBy: FR0P281CA0168.DEUP281.PROD.OUTLOOK.COM (2603:10a6:d10:b4::17) To DU2PR04MB8790.eurprd04.prod.outlook.com (2603:10a6:10:2e1::23) MIME-Version: 1.0 X-MS-PublicTrafficType: Email X-MS-TrafficTypeDiagnostic: DU2PR04MB8790:EE_|DB8PR04MB7051:EE_ X-MS-Office365-Filtering-Correlation-Id: 028e6661-a92d-492c-21f2-08dba974850f X-MS-Exchange-SenderADCheck: 1 X-MS-Exchange-AntiSpam-Relay: 0 X-Microsoft-Antispam: BCL:0; X-Microsoft-Antispam-Message-Info: dNMOOsvAlgh7ZmL6UjsVyXjL5DpTGweM47TejequntGe2etEedWlV1pD+0wRUeyxTvmih1iDajAlkm6ZGuM2FyU99QRu8gU4XhC2xm2yzIFeosaJMMJuQPuTyibGRJRilsoEk4cN+WO7HEGRMy4XxafeJ4UwvDoKpKkTaFX92bCSTHB4tQpY8V4huomB2V/V489eamt81KGcjr9Xfyns+ll88mdamMqKmpiJj1/7oQXawFmw+z1hstij+8vhre8TtP8uz8XGKNX3ogWhggoGt9EH6g0SW86gHzOPIZeACD14j7fY1UBK8Ib41i7ybkTKbgqtqdBylm2gt062yA3Zoc2nXXe+Sw4MgOhD5iLRwxFgaL2HASlBCc+fp5Szz4KhPeAQgcbwjg8mqrkpY5KRyDVpyGLjhaJOYjF2nKSyJzdn63cOXDtrT1HKfmp+9tCtOKO2RnQXAK2xTjO6WbJBGLokM+yZ86r5v8Y/8rx8PZH4sJVmjYtOoyBqjBEc+Nw+BapZcRKuddPQPYNuZXrlPBBAjvKNgB2irawcp25+7yeboGbWEDAZsDsL6ROCSk5n9ttd7/sc1KHSFiUw9bgGNOavtNlWrQQd7uPX1SeucmIYCbJ1ggSVRGBmbWzSnwcyRGPndre9AzqOBJqc+VYhoQ== X-Forefront-Antispam-Report: CIP:255.255.255.255;CTRY:;LANG:en;SCL:1;SRV:;IPV:NLI;SFV:NSPM;H:DU2PR04MB8790.eurprd04.prod.outlook.com;PTR:;CAT:NONE;SFS:(13230031)(366004)(346002)(39860400002)(136003)(376002)(396003)(451199024)(186009)(1800799009)(2906002)(38100700002)(83380400001)(5660300002)(6486002)(6506007)(26005)(31696002)(31686004)(86362001)(8676002)(4326008)(8936002)(66946007)(2616005)(6916009)(6512007)(66556008)(66476007)(316002)(478600001)(53546011)(36756003)(41300700001)(54906003)(45980500001)(43740500002);DIR:OUT;SFP:1101; X-MS-Exchange-AntiSpam-MessageData-ChunkCount: 1 X-MS-Exchange-AntiSpam-MessageData-0: =?utf-8?B?OGpFNmlFMVFybnYvRE1nbEJWNENRM3pzRUxDeGpzVEVkL2tmdytEN2RnWHha?= =?utf-8?B?RldsMFRMZjVZUlMyY3BjWmdRN21oRHpldWlzSjlSRlE4L25hVy91dkMveEUv?= =?utf-8?B?QTdTOFhwcWVlWkRLdEZhVXRRK29ZbkNYL2VTTXhvQ2lRNmZOZjJlTnB5Q25l?= =?utf-8?B?RUdldGFyMzZqcXE1QTVheDdqMFpERmFsdUdrYnZVd2pYMXMydGFJU28vV2NK?= =?utf-8?B?SVk3OEkxU3UzSGZxWGdjMUNCc1paYk1lYXJLclZpWEEyVjFVWWJLUHIwVGZ2?= =?utf-8?B?ODlRU3c0YXlhVmkvOFVHaWl6Z2Z0WEZQNzZVSDJDblVWSUk1cXVoREgvQnhV?= =?utf-8?B?Zys4amZaVlJlT3ZVbDcwRkRVenZObEMrRGFST3dLVGwzc0xjakNsYm1GWHli?= =?utf-8?B?Rytuak8xaks3czk0dzlURjlyTzlLQUdiamtaajBwSnkwckpaei8vSFl3VHRU?= =?utf-8?B?dy9KTjdCQmlNbndVdHh1NXp0VTVtMFYwTmdvYUtWbGlaM3oydFVxcG5Bb0R4?= =?utf-8?B?R1BIMHBmSWtUVHMrRGNEYjRuWCthNzdiaWRSSmhSdXM0M0RCU3ZHQTR5cGky?= =?utf-8?B?WnZuNWpiREJFUHIwK2pIVC9GQlVsT3poeUhaNzYvWGttcjN6K3ViYzBkeE16?= =?utf-8?B?UVZuZlNYdGUzeEdiUGQvd2VxK2JlMHdDOGs2L1lrcFhUOU95U1BiNWpnbnlM?= =?utf-8?B?VnFjMmpSdDNuVHRFMnJ5dEZRMlhocHhuelJUWk9neHUzcm9VQ2xMZ1ZZK1Zl?= =?utf-8?B?VlpEb2lEeTMvYXdBUDJ2Z1BaMTVIeHU1dkNFb0lsekJmeWVZRWJsR01KWko3?= =?utf-8?B?WE5vNkd1VEs4ZmxkM0N2THlaa0hIZXo5MXY2bTZqd1U3aFdZcTUwVHNnZ2No?= =?utf-8?B?cXp2amRnMVdBNDRxR1dYZkVPOGtZU1Jnd284dUJvQjAzVWhkTWVWSXZ6RDZi?= =?utf-8?B?QUgwUkMwZUpLeWprMGZkSTh4eDlaYUg2dVBvcms3eWRVcVlmaHVvMUw4RjNX?= =?utf-8?B?QktFcE9tbFdJVW45RHU3eFFsRkhacjNocXlld2JyWms4QXFMYkh2S1g3QUU0?= =?utf-8?B?S09ranNFalpxUFBSZ2dpdTJzbzFuaGhmV2JPUDhqUEhkUFZ5MFVmenZDZ3BC?= =?utf-8?B?QlNydWZTaW12bjJWaVpVczhrZ0dzekdFVVplaXpWMlBJWWVTZHJ0QjArY1ZD?= =?utf-8?B?OSt1TjRMYlhOdmpMRkNzQVhjSWk0QU5mYlpXL3dSZTlUdVNaZklGZUYyWERU?= =?utf-8?B?SG1LdFhkcmp0MUNNcFhIYnUzYzJiQjlLZEZDczByN25CeHNyRk15ZDMxNUFY?= =?utf-8?B?SHh6dmwxZUh6YlBObWZXY2t1ekNzZDhnMjBlNUM5YkRUZFQ3UHNqRkFOMEdR?= =?utf-8?B?MjN6TDY1aDZTd0s4dzc4cTJoZ3VSZU4wcTI0b2c3UmU2Zjd3dGs3a0dFRGZ4?= =?utf-8?B?NzBHek9EazFkNlVtcGFsellkaW50K1JPc3ZQQUpWVzdSa2VXaklTQWp2dkhN?= =?utf-8?B?dDVxRHkrMnE1TldhWThqdXhtQzVJcmZGNlcrUnR6eURrdXg4NkpJcnR2VStV?= =?utf-8?B?Rm1MZm44Vm9Jcm5IeHRyYUM5ZzhoZjFqTXdxM0xqd0IyOWhMS1VBdGJTVmtZ?= =?utf-8?B?d2Q2WXlOb3RYWmp4cXN0bWZzbkFCbS9DbU1ucjNib0dOUEJsVEl0aGk0QVFv?= =?utf-8?B?OGNIc1V2Nnl2SDhlU0RWcG5BbjlNMU5sQ3BxOEtPQUEydWkwQkl4OS8xaDY2?= =?utf-8?B?SEtFQjNyRmNEN2diaCtxTnpDaU5rakMzNjlMT1VCSDJkdm01ZEVPR0prc1Jt?= =?utf-8?B?WmljMFdZOWNhQW1YcGlsNDY5WUt6YSs3bzhQekZsM3poS0lkK2N0eXBMbGcy?= =?utf-8?B?WUtCWi9iQys0NFQzVG5leDFtY3ZQaVZHc1l0U3ZKZ3V0QnFEdzRjdGw2QzJM?= =?utf-8?B?Y1YzUmxSVGZDVFVZZDJmRFdsTVNjdFlDMmNUc1hnc0N1bWFicnN4L3IvUGY4?= =?utf-8?B?dVpralhnZVFlY3NnQnBTMmJjN0tnQVFqN3lxL2RQb3EydDBIN1JOMDEwb2pS?= =?utf-8?B?NHlnSnhhUnZPVld2Y2xmYStlRGl5Q0RQbWxYZHNLdStIb2xMbEtiNUdTd2VH?= =?utf-8?Q?PyA9RbsUsu1J8bmudItFxDUoV?= X-OriginatorOrg: suse.com X-MS-Exchange-CrossTenant-Network-Message-Id: 028e6661-a92d-492c-21f2-08dba974850f X-MS-Exchange-CrossTenant-AuthSource: DU2PR04MB8790.eurprd04.prod.outlook.com X-MS-Exchange-CrossTenant-AuthAs: Internal X-MS-Exchange-CrossTenant-OriginalArrivalTime: 30 Aug 2023 16:16:51.7696 (UTC) X-MS-Exchange-CrossTenant-FromEntityHeader: Hosted X-MS-Exchange-CrossTenant-Id: f7a17af6-1c5c-4a36-aa8b-f5be247aa4ba X-MS-Exchange-CrossTenant-MailboxType: HOSTED X-MS-Exchange-CrossTenant-UserPrincipalName: 5dmcuqFXtWOLb1eg/M9cCj8wbohr2xScmBVWfzHkhB6DdWI+/1H3uRzPV4lN5U3UGkhmoNYdp/p1YrO9lY1Kew== X-MS-Exchange-Transport-CrossTenantHeadersStamped: DB8PR04MB7051 X-Spam-Status: No, score=-3027.6 required=5.0 tests=BAYES_00,DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,NICE_REPLY_A,RCVD_IN_DNSWL_NONE,RCVD_IN_MSPIKE_H2,SPF_HELO_PASS,SPF_PASS,TXREP autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server2.sourceware.org List-Id: On 30.08.2023 17:25, H.J. Lu wrote: > On Wed, Aug 30, 2023 at 12:57 AM Jan Beulich wrote: >> >> On 29.08.2023 18:26, H.J. Lu wrote: >>> On Fri, Aug 25, 2023 at 5:48 AM Jan Beulich wrote: >>>> @@ -1673,6 +1680,12 @@ an unconditional jump to the target. >>>> >>>> Note that the sub-architecture specifiers (starting with a dot) can be prefixed >>>> with @code{no} to revoke the respective (and any dependent) functionality. >>>> +Note further that @samp{.avx10.} can be suffixed with a vector length >>>> +restriction (@samp{/256} or @samp{/128}, with @samp{/512} simply restoring the >>>> +default). Despite these otherwise being "enabling" specifiers, using these >>>> +suffixes will disable all insns with wider vector or mask register operands. >>>> +On SVR4-derived platforms, the separator character @samp{/} can be replaced by >>>> +@samp{:}. >>>> >>>> Following the CPU architecture (but not a sub-architecture, which are those >>>> starting with a dot), you may specify @samp{jumps} or @samp{nojumps} to >>> >>> Although CPUID bits in AVX10 spec may leave an impression that 128-bit, >>> 256-bit and 512-bit vectors may be enabled independently. But it also says >>> >>> A “converged” version of Intel AVX10 with maximum vector lengths of 256 >>> bits and 32-bit opmask registers will be supported across all Intel processors, >>> while 512-bit vector registers and 64-bit opmasks will continue to be supported >>> on some P-core processors. >>> >>> Adding avx10.1/128 isn't necessary. >> >> I agree it isn't necessary, but as expressed before I view it as desirable. >> Apart from the sentence you quoted the spec later also says "There are >> currently no plans to support an Intel AVX10/128 implementation." For my >> choice of also supporting the 128-bit restriction I'd like to put emphasis >> on "currently". I think I said before that emulation environments (qemu, >> sde to name just two well-known examples) are free to implement such >> further restricted ISAs without then becoming out-of-spec. >> >> Plus supporting this mode right away has made me make certain adjustments >> in what I'd call more clean a way, which I view as desirable as well. > > Since AVX10 spec doesn't specify if mask registers should be limited to > 16 bits for AVX10/128, doing it in assembler is premature. It's hard to see why they would remain wider. The more that they were 16 bits only in AVX512F. Plus of course nobody needs to use the options to enforce the 128-bit limit. The way I've coded it, it matches what the specification says. Jan