From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from EUR05-DB8-obe.outbound.protection.outlook.com (mail-db8eur05on2058.outbound.protection.outlook.com [40.107.20.58]) by sourceware.org (Postfix) with ESMTPS id 43D9D385701B for ; Fri, 16 Jun 2023 07:29:49 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org 43D9D385701B Authentication-Results: sourceware.org; dmarc=pass (p=quarantine dis=none) header.from=suse.com Authentication-Results: sourceware.org; spf=pass smtp.mailfrom=suse.com ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=cnqGlw3q5J6gkZXx5UhDvLAxWogfYeS344M4Ajlotdj/ib7UtAAsrd9TZsNhnVsg1XpGgEHb0oJ08Xj0JqNU/CFIjJd5h4RFle3nmWvAOVbAdn5DcOT/XX8nRh3Rmtp4gMRw1R1FJ2eoH6u2R0/9u+j0LSVJxYR+6HgVfjYf/fkAtZ+CdUYpQC1xs2Hj7RjU2EwL/KX7xKSlsMr86yG72VP4rphIIA19u4D3H7VStB40mWM1ueVlpbBr0Of+9Q4wlc79TtxMrLJyK5YIHt3Vt7b71C8tA5tDfrHEgBOs5jq20QZMtJkmz4/nAxu5WoiYxrSBpn39UWKWqdJos9XaeA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=SSdKJ2vaaPffT8Jua/Cz228O54MNWgJr8Ea0kvaw+y8=; b=GxDgeGeHFpBL7GlvjgjiM3ITdLf6a7jiL/PEr73KpkPQy/EBBDTsNqffdkMlnMne5XhmlZZxkaFNj1OQf1LdXJQvMnmrLOJgdG+IZtyadgoHhG9J9O0vthOev0rgHYoGLJ2+mKq5rg3nvLcaPaqJc/68IZaxTHXpkGiNhbUlY23pJkS40Uc6PEr2FBVVtSvNHB9zfq9GFOhN9pInnghPW2fvIetTKd7ITcEPd8t/0LG71AzWTlCjQlZeydNRqEO/dOhGeIbHnZ31kZmMUTnaZUKtaKm1J6P44hwoCmVXUQJMkD82881u5OheVyEglT/3vlQk8KjF06ZuvBXt6/5wpA== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=suse.com; dmarc=pass action=none header.from=suse.com; dkim=pass header.d=suse.com; arc=none DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=suse.com; s=selector1; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=SSdKJ2vaaPffT8Jua/Cz228O54MNWgJr8Ea0kvaw+y8=; b=lZzcjoAjJ6ke5uSII5UQmAvaYcmYp3kmcHXHB/percqM3mB+7l2Smr5y/x+HcuoSMo2lvSzvFum+Hs4y5mE3cKvTHM+A367nJscJcvFfzsRbygHxAyglUl7UGCoOn/cHqlDqeoNmi0WrMWgWAJjabvVqrvrJLozLf1RhzeWr3hODxL+0E8Fm+mK6J0RhFhlTP53/a7PuG2ErDuY/6qr++WrvxgZdnRK9QNxi0TM8phPeD4gERu/zJJDNGIKHt/grfjIKTeFjy4NN3B+5tsUBptAUAAjTlB8pKiEXQRBnz+hNTNffQvkTo1paSXdrdIZ6jyhPoX4w2igM+lBX3T3zmA== Authentication-Results: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=suse.com; Received: from VE1PR04MB6560.eurprd04.prod.outlook.com (2603:10a6:803:122::25) by AM0PR04MB6913.eurprd04.prod.outlook.com (2603:10a6:208:184::24) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.6500.29; Fri, 16 Jun 2023 07:29:46 +0000 Received: from VE1PR04MB6560.eurprd04.prod.outlook.com ([fe80::e442:306f:7711:e24c]) by VE1PR04MB6560.eurprd04.prod.outlook.com ([fe80::e442:306f:7711:e24c%5]) with mapi id 15.20.6455.039; Fri, 16 Jun 2023 07:29:46 +0000 Message-ID: Date: Fri, 16 Jun 2023 09:29:43 +0200 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101 Thunderbird/102.12.0 Content-Language: en-US To: Binutils Cc: "H.J. Lu" From: Jan Beulich Subject: [PATCH 0/4] x86: some more optimization plus a new pseudo insn form Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit X-ClientProxiedBy: FR2P281CA0125.DEUP281.PROD.OUTLOOK.COM (2603:10a6:d10:9d::19) To VE1PR04MB6560.eurprd04.prod.outlook.com (2603:10a6:803:122::25) MIME-Version: 1.0 X-MS-PublicTrafficType: Email X-MS-TrafficTypeDiagnostic: VE1PR04MB6560:EE_|AM0PR04MB6913:EE_ X-MS-Office365-Filtering-Correlation-Id: e6c2e476-19e7-45b9-836e-08db6e3b75a2 X-MS-Exchange-SenderADCheck: 1 X-MS-Exchange-AntiSpam-Relay: 0 X-Microsoft-Antispam: BCL:0; X-Microsoft-Antispam-Message-Info: p+aSsUlApccoCU9sS7vjuKK9GaUQahdXjqdv8ra8y6q+2pf2b/0hi5GvXhI2SnWi9CEDgN0Vq0iLX5D4zwtDRTk3u4hxPhG8VinJrAqlDxtyHJJvi73YCgXbCfnw3aafgDtjOmnf6jE0t14pvRNSh/7QNuoj6h6UVX/Yb2qkvOmzsTvnq0ChgktDJ7IOejxpUH6UaJA6e1BDWpd7HsIVNWNJeZCUQruWEvOBDMnjEy34ssNcCBoyyss2/fCjq5g9wmrn1pLMS0eiVh9bo5GKuOWzgw3ObGWzAYyNWrUpw4faBDDfNis/FUQhxdhV5iOrRmIREG4wVvmGiLTuz7aD+HRbqBuQ8NxP3/dVHMgVUZb8Cg3EAo0tbDVEngmYgrDvpynJx0BFH8n/6NOfyUfL+yje0vNN6gnSL4hLD8MIdWhCAQH1GQ1JctHp8GvnudCFqhlB9wzxF6Q7p0gvpQOsfFnh26KXY8hj8KJjSsmfiT9cGxnhJmAOxdJESKKa72jBj+svlq3t/tvVyu6pR+gVUehjWZ9jSr3NeoiFsv20yh+j+lTh9hQEh06C6n3qibuGuJnvFjUY+9heA9cBCmGzHjQGNxUeBdAxtnqDB5fIz60ty6IfIn2ENz4RicsxCdM9/22V7uAEKRL6gIVJDuDPuw== X-Forefront-Antispam-Report: CIP:255.255.255.255;CTRY:;LANG:en;SCL:1;SRV:;IPV:NLI;SFV:NSPM;H:VE1PR04MB6560.eurprd04.prod.outlook.com;PTR:;CAT:NONE;SFS:(13230028)(376002)(396003)(366004)(136003)(346002)(39860400002)(451199021)(478600001)(6666004)(6486002)(66899021)(5660300002)(41300700001)(8936002)(8676002)(2906002)(36756003)(4744005)(86362001)(31696002)(38100700002)(66476007)(66556008)(66946007)(316002)(6916009)(4326008)(26005)(6512007)(6506007)(186003)(31686004)(2616005)(45980500001)(43740500002);DIR:OUT;SFP:1101; X-MS-Exchange-AntiSpam-MessageData-ChunkCount: 1 X-MS-Exchange-AntiSpam-MessageData-0: =?utf-8?B?Vm5xbXdUWUlsRE8xM3ZDbFJGTmtEakw3SXJEWEUrQkdjNXl4SmVHcWpFSHEw?= =?utf-8?B?RFVyUllDY0ViVmplb2l4bW5abDhrR2xxcTErZFpjSFhtRGdzZGJCeDYxTDQ4?= =?utf-8?B?WUtZU3ZBSHU3TDV0UDAvQnJqSzJRSWJES09hUDJ3WEYrVnlZMXBlS2h4ZlFH?= =?utf-8?B?MkJsdFZmZXVNNnlkSmJ2Q2pZWm1JQmRSWnBnK1pZdEVUT3Jvc3R0bjRNdUIz?= =?utf-8?B?d1o3TExUYVZrWTloVnp4ZzlNcDl2Z0RmY05LY2dMSHpZdzRNcjZ3ZUFSUy9k?= =?utf-8?B?M0tnQk5rSFhXVFU5KzM4a3RSRW01eWpVVDNCY1ZSNU82ZFpVR3FzZzEwRmxk?= =?utf-8?B?RTdYZlhWRnRTU0gwWkdCdE1qVzdidDQ0V1NKcWRESUNjczI4bEpGNzQzZzg4?= =?utf-8?B?bDk1UE5CZ3JXUWRMSXQ3eHdxdURXRmVTVDJORnhvd1dtdWFOc3A2WkNNblZQ?= =?utf-8?B?czZwRkRaanBSUVFhZzFQOFQwY21DbmtMYzNIdVJwM2plSTlteUVLQTdHVVhR?= =?utf-8?B?LzllMysrUkJnU2xtMmZ2N0VnbXJIVWczcURjdEFtQ1VRcUkvM1FvbVFRUVQv?= =?utf-8?B?aGxnbUdob21RUU9MWno3ZUZULzViM2hIRURTQ3VpakpyN3ptWUExWUQzT1JX?= =?utf-8?B?dlZXWWRkNDFDS0NHczNYcHlQRUxCTWlUV0RTOTE2a0pSbnZoaGZOOHhUWStq?= =?utf-8?B?R29lUlpjemtLalNHaWRmdDBpQmRMbzAzL2gxU092cmJqaEtIKzIzVmFOR1No?= =?utf-8?B?dWcyanRZVlVWRXJoaG1pZCt4dTRsWWxSNFFRemVjcjVkeEZKdTIvUEdhTk5u?= =?utf-8?B?Vm04bndHTUlMTUhnRXhWcWVLZzdSTG1GUWl3K2VGRUlVenZhenNCRlVpZnRU?= =?utf-8?B?VndWaGhtdnJaaTY3TVpSVUgyMDIvZHRpRXd3MzJJSVZ1a0k4VDVHWDErbG5N?= =?utf-8?B?UUZDTitKclJtNkh5YW0vZ1pTVTdqVnMvR2ordldBS3lObGhieVZnb0RCY2xC?= =?utf-8?B?eXJuVy8vclB4d2Y4VzNKVFhTaHh6NXQvb1A0SjA0WGkrNzhHV3ZoOHI4S2hD?= =?utf-8?B?elNqdXpDL0NLS05VNStrUVo4YWwzSHdiQWdaMXVjL0w5TURkL2xmeWZoSXE2?= =?utf-8?B?QXB0NlZPSHYxTjE4MUJlbSttR2FkWU1xRlZJN2NkS3ZTb1NEM2huUnEyb0s5?= =?utf-8?B?SytzNkpWM0lwT3lDV080K05RcG13Q1I2Y0JoTmp1SEd6VDY4L2U4WXhFMERN?= =?utf-8?B?ZWJxaEFsRnRyajRTMjl0WE0xSkhTYkRuWjFldGdhT1JxYnNYTkpXR0cwc2pS?= =?utf-8?B?MU9BSVg2cThLMEtZMlhMVVhnQjBFcm1wMVQ0RGJubkw3N3QrcDdTM3UrSVdi?= =?utf-8?B?azJGSWRUQ2JXMiswMEI2VFVlQXRBM2IycVQzZmUvV0lHcG0wV2pwS0VmT1hy?= =?utf-8?B?dVlUSTJBbGtHYVRFdXY4MTg3R3dFQXF2WTZTZ3J4NHlJRHVoMnZnNlhYUENZ?= =?utf-8?B?SDVmZ05nb3MxR0tLKzVZaTZqeDR3UUFPZ21HTURabzJ1cDNjWFVmclNINUND?= =?utf-8?B?YmgxVUQ4WWxuM3QrZnE3QTkwTGlxSGF1ckZJWmxBMFozbTViRHkzMGdHUCtI?= =?utf-8?B?aGMzdm1PaWJhd2s4bDBzUTdvOEoyQ0hmSFhhYzdqcTNOS28vTTF3a1RIK3cy?= =?utf-8?B?WHpPQzdUMTJyYzBIakhjOVY5WllxVVJXdmsrcjF6Tnk5T25ReHhCd1dwZWNO?= =?utf-8?B?SzliQzZmQXk5cHd2cXdzTHdySk9ERG8wUHExMmh3cGdaYk8vOEpPVE5TdVR6?= =?utf-8?B?Z1VEMFlpUlNMTGFFcXBaTWZrcUdXdnAzS3JnSEZlT09CWU1BNENiYW1heGZ0?= =?utf-8?B?YTBCSldiUld3WjNaRWxnQkRMVnpIaE53S3VqY3RZZy9sYXVJZkNGWTNReG1K?= =?utf-8?B?RlhGNHBTQ3paVlZvM3lhTVYyeVhkMmo2cW9QS1lmMWp6WndHekFzQUZIZkN0?= =?utf-8?B?SnlHMDk4VDZTemQvUDRxTk1YMWVWZXRRTEpERm44bStFRkNpM25ISklOYlNy?= =?utf-8?B?VXBsTWY5dUJWVFlNcU9TK1lNTWJEaHV6elpKSmNwMkw1Qm1MYjV0eTJNV0dv?= =?utf-8?Q?u0cjufCZcwduRtERcLhjnojuF?= X-OriginatorOrg: suse.com X-MS-Exchange-CrossTenant-Network-Message-Id: e6c2e476-19e7-45b9-836e-08db6e3b75a2 X-MS-Exchange-CrossTenant-AuthSource: VE1PR04MB6560.eurprd04.prod.outlook.com X-MS-Exchange-CrossTenant-AuthAs: Internal X-MS-Exchange-CrossTenant-OriginalArrivalTime: 16 Jun 2023 07:29:46.0605 (UTC) X-MS-Exchange-CrossTenant-FromEntityHeader: Hosted X-MS-Exchange-CrossTenant-Id: f7a17af6-1c5c-4a36-aa8b-f5be247aa4ba X-MS-Exchange-CrossTenant-MailboxType: HOSTED X-MS-Exchange-CrossTenant-UserPrincipalName: aJNmJTYZdjpvbKC4ZlTa/qKeGhzRo3znqzkjirKv13mxjBVE/6EZKFSTJSu0NJpeSxDMkK6G0LhBNf8+IXGQ9w== X-MS-Exchange-Transport-CrossTenantHeadersStamped: AM0PR04MB6913 X-Spam-Status: No, score=-3027.6 required=5.0 tests=BAYES_00,DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_NONE,RCVD_IN_MSPIKE_H2,SPF_HELO_PASS,SPF_PASS,TXREP,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server2.sourceware.org List-Id: While the first three patches here merely may require establishing how far we may want to go with optimizations which are possible, the last patch is at least partly RFC, for going beyond what vendor documentation states / suggests. Constructive comments welcome. 1: optimize pre-AVX512 {,V}PCMPEQQ with identical sources 2: optimize pre-AVX512 {,V}PCMPGT* with identical sources 3: optimize 128-bit VPBROADCASTQ to VPUNPCKLQDQ 4: provide a 128-bit VBROADCASTSD pseudo Jan