From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from EUR02-VI1-obe.outbound.protection.outlook.com (mail-vi1eur02on2081.outbound.protection.outlook.com [40.107.241.81]) by sourceware.org (Postfix) with ESMTPS id 456723858418 for ; Tue, 3 Oct 2023 10:48:57 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org 456723858418 Authentication-Results: sourceware.org; dmarc=pass (p=none dis=none) header.from=arm.com Authentication-Results: sourceware.org; spf=pass smtp.mailfrom=arm.com DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=armh.onmicrosoft.com; s=selector2-armh-onmicrosoft-com; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=p7qccrcFD/swwoOdby7NeKuhhrAZL/u7BShAulpGQGI=; b=E1H64DfcbPliCWcwYKDltlvUHlPPBZXsg/VWj3211mMyGIqlghA0DYByJgRQqveXnEm++GU7pVX96iACqn35584hdF+T1QzN0Zds415gONsZlXlv9bo+5vLFQQq7tA+8AhBy5tZbSssX6ZRZllmj2cGLSFvRS2ulm0+JClIL1q4= Received: from AS8PR04CA0052.eurprd04.prod.outlook.com (2603:10a6:20b:312::27) by DU0PR08MB9557.eurprd08.prod.outlook.com (2603:10a6:10:44e::14) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.6838.26; Tue, 3 Oct 2023 10:48:51 +0000 Received: from AM7EUR03FT032.eop-EUR03.prod.protection.outlook.com (2603:10a6:20b:312:cafe::2) by AS8PR04CA0052.outlook.office365.com (2603:10a6:20b:312::27) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.6838.33 via Frontend Transport; Tue, 3 Oct 2023 10:48:51 +0000 X-MS-Exchange-Authentication-Results: spf=pass (sender IP is 63.35.35.123) smtp.mailfrom=arm.com; dkim=pass (signature was verified) header.d=armh.onmicrosoft.com;dmarc=pass action=none header.from=arm.com; Received-SPF: Pass (protection.outlook.com: domain of arm.com designates 63.35.35.123 as permitted sender) receiver=protection.outlook.com; client-ip=63.35.35.123; helo=64aa7808-outbound-1.mta.getcheckrecipient.com; pr=C Received: from 64aa7808-outbound-1.mta.getcheckrecipient.com (63.35.35.123) by AM7EUR03FT032.mail.protection.outlook.com (100.127.140.65) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.6863.24 via Frontend Transport; Tue, 3 Oct 2023 10:48:51 +0000 Received: ("Tessian outbound fdf44c93bd44:v211"); Tue, 03 Oct 2023 10:48:51 +0000 X-CheckRecipientChecked: true X-CR-MTA-CID: 8024e88380ec9639 X-CR-MTA-TID: 64aa7808 Received: from f0ea79977de9.2 by 64aa7808-outbound-1.mta.getcheckrecipient.com id DAF9F467-7BCD-4AA0-9FC7-EAB2F9FD0EC4.1; Tue, 03 Oct 2023 10:48:44 +0000 Received: from EUR02-AM0-obe.outbound.protection.outlook.com by 64aa7808-outbound-1.mta.getcheckrecipient.com with ESMTPS id f0ea79977de9.2 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384); Tue, 03 Oct 2023 10:48:44 +0000 ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=Y8LXvATbX2r+ry5177k6c0L0Pw7V5R6K2DX7Hnt5nK+yuJoTCU5bSZ6JUIDViwJQeh4rAiLH4HtI5pRXB2f319XzPAkCHzd792vtDyI4MGgTD70Fl7MgNZjAKGX+v11E1y+fSgjsrgaGjQREasewVc7CVUn6DKTrvS7jD0Vzj/YYRlzIcfce8vzZxlQQOagv4mDEUJumyod5yyp8s6R0h8zSECVQolBw9EzQPbDEFId6XGMOL9I94A3GtnHtPdkwMapvjFpKcb3fqx80iL65ORWD3z32b8Vh6iQMWzwv2KCcsTmI/yH5WWqTG3vK7bxWXgHZH2IaBmn+PQpUqpEyfQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=p7qccrcFD/swwoOdby7NeKuhhrAZL/u7BShAulpGQGI=; b=K5CqoBbc1g5VdB2U4/PhJFx3s54EfYfKgPbUJzPR7ngi2161a8ZVP9mqOXsObgQiqPgfjBU7HeRZblWNmx6GVSwW0zCJjt1v+dSfPyPSdk6+GvaNLAnDP3owIud8D/ZSeY71jeh0l6t+X2OGWEg4U1n17nU1rOtV0MTfEdyVYcqH1NTwG46LUnUYjBl9xk3I4LM+oSDgKWReV85lAkChBLcfDyaibMNe2ngIs3mY3M74k4VVo1mCK5fzrT1M1dqqqXO4CWt/PHz6c6PQRQVMNYokdNO7jMm/ItmWqFZqXtrZ+NJYkfvrNdj9KPYolYC0G+JoYVDTKP7lTWJiWK1Jkw== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=arm.com; dmarc=pass action=none header.from=arm.com; dkim=pass header.d=arm.com; arc=none DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=armh.onmicrosoft.com; s=selector2-armh-onmicrosoft-com; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=p7qccrcFD/swwoOdby7NeKuhhrAZL/u7BShAulpGQGI=; b=E1H64DfcbPliCWcwYKDltlvUHlPPBZXsg/VWj3211mMyGIqlghA0DYByJgRQqveXnEm++GU7pVX96iACqn35584hdF+T1QzN0Zds415gONsZlXlv9bo+5vLFQQq7tA+8AhBy5tZbSssX6ZRZllmj2cGLSFvRS2ulm0+JClIL1q4= Authentication-Results-Original: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=arm.com; Received: from DB9PR08MB7179.eurprd08.prod.outlook.com (2603:10a6:10:2cc::19) by GV1PR08MB8238.eurprd08.prod.outlook.com (2603:10a6:150:5e::18) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.6838.29; Tue, 3 Oct 2023 10:48:42 +0000 Received: from DB9PR08MB7179.eurprd08.prod.outlook.com ([fe80::e34a:7a41:96db:8aba]) by DB9PR08MB7179.eurprd08.prod.outlook.com ([fe80::e34a:7a41:96db:8aba%4]) with mapi id 15.20.6838.029; Tue, 3 Oct 2023 10:48:42 +0000 Date: Tue, 3 Oct 2023 11:48:29 +0100 From: Szabolcs Nagy To: Joe Ramsay , Subject: Re: [PATCH] aarch64: Improve SVE sin polynomial Message-ID: References: <20230803115453.14801-1-Joe.Ramsay@arm.com> Content-Type: text/plain; charset=utf-8 Content-Disposition: inline In-Reply-To: <20230803115453.14801-1-Joe.Ramsay@arm.com> X-ClientProxiedBy: LO3P265CA0022.GBRP265.PROD.OUTLOOK.COM (2603:10a6:600:387::13) To DB9PR08MB7179.eurprd08.prod.outlook.com (2603:10a6:10:2cc::19) MIME-Version: 1.0 X-MS-TrafficTypeDiagnostic: DB9PR08MB7179:EE_|GV1PR08MB8238:EE_|AM7EUR03FT032:EE_|DU0PR08MB9557:EE_ X-MS-Office365-Filtering-Correlation-Id: 62d5ab2e-be2b-40fa-c750-08dbc3fe54eb x-checkrecipientrouted: true NoDisclaimer: true X-MS-Exchange-SenderADCheck: 1 X-MS-Exchange-AntiSpam-Relay: 0 X-Microsoft-Antispam-Untrusted: BCL:0; X-Microsoft-Antispam-Message-Info-Original: yrLZWbT7PTieE3rXktNdVwc1KYOQeugymHeh20s5xEQAFoyhFG8TDmw1lIr+Y+3z5mg3LQx65J5GtxLV+SqgwaLfFsJMOL3AjFYlaK9FMUzADkNYZvT2Zaz9imxLv4SgrA0Tz95073NaTWpFzQoApp3WzcbsIYfhgaucyUISLadxb+Wz+cNs3s109GIfHZL9j8vRKKwuG3JONsRWh5cdbSNDm2SPxlozLq+BIsYyx3r01sifUmmlORFQULBH2D9RarjSr3bibGh4I9ut0SspD3eD1JpAWfezq01vF0ThjSUkjoUAcxF0zr6MkhnAaMRLLRmvwHJLimFxulY02y5QhF8Ar+DhnrteLwSVy09huPmYkZV0ZLWQ0UW7EzjdSXnSyaT5+hDgZ/5nHGco0YQQ1leDllmVem7ge5SsT3wjdOSlqD+eiyvark43P5pbfeNO3j74HjCk8rTNl5qe1RowATKZZCEkopkYOw+CDQn5WcfhWbWi2TNNvqqxSU6LAa0d2ibe4ID7GbIdP39R5IdNeFF8PGAgsuw5qm4ImiGMHKv2piDKt4PGIOafsOrT4QIu X-Forefront-Antispam-Report-Untrusted: CIP:255.255.255.255;CTRY:;LANG:en;SCL:1;SRV:;IPV:NLI;SFV:NSPM;H:DB9PR08MB7179.eurprd08.prod.outlook.com;PTR:;CAT:NONE;SFS:(13230031)(39860400002)(366004)(376002)(396003)(136003)(346002)(230922051799003)(64100799003)(1800799009)(451199024)(186009)(6486002)(478600001)(36756003)(44832011)(5660300002)(6506007)(4744005)(2906002)(8936002)(8676002)(110136005)(66556008)(66946007)(316002)(66476007)(6666004)(41300700001)(6512007)(86362001)(2616005)(26005)(83380400001)(38100700002);DIR:OUT;SFP:1101; X-MS-Exchange-Transport-CrossTenantHeadersStamped: GV1PR08MB8238 Original-Authentication-Results: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=arm.com; X-EOPAttributedMessage: 0 X-MS-Exchange-Transport-CrossTenantHeadersStripped: AM7EUR03FT032.eop-EUR03.prod.protection.outlook.com X-MS-PublicTrafficType: Email X-MS-Office365-Filtering-Correlation-Id-Prvs: f9dfab3e-417c-4494-cf6a-08dbc3fe4f60 X-Microsoft-Antispam: BCL:0; X-Microsoft-Antispam-Message-Info: wM2rzPYCl2mzDW4UxUwfV1ICTpAZc6Kq2jHazG+Ml+bARjqYpxrZ/6WPeuJVCjfwq6YeRLJLmR7E3ll1qEd+QYzry+J0hHRx/0u/cyQ+zgtdtDIyOGPV35PGzfrHecs6SvdfJNGGbvB6jb9wcp7nFfFS+9UUYQb6PRlAYG+FJUT8l2E+vsmWJ8BMa7HAWYI1ntXBTr5UGFjDvdw3f2ZAhGJYKE9pVt4IOFOS2Vwmu0hSwRH/KVaHxSHZ8H2qDMgEugzJdyxd1gd1eSvinFIdJ/Zpzv9/QfWG1v8M1Kk2vRYWcs2weWiou7lIFTufWA/RUor92YsqYaewgvIbDhiv0p/S33UVkN0CFK8z94TUWatplT/a2MDw8TgYSKHfsxmUEWxfB3DKIQFW01Ors245EsYKkLxW/HfpS6BtXRycrElzLXVF1PnwiZQughG8gVBTl1NxtoFjnlLrSkFUt4FivWC1zt/cea/DbASFISvoKlFzJ5VfnBxk4DUaVQl3YlcpI9UcC+v5WzQ8eMjt/t9fiViBABetmF4tElDwsTl2PrUZnVshfG1CeqJQPNDVKDxumjs4bghDpxYYuINzrwOK8hXP/WsI53usSmW+zeesGJHsWRkGApUA3n0bH5Z6mYKRLQG2ZAurQx7Kl8PXDrjdeWJw2nzlObbd/XHLnwKuhi9Y4wTHgY4oGf9iVAmEjaSGuMOTzU28tzGW9mgZ3ljyX/5Xfq8qRA7b3+ioH85805JouSxzKefZnijq6ogYs2Zq X-Forefront-Antispam-Report: CIP:63.35.35.123;CTRY:IE;LANG:en;SCL:1;SRV:;IPV:CAL;SFV:NSPM;H:64aa7808-outbound-1.mta.getcheckrecipient.com;PTR:ec2-63-35-35-123.eu-west-1.compute.amazonaws.com;CAT:NONE;SFS:(13230031)(4636009)(39860400002)(136003)(376002)(396003)(346002)(230922051799003)(451199024)(186009)(64100799003)(1800799009)(82310400011)(36840700001)(40470700004)(46966006)(44832011)(6486002)(6512007)(6506007)(40460700003)(2616005)(86362001)(356005)(81166007)(36756003)(40480700001)(36860700001)(82740400003)(4744005)(336012)(2906002)(478600001)(316002)(47076005)(6666004)(83380400001)(8676002)(41300700001)(26005)(8936002)(70586007)(5660300002)(70206006)(110136005);DIR:OUT;SFP:1101; X-OriginatorOrg: arm.com X-MS-Exchange-CrossTenant-OriginalArrivalTime: 03 Oct 2023 10:48:51.6279 (UTC) X-MS-Exchange-CrossTenant-Network-Message-Id: 62d5ab2e-be2b-40fa-c750-08dbc3fe54eb X-MS-Exchange-CrossTenant-Id: f34e5979-57d9-4aaa-ad4d-b122a662184d X-MS-Exchange-CrossTenant-OriginalAttributedTenantConnectingIp: TenantId=f34e5979-57d9-4aaa-ad4d-b122a662184d;Ip=[63.35.35.123];Helo=[64aa7808-outbound-1.mta.getcheckrecipient.com] X-MS-Exchange-CrossTenant-AuthSource: AM7EUR03FT032.eop-EUR03.prod.protection.outlook.com X-MS-Exchange-CrossTenant-AuthAs: Anonymous X-MS-Exchange-CrossTenant-FromEntityHeader: HybridOnPrem X-MS-Exchange-Transport-CrossTenantHeadersStamped: DU0PR08MB9557 X-Spam-Status: No, score=-5.8 required=5.0 tests=BAYES_00,DKIM_SIGNED,DKIM_VALID,FORGED_SPF_HELO,KAM_DMARC_NONE,RCVD_IN_DNSWL_NONE,RCVD_IN_MSPIKE_H2,SPF_HELO_PASS,SPF_NONE,TXREP,UNPARSEABLE_RELAY autolearn=no autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server2.sourceware.org List-Id: The 08/03/2023 12:54, Joe Ramsay via Libc-alpha wrote: > The polynomial used in the AdvSIMD variant was found to be more > efficient than FTRIG intrinsics, with acceptable loss of precision. i think we want this change, and if somebody finds a better polynomial we can update both the sve and advsimd implementations. > - /* Polynomial coefficients are hard-wired in the FTMAD instruction. */ > + /* Worst-case error is 2.8+0.5 ulp in [-pi/2, pi/2]. */ known worst-case is 2.87 ulp now > + svuint64_t odd = svlsl_x (pg, svreinterpret_u64 (n), 63); > + odd = sveor_m ( > + svcmpeq (pg, svreinterpret_u64 (x), svreinterpret_u64 (sv_f64 (-0.0))), we don't need to special case -0.0 any more.