From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from EUR04-HE1-obe.outbound.protection.outlook.com (mail-eopbgr70050.outbound.protection.outlook.com [40.107.7.50]) by sourceware.org (Postfix) with ESMTPS id 3F5173858400 for ; Wed, 10 Nov 2021 14:34:41 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org 3F5173858400 Received: from AM5P194CA0018.EURP194.PROD.OUTLOOK.COM (2603:10a6:203:8f::28) by DBBPR08MB4694.eurprd08.prod.outlook.com (2603:10a6:10:de::17) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4669.13; Wed, 10 Nov 2021 14:34:38 +0000 Received: from AM5EUR03FT050.eop-EUR03.prod.protection.outlook.com (2603:10a6:203:8f:cafe::dd) by AM5P194CA0018.outlook.office365.com (2603:10a6:203:8f::28) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4669.11 via Frontend Transport; Wed, 10 Nov 2021 14:34:38 +0000 X-MS-Exchange-Authentication-Results: spf=pass (sender IP is 63.35.35.123) smtp.mailfrom=arm.com; dkim=pass (signature was verified) header.d=armh.onmicrosoft.com;dmarc=pass action=none header.from=arm.com; Received-SPF: Pass (protection.outlook.com: domain of arm.com designates 63.35.35.123 as permitted sender) receiver=protection.outlook.com; client-ip=63.35.35.123; helo=64aa7808-outbound-1.mta.getcheckrecipient.com; Received: from 64aa7808-outbound-1.mta.getcheckrecipient.com (63.35.35.123) by AM5EUR03FT050.mail.protection.outlook.com (10.152.17.47) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4649.14 via Frontend Transport; Wed, 10 Nov 2021 14:34:37 +0000 Received: ("Tessian outbound 6ebd41198c5d:v108"); Wed, 10 Nov 2021 14:34:37 +0000 X-CR-MTA-TID: 64aa7808 Received: from fc9207e4dd93.1 by 64aa7808-outbound-1.mta.getcheckrecipient.com id B21B59F9-D6FF-43C6-86B8-9967713A6223.1; Wed, 10 Nov 2021 14:34:12 +0000 Received: from EUR05-AM6-obe.outbound.protection.outlook.com by 64aa7808-outbound-1.mta.getcheckrecipient.com with ESMTPS id fc9207e4dd93.1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384); Wed, 10 Nov 2021 14:34:12 +0000 ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=H7frI78O2qIoeZCLzyojrMQ3nkEP45mBqjNGciRposWbfB8ZGg/ykxd6N8baUXvlDlMGHr+HBBkkzOEipoxIKLwPio2xHw7XBQvVUYGDXFkBMj9aQUl9o3BKAyk6NQdg6w+RL0+RPLhB030GDfZI3DTqmDJ9P1ZC85STHk7MRgvdEvdSLIRUSPCH0hpbdEkLN9A1O3KuiOmQBq8VXGBgMqiOo9dBajevu0eNDgoMhi8XqzuUo7yUV/gBJtyM++9AxmYbyErHgqTlRb7phFQLXbn1a8fVXaiZ2/FAsCB4tAMr9buFPSNYvuQzaWuj3JfiTx1ulqekQgSSihvrIiO1Rg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=do8GBx2W0L3j5Ebk5gulF3+Jrlplning+mZe/0wLXt8=; b=QJvoZcN8HAhokkseO+ZeQFUTloqEAlWw7z7SUM8XrtRsei9cYJkS6VKjiKyT932vUPH6tLXj9qyWn9ges3p7dOxH2d9MYyrC2uDQgIE2+pDle3Ear29mcm5AATI/0UUm6u5ld8PeAc7YbhZKvsXVG45esYERTXXd//JziqR+JB5it5WoeqZ0EyGCZdwBIfIKOQoVG+sS4ise4LPtzt5t9R/vMOI9EZ+bHJMhGZnTOcK4QkYzR/ttDwJmk2/m7DLTwl36936smXxwyT8K6VR78OhwxkMOgY9ovd4ZmPgPkWB5XBaExk2di+HnjxWGZl6bNUxCHBranikRmMxuySMirQ== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=arm.com; dmarc=pass action=none header.from=arm.com; dkim=pass header.d=arm.com; arc=none Received: from VE1PR08MB5599.eurprd08.prod.outlook.com (2603:10a6:800:1a1::12) by VI1PR0802MB2525.eurprd08.prod.outlook.com (2603:10a6:800:b1::12) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4690.15; Wed, 10 Nov 2021 14:34:09 +0000 Received: from VE1PR08MB5599.eurprd08.prod.outlook.com ([fe80::e49f:f587:130d:78e4]) by VE1PR08MB5599.eurprd08.prod.outlook.com ([fe80::e49f:f587:130d:78e4%9]) with mapi id 15.20.4690.016; Wed, 10 Nov 2021 14:34:09 +0000 From: Wilco Dijkstra To: "Paul A. Clarke" , Adhemerval Zanella CC: "libc-alpha@sourceware.org" , Tulio Magno Quites Machado Filho Subject: Re: [PATCH v3 5/7] math: Remove powerpc e_hypot Thread-Topic: [PATCH v3 5/7] math: Remove powerpc e_hypot Thread-Index: AQHXz14N6PTp+7MyoU+R61LgXgFRjav7oRMAgAE4tmg= Date: Wed, 10 Nov 2021 14:34:09 +0000 Message-ID: References: <20211101202059.1026032-1-adhemerval.zanella@linaro.org> <20211101202059.1026032-6-adhemerval.zanella@linaro.org> <20211109192800.GA4930@li-24c3614c-2adc-11b2-a85c-85f334518bdb.ibm.com> In-Reply-To: <20211109192800.GA4930@li-24c3614c-2adc-11b2-a85c-85f334518bdb.ibm.com> Accept-Language: en-GB, en-US Content-Language: en-GB X-MS-Has-Attach: X-MS-TNEF-Correlator: suggested_attachment_session_id: cd4fbea5-50d6-9e0e-6e0a-b2a6857f9da1 Authentication-Results-Original: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=arm.com; x-ms-publictraffictype: Email X-MS-Office365-Filtering-Correlation-Id: 1f5c44dd-cf9f-4a83-f18a-08d9a4573923 x-ms-traffictypediagnostic: VI1PR0802MB2525:|DBBPR08MB4694: X-Microsoft-Antispam-PRVS: x-checkrecipientrouted: true nodisclaimer: true x-ms-oob-tlc-oobclassifiers: OLM:8273;OLM:8273; X-MS-Exchange-SenderADCheck: 1 X-MS-Exchange-AntiSpam-Relay: 0 X-Microsoft-Antispam-Untrusted: BCL:0; X-Microsoft-Antispam-Message-Info-Original: Qp20CU+xH05nKY1+fuAe+c5UOBg6pNm9yH4LVVDFDv1ingkmGJCp+TYuX9PYyMenFwtRgbEE2ZnVr7YS8r7G8qamofskcbFGTG/hvGxbXNd9S76Pawyg9jLVTQnqu2rIVbNt8SvMo6MJFnMZ2bVVzkZ4+zXQ0NMcEEHGteSbrmIukl2qHpkOU4Rl0Ksnnk5Xgm6kDjzqtGlP/ubabJ6pML6ca0KybClW4OT6bq73eK9+r4Yo1hm3zF8NEuKmEP5JfrFflEfUyY/YBkC5uS2jdZlFy9KBMdGue/OZxkRaXNLQtw+uF3XB48ofYeCw8phhd+IgsuzHEN4Dt4sntQ37YYLIMyo26k5F0h+3dZ9883U39jq1FzZj4VIJRJtcUl86jN3+8VKNpgRgWEVe2IMEI4L243YJb5qD8x1vd9BhBgYk5gSqtSTkiwWfBm52uOM6Y0e34POCYlAV9PZzkIcp7IKBbK+kFeP5qE7MBx3ifkdzL0Sl9ZN4OaNUrR8LztYOwrToCL7mDRQfqzZYsYAUrF/SUiNQUkNWqjzf9C5slbge9uAJtHq10wTBsNxowgNTK3ZsDm9p6vZS12NezbzhBJTuZSal6fPzjchyUx9tty24kvQvaH3mfkG4nTQWPVvs5cNmZXj8wqBxSCUl1QFOYhbyYgMhTO+u4IBDedPstEAaISniCd9vBPKYqZPkU+Qs7Tak20WgjuuycjW19LOIoA== X-Forefront-Antispam-Report-Untrusted: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:VE1PR08MB5599.eurprd08.prod.outlook.com; PTR:; CAT:NONE; SFS:(4636009)(366004)(9686003)(55016002)(316002)(8936002)(38100700002)(86362001)(52536014)(8676002)(186003)(7696005)(71200400001)(122000001)(4326008)(6506007)(38070700005)(5660300002)(83380400001)(76116006)(26005)(33656002)(66476007)(66556008)(64756008)(66446008)(2906002)(66946007)(508600001)(4744005)(54906003)(110136005)(91956017); DIR:OUT; SFP:1101; Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-MS-Exchange-Transport-CrossTenantHeadersStamped: VI1PR0802MB2525 Original-Authentication-Results: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=arm.com; X-EOPAttributedMessage: 0 X-MS-Exchange-Transport-CrossTenantHeadersStripped: AM5EUR03FT050.eop-EUR03.prod.protection.outlook.com X-MS-Office365-Filtering-Correlation-Id-Prvs: a8efab00-cc27-44d1-fd99-08d9a4572840 X-Microsoft-Antispam: BCL:0; X-Microsoft-Antispam-Message-Info: Y39o6XvJSZqvejTUD7wuN4Cb9Rs1iyyf7vuxjq2xC0YDkcWpQKVbNDIuk0TB/tKcQ4iGbY0sRJUNBltMFldGC5G+SWvwcBt2C7rABlriAHv3I/jEnFJ4ht8Le/k3obr72lbbfNUwKcmdrHs4gjLTDvoqvpg2aSZbtM/d5eRPFuSXzMKL85n1oRI6iwZHmMtYp7JtzYZiVb9Dkdqeldr64D0SGarxvKbyZGAmkhqqY4DA+TqWf4H8JAHgkIDp1s2cuKKG9O4RO1ynP5dVdwpeyriqWzIunI9QkjuR7LD+j8Fj7R/Bop/neuz4Z+es3Co1Ecoy52Efri0eXbiazPSb2SAv0QxH22wZB3BMpYsPt6rT2Z7/tuL5wGMbc+elF9hfiHSLA0E/cdWhKts5vjAI3YRGpOmHrr+/t5zteIvFVcER78xnz3443+XpSb0O7YaL2EIyHw0xUHi13SUYFcHkq0JLJuiMxYo/FQ/aWMgrFrq+t34Vbv6QMfULhVE2CvAPfoeRVZsgpSPtvmLCX9N4pET0L1rV8GB2idAQkwI+Wff7sIc09lRoUJpnkmgPIKVAwH43LfRItnZ2gAlOK0s3GQYF5P2va0BrL8FX8NoBuU9UV39tEWUHPOXr1WePQXKMfVjbAguVoybfqJk16xvjR4j1/gosCcW+wYIzcYkqELCX91eY06yUqwU26kVhH5WBq7IAbJrLqwQ0m0chllQxvw== X-Forefront-Antispam-Report: CIP:63.35.35.123; CTRY:IE; LANG:en; SCL:1; SRV:; IPV:CAL; SFV:NSPM; H:64aa7808-outbound-1.mta.getcheckrecipient.com; PTR:ec2-63-35-35-123.eu-west-1.compute.amazonaws.com; CAT:NONE; SFS:(4636009)(36840700001)(46966006)(316002)(83380400001)(8936002)(81166007)(47076005)(8676002)(9686003)(356005)(186003)(36860700001)(55016002)(2906002)(70206006)(7696005)(54906003)(70586007)(336012)(33656002)(4326008)(26005)(6506007)(110136005)(86362001)(52536014)(4744005)(5660300002)(508600001)(82310400003); DIR:OUT; SFP:1101; X-OriginatorOrg: arm.com X-MS-Exchange-CrossTenant-OriginalArrivalTime: 10 Nov 2021 14:34:37.6883 (UTC) X-MS-Exchange-CrossTenant-Network-Message-Id: 1f5c44dd-cf9f-4a83-f18a-08d9a4573923 X-MS-Exchange-CrossTenant-Id: f34e5979-57d9-4aaa-ad4d-b122a662184d X-MS-Exchange-CrossTenant-OriginalAttributedTenantConnectingIp: TenantId=f34e5979-57d9-4aaa-ad4d-b122a662184d; Ip=[63.35.35.123]; Helo=[64aa7808-outbound-1.mta.getcheckrecipient.com] X-MS-Exchange-CrossTenant-AuthSource: AM5EUR03FT050.eop-EUR03.prod.protection.outlook.com X-MS-Exchange-CrossTenant-AuthAs: Anonymous X-MS-Exchange-CrossTenant-FromEntityHeader: HybridOnPrem X-MS-Exchange-Transport-CrossTenantHeadersStamped: DBBPR08MB4694 X-Spam-Status: No, score=-3.3 required=5.0 tests=BAYES_00, DKIM_SIGNED, DKIM_VALID, RCVD_IN_DNSWL_NONE, RCVD_IN_MSPIKE_H2, SPF_HELO_PASS, SPF_PASS, TVD_SUBJ_WIPE_DEBT, TXREP, UNPARSEABLE_RELAY autolearn=no autolearn_force=no version=3.4.4 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on server2.sourceware.org X-BeenThere: libc-alpha@sourceware.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Libc-alpha mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 10 Nov 2021 14:34:43 -0000 Hi Paul,=0A= =0A= > The patches see poorer performance on Power10, also:=0A= =0A= Was that with the last patch to remove the redirects as well? The latest ve= rsion=0A= is still slower even with that on AArch64...=0A= =0A= I think the new algorithm will always be slower due to the dependent sqrt a= nd=0A= division. So it's hard to improve unless we only use it for special cases (= eg. when=0A= ax and ay are close). Returning sqrt (fma (ax, ax, ay * ay)) is about twice= as fast=0A= and gives just over 1 ULP, so we're losing a lot of performance for a small= ULP=0A= improvement.=0A= =0A= Cheers,=0A= Wilco=0A=