From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from EUR05-AM6-obe.outbound.protection.outlook.com (mail-am6eur05on2075.outbound.protection.outlook.com [40.107.22.75]) by sourceware.org (Postfix) with ESMTPS id C061F385841C for ; Thu, 18 Nov 2021 12:37:41 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org C061F385841C Received: from AS9PR06CA0283.eurprd06.prod.outlook.com (2603:10a6:20b:45a::14) by VI1PR08MB4494.eurprd08.prod.outlook.com (2603:10a6:803:f4::21) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4713.19; Thu, 18 Nov 2021 12:37:38 +0000 Received: from AM5EUR03FT062.eop-EUR03.prod.protection.outlook.com (2603:10a6:20b:45a:cafe::30) by AS9PR06CA0283.outlook.office365.com (2603:10a6:20b:45a::14) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4713.21 via Frontend Transport; Thu, 18 Nov 2021 12:37:37 +0000 X-MS-Exchange-Authentication-Results: spf=temperror (sender IP is 63.35.35.123) smtp.mailfrom=arm.com; dkim=pass (signature was verified) header.d=armh.onmicrosoft.com;dmarc=temperror action=none header.from=arm.com; Received-SPF: TempError (protection.outlook.com: error in processing during lookup of arm.com: DNS Timeout) Received: from 64aa7808-outbound-1.mta.getcheckrecipient.com (63.35.35.123) by AM5EUR03FT062.mail.protection.outlook.com (10.152.17.120) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4690.20 via Frontend Transport; Thu, 18 Nov 2021 12:37:36 +0000 Received: ("Tessian outbound de6049708a0a:v110"); Thu, 18 Nov 2021 12:37:33 +0000 X-CheckRecipientChecked: true X-CR-MTA-CID: 1745f85ae085a42c X-CR-MTA-TID: 64aa7808 Received: from 162c4cc7033f.1 by 64aa7808-outbound-1.mta.getcheckrecipient.com id D79967E2-BDA4-4849-B36F-5FB2D37CF261.1; Thu, 18 Nov 2021 12:37:28 +0000 Received: from EUR02-VE1-obe.outbound.protection.outlook.com by 64aa7808-outbound-1.mta.getcheckrecipient.com with ESMTPS id 162c4cc7033f.1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384); Thu, 18 Nov 2021 12:37:28 +0000 ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=khQBHh2PdPUCOKwwHG5Kdbw7+3Kr52CdmmG9i6Oyi7NpqPe3WqyDqUApMYj4SLHz/CN8JOYGPJSbJPKxZZq4JsrsM9kbBRJkXHqwbVkheUtqChWcMTeHW8EFZ1hbEoz7c947JHteYm8KbDlpw+kA+8EEI9bGOHYlGWX/hBGM/ls2Vus3lrvR1BuiMV84qQr8Vz9QOgVfMvaunXa3KuHBI/1gYqiIIRotI560xL0kolRUcQ1UbDJ6XtMsb/lK+IhhNokWgWpT7EVR0xqLYRkBEhadyb/eaFrGvL2BtGaTziGgOG6mlWFoKj2KaLsoFH3aVuMNJxkqd8mAqDD7pbRCAQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=hXLGQy500pmnLgjXwkc+lkLoCAFrIw0oCjKvVYE7JD0=; b=UfWp6jtsvFNxFQ2nA4kEaSLByZvTmFWfD4JoaWzbNwg2kjjw1fQz6vQQ+OjWKOnhAKbqhO6NVXNQkthDJsvqHguI0E/Q5MFLAHIbpRKl2KN03t8zbskytjDUVvmkG8BPetl17WuoSl/MksC6pXKsL3Vf737fDyKs+ElSZDXqVP7afHZ+WgawCy4X5Ige+Q/G0wfAbooOGEwwwjOJpUBnNWImalGhLjNX+Io2zNkAxHOFmNWkPFuCmUAQcfbzt+AhwhSIDEmP6ICa0XhNyd7n6bbstGuZ1SYKtK/y4m/RH3qn7HVmShoADu/Ajk9++dIfu407NzuA5VDHnaWrm/iVeQ== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=arm.com; dmarc=pass action=none header.from=arm.com; dkim=pass header.d=arm.com; arc=none Received: from VE1PR08MB5599.eurprd08.prod.outlook.com (2603:10a6:800:1a1::12) by VI1PR08MB3358.eurprd08.prod.outlook.com (2603:10a6:803:47::16) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4690.26; Thu, 18 Nov 2021 12:37:24 +0000 Received: from VE1PR08MB5599.eurprd08.prod.outlook.com ([fe80::e49f:f587:130d:78e4]) by VE1PR08MB5599.eurprd08.prod.outlook.com ([fe80::e49f:f587:130d:78e4%8]) with mapi id 15.20.4713.019; Thu, 18 Nov 2021 12:37:24 +0000 From: Wilco Dijkstra To: "Paul A. Clarke" CC: Adhemerval Zanella , 'GNU C Library' Subject: Re: RFC: Improve hypot performance Thread-Topic: RFC: Improve hypot performance Thread-Index: AQHX28SGNCsQvCJfBE6VimQCQbhoZqwIE+SAgAEkNVg= Date: Thu, 18 Nov 2021 12:37:24 +0000 Message-ID: References: <20211117190249.GC7755@li-24c3614c-2adc-11b2-a85c-85f334518bdb.ibm.com> In-Reply-To: <20211117190249.GC7755@li-24c3614c-2adc-11b2-a85c-85f334518bdb.ibm.com> Accept-Language: en-GB, en-US Content-Language: en-GB X-MS-Has-Attach: X-MS-TNEF-Correlator: suggested_attachment_session_id: 97fef9d7-4346-0dd2-0a56-71187794fe8e Authentication-Results-Original: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=arm.com; x-ms-publictraffictype: Email X-MS-Office365-Filtering-Correlation-Id: 58f26de8-5b1d-4528-4f3a-08d9aa903365 x-ms-traffictypediagnostic: VI1PR08MB3358:|VI1PR08MB4494: X-Microsoft-Antispam-PRVS: x-checkrecipientrouted: true nodisclaimer: true x-ms-oob-tlc-oobclassifiers: OLM:7219;OLM:7219; X-MS-Exchange-SenderADCheck: 1 X-MS-Exchange-AntiSpam-Relay: 0 X-Microsoft-Antispam-Untrusted: BCL:0; X-Microsoft-Antispam-Message-Info-Original: OG6ivRZUzPRaKBxE224OOPJ4Lxcb8hF4oDlWS+00V/H5FaJmF6nOcprmrCG9vyn3PeGHHJcyiAZv3L0ye+bW6YZi0f/wlHsNIMpNLvPYiwT71wBbbf76CFwqp9pdrzeqbEOwIj/5WqcjH64UfNw8LBtgnRiijyWbvoM5RvLXjoFNGiu5reGyCtroGWPxXvDROzdDcx5/Fmwp4+hubbJ0Ed8gLPz4DhLcu5qzqS/wS9xdVNGDNdTv4DIhfQY7yNP2vQz9TJ7EB0Nx3yKqrsVdxGS2s+t7M0+Y9i29+KhhCB/c29olcNjnHcYp+0pGZ1si4tZQ0gHh9a+WiDnvdk2IRiPgCczXOG0g1SUA74Z341YIRPsL5H21HawkaCUrD5YvGFNxpBG/J4jQI6l04RdO4TtN5w2E/tny2BJ9pg1ZhIjzitx3kL3zjQPxTCLnssfoOwKUKbAit3IaJzkV4OsRf+WVO0CUp3F3gE8AJmwYcSOaatDQ3lAAcuo2LYHb5raKm8TSZQp25MiEoHYkOS9ZBjy3EB4upAQdlAKg5pTR87OsS76Yc7eVc06PubiHifF7GImhH/XZb4MMrBn3MGF7tFgH+31GSWGYkF1kKgm4XmVy+U/n/Eb/CKW/fGE+qiu/FOFi4fANLYctUwuMiXBpWwV9xPAOXO5OydDVKEQxJXjXNiuqJ/ycJbK8ud/rRAw2mBjOKHu/DfoARCHrMhXftg== X-Forefront-Antispam-Report-Untrusted: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:VE1PR08MB5599.eurprd08.prod.outlook.com; PTR:; CAT:NONE; SFS:(4636009)(366004)(6506007)(186003)(4744005)(316002)(5660300002)(6916009)(66556008)(55016002)(122000001)(86362001)(8936002)(66446008)(76116006)(38100700002)(508600001)(7696005)(66946007)(52536014)(64756008)(66476007)(54906003)(33656002)(91956017)(4326008)(26005)(71200400001)(9686003)(38070700005)(2906002)(8676002); DIR:OUT; SFP:1101; Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-MS-Exchange-Transport-CrossTenantHeadersStamped: VI1PR08MB3358 Original-Authentication-Results: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=arm.com; X-EOPAttributedMessage: 0 X-MS-Exchange-Transport-CrossTenantHeadersStripped: AM5EUR03FT062.eop-EUR03.prod.protection.outlook.com X-MS-Office365-Filtering-Correlation-Id-Prvs: 4543451d-6833-4ece-59ca-08d9aa902c72 X-Microsoft-Antispam: BCL:0; X-Microsoft-Antispam-Message-Info: XSX/LtE4b0rg1+0hxBNGhHZtz0GbMjFn5FkRhFqdLdsd9cgtoTXEwjr1Ue17TSHaDVD1gk8lKz7lPemj1H64Irb5irtpB8YEI59YQeb1ItIjmJCcvQbhChbuWkdEvDefh6mhqeoFo5nt6wS14kNXyAYgxhg6JpRD3bUQ97ZrOrApPFLTvBNZoUU3YV+IEq//AFfU/4DEtp0Njj8n+QG8z0BNH/EmVE6Xrlrc0wCzpGE6+rgljvKztPQL14/UC9VGHMloHu3DVX4KOQfI+aUektNwpoFjaZp40BUuQvHPVGASCPvs43cLa18TGPw4e9p6YEWL1ml5owkTL3YmWZfGTkRfme885yUFI0dDdvrmDflK7do4YsIB2/rJ9Y334WATIy7e2A4K5wzGWZZTXEVzuasaWJY+zn2DAxQCHo43BccMwWUy1mxQFgWsjydCNs0DLXNA8uFuA9GfxpPB1tpxO3pSRlfh+HDFfTyKndhkhDtQvygO+2xjRjIBcod69HqidsFpEIyvS5TzGF8YaO/+M2ezQPCRIuW5iDaGJEnogTBCaIoNIRGDg51IhHqe8B6j7hCcqv8S3ipmDu8us53mSzud6041kijpgxcBr880KzXRGxTv1bWP8LXFJuXF8QbKzeFAmPIsyc9jk0OO91MWkrsuiMUoXdgxaHd1wLImYo9YKbh7h93sQy0xjNpGLdYm+yOzexl+1SDSj7rbD8lpVA== X-Forefront-Antispam-Report: CIP:63.35.35.123; CTRY:IE; LANG:en; SCL:1; SRV:; IPV:CAL; SFV:NSPM; H:64aa7808-outbound-1.mta.getcheckrecipient.com; PTR:ec2-63-35-35-123.eu-west-1.compute.amazonaws.com; CAT:NONE; SFS:(4636009)(36840700001)(46966006)(9686003)(82310400003)(55016002)(356005)(5660300002)(6862004)(2906002)(4744005)(63350400001)(33656002)(508600001)(8936002)(316002)(26005)(81166007)(52536014)(336012)(6506007)(70586007)(8676002)(70206006)(36860700001)(63370400001)(47076005)(4326008)(86362001)(54906003)(7696005)(186003); DIR:OUT; SFP:1101; X-OriginatorOrg: arm.com X-MS-Exchange-CrossTenant-OriginalArrivalTime: 18 Nov 2021 12:37:36.2576 (UTC) X-MS-Exchange-CrossTenant-Network-Message-Id: 58f26de8-5b1d-4528-4f3a-08d9aa903365 X-MS-Exchange-CrossTenant-Id: f34e5979-57d9-4aaa-ad4d-b122a662184d X-MS-Exchange-CrossTenant-OriginalAttributedTenantConnectingIp: TenantId=f34e5979-57d9-4aaa-ad4d-b122a662184d; Ip=[63.35.35.123]; Helo=[64aa7808-outbound-1.mta.getcheckrecipient.com] X-MS-Exchange-CrossTenant-AuthSource: AM5EUR03FT062.eop-EUR03.prod.protection.outlook.com X-MS-Exchange-CrossTenant-AuthAs: Anonymous X-MS-Exchange-CrossTenant-FromEntityHeader: HybridOnPrem X-MS-Exchange-Transport-CrossTenantHeadersStamped: VI1PR08MB4494 X-Spam-Status: No, score=-6.1 required=5.0 tests=BAYES_00, DKIM_SIGNED, DKIM_VALID, RCVD_IN_DNSWL_NONE, RCVD_IN_MSPIKE_H2, SPF_HELO_PASS, SPF_PASS, TXREP, UNPARSEABLE_RELAY autolearn=ham autolearn_force=no version=3.4.4 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on server2.sourceware.org X-BeenThere: libc-alpha@sourceware.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Libc-alpha mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 18 Nov 2021 12:37:44 -0000 Hi Paul,=0A= =0A= > On Power10, this implementation is still has a large delta compared to th= e=0A= > current implementation:=0A= =0A= It's obvious something went wrong here since it is slower than Adhemerval's= =0A= version - did you look at the generated code?=0A= =0A= It should use fma, and no division or function calls (if you see calls to f= min/fmax=0A= you need to set=A0FAST_FMINMAX to 0).=0A= =0A= Cheers,=0A= Wilco=0A=