From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from esa1.mentor.iphmx.com (esa1.mentor.iphmx.com [68.232.129.153]) by sourceware.org (Postfix) with ESMTPS id F3FBB3858C52 for ; Wed, 2 Nov 2022 00:50:52 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org F3FBB3858C52 Authentication-Results: sourceware.org; dmarc=none (p=none dis=none) header.from=codesourcery.com Authentication-Results: sourceware.org; spf=pass smtp.mailfrom=mentor.com X-IronPort-AV: E=Sophos;i="5.95,232,1661846400"; d="scan'208";a="88741324" Received: from orw-gwy-01-in.mentorg.com ([192.94.38.165]) by esa1.mentor.iphmx.com with ESMTP; 01 Nov 2022 16:50:51 -0800 IronPort-SDR: pNrbp7b6FDr1sVdKKwmeAvpTkwE3Ckg8vxVFf5dDps4Vo7zTWSwV1RX2nW7LiY0DToMyKheHV7 PbIm9crLlmC4OOCnY4WDT5HrJd/MJHBbJvoXeWX5ed/fUMb70YfHbyiAzPI3pacGbm/FWOPnxp 5Z1hK157en/Pa2DdHRyR/he06pUPqjRAOkhCUB1PiLcHVF4PCscX3v1p8iy6bJ9HHIxWXja5IX uBDrxez0d4+pjiAL1OYuYOKYz0rONpZyUS1/MLkq2SoFFjmm4l6GyMeVVzTBUQP0PRunzTpyLr Cg0= Message-ID: Date: Wed, 2 Nov 2022 00:50:40 +0000 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101 Thunderbird/102.4.1 To: gcc-patches CC: Catherine Moore , Andrew Stubbs From: Kwok Cheung Yeung Subject: [OG12] [committed] amdgcn: Enable SIMD vectorization of math library functions Content-Type: text/plain; charset="UTF-8"; format=flowed Content-Transfer-Encoding: 7bit X-Originating-IP: [137.202.0.90] X-ClientProxiedBy: svr-ies-mbx-10.mgc.mentorg.com (139.181.222.10) To svr-ies-mbx-12.mgc.mentorg.com (139.181.222.12) X-Spam-Status: No, score=-5.3 required=5.0 tests=BAYES_00,HEADER_FROM_DIFFERENT_DOMAINS,HEXHASH_WORD,KAM_DMARC_STATUS,SPF_HELO_PASS,SPF_PASS,TXREP autolearn=no autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server2.sourceware.org List-Id: Hello I have committed the following patches onto the devel/omp/gcc-12 development branch: 863579c4e30 amdgcn: Enable SIMD vectorization of math functions bd9a6106b95 amdgcn: Add SIMD versions of math routines to libgcc d3a2a1cc424 amdgcn: Add builtins for vector floor/floorf a3c04a367a9 amdgcn: Fix expansion of builtin for vector fabs operation These patches implement a vectorized version of most of the C math library for AMD GCN. These routines will be used when math functions are used in auto-vectorized code. Note that -fno-math-errno must be specified on the command line in most cases before the compiler will consider using these functions. Vectors smaller than the native 64 element ones are also supported (by masking off the unused lanes), which can be useful for SLP vectorized code. Kwok Yeung