From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from server.nextmovesoftware.com (server.nextmovesoftware.com [162.254.253.69]) by sourceware.org (Postfix) with ESMTPS id C10263858D1E for ; Sat, 6 May 2023 17:26:13 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org C10263858D1E Authentication-Results: sourceware.org; dmarc=none (p=none dis=none) header.from=nextmovesoftware.com Authentication-Results: sourceware.org; spf=pass smtp.mailfrom=nextmovesoftware.com DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=nextmovesoftware.com; s=default; h=Content-Type:MIME-Version:Message-ID: Date:Subject:To:From:Sender:Reply-To:Cc:Content-Transfer-Encoding:Content-ID: Content-Description:Resent-Date:Resent-From:Resent-Sender:Resent-To:Resent-Cc :Resent-Message-ID:In-Reply-To:References:List-Id:List-Help:List-Unsubscribe: List-Subscribe:List-Post:List-Owner:List-Archive; bh=aZQTtW0y5VpDdZmJCHRUHFtHMh5WBFHFMzCv3X4PXlc=; b=pSVQXEU9DEj+F0L0KrLIHUp3p+ wf7tYsX/9ZWgxHcookNHfLEMQhk39djO0w103sf8QGQFVhw+Z7wxv1JUlQ+OePzyPDsjInBZQku41 FNai0DNagoP55VUmL1xTLidhmFA+tCmMagtXcwUzGXeMX6NeEEcDks58kOF2PaVwlUi32gQ5yWlNK UyqoG7X77c5cWtE/nPetLNoIrQWU90O1TZOtk/em8gQ38xtvtbsWbJh3ZE3nbDf/lxSqSQK12c6FC NlnqkoJbJESXCmAPX1GtO7Z2a1tkwtc/mGICv+gY8EqF8mo4+B9e63HDQZMNKvi++ZuprBKWmwp8L VR/+Rosg==; Received: from host86-169-41-81.range86-169.btcentralplus.com ([86.169.41.81]:50163 helo=Dell) by server.nextmovesoftware.com with esmtpsa (TLS1.2) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.96) (envelope-from ) id 1pvLfh-0004tw-0T for gcc-patches@gcc.gnu.org; Sat, 06 May 2023 13:26:13 -0400 From: "Roger Sayle" To: "'GCC Patches'" Subject: [libgcc PATCH] Add bit reversal functions __bitrev[qhsd]i2. Date: Sat, 6 May 2023 18:26:11 +0100 Message-ID: <00c401d9803f$dafe3c90$90fab5b0$@nextmovesoftware.com> MIME-Version: 1.0 Content-Type: multipart/mixed; boundary="----=_NextPart_000_00C5_01D98048.3CC2A490" X-Mailer: Microsoft Outlook 16.0 Thread-Index: AdmAO2YVL782eB0dQCCYTLokdiovGw== Content-Language: en-gb X-AntiAbuse: This header was added to track abuse, please include it with any abuse report X-AntiAbuse: Primary Hostname - server.nextmovesoftware.com X-AntiAbuse: Original Domain - gcc.gnu.org X-AntiAbuse: Originator/Caller UID/GID - [47 12] / [47 12] X-AntiAbuse: Sender Address Domain - nextmovesoftware.com X-Get-Message-Sender-Via: server.nextmovesoftware.com: authenticated_id: roger@nextmovesoftware.com X-Authenticated-Sender: server.nextmovesoftware.com: roger@nextmovesoftware.com X-Source: X-Source-Args: X-Source-Dir: X-Spam-Status: No, score=-10.9 required=5.0 tests=BAYES_00,DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,GIT_PATCH_0,RCVD_IN_BARRACUDACENTRAL,SPF_HELO_NONE,SPF_PASS,TXREP,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server2.sourceware.org List-Id: This is a multipart message in MIME format. ------=_NextPart_000_00C5_01D98048.3CC2A490 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit This patch proposes adding run-time library support for bit reversal, by adding a __bitrevsi2 function to libgcc. Thoughts/opinions? I'm also tempted to add __popcount[qh]i2 and __parity[qh]i2 to libgcc, to allow the RTL optimizers to perform narrowing operations, but I'm curious to hear whether QImode and HImode support, though more efficient, is frowned by the libgcc maintainers/philosophy. This patch has been tested on x86_64-pc-linux-gnu with make bootstrap and make -k check, both with and without --target_board=unix{-m32} and on nvptx-none, with no new regressions. Ok for mainline? 2023-05-06 Roger Sayle gcc/ChangeLog * doc/libgcc.texi (__bitrevqi2): Document bit reversal run-time functions; __bitrevqi2, __bitrevhi2, __bitrevsi2 and __bitrevdi2. libgcc/ChangeLog * Makfile.in (lib2funcs): Add __bitrev[qhsd]i2. * libgcc-std.ver.in (GCC_14.0.0): Add __bitrev[qhsd]i2. * libgcc2.c (__bitrevqi2): New function. (__bitrevhi2): Likewise. (__bitrevsi2): Likewise. (__bitrevdi2): Likewise. * libgcc2.h (__bitrevqi2): Prototype here. (__bitrevhi2): Likewise. (__bitrevsi2): Likewise. (__bitrevdi2): Likewise. Thanks in advance, Roger -- ------=_NextPart_000_00C5_01D98048.3CC2A490 Content-Type: text/plain; name="patchlg.txt" Content-Transfer-Encoding: quoted-printable Content-Disposition: attachment; filename="patchlg.txt" diff --git a/gcc/doc/libgcc.texi b/gcc/doc/libgcc.texi=0A= index 73aa803..7611347 100644=0A= --- a/gcc/doc/libgcc.texi=0A= +++ b/gcc/doc/libgcc.texi=0A= @@ -218,6 +218,13 @@ These functions return the number of bits set in = @var{a}.=0A= These functions return the @var{a} byteswapped.=0A= @end deftypefn=0A= =0A= +@deftypefn {Runtime Function} int8_t __bitrevqi2 (int8_t @var{a})=0A= +@deftypefnx {Runtime Function} int16_t __bitrevhi2 (int16_t @var{a})=0A= +@deftypefnx {Runtime Function} int32_t __bitrevsi2 (int32_t @var{a})=0A= +@deftypefnx {Runtime Function} int64_t __bitrevdi2 (int64_t @var{a})=0A= +These functions return the bit reversed @var{a}.=0A= +@end deftypefn=0A= +=0A= @node Soft float library routines=0A= @section Routines for floating point emulation=0A= @cindex soft float library=0A= diff --git a/libgcc/Makefile.in b/libgcc/Makefile.in=0A= index 6c4dc79..67c54df 100644=0A= --- a/libgcc/Makefile.in=0A= +++ b/libgcc/Makefile.in=0A= @@ -446,7 +446,7 @@ lib2funcs =3D _muldi3 _negdi2 _lshrdi3 _ashldi3 = _ashrdi3 _cmpdi2 _ucmpdi2 \=0A= _paritysi2 _paritydi2 _powisf2 _powidf2 _powixf2 _powitf2 \=0A= _mulhc3 _mulsc3 _muldc3 _mulxc3 _multc3 _divhc3 _divsc3 \=0A= _divdc3 _divxc3 _divtc3 _bswapsi2 _bswapdi2 _clrsbsi2 \=0A= - _clrsbdi2=0A= + _clrsbdi2 _bitrevqi2 _bitrevhi2 _bitrevsi2 _bitrevdi2=0A= =0A= # The floating-point conversion routines that involve a single-word = integer.=0A= # XX stands for the integer mode.=0A= diff --git a/libgcc/libgcc-std.ver.in b/libgcc/libgcc-std.ver.in=0A= index c4f87a5..2198b0e 100644=0A= --- a/libgcc/libgcc-std.ver.in=0A= +++ b/libgcc/libgcc-std.ver.in=0A= @@ -1944,3 +1944,12 @@ GCC_7.0.0 {=0A= __PFX__divmoddi4=0A= __PFX__divmodti4=0A= }=0A= +=0A= +%inherit GCC_14.0.0 GCC_7.0.0=0A= +GCC_14.0.0 {=0A= + # bit reversal functions=0A= + __PFX__bitrevqi2=0A= + __PFX__bitrevhi2=0A= + __PFX__bitrevsi2=0A= + __PFX__bitrevdi2=0A= +}=0A= diff --git a/libgcc/libgcc2.c b/libgcc/libgcc2.c=0A= index e0017d1..2bef2a1 100644=0A= --- a/libgcc/libgcc2.c=0A= +++ b/libgcc/libgcc2.c=0A= @@ -488,6 +488,54 @@ __bswapdi2 (DItype u)=0A= | (((u) & 0x00000000000000ffull) << 56));=0A= }=0A= #endif=0A= +=0C=0A= +#ifdef L_bitrevqi2=0A= +QItype=0A= +__bitrevqi2 (QItype x)=0A= +{=0A= + UQItype u =3D x;=0A= + u =3D (((u) >> 1) & 0x55) | (((u) & 0x55) << 1);=0A= + u =3D (((u) >> 2) & 0x33) | (((u) & 0x33) << 2);=0A= + return ((u) >> 4) | ((u) << 4);=0A= +}=0A= +#endif=0A= +#ifdef L_bitrevhi2=0A= +HItype=0A= +__bitrevhi2 (HItype x)=0A= +{=0A= + UHItype u =3D x;=0A= + u =3D (((u) >> 1) & 0x5555) | (((u) & 0x5555) << 1);=0A= + u =3D (((u) >> 2) & 0x3333) | (((u) & 0x3333) << 2);=0A= + u =3D (((u) >> 4) & 0x0f0f) | (((u) & 0x0f0f) << 4);=0A= + return ((u) >> 8) | ((u) << 8);=0A= +}=0A= +#endif=0A= +#ifdef L_bitrevsi2=0A= +SItype=0A= +__bitrevsi2 (SItype x)=0A= +{=0A= + USItype u =3D x;=0A= + u =3D (((u) >> 1) & 0x55555555) | (((u) & 0x55555555) << 1);=0A= + u =3D (((u) >> 2) & 0x33333333) | (((u) & 0x33333333) << 2);=0A= + u =3D (((u) >> 4) & 0x0f0f0f0f) | (((u) & 0x0f0f0f0f) << 4);=0A= + return __bswapsi2 (u);=0A= +}=0A= +#endif=0A= +#ifdef L_bitrevdi2=0A= +DItype=0A= +__bitrevdi2 (DItype x)=0A= +{=0A= + UDItype u =3D x;=0A= + u =3D (((u) >> 1) & 0x5555555555555555ll)=0A= + | (((u) & 0x5555555555555555ll) << 1);=0A= + u =3D (((u) >> 2) & 0x3333333333333333ll)=0A= + | (((u) & 0x3333333333333333ll) << 2);=0A= + u =3D (((u) >> 4) & 0x0f0f0f0f0f0f0f0fll)=0A= + | (((u) & 0x0f0f0f0f0f0f0f0fll) << 4);=0A= + return __bswapdi2 (u);=0A= +}=0A= +#endif=0A= +=0C=0A= #ifdef L_ffssi2=0A= #undef int=0A= int=0A= diff --git a/libgcc/libgcc2.h b/libgcc/libgcc2.h=0A= index 3ec9bbd..e1abc0d 100644=0A= --- a/libgcc/libgcc2.h=0A= +++ b/libgcc/libgcc2.h=0A= @@ -338,6 +338,10 @@ typedef int shift_count_type __attribute__((mode = (__libgcc_shift_count__)));=0A= #define __udiv_w_sdiv __N(udiv_w_sdiv)=0A= #define __clear_cache __N(clear_cache)=0A= #define __enable_execute_stack __N(enable_execute_stack)=0A= +#define __bitrevqi2 __N(bitrevqi2)=0A= +#define __bitrevhi2 __N(bitrevhi2)=0A= +#define __bitrevsi2 __N(bitrevsi2)=0A= +#define __bitrevdi2 __N(bitrevdi2)=0A= =0A= #ifndef __powisf2=0A= #define __powisf2 __N(powisf2)=0A= @@ -426,6 +430,15 @@ extern DWtype __subvDI3 (DWtype, DWtype);=0A= extern DWtype __mulvDI3 (DWtype, DWtype);=0A= extern DWtype __negvDI2 (DWtype);=0A= =0A= +extern QItype __bitrevqi2 (QItype);=0A= +extern HItype __bitrevhi2 (HItype);=0A= +#if MIN_UNITS_PER_WORD > 1=0A= +extern SItype __bitrevsi2 (SItype);=0A= +#endif=0A= +#if __SIZEOF_LONG_LONG__ > 4=0A= +extern DItype __bitrevdi2 (DItype);=0A= +#endif=0A= +=0A= #ifdef COMPAT_SIMODE_TRAPPING_ARITHMETIC=0A= #define __absvsi2 __N(absvsi2)=0A= #define __negvsi2 __N(negvsi2)=0A= ------=_NextPart_000_00C5_01D98048.3CC2A490--