From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail-pl1-x636.google.com (mail-pl1-x636.google.com [IPv6:2607:f8b0:4864:20::636]) by sourceware.org (Postfix) with ESMTPS id 937243858D20 for ; Thu, 9 Mar 2023 18:33:15 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org 937243858D20 Authentication-Results: sourceware.org; dmarc=pass (p=none dis=none) header.from=gmail.com Authentication-Results: sourceware.org; spf=pass smtp.mailfrom=gmail.com Received: by mail-pl1-x636.google.com with SMTP id ky4so2979606plb.3 for ; Thu, 09 Mar 2023 10:33:15 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; t=1678386794; h=content-transfer-encoding:mime-version:message-id:date:subject:to :from:from:to:cc:subject:date:message-id:reply-to; bh=HOU0TLcba1HHWpPeb01GrU2k3ZG3+ZAMmCaO/4ZI/uE=; b=GC0JD5aTpGu+zD2jdX7wFAus6ZEBgTvbSP8/Th1sBqMYv7rtpCbXaxVy3mrTTslZyT e2IfGQV5V4u1QlnQQMXZCkyITRoIFAxRJ23iivirAE0mNGh9JOEVMY7l6dA7e72ZswGo Yb68A1jeNooCdxTP7s1jptHotC95dnSgQUreJ47IUmiMo8d8qD3xN3gTgroVx8RGvOxg 3WLprTjp/Ab+hWy9rmrDzua6gOaOf2mP6x/KKrsvp4k5ggrNJO6GoSoCqXOuI0vLG9A5 yclSlVUOAkZp5tAkMA/F6RNlxoBfyJku6zA4j4lVhjYKsrqKC/oF5QXBnZumZVy/l1aJ iQug== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; t=1678386794; h=content-transfer-encoding:mime-version:message-id:date:subject:to :from:x-gm-message-state:from:to:cc:subject:date:message-id:reply-to; bh=HOU0TLcba1HHWpPeb01GrU2k3ZG3+ZAMmCaO/4ZI/uE=; b=eDLu1pG41hOiyVR6jM9STbam7bEDJi3dgIMaq9mUy8VEuKyqsviU2gR6Tr/lBDQxvA HTBP2tm10AhmUgnyB056DCiCAjoqZy3p/r5FFRHVpwC+gl30LAGztjN62j4pDHzubk80 3yL9l5TWKReeRS/QquNdeL23QcJjyGAEeLLOQ06SWndCobI952V8tyzgb3gKedwv6zYl oBlli5dY56BZyNTkDBSJvrmE0nPJLILw/4Q47orB7xtt87sNqB8vwwH9W7+rMzeTK3xw 8FNFCZSur1MSiUoN0l+mf4lQusEZ75r8VyWSVhPSxJu10glznon0DW8BjcuhOHJRizYk uCCQ== X-Gm-Message-State: AO0yUKXuZxq3V8KrgOC20O30q36aIjC+Dp224WnkPnYOEJfB/U1EKefY C86cRWzKKiYuIZZUgBBS/c/8tT4eWCY= X-Google-Smtp-Source: AK7set+fGpiatHSOUqLhUubEVdnqQfRLNYhcAy65IaL5zrF/fYQweqPGW7qM5+udkpI0lIq+5QUX4g== X-Received: by 2002:a17:902:bd93:b0:19c:be03:d1a3 with SMTP id q19-20020a170902bd9300b0019cbe03d1a3mr19526832pls.40.1678386794003; Thu, 09 Mar 2023 10:33:14 -0800 (PST) Received: from gnu-cfl-3.localdomain ([172.59.161.113]) by smtp.gmail.com with ESMTPSA id g22-20020a1709029f9600b001991d6c6c64sm10448761plq.185.2023.03.09.10.33.13 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 09 Mar 2023 10:33:13 -0800 (PST) Received: from gnu-cfl-3.. (localhost [IPv6:::1]) by gnu-cfl-3.localdomain (Postfix) with ESMTP id 6E416740146 for ; Thu, 9 Mar 2023 10:33:12 -0800 (PST) From: "H.J. Lu" To: libc-alpha@sourceware.org Subject: [PATCH] x86-64: Add x87 fmod and remainder [BZ #30179] Date: Thu, 9 Mar 2023 10:33:12 -0800 Message-Id: <20230309183312.205763-1-hjl.tools@gmail.com> X-Mailer: git-send-email 2.39.2 MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-Spam-Status: No, score=-3025.6 required=5.0 tests=BAYES_00,DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,FREEMAIL_FROM,GIT_PATCH_0,RCVD_IN_DNSWL_NONE,SPF_HELO_NONE,SPF_PASS,TXREP autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server2.sourceware.org List-Id: X87 (fprem/fprem1) implementations of fmod and remainder are much faster than generic fmod and remainder. Add e_fmod.S, e_fmodf.S, e_remainder.S and e_remainderf.S with fprem/fprem1. This fixes BZ #30179. --- sysdeps/x86_64/fpu/e_fmod.S | 22 ++++++++++++++++++++++ sysdeps/x86_64/fpu/e_fmodf.S | 22 ++++++++++++++++++++++ sysdeps/x86_64/fpu/e_remainder.S | 22 ++++++++++++++++++++++ sysdeps/x86_64/fpu/e_remainderf.S | 22 ++++++++++++++++++++++ 4 files changed, 88 insertions(+) create mode 100644 sysdeps/x86_64/fpu/e_fmod.S create mode 100644 sysdeps/x86_64/fpu/e_fmodf.S create mode 100644 sysdeps/x86_64/fpu/e_remainder.S create mode 100644 sysdeps/x86_64/fpu/e_remainderf.S diff --git a/sysdeps/x86_64/fpu/e_fmod.S b/sysdeps/x86_64/fpu/e_fmod.S new file mode 100644 index 0000000000..4bdc8a1ab0 --- /dev/null +++ b/sysdeps/x86_64/fpu/e_fmod.S @@ -0,0 +1,22 @@ +/* + * Public domain. + */ + +#include +#include + +ENTRY(__ieee754_fmod) + movsd %xmm0, -16(%rsp) + movsd %xmm1, -8(%rsp) + fldl -8(%rsp) + fldl -16(%rsp) +1: fprem + fstsw %ax + sahf + jp 1b + fstp %st(1) + fstpl -8(%rsp) + movsd -8(%rsp), %xmm0 + ret +END (__ieee754_fmod) +libm_alias_finite (__ieee754_fmod, __fmod) diff --git a/sysdeps/x86_64/fpu/e_fmodf.S b/sysdeps/x86_64/fpu/e_fmodf.S new file mode 100644 index 0000000000..6f76daff01 --- /dev/null +++ b/sysdeps/x86_64/fpu/e_fmodf.S @@ -0,0 +1,22 @@ +/* + * Public domain. + */ + +#include +#include + +ENTRY(__ieee754_fmodf) + movss %xmm0, -8(%rsp) + movss %xmm1, -4(%rsp) + flds -4(%rsp) + flds -8(%rsp) +1: fprem + fstsw %ax + sahf + jp 1b + fstp %st(1) + fstps -4(%rsp) + movss -4(%rsp), %xmm0 + ret +END (__ieee754_fmodf) +libm_alias_finite (__ieee754_fmodf, __fmodf) diff --git a/sysdeps/x86_64/fpu/e_remainder.S b/sysdeps/x86_64/fpu/e_remainder.S new file mode 100644 index 0000000000..be2184f25a --- /dev/null +++ b/sysdeps/x86_64/fpu/e_remainder.S @@ -0,0 +1,22 @@ +/* + * Public domain. + */ + +#include +#include + +ENTRY(__ieee754_remainder) + movsd %xmm0, -16(%rsp) + movsd %xmm1, -8(%rsp) + fldl -8(%rsp) + fldl -16(%rsp) +1: fprem1 + fstsw %ax + sahf + jp 1b + fstp %st(1) + fstpl -8(%rsp) + movsd -8(%rsp), %xmm0 + ret +END (__ieee754_remainder) +libm_alias_finite (__ieee754_remainder, __remainder) diff --git a/sysdeps/x86_64/fpu/e_remainderf.S b/sysdeps/x86_64/fpu/e_remainderf.S new file mode 100644 index 0000000000..42972d3f84 --- /dev/null +++ b/sysdeps/x86_64/fpu/e_remainderf.S @@ -0,0 +1,22 @@ +/* + * Public domain. + */ + +#include +#include + +ENTRY(__ieee754_remainderf) + movss %xmm0, -8(%rsp) + movss %xmm1, -4(%rsp) + flds -4(%rsp) + flds -8(%rsp) +1: fprem1 + fstsw %ax + sahf + jp 1b + fstp %st(1) + fstps -4(%rsp) + movss -4(%rsp), %xmm0 + ret +END (__ieee754_remainderf) +libm_alias_finite (__ieee754_remainderf, __remainderf) -- 2.39.2