From: HAO CHEN GUI <guihaoc@linux.ibm.com>
To: gcc-patches <gcc-patches@gcc.gnu.org>
Cc: Segher Boessenkool <segher@kernel.crashing.org>,
David <dje.gcc@gmail.com>, "Kewen.Lin" <linkw@linux.ibm.com>,
Peter Bergner <bergner@linux.ibm.com>
Subject: Ping^2 [PATCH v6, rs6000] Implemented f[min/max]_optab by xs[min/max]dp [PR103605]
Date: Mon, 1 Aug 2022 10:03:23 +0800 [thread overview]
Message-ID: <60e33434-d830-d580-2cb7-f7882d2d1b21@linux.ibm.com> (raw)
In-Reply-To: <504feaec-9e0c-8768-9adf-435e877dedc2@linux.ibm.com>
Hi,
Gentle ping this:
https://gcc.gnu.org/pipermail/gcc-patches/2022-June/597158.html
Thanks.
On 4/7/2022 下午 2:32, HAO CHEN GUI wrote:
> Hi,
> Gentle ping this:
> https://gcc.gnu.org/pipermail/gcc-patches/2022-June/597158.html
> Thanks.
>
> On 24/6/2022 上午 10:02, HAO CHEN GUI wrote:
>> Hi,
>> This patch implements optab f[min/max]_optab by xs[min/max]dp on rs6000.
>> Tests show that outputs of xs[min/max]dp are consistent with the standard
>> of C99 fmin/max.
>>
>> This patch also binds __builtin_vsx_xs[min/max]dp to fmin/max instead
>> of smin/max. So the builtins always generate xs[min/max]dp on all
>> platforms.
>>
>> Bootstrapped and tested on ppc64 Linux BE and LE with no regressions.
>> Is this okay for trunk? Any recommendations? Thanks a lot.
>>
>> ChangeLog
>> 2022-06-24 Haochen Gui <guihaoc@linux.ibm.com>
>>
>> gcc/
>> PR target/103605
>> * config/rs6000/rs6000.md (FMINMAX): New.
>> (minmax_op): New.
>> (f<minmax_op><mode>3): New pattern by UNSPEC_FMAX and UNSPEC_FMIN.
>> * config/rs6000/rs6000-builtins.def (__builtin_vsx_xsmaxdp): Set
>> pattern to fmaxdf3.
>> (__builtin_vsx_xsmindp): Set pattern to fmindf3.
>>
>> gcc/testsuite/
>> PR target/103605
>> * gcc.dg/powerpc/pr103605.c: New.
>>
>>
>> patch.diff
>> diff --git a/gcc/config/rs6000/rs6000-builtins.def b/gcc/config/rs6000/rs6000-builtins.def
>> index f4a9f24bcc5..8b735493b40 100644
>> --- a/gcc/config/rs6000/rs6000-builtins.def
>> +++ b/gcc/config/rs6000/rs6000-builtins.def
>> @@ -1613,10 +1613,10 @@
>> XSCVSPDP vsx_xscvspdp {}
>>
>> const double __builtin_vsx_xsmaxdp (double, double);
>> - XSMAXDP smaxdf3 {}
>> + XSMAXDP fmaxdf3 {}
>>
>> const double __builtin_vsx_xsmindp (double, double);
>> - XSMINDP smindf3 {}
>> + XSMINDP fmindf3 {}
>>
>> const double __builtin_vsx_xsrdpi (double);
>> XSRDPI vsx_xsrdpi {}
>> diff --git a/gcc/config/rs6000/rs6000.md b/gcc/config/rs6000/rs6000.md
>> index bf85baa5370..ae0dd98f0f9 100644
>> --- a/gcc/config/rs6000/rs6000.md
>> +++ b/gcc/config/rs6000/rs6000.md
>> @@ -158,6 +158,8 @@ (define_c_enum "unspec"
>> UNSPEC_HASHCHK
>> UNSPEC_XXSPLTIDP_CONST
>> UNSPEC_XXSPLTIW_CONST
>> + UNSPEC_FMAX
>> + UNSPEC_FMIN
>> ])
>>
>> ;;
>> @@ -5341,6 +5343,22 @@ (define_insn_and_split "*s<minmax><mode>3_fpr"
>> DONE;
>> })
>>
>> +
>> +(define_int_iterator FMINMAX [UNSPEC_FMAX UNSPEC_FMIN])
>> +
>> +(define_int_attr minmax_op [(UNSPEC_FMAX "max")
>> + (UNSPEC_FMIN "min")])
>> +
>> +(define_insn "f<minmax_op><mode>3"
>> + [(set (match_operand:SFDF 0 "vsx_register_operand" "=wa")
>> + (unspec:SFDF [(match_operand:SFDF 1 "vsx_register_operand" "wa")
>> + (match_operand:SFDF 2 "vsx_register_operand" "wa")]
>> + FMINMAX))]
>> + "TARGET_VSX && !flag_finite_math_only"
>> + "xs<minmax_op>dp %x0,%x1,%x2"
>> + [(set_attr "type" "fp")]
>> +)
>> +
>> (define_expand "mov<mode>cc"
>> [(set (match_operand:GPR 0 "gpc_reg_operand")
>> (if_then_else:GPR (match_operand 1 "comparison_operator")
>> diff --git a/gcc/testsuite/gcc.target/powerpc/pr103605.c b/gcc/testsuite/gcc.target/powerpc/pr103605.c
>> new file mode 100644
>> index 00000000000..1c938d40e61
>> --- /dev/null
>> +++ b/gcc/testsuite/gcc.target/powerpc/pr103605.c
>> @@ -0,0 +1,37 @@
>> +/* { dg-do compile } */
>> +/* { dg-require-effective-target powerpc_vsx_ok } */
>> +/* { dg-options "-O2 -mvsx" } */
>> +/* { dg-final { scan-assembler-times {\mxsmaxdp\M} 3 } } */
>> +/* { dg-final { scan-assembler-times {\mxsmindp\M} 3 } } */
>> +
>> +#include <math.h>
>> +
>> +double test1 (double d0, double d1)
>> +{
>> + return fmin (d0, d1);
>> +}
>> +
>> +float test2 (float d0, float d1)
>> +{
>> + return fmin (d0, d1);
>> +}
>> +
>> +double test3 (double d0, double d1)
>> +{
>> + return fmax (d0, d1);
>> +}
>> +
>> +float test4 (float d0, float d1)
>> +{
>> + return fmax (d0, d1);
>> +}
>> +
>> +double test5 (double d0, double d1)
>> +{
>> + return __builtin_vsx_xsmindp (d0, d1);
>> +}
>> +
>> +double test6 (double d0, double d1)
>> +{
>> + return __builtin_vsx_xsmaxdp (d0, d1);
>> +}
next prev parent reply other threads:[~2022-08-01 2:03 UTC|newest]
Thread overview: 11+ messages / expand[flat|nested] mbox.gz Atom feed top
2022-06-24 2:02 HAO CHEN GUI
2022-07-04 6:32 ` Ping " HAO CHEN GUI
2022-08-01 2:03 ` HAO CHEN GUI [this message]
2022-09-21 5:20 ` Ping^3 " HAO CHEN GUI
2022-09-21 9:34 ` Kewen.Lin
2022-09-21 21:56 ` Segher Boessenkool
2022-09-22 2:28 ` Kewen.Lin
2022-09-22 9:59 ` HAO CHEN GUI
2022-09-22 13:56 ` Segher Boessenkool
2022-09-22 14:05 ` Segher Boessenkool
2022-09-26 5:58 ` Kewen.Lin
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=60e33434-d830-d580-2cb7-f7882d2d1b21@linux.ibm.com \
--to=guihaoc@linux.ibm.com \
--cc=bergner@linux.ibm.com \
--cc=dje.gcc@gmail.com \
--cc=gcc-patches@gcc.gnu.org \
--cc=linkw@linux.ibm.com \
--cc=segher@kernel.crashing.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).