public inbox for gcc-patches@gcc.gnu.org
 help / color / mirror / Atom feed
From: chenglulu <chenglulu@loongson.cn>
To: Xi Ruoyao <xry111@xry111.site>, gcc-patches@gcc.gnu.org
Cc: i@xen0n.name, xuchenghua@loongson.cn
Subject: Re: [PATCH v3 2/5] LoongArch: Use standard pattern name and RTX code for LSX/LASX muh instructions
Date: Thu, 23 Nov 2023 20:08:00 +0800	[thread overview]
Message-ID: <df1f7cf0-cff0-1e24-412c-3ed9900a4e86@loongson.cn> (raw)
In-Reply-To: <20231120004728.205167-3-xry111@xry111.site>

LGTM.

Thanks!

在 2023/11/20 上午8:47, Xi Ruoyao 写道:
> Removes unnecessary UNSPECs and make the muh instructions useful with
> GNU vectors or auto vectorization.
>
> gcc/ChangeLog:
>
> 	* config/loongarch/simd.md (muh): New code attribute mapping
> 	any_extend to smul_highpart or umul_highpart.
> 	(<su>mul<mode>3_highpart): New define_insn.
> 	* config/loongarch/lsx.md (UNSPEC_LSX_VMUH_S): Remove.
> 	(UNSPEC_LSX_VMUH_U): Remove.
> 	(lsx_vmuh_s_<lsxfmt>): Remove.
> 	(lsx_vmuh_u_<lsxfmt>): Remove.
> 	* config/loongarch/lasx.md (UNSPEC_LASX_XVMUH_S): Remove.
> 	(UNSPEC_LASX_XVMUH_U): Remove.
> 	(lasx_xvmuh_s_<lasxfmt>): Remove.
> 	(lasx_xvmuh_u_<lasxfmt>): Remove.
> 	* config/loongarch/loongarch-builtins.cc (CODE_FOR_lsx_vmuh_b):
> 	Redefine to standard pattern name.
> 	(CODE_FOR_lsx_vmuh_h): Likewise.
> 	(CODE_FOR_lsx_vmuh_w): Likewise.
> 	(CODE_FOR_lsx_vmuh_d): Likewise.
> 	(CODE_FOR_lsx_vmuh_bu): Likewise.
> 	(CODE_FOR_lsx_vmuh_hu): Likewise.
> 	(CODE_FOR_lsx_vmuh_wu): Likewise.
> 	(CODE_FOR_lsx_vmuh_du): Likewise.
> 	(CODE_FOR_lasx_xvmuh_b): Likewise.
> 	(CODE_FOR_lasx_xvmuh_h): Likewise.
> 	(CODE_FOR_lasx_xvmuh_w): Likewise.
> 	(CODE_FOR_lasx_xvmuh_d): Likewise.
> 	(CODE_FOR_lasx_xvmuh_bu): Likewise.
> 	(CODE_FOR_lasx_xvmuh_hu): Likewise.
> 	(CODE_FOR_lasx_xvmuh_wu): Likewise.
> 	(CODE_FOR_lasx_xvmuh_du): Likewise.
>
> gcc/testsuite/ChangeLog:
>
> 	* gcc.target/loongarch/vect-muh.c: New test.
> ---
>   gcc/config/loongarch/lasx.md                  | 22 ------------
>   gcc/config/loongarch/loongarch-builtins.cc    | 32 ++++++++---------
>   gcc/config/loongarch/lsx.md                   | 22 ------------
>   gcc/config/loongarch/simd.md                  | 16 +++++++++
>   gcc/testsuite/gcc.target/loongarch/vect-muh.c | 36 +++++++++++++++++++
>   5 files changed, 68 insertions(+), 60 deletions(-)
>   create mode 100644 gcc/testsuite/gcc.target/loongarch/vect-muh.c
>
> diff --git a/gcc/config/loongarch/lasx.md b/gcc/config/loongarch/lasx.md
> index d4a56c307c4..023a023b44e 100644
> --- a/gcc/config/loongarch/lasx.md
> +++ b/gcc/config/loongarch/lasx.md
> @@ -68,8 +68,6 @@ (define_c_enum "unspec" [
>     UNSPEC_LASX_BRANCH
>     UNSPEC_LASX_BRANCH_V
>   
> -  UNSPEC_LASX_XVMUH_S
> -  UNSPEC_LASX_XVMUH_U
>     UNSPEC_LASX_MXVEXTW_U
>     UNSPEC_LASX_XVSLLWIL_S
>     UNSPEC_LASX_XVSLLWIL_U
> @@ -2823,26 +2821,6 @@ (define_insn "neg<mode>2"
>     [(set_attr "type" "simd_logic")
>      (set_attr "mode" "<MODE>")])
>   
> -(define_insn "lasx_xvmuh_s_<lasxfmt>"
> -  [(set (match_operand:ILASX 0 "register_operand" "=f")
> -	(unspec:ILASX [(match_operand:ILASX 1 "register_operand" "f")
> -		       (match_operand:ILASX 2 "register_operand" "f")]
> -		      UNSPEC_LASX_XVMUH_S))]
> -  "ISA_HAS_LASX"
> -  "xvmuh.<lasxfmt>\t%u0,%u1,%u2"
> -  [(set_attr "type" "simd_int_arith")
> -   (set_attr "mode" "<MODE>")])
> -
> -(define_insn "lasx_xvmuh_u_<lasxfmt_u>"
> -  [(set (match_operand:ILASX 0 "register_operand" "=f")
> -	(unspec:ILASX [(match_operand:ILASX 1 "register_operand" "f")
> -		       (match_operand:ILASX 2 "register_operand" "f")]
> -		      UNSPEC_LASX_XVMUH_U))]
> -  "ISA_HAS_LASX"
> -  "xvmuh.<lasxfmt_u>\t%u0,%u1,%u2"
> -  [(set_attr "type" "simd_int_arith")
> -   (set_attr "mode" "<MODE>")])
> -
>   (define_insn "lasx_xvsllwil_s_<dlasxfmt>_<lasxfmt>"
>     [(set (match_operand:<VDMODE256> 0 "register_operand" "=f")
>   	(unspec:<VDMODE256> [(match_operand:ILASX_WHB 1 "register_operand" "f")
> diff --git a/gcc/config/loongarch/loongarch-builtins.cc b/gcc/config/loongarch/loongarch-builtins.cc
> index cbd833aa283..a6fcc1c731e 100644
> --- a/gcc/config/loongarch/loongarch-builtins.cc
> +++ b/gcc/config/loongarch/loongarch-builtins.cc
> @@ -319,6 +319,14 @@ AVAIL_ALL (lasx, ISA_HAS_LASX)
>   #define CODE_FOR_lsx_vmod_hu CODE_FOR_umodv8hi3
>   #define CODE_FOR_lsx_vmod_wu CODE_FOR_umodv4si3
>   #define CODE_FOR_lsx_vmod_du CODE_FOR_umodv2di3
> +#define CODE_FOR_lsx_vmuh_b CODE_FOR_smulv16qi3_highpart
> +#define CODE_FOR_lsx_vmuh_h CODE_FOR_smulv8hi3_highpart
> +#define CODE_FOR_lsx_vmuh_w CODE_FOR_smulv4si3_highpart
> +#define CODE_FOR_lsx_vmuh_d CODE_FOR_smulv2di3_highpart
> +#define CODE_FOR_lsx_vmuh_bu CODE_FOR_umulv16qi3_highpart
> +#define CODE_FOR_lsx_vmuh_hu CODE_FOR_umulv8hi3_highpart
> +#define CODE_FOR_lsx_vmuh_wu CODE_FOR_umulv4si3_highpart
> +#define CODE_FOR_lsx_vmuh_du CODE_FOR_umulv2di3_highpart
>   #define CODE_FOR_lsx_vmul_b CODE_FOR_mulv16qi3
>   #define CODE_FOR_lsx_vmul_h CODE_FOR_mulv8hi3
>   #define CODE_FOR_lsx_vmul_w CODE_FOR_mulv4si3
> @@ -439,14 +447,6 @@ AVAIL_ALL (lasx, ISA_HAS_LASX)
>   #define CODE_FOR_lsx_vfnmsub_s CODE_FOR_vfnmsubv4sf4_nmsub4
>   #define CODE_FOR_lsx_vfnmsub_d CODE_FOR_vfnmsubv2df4_nmsub4
>   
> -#define CODE_FOR_lsx_vmuh_b CODE_FOR_lsx_vmuh_s_b
> -#define CODE_FOR_lsx_vmuh_h CODE_FOR_lsx_vmuh_s_h
> -#define CODE_FOR_lsx_vmuh_w CODE_FOR_lsx_vmuh_s_w
> -#define CODE_FOR_lsx_vmuh_d CODE_FOR_lsx_vmuh_s_d
> -#define CODE_FOR_lsx_vmuh_bu CODE_FOR_lsx_vmuh_u_bu
> -#define CODE_FOR_lsx_vmuh_hu CODE_FOR_lsx_vmuh_u_hu
> -#define CODE_FOR_lsx_vmuh_wu CODE_FOR_lsx_vmuh_u_wu
> -#define CODE_FOR_lsx_vmuh_du CODE_FOR_lsx_vmuh_u_du
>   #define CODE_FOR_lsx_vsllwil_h_b CODE_FOR_lsx_vsllwil_s_h_b
>   #define CODE_FOR_lsx_vsllwil_w_h CODE_FOR_lsx_vsllwil_s_w_h
>   #define CODE_FOR_lsx_vsllwil_d_w CODE_FOR_lsx_vsllwil_s_d_w
> @@ -588,6 +588,14 @@ AVAIL_ALL (lasx, ISA_HAS_LASX)
>   #define CODE_FOR_lasx_xvmul_h CODE_FOR_mulv16hi3
>   #define CODE_FOR_lasx_xvmul_w CODE_FOR_mulv8si3
>   #define CODE_FOR_lasx_xvmul_d CODE_FOR_mulv4di3
> +#define CODE_FOR_lasx_xvmuh_b CODE_FOR_smulv32qi3_highpart
> +#define CODE_FOR_lasx_xvmuh_h CODE_FOR_smulv16hi3_highpart
> +#define CODE_FOR_lasx_xvmuh_w CODE_FOR_smulv8si3_highpart
> +#define CODE_FOR_lasx_xvmuh_d CODE_FOR_smulv4di3_highpart
> +#define CODE_FOR_lasx_xvmuh_bu CODE_FOR_umulv32qi3_highpart
> +#define CODE_FOR_lasx_xvmuh_hu CODE_FOR_umulv16hi3_highpart
> +#define CODE_FOR_lasx_xvmuh_wu CODE_FOR_umulv8si3_highpart
> +#define CODE_FOR_lasx_xvmuh_du CODE_FOR_umulv4di3_highpart
>   #define CODE_FOR_lasx_xvclz_b CODE_FOR_clzv32qi2
>   #define CODE_FOR_lasx_xvclz_h CODE_FOR_clzv16hi2
>   #define CODE_FOR_lasx_xvclz_w CODE_FOR_clzv8si2
> @@ -697,14 +705,6 @@ AVAIL_ALL (lasx, ISA_HAS_LASX)
>   #define CODE_FOR_lasx_xvavgr_hu CODE_FOR_lasx_xvavgr_u_hu
>   #define CODE_FOR_lasx_xvavgr_wu CODE_FOR_lasx_xvavgr_u_wu
>   #define CODE_FOR_lasx_xvavgr_du CODE_FOR_lasx_xvavgr_u_du
> -#define CODE_FOR_lasx_xvmuh_b CODE_FOR_lasx_xvmuh_s_b
> -#define CODE_FOR_lasx_xvmuh_h CODE_FOR_lasx_xvmuh_s_h
> -#define CODE_FOR_lasx_xvmuh_w CODE_FOR_lasx_xvmuh_s_w
> -#define CODE_FOR_lasx_xvmuh_d CODE_FOR_lasx_xvmuh_s_d
> -#define CODE_FOR_lasx_xvmuh_bu CODE_FOR_lasx_xvmuh_u_bu
> -#define CODE_FOR_lasx_xvmuh_hu CODE_FOR_lasx_xvmuh_u_hu
> -#define CODE_FOR_lasx_xvmuh_wu CODE_FOR_lasx_xvmuh_u_wu
> -#define CODE_FOR_lasx_xvmuh_du CODE_FOR_lasx_xvmuh_u_du
>   #define CODE_FOR_lasx_xvssran_b_h CODE_FOR_lasx_xvssran_s_b_h
>   #define CODE_FOR_lasx_xvssran_h_w CODE_FOR_lasx_xvssran_s_h_w
>   #define CODE_FOR_lasx_xvssran_w_d CODE_FOR_lasx_xvssran_s_w_d
> diff --git a/gcc/config/loongarch/lsx.md b/gcc/config/loongarch/lsx.md
> index c1c3719e383..537afaf9625 100644
> --- a/gcc/config/loongarch/lsx.md
> +++ b/gcc/config/loongarch/lsx.md
> @@ -64,8 +64,6 @@ (define_c_enum "unspec" [
>     UNSPEC_LSX_VSRLR
>     UNSPEC_LSX_VSRLRI
>     UNSPEC_LSX_VSHUF
> -  UNSPEC_LSX_VMUH_S
> -  UNSPEC_LSX_VMUH_U
>     UNSPEC_LSX_VEXTW_S
>     UNSPEC_LSX_VEXTW_U
>     UNSPEC_LSX_VSLLWIL_S
> @@ -2506,26 +2504,6 @@ (define_insn "vneg<mode>2"
>     [(set_attr "type" "simd_logic")
>      (set_attr "mode" "<MODE>")])
>   
> -(define_insn "lsx_vmuh_s_<lsxfmt>"
> -  [(set (match_operand:ILSX 0 "register_operand" "=f")
> -	(unspec:ILSX [(match_operand:ILSX 1 "register_operand" "f")
> -		      (match_operand:ILSX 2 "register_operand" "f")]
> -		     UNSPEC_LSX_VMUH_S))]
> -  "ISA_HAS_LSX"
> -  "vmuh.<lsxfmt>\t%w0,%w1,%w2"
> -  [(set_attr "type" "simd_int_arith")
> -   (set_attr "mode" "<MODE>")])
> -
> -(define_insn "lsx_vmuh_u_<lsxfmt_u>"
> -  [(set (match_operand:ILSX 0 "register_operand" "=f")
> -	(unspec:ILSX [(match_operand:ILSX 1 "register_operand" "f")
> -		      (match_operand:ILSX 2 "register_operand" "f")]
> -		     UNSPEC_LSX_VMUH_U))]
> -  "ISA_HAS_LSX"
> -  "vmuh.<lsxfmt_u>\t%w0,%w1,%w2"
> -  [(set_attr "type" "simd_int_arith")
> -   (set_attr "mode" "<MODE>")])
> -
>   (define_insn "lsx_vextw_s_d"
>     [(set (match_operand:V2DI 0 "register_operand" "=f")
>   	(unspec:V2DI [(match_operand:V4SI 1 "register_operand" "f")]
> diff --git a/gcc/config/loongarch/simd.md b/gcc/config/loongarch/simd.md
> index f371e201127..79324183233 100644
> --- a/gcc/config/loongarch/simd.md
> +++ b/gcc/config/loongarch/simd.md
> @@ -187,6 +187,22 @@ (define_insn_and_split "fix_trunc<mode><vimode>2"
>     [(set_attr "type" "simd_fcvt")
>      (set_attr "mode" "<MODE>")])
>   
> +;; <x>vmuh.{b/h/w/d}
> +
> +(define_code_attr muh
> +  [(sign_extend "smul_highpart")
> +   (zero_extend "umul_highpart")])
> +
> +(define_insn "<su>mul<mode>3_highpart"
> +  [(set (match_operand:IVEC 0 "register_operand" "=f")
> +	(<muh>:IVEC (match_operand:IVEC 1 "register_operand" "f")
> +		    (match_operand:IVEC 2 "register_operand" "f")))
> +   (any_extend (const_int 0))]
> +  ""
> +  "<x>vmuh.<simdfmt><u>\t%<wu>0,%<wu>1,%<wu>2"
> +  [(set_attr "type" "simd_int_arith")
> +   (set_attr "mode" "<MODE>")])
> +
>   ; The LoongArch SX Instructions.
>   (include "lsx.md")
>   
> diff --git a/gcc/testsuite/gcc.target/loongarch/vect-muh.c b/gcc/testsuite/gcc.target/loongarch/vect-muh.c
> new file mode 100644
> index 00000000000..a788840b23c
> --- /dev/null
> +++ b/gcc/testsuite/gcc.target/loongarch/vect-muh.c
> @@ -0,0 +1,36 @@
> +/* { dg-do compile } */
> +/* { dg-options "-mlasx -O3" } */
> +/* { dg-final { scan-assembler "\tvmuh\.w\t" } } */
> +/* { dg-final { scan-assembler "\tvmuh\.wu\t" } } */
> +/* { dg-final { scan-assembler "\txvmuh\.w\t" } } */
> +/* { dg-final { scan-assembler "\txvmuh\.wu\t" } } */
> +
> +int a[8], b[8], c[8];
> +
> +void
> +test1 (void)
> +{
> +  for (int i = 0; i < 4; i++)
> +    c[i] = ((long)a[i] * (long)b[i]) >> 32;
> +}
> +
> +void
> +test2 (void)
> +{
> +  for (int i = 0; i < 4; i++)
> +    c[i] = ((long)(unsigned)a[i] * (long)(unsigned)b[i]) >> 32;
> +}
> +
> +void
> +test3 (void)
> +{
> +  for (int i = 0; i < 8; i++)
> +    c[i] = ((long)a[i] * (long)b[i]) >> 32;
> +}
> +
> +void
> +test4 (void)
> +{
> +  for (int i = 0; i < 8; i++)
> +    c[i] = ((long)(unsigned)a[i] * (long)(unsigned)b[i]) >> 32;
> +}


  reply	other threads:[~2023-11-23 12:08 UTC|newest]

Thread overview: 33+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2023-11-20  0:47 [PATCH v3 0/5] LoongArch: SIMD fixes and optimizations Xi Ruoyao
2023-11-20  0:47 ` [PATCH v3 1/5] LoongArch: Fix usage of LSX and LASX frint/ftint instructions [PR112578] Xi Ruoyao
2023-11-23  6:35   ` chenglulu
2023-11-23  7:11     ` Xi Ruoyao
2023-11-23  7:31       ` chenglulu
2023-11-23  8:13         ` chenglulu
2023-11-23  9:02           ` Xi Ruoyao
2023-11-23  9:12             ` chenglulu
2023-11-23 10:12               ` Xi Ruoyao
2023-11-23 12:06                 ` Xi Ruoyao
2023-11-23 18:03                 ` Joseph Myers
2023-11-24  2:39                   ` Xi Ruoyao
2023-11-24  8:01                     ` chenglulu
2023-11-24  8:26                       ` Xi Ruoyao
2023-11-24  8:36                         ` chenglulu
2023-11-24  8:42                           ` Xi Ruoyao
2023-11-24  9:46                             ` chenglulu
2023-11-24 10:30                               ` Xi Ruoyao
2023-11-24 14:59                                 ` chenglulu
2023-11-23  8:54         ` Xi Ruoyao
2023-11-20  0:47 ` [PATCH v3 2/5] LoongArch: Use standard pattern name and RTX code for LSX/LASX muh instructions Xi Ruoyao
2023-11-23 12:08   ` chenglulu [this message]
2023-11-20  0:47 ` [PATCH v3 3/5] LoongArch: Use standard pattern name and RTX code for LSX/LASX rotate shift Xi Ruoyao
2023-11-23  8:42   ` chenglulu
2023-11-20  0:47 ` [PATCH v3 4/5] LoongArch: Remove lrint_allow_inexact Xi Ruoyao
2023-11-23  8:23   ` chenglulu
2023-11-23  8:58     ` Xi Ruoyao
2023-11-23  9:14       ` chenglulu
2023-11-23 12:24         ` Xi Ruoyao
2023-11-23 14:39           ` chenglulu
2023-11-20  0:47 ` [PATCH v3 5/5] LoongArch: Use LSX for scalar FP rounding with explicit rounding mode Xi Ruoyao
2023-11-29  7:12 ` Pushed: [PATCH v3 0/5] LoongArch: SIMD fixes and optimizations Xi Ruoyao
2023-11-29  7:45   ` chenglulu

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=df1f7cf0-cff0-1e24-412c-3ed9900a4e86@loongson.cn \
    --to=chenglulu@loongson.cn \
    --cc=gcc-patches@gcc.gnu.org \
    --cc=i@xen0n.name \
    --cc=xry111@xry111.site \
    --cc=xuchenghua@loongson.cn \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).