public inbox for gcc-patches@gcc.gnu.org
 help / color / mirror / Atom feed
* [PATCH] RISC-V: Support cond vmulh.vv and vmulu.vv
@ 2023-09-12 16:25 Lehua Ding
  2023-09-13  8:50 ` Kito Cheng
  0 siblings, 1 reply; 3+ messages in thread
From: Lehua Ding @ 2023-09-12 16:25 UTC (permalink / raw)
  To: gcc-patches
  Cc: juzhe.zhong, kito.cheng, rdapp.gcc, palmer, jeffreyalaw, lehua.ding

This patch adds combine patterns to combine vmulh[u].vv + vcond_mask
to mask vmulh[u].vv. For vmulsu.vv, it can not be produced in midend
currently. We will send another patch to take this issue.

gcc/ChangeLog:

	* config/riscv/autovec-opt.md (*cond_<mulh_table><mode>3_highpart):
	New combine pattern.
	* config/riscv/autovec.md (smul<mode>3_highpart): Mrege smul and umul.
	(<mulh_table><mode>3_highpart): Merged pattern.
	(umul<mode>3_highpart): Mrege smul and umul.
	* config/riscv/vector-iterators.md (umul): New iterators.
	(UNSPEC_VMULHU): New iterators.

gcc/testsuite/ChangeLog:

	* gcc.target/riscv/rvv/autovec/cond/cond_mulh-1.c: New test.
	* gcc.target/riscv/rvv/autovec/cond/cond_mulh-2.c: New test.
	* gcc.target/riscv/rvv/autovec/cond/cond_mulh_run-1.c: New test.
	* gcc.target/riscv/rvv/autovec/cond/cond_mulh_run-2.c: New test.

---
 gcc/config/riscv/autovec-opt.md               | 23 ++++++++++++-
 gcc/config/riscv/autovec.md                   | 22 ++----------
 gcc/config/riscv/vector-iterators.md          |  4 +++
 .../riscv/rvv/autovec/cond/cond_mulh-1.c      | 29 ++++++++++++++++
 .../riscv/rvv/autovec/cond/cond_mulh-2.c      | 30 ++++++++++++++++
 .../riscv/rvv/autovec/cond/cond_mulh_run-1.c  | 32 +++++++++++++++++
 .../riscv/rvv/autovec/cond/cond_mulh_run-2.c  | 34 +++++++++++++++++++
 7 files changed, 154 insertions(+), 20 deletions(-)
 create mode 100644 gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh-1.c
 create mode 100644 gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh-2.c
 create mode 100644 gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh_run-1.c
 create mode 100644 gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh_run-2.c

diff --git a/gcc/config/riscv/autovec-opt.md b/gcc/config/riscv/autovec-opt.md
index 0d2721f0b29..552be48bf73 100644
--- a/gcc/config/riscv/autovec-opt.md
+++ b/gcc/config/riscv/autovec-opt.md
@@ -970,6 +970,28 @@
 }
  [(set_attr "type" "vnshift")])
 
+;; Combine vmulh.vv/vmulhu.vv + vcond_mask
+(define_insn_and_split "*cond_<mulh_table><mode>3_highpart"
+   [(set (match_operand:VFULLI 0 "register_operand")
+    (if_then_else:VFULLI
+      (match_operand:<VM> 1 "register_operand")
+      (mulh:VFULLI
+        (match_operand:VFULLI 2 "register_operand")
+        (match_operand:VFULLI 3 "register_operand"))
+      (match_operand:VFULLI 4 "register_operand")))]
+   "TARGET_VECTOR && can_create_pseudo_p ()"
+   "#"
+   "&& 1"
+   [(const_int 0)]
+{
+  insn_code icode = code_for_pred_mulh (<MULH_UNSPEC>, <MODE>mode);
+  rtx ops[] = {operands[0], operands[1], operands[2], operands[3], operands[4],
+               gen_int_mode (GET_MODE_NUNITS (<MODE>mode), Pmode)};
+  riscv_vector::expand_cond_len_binop (icode, ops);
+   DONE;
+}
+[(set_attr "type" "vector")])
+
 ;; =============================================================================
 ;; Combine extend + binop to widen_binop
 ;; =============================================================================
@@ -1172,7 +1194,6 @@
 }
 [(set_attr "type" "vfwmul")])
 
-
 ;; =============================================================================
 ;; Misc combine patterns
 ;; =============================================================================
diff --git a/gcc/config/riscv/autovec.md b/gcc/config/riscv/autovec.md
index e9dd40af935..b4ac22bb97b 100644
--- a/gcc/config/riscv/autovec.md
+++ b/gcc/config/riscv/autovec.md
@@ -1569,9 +1569,9 @@
 ;; - vmulhu.vv
 ;; -------------------------------------------------------------------------
 
-(define_insn_and_split "smul<mode>3_highpart"
+(define_insn_and_split "<mulh_table><mode>3_highpart"
   [(set (match_operand:VFULLI 0 "register_operand")
-        (smul_highpart:VFULLI
+        (mulh:VFULLI
           (match_operand:VFULLI 1 "register_operand")
           (match_operand:VFULLI 2 "register_operand")))]
   "TARGET_VECTOR && can_create_pseudo_p ()"
@@ -1579,23 +1579,7 @@
   "&& 1"
   [(const_int 0)]
 {
-  insn_code icode = code_for_pred_mulh (UNSPEC_VMULHS, <MODE>mode);
-  riscv_vector::emit_vlmax_insn (icode, riscv_vector::BINARY_OP, operands);
-  DONE;
-}
-[(set_attr "type" "vimul")])
-
-(define_insn_and_split "umul<mode>3_highpart"
-  [(set (match_operand:VFULLI 0 "register_operand")
-        (umul_highpart:VFULLI
-          (match_operand:VFULLI 1 "register_operand")
-          (match_operand:VFULLI 2 "register_operand")))]
-  "TARGET_VECTOR && can_create_pseudo_p ()"
-  "#"
-  "&& 1"
-  [(const_int 0)]
-{
-  insn_code icode = code_for_pred_mulh (UNSPEC_VMULHU, <MODE>mode);
+  insn_code icode = code_for_pred_mulh (<MULH_UNSPEC>, <MODE>mode);
   riscv_vector::emit_vlmax_insn (icode, riscv_vector::BINARY_OP, operands);
   DONE;
 }
diff --git a/gcc/config/riscv/vector-iterators.md b/gcc/config/riscv/vector-iterators.md
index 2f7f7cbe08c..e70a9bc5c74 100644
--- a/gcc/config/riscv/vector-iterators.md
+++ b/gcc/config/riscv/vector-iterators.md
@@ -2354,6 +2354,10 @@
 (define_code_iterator sat_int_plus_binop [ss_plus us_plus])
 (define_code_iterator sat_int_minus_binop [ss_minus us_minus])
 
+(define_code_iterator mulh [smul_highpart umul_highpart])
+(define_code_attr mulh_table [(smul_highpart "smul") (umul_highpart "umul")])
+(define_code_attr MULH_UNSPEC [(smul_highpart "UNSPEC_VMULHS") (umul_highpart "UNSPEC_VMULHU")])
+
 (define_code_iterator any_widen_binop [plus minus mult])
 (define_code_iterator plus_minus [plus minus])
 
diff --git a/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh-1.c b/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh-1.c
new file mode 100644
index 00000000000..6ff2dc580a4
--- /dev/null
+++ b/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh-1.c
@@ -0,0 +1,29 @@
+/* { dg-do compile } */
+/* { dg-additional-options "-march=rv32gcv_zvfh -mabi=ilp32d --param=riscv-autovec-preference=scalable -fno-vect-cost-model" } */
+
+#include <stdint-gcc.h>
+
+#define DEF_LOOP(TYPE, WTYPE)                                                  \
+  void __attribute__ ((noipa))                                                 \
+  test_##TYPE##_##WTYPE (TYPE *__restrict a, TYPE *__restrict b,               \
+			 TYPE *__restrict c, TYPE *__restrict pred, int n)     \
+  {                                                                            \
+    for (int i = 0; i < n; ++i)                                                \
+      c[i] = pred[i]                                                           \
+	       ? (TYPE) (((WTYPE) a[i] * (WTYPE) b[i]) >> sizeof (TYPE) * 8)   \
+	       : c[i];                                                         \
+  }
+
+#define TEST_ALL(T)                                                            \
+  T (int8_t, int16_t)                                                          \
+  T (int16_t, int32_t)                                                         \
+  T (int32_t, int64_t)                                                         \
+  T (uint8_t, uint16_t)                                                        \
+  T (uint16_t, uint32_t)                                                       \
+  T (uint32_t, uint64_t)
+
+TEST_ALL (DEF_LOOP)
+
+/* { dg-final { scan-assembler-times {\tvmulh\.vv\tv[0-9]+,v[0-9]+,v[0-9]+,v0.t} 3 } } */
+/* { dg-final { scan-assembler-times {\tvmulhu\.vv\tv[0-9]+,v[0-9]+,v[0-9]+,v0.t} 3 } } */
+/* { dg-final { scan-assembler-not {\tvmerge\.vvm\t} } } */
diff --git a/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh-2.c b/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh-2.c
new file mode 100644
index 00000000000..c4c2b50f203
--- /dev/null
+++ b/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh-2.c
@@ -0,0 +1,30 @@
+/* { dg-do compile } */
+/* { dg-additional-options "-march=rv32gcv_zvfh -mabi=ilp32d --param=riscv-autovec-preference=scalable -fno-vect-cost-model" } */
+
+#include <stdint-gcc.h>
+
+#define DEF_LOOP(TYPE1, WTYPE1, TYPE2, WTYPE2, TYPE3)                          \
+  void __attribute__ ((noipa))                                                 \
+  test_##TYPE1##_##TYPE2 (TYPE1 *__restrict a, TYPE2 *__restrict b,            \
+			  TYPE3 *__restrict c, TYPE3 *__restrict pred, int n)  \
+  {                                                                            \
+    for (int i = 0; i < n; ++i)                                                \
+      c[i]                                                                     \
+	= pred[i]                                                              \
+	    ? (TYPE3) (((WTYPE1) a[i] * (WTYPE2) b[i]) >> sizeof (TYPE1) * 8)  \
+	    : c[i];                                                            \
+  }
+
+#define TEST_ALL(T)                                                            \
+  T (int8_t, int16_t, uint8_t, uint16_t, int8_t)                               \
+  T (uint8_t, uint16_t, int8_t, int16_t, int8_t)                               \
+  T (int16_t, int32_t, uint16_t, uint32_t, int16_t)                            \
+  T (uint16_t, uint32_t, int16_t, int32_t, int16_t)                            \
+  T (int32_t, int64_t, uint32_t, uint64_t, int32_t)                            \
+  T (uint32_t, uint64_t, int32_t, int64_t, int32_t)
+
+TEST_ALL (DEF_LOOP)
+
+/* FIXME: need midend support: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=111381 */
+/* { dg-final { scan-assembler-times {\tvmulhsu\.vv\tv[0-9]+,v[0-9]+,v[0-9]+,v0.t} 6 {xfail riscv*-*-*} } } */
+/* { dg-final { scan-assembler-not {\tvmerge\.vvm\t} } } */
diff --git a/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh_run-1.c b/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh_run-1.c
new file mode 100644
index 00000000000..8260472add7
--- /dev/null
+++ b/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh_run-1.c
@@ -0,0 +1,32 @@
+/* { dg-do run { target { riscv_vector } } } */
+/* { dg-additional-options "--param=riscv-autovec-preference=scalable -fno-vect-cost-model" } */
+
+#include "cond_mulh-1.c"
+
+#define N 99
+
+#define TEST_LOOP(TYPE, WTYPE)                                                 \
+  {                                                                            \
+    TYPE a[N], b[N], c[N], pred[N];                                            \
+    for (int i = 0; i < N; ++i)                                                \
+      {                                                                        \
+	a[i] = i * i;                                                          \
+	b[i] = ((i + 2) % 3) * (i + 1);                                        \
+	pred[i] = i % 3;                                                       \
+	asm volatile ("" ::: "memory");                                        \
+      }                                                                        \
+    test_##TYPE##_##WTYPE (a, b, c, pred, N);                                  \
+    for (int i = 0; i < N; ++i)                                                \
+      if (c[i]                                                                 \
+	  != (pred[i]                                                          \
+		? (TYPE) (((WTYPE) a[i] * (WTYPE) b[i]) >> sizeof (TYPE) * 8)  \
+		: c[i]))                                                       \
+	__builtin_abort ();                                                    \
+  }
+
+int
+main ()
+{
+  TEST_ALL (TEST_LOOP)
+  return 0;
+}
diff --git a/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh_run-2.c b/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh_run-2.c
new file mode 100644
index 00000000000..24612ff84f7
--- /dev/null
+++ b/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh_run-2.c
@@ -0,0 +1,34 @@
+/* { dg-do run { target { riscv_vector } } } */
+/* { dg-additional-options "--param=riscv-autovec-preference=scalable -fno-vect-cost-model" } */
+
+#include "cond_mulh-2.c"
+
+#define N 99
+
+#define TEST_LOOP(TYPE1, WTYPE1, TYPE2, WTYPE2, TYPE3)                         \
+  {                                                                            \
+    TYPE1 a[N];                                                                \
+    TYPE2 b[N];                                                                \
+    TYPE3 c[N], pred[N];                                                       \
+    for (int i = 0; i < N; ++i)                                                \
+      {                                                                        \
+	a[i] = i * i;                                                          \
+	b[i] = ((i + 2) % 3) * (i + 1);                                        \
+	pred[i] = i % 3;                                                       \
+	asm volatile ("" ::: "memory");                                        \
+      }                                                                        \
+    test_##TYPE1##_##TYPE2 (a, b, c, pred, N);                                 \
+    for (int i = 0; i < N; ++i)                                                \
+      if (c[i]                                                                 \
+	  != (pred[i] ? (TYPE3) (((WTYPE1) a[i] * (WTYPE2) b[i])               \
+				 >> sizeof (TYPE1) * 8)                        \
+		      : c[i]))                                                 \
+	__builtin_abort ();                                                    \
+  }
+
+int
+main ()
+{
+  TEST_ALL (TEST_LOOP)
+  return 0;
+}
-- 
2.36.3


^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: [PATCH] RISC-V: Support cond vmulh.vv and vmulu.vv
  2023-09-12 16:25 [PATCH] RISC-V: Support cond vmulh.vv and vmulu.vv Lehua Ding
@ 2023-09-13  8:50 ` Kito Cheng
  2023-09-13 10:35   ` Lehua Ding
  0 siblings, 1 reply; 3+ messages in thread
From: Kito Cheng @ 2023-09-13  8:50 UTC (permalink / raw)
  To: Lehua Ding; +Cc: gcc-patches, juzhe.zhong, rdapp.gcc, palmer, jeffreyalaw

LGTM, thanks :)

On Wed, Sep 13, 2023 at 12:25 AM Lehua Ding <lehua.ding@rivai.ai> wrote:
>
> This patch adds combine patterns to combine vmulh[u].vv + vcond_mask
> to mask vmulh[u].vv. For vmulsu.vv, it can not be produced in midend
> currently. We will send another patch to take this issue.
>
> gcc/ChangeLog:
>
>         * config/riscv/autovec-opt.md (*cond_<mulh_table><mode>3_highpart):
>         New combine pattern.
>         * config/riscv/autovec.md (smul<mode>3_highpart): Mrege smul and umul.
>         (<mulh_table><mode>3_highpart): Merged pattern.
>         (umul<mode>3_highpart): Mrege smul and umul.
>         * config/riscv/vector-iterators.md (umul): New iterators.
>         (UNSPEC_VMULHU): New iterators.
>
> gcc/testsuite/ChangeLog:
>
>         * gcc.target/riscv/rvv/autovec/cond/cond_mulh-1.c: New test.
>         * gcc.target/riscv/rvv/autovec/cond/cond_mulh-2.c: New test.
>         * gcc.target/riscv/rvv/autovec/cond/cond_mulh_run-1.c: New test.
>         * gcc.target/riscv/rvv/autovec/cond/cond_mulh_run-2.c: New test.
>
> ---
>  gcc/config/riscv/autovec-opt.md               | 23 ++++++++++++-
>  gcc/config/riscv/autovec.md                   | 22 ++----------
>  gcc/config/riscv/vector-iterators.md          |  4 +++
>  .../riscv/rvv/autovec/cond/cond_mulh-1.c      | 29 ++++++++++++++++
>  .../riscv/rvv/autovec/cond/cond_mulh-2.c      | 30 ++++++++++++++++
>  .../riscv/rvv/autovec/cond/cond_mulh_run-1.c  | 32 +++++++++++++++++
>  .../riscv/rvv/autovec/cond/cond_mulh_run-2.c  | 34 +++++++++++++++++++
>  7 files changed, 154 insertions(+), 20 deletions(-)
>  create mode 100644 gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh-1.c
>  create mode 100644 gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh-2.c
>  create mode 100644 gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh_run-1.c
>  create mode 100644 gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh_run-2.c
>
> diff --git a/gcc/config/riscv/autovec-opt.md b/gcc/config/riscv/autovec-opt.md
> index 0d2721f0b29..552be48bf73 100644
> --- a/gcc/config/riscv/autovec-opt.md
> +++ b/gcc/config/riscv/autovec-opt.md
> @@ -970,6 +970,28 @@
>  }
>   [(set_attr "type" "vnshift")])
>
> +;; Combine vmulh.vv/vmulhu.vv + vcond_mask
> +(define_insn_and_split "*cond_<mulh_table><mode>3_highpart"
> +   [(set (match_operand:VFULLI 0 "register_operand")
> +    (if_then_else:VFULLI
> +      (match_operand:<VM> 1 "register_operand")
> +      (mulh:VFULLI
> +        (match_operand:VFULLI 2 "register_operand")
> +        (match_operand:VFULLI 3 "register_operand"))
> +      (match_operand:VFULLI 4 "register_operand")))]
> +   "TARGET_VECTOR && can_create_pseudo_p ()"
> +   "#"
> +   "&& 1"
> +   [(const_int 0)]
> +{
> +  insn_code icode = code_for_pred_mulh (<MULH_UNSPEC>, <MODE>mode);
> +  rtx ops[] = {operands[0], operands[1], operands[2], operands[3], operands[4],
> +               gen_int_mode (GET_MODE_NUNITS (<MODE>mode), Pmode)};
> +  riscv_vector::expand_cond_len_binop (icode, ops);
> +   DONE;
> +}
> +[(set_attr "type" "vector")])
> +
>  ;; =============================================================================
>  ;; Combine extend + binop to widen_binop
>  ;; =============================================================================
> @@ -1172,7 +1194,6 @@
>  }
>  [(set_attr "type" "vfwmul")])
>
> -
>  ;; =============================================================================
>  ;; Misc combine patterns
>  ;; =============================================================================
> diff --git a/gcc/config/riscv/autovec.md b/gcc/config/riscv/autovec.md
> index e9dd40af935..b4ac22bb97b 100644
> --- a/gcc/config/riscv/autovec.md
> +++ b/gcc/config/riscv/autovec.md
> @@ -1569,9 +1569,9 @@
>  ;; - vmulhu.vv
>  ;; -------------------------------------------------------------------------
>
> -(define_insn_and_split "smul<mode>3_highpart"
> +(define_insn_and_split "<mulh_table><mode>3_highpart"
>    [(set (match_operand:VFULLI 0 "register_operand")
> -        (smul_highpart:VFULLI
> +        (mulh:VFULLI
>            (match_operand:VFULLI 1 "register_operand")
>            (match_operand:VFULLI 2 "register_operand")))]
>    "TARGET_VECTOR && can_create_pseudo_p ()"
> @@ -1579,23 +1579,7 @@
>    "&& 1"
>    [(const_int 0)]
>  {
> -  insn_code icode = code_for_pred_mulh (UNSPEC_VMULHS, <MODE>mode);
> -  riscv_vector::emit_vlmax_insn (icode, riscv_vector::BINARY_OP, operands);
> -  DONE;
> -}
> -[(set_attr "type" "vimul")])
> -
> -(define_insn_and_split "umul<mode>3_highpart"
> -  [(set (match_operand:VFULLI 0 "register_operand")
> -        (umul_highpart:VFULLI
> -          (match_operand:VFULLI 1 "register_operand")
> -          (match_operand:VFULLI 2 "register_operand")))]
> -  "TARGET_VECTOR && can_create_pseudo_p ()"
> -  "#"
> -  "&& 1"
> -  [(const_int 0)]
> -{
> -  insn_code icode = code_for_pred_mulh (UNSPEC_VMULHU, <MODE>mode);
> +  insn_code icode = code_for_pred_mulh (<MULH_UNSPEC>, <MODE>mode);
>    riscv_vector::emit_vlmax_insn (icode, riscv_vector::BINARY_OP, operands);
>    DONE;
>  }
> diff --git a/gcc/config/riscv/vector-iterators.md b/gcc/config/riscv/vector-iterators.md
> index 2f7f7cbe08c..e70a9bc5c74 100644
> --- a/gcc/config/riscv/vector-iterators.md
> +++ b/gcc/config/riscv/vector-iterators.md
> @@ -2354,6 +2354,10 @@
>  (define_code_iterator sat_int_plus_binop [ss_plus us_plus])
>  (define_code_iterator sat_int_minus_binop [ss_minus us_minus])
>
> +(define_code_iterator mulh [smul_highpart umul_highpart])
> +(define_code_attr mulh_table [(smul_highpart "smul") (umul_highpart "umul")])
> +(define_code_attr MULH_UNSPEC [(smul_highpart "UNSPEC_VMULHS") (umul_highpart "UNSPEC_VMULHU")])
> +
>  (define_code_iterator any_widen_binop [plus minus mult])
>  (define_code_iterator plus_minus [plus minus])
>
> diff --git a/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh-1.c b/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh-1.c
> new file mode 100644
> index 00000000000..6ff2dc580a4
> --- /dev/null
> +++ b/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh-1.c
> @@ -0,0 +1,29 @@
> +/* { dg-do compile } */
> +/* { dg-additional-options "-march=rv32gcv_zvfh -mabi=ilp32d --param=riscv-autovec-preference=scalable -fno-vect-cost-model" } */
> +
> +#include <stdint-gcc.h>
> +
> +#define DEF_LOOP(TYPE, WTYPE)                                                  \
> +  void __attribute__ ((noipa))                                                 \
> +  test_##TYPE##_##WTYPE (TYPE *__restrict a, TYPE *__restrict b,               \
> +                        TYPE *__restrict c, TYPE *__restrict pred, int n)     \
> +  {                                                                            \
> +    for (int i = 0; i < n; ++i)                                                \
> +      c[i] = pred[i]                                                           \
> +              ? (TYPE) (((WTYPE) a[i] * (WTYPE) b[i]) >> sizeof (TYPE) * 8)   \
> +              : c[i];                                                         \
> +  }
> +
> +#define TEST_ALL(T)                                                            \
> +  T (int8_t, int16_t)                                                          \
> +  T (int16_t, int32_t)                                                         \
> +  T (int32_t, int64_t)                                                         \
> +  T (uint8_t, uint16_t)                                                        \
> +  T (uint16_t, uint32_t)                                                       \
> +  T (uint32_t, uint64_t)
> +
> +TEST_ALL (DEF_LOOP)
> +
> +/* { dg-final { scan-assembler-times {\tvmulh\.vv\tv[0-9]+,v[0-9]+,v[0-9]+,v0.t} 3 } } */
> +/* { dg-final { scan-assembler-times {\tvmulhu\.vv\tv[0-9]+,v[0-9]+,v[0-9]+,v0.t} 3 } } */
> +/* { dg-final { scan-assembler-not {\tvmerge\.vvm\t} } } */
> diff --git a/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh-2.c b/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh-2.c
> new file mode 100644
> index 00000000000..c4c2b50f203
> --- /dev/null
> +++ b/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh-2.c
> @@ -0,0 +1,30 @@
> +/* { dg-do compile } */
> +/* { dg-additional-options "-march=rv32gcv_zvfh -mabi=ilp32d --param=riscv-autovec-preference=scalable -fno-vect-cost-model" } */
> +
> +#include <stdint-gcc.h>
> +
> +#define DEF_LOOP(TYPE1, WTYPE1, TYPE2, WTYPE2, TYPE3)                          \
> +  void __attribute__ ((noipa))                                                 \
> +  test_##TYPE1##_##TYPE2 (TYPE1 *__restrict a, TYPE2 *__restrict b,            \
> +                         TYPE3 *__restrict c, TYPE3 *__restrict pred, int n)  \
> +  {                                                                            \
> +    for (int i = 0; i < n; ++i)                                                \
> +      c[i]                                                                     \
> +       = pred[i]                                                              \
> +           ? (TYPE3) (((WTYPE1) a[i] * (WTYPE2) b[i]) >> sizeof (TYPE1) * 8)  \
> +           : c[i];                                                            \
> +  }
> +
> +#define TEST_ALL(T)                                                            \
> +  T (int8_t, int16_t, uint8_t, uint16_t, int8_t)                               \
> +  T (uint8_t, uint16_t, int8_t, int16_t, int8_t)                               \
> +  T (int16_t, int32_t, uint16_t, uint32_t, int16_t)                            \
> +  T (uint16_t, uint32_t, int16_t, int32_t, int16_t)                            \
> +  T (int32_t, int64_t, uint32_t, uint64_t, int32_t)                            \
> +  T (uint32_t, uint64_t, int32_t, int64_t, int32_t)
> +
> +TEST_ALL (DEF_LOOP)
> +
> +/* FIXME: need midend support: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=111381 */
> +/* { dg-final { scan-assembler-times {\tvmulhsu\.vv\tv[0-9]+,v[0-9]+,v[0-9]+,v0.t} 6 {xfail riscv*-*-*} } } */
> +/* { dg-final { scan-assembler-not {\tvmerge\.vvm\t} } } */
> diff --git a/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh_run-1.c b/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh_run-1.c
> new file mode 100644
> index 00000000000..8260472add7
> --- /dev/null
> +++ b/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh_run-1.c
> @@ -0,0 +1,32 @@
> +/* { dg-do run { target { riscv_vector } } } */
> +/* { dg-additional-options "--param=riscv-autovec-preference=scalable -fno-vect-cost-model" } */
> +
> +#include "cond_mulh-1.c"
> +
> +#define N 99
> +
> +#define TEST_LOOP(TYPE, WTYPE)                                                 \
> +  {                                                                            \
> +    TYPE a[N], b[N], c[N], pred[N];                                            \
> +    for (int i = 0; i < N; ++i)                                                \
> +      {                                                                        \
> +       a[i] = i * i;                                                          \
> +       b[i] = ((i + 2) % 3) * (i + 1);                                        \
> +       pred[i] = i % 3;                                                       \
> +       asm volatile ("" ::: "memory");                                        \
> +      }                                                                        \
> +    test_##TYPE##_##WTYPE (a, b, c, pred, N);                                  \
> +    for (int i = 0; i < N; ++i)                                                \
> +      if (c[i]                                                                 \
> +         != (pred[i]                                                          \
> +               ? (TYPE) (((WTYPE) a[i] * (WTYPE) b[i]) >> sizeof (TYPE) * 8)  \
> +               : c[i]))                                                       \
> +       __builtin_abort ();                                                    \
> +  }
> +
> +int
> +main ()
> +{
> +  TEST_ALL (TEST_LOOP)
> +  return 0;
> +}
> diff --git a/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh_run-2.c b/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh_run-2.c
> new file mode 100644
> index 00000000000..24612ff84f7
> --- /dev/null
> +++ b/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh_run-2.c
> @@ -0,0 +1,34 @@
> +/* { dg-do run { target { riscv_vector } } } */
> +/* { dg-additional-options "--param=riscv-autovec-preference=scalable -fno-vect-cost-model" } */
> +
> +#include "cond_mulh-2.c"
> +
> +#define N 99
> +
> +#define TEST_LOOP(TYPE1, WTYPE1, TYPE2, WTYPE2, TYPE3)                         \
> +  {                                                                            \
> +    TYPE1 a[N];                                                                \
> +    TYPE2 b[N];                                                                \
> +    TYPE3 c[N], pred[N];                                                       \
> +    for (int i = 0; i < N; ++i)                                                \
> +      {                                                                        \
> +       a[i] = i * i;                                                          \
> +       b[i] = ((i + 2) % 3) * (i + 1);                                        \
> +       pred[i] = i % 3;                                                       \
> +       asm volatile ("" ::: "memory");                                        \
> +      }                                                                        \
> +    test_##TYPE1##_##TYPE2 (a, b, c, pred, N);                                 \
> +    for (int i = 0; i < N; ++i)                                                \
> +      if (c[i]                                                                 \
> +         != (pred[i] ? (TYPE3) (((WTYPE1) a[i] * (WTYPE2) b[i])               \
> +                                >> sizeof (TYPE1) * 8)                        \
> +                     : c[i]))                                                 \
> +       __builtin_abort ();                                                    \
> +  }
> +
> +int
> +main ()
> +{
> +  TEST_ALL (TEST_LOOP)
> +  return 0;
> +}
> --
> 2.36.3
>

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: [PATCH] RISC-V: Support cond vmulh.vv and vmulu.vv
  2023-09-13  8:50 ` Kito Cheng
@ 2023-09-13 10:35   ` Lehua Ding
  0 siblings, 0 replies; 3+ messages in thread
From: Lehua Ding @ 2023-09-13 10:35 UTC (permalink / raw)
  To: Kito Cheng; +Cc: gcc-patches, juzhe.zhong, rdapp.gcc, palmer, jeffreyalaw

Committed, thanks Kito.

On 2023/9/13 16:50, Kito Cheng wrote:
> LGTM, thanks :)
> 
> On Wed, Sep 13, 2023 at 12:25 AM Lehua Ding <lehua.ding@rivai.ai> wrote:
>>
>> This patch adds combine patterns to combine vmulh[u].vv + vcond_mask
>> to mask vmulh[u].vv. For vmulsu.vv, it can not be produced in midend
>> currently. We will send another patch to take this issue.
>>
>> gcc/ChangeLog:
>>
>>          * config/riscv/autovec-opt.md (*cond_<mulh_table><mode>3_highpart):
>>          New combine pattern.
>>          * config/riscv/autovec.md (smul<mode>3_highpart): Mrege smul and umul.
>>          (<mulh_table><mode>3_highpart): Merged pattern.
>>          (umul<mode>3_highpart): Mrege smul and umul.
>>          * config/riscv/vector-iterators.md (umul): New iterators.
>>          (UNSPEC_VMULHU): New iterators.
>>
>> gcc/testsuite/ChangeLog:
>>
>>          * gcc.target/riscv/rvv/autovec/cond/cond_mulh-1.c: New test.
>>          * gcc.target/riscv/rvv/autovec/cond/cond_mulh-2.c: New test.
>>          * gcc.target/riscv/rvv/autovec/cond/cond_mulh_run-1.c: New test.
>>          * gcc.target/riscv/rvv/autovec/cond/cond_mulh_run-2.c: New test.
>>
>> ---
>>   gcc/config/riscv/autovec-opt.md               | 23 ++++++++++++-
>>   gcc/config/riscv/autovec.md                   | 22 ++----------
>>   gcc/config/riscv/vector-iterators.md          |  4 +++
>>   .../riscv/rvv/autovec/cond/cond_mulh-1.c      | 29 ++++++++++++++++
>>   .../riscv/rvv/autovec/cond/cond_mulh-2.c      | 30 ++++++++++++++++
>>   .../riscv/rvv/autovec/cond/cond_mulh_run-1.c  | 32 +++++++++++++++++
>>   .../riscv/rvv/autovec/cond/cond_mulh_run-2.c  | 34 +++++++++++++++++++
>>   7 files changed, 154 insertions(+), 20 deletions(-)
>>   create mode 100644 gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh-1.c
>>   create mode 100644 gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh-2.c
>>   create mode 100644 gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh_run-1.c
>>   create mode 100644 gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh_run-2.c
>>
>> diff --git a/gcc/config/riscv/autovec-opt.md b/gcc/config/riscv/autovec-opt.md
>> index 0d2721f0b29..552be48bf73 100644
>> --- a/gcc/config/riscv/autovec-opt.md
>> +++ b/gcc/config/riscv/autovec-opt.md
>> @@ -970,6 +970,28 @@
>>   }
>>    [(set_attr "type" "vnshift")])
>>
>> +;; Combine vmulh.vv/vmulhu.vv + vcond_mask
>> +(define_insn_and_split "*cond_<mulh_table><mode>3_highpart"
>> +   [(set (match_operand:VFULLI 0 "register_operand")
>> +    (if_then_else:VFULLI
>> +      (match_operand:<VM> 1 "register_operand")
>> +      (mulh:VFULLI
>> +        (match_operand:VFULLI 2 "register_operand")
>> +        (match_operand:VFULLI 3 "register_operand"))
>> +      (match_operand:VFULLI 4 "register_operand")))]
>> +   "TARGET_VECTOR && can_create_pseudo_p ()"
>> +   "#"
>> +   "&& 1"
>> +   [(const_int 0)]
>> +{
>> +  insn_code icode = code_for_pred_mulh (<MULH_UNSPEC>, <MODE>mode);
>> +  rtx ops[] = {operands[0], operands[1], operands[2], operands[3], operands[4],
>> +               gen_int_mode (GET_MODE_NUNITS (<MODE>mode), Pmode)};
>> +  riscv_vector::expand_cond_len_binop (icode, ops);
>> +   DONE;
>> +}
>> +[(set_attr "type" "vector")])
>> +
>>   ;; =============================================================================
>>   ;; Combine extend + binop to widen_binop
>>   ;; =============================================================================
>> @@ -1172,7 +1194,6 @@
>>   }
>>   [(set_attr "type" "vfwmul")])
>>
>> -
>>   ;; =============================================================================
>>   ;; Misc combine patterns
>>   ;; =============================================================================
>> diff --git a/gcc/config/riscv/autovec.md b/gcc/config/riscv/autovec.md
>> index e9dd40af935..b4ac22bb97b 100644
>> --- a/gcc/config/riscv/autovec.md
>> +++ b/gcc/config/riscv/autovec.md
>> @@ -1569,9 +1569,9 @@
>>   ;; - vmulhu.vv
>>   ;; -------------------------------------------------------------------------
>>
>> -(define_insn_and_split "smul<mode>3_highpart"
>> +(define_insn_and_split "<mulh_table><mode>3_highpart"
>>     [(set (match_operand:VFULLI 0 "register_operand")
>> -        (smul_highpart:VFULLI
>> +        (mulh:VFULLI
>>             (match_operand:VFULLI 1 "register_operand")
>>             (match_operand:VFULLI 2 "register_operand")))]
>>     "TARGET_VECTOR && can_create_pseudo_p ()"
>> @@ -1579,23 +1579,7 @@
>>     "&& 1"
>>     [(const_int 0)]
>>   {
>> -  insn_code icode = code_for_pred_mulh (UNSPEC_VMULHS, <MODE>mode);
>> -  riscv_vector::emit_vlmax_insn (icode, riscv_vector::BINARY_OP, operands);
>> -  DONE;
>> -}
>> -[(set_attr "type" "vimul")])
>> -
>> -(define_insn_and_split "umul<mode>3_highpart"
>> -  [(set (match_operand:VFULLI 0 "register_operand")
>> -        (umul_highpart:VFULLI
>> -          (match_operand:VFULLI 1 "register_operand")
>> -          (match_operand:VFULLI 2 "register_operand")))]
>> -  "TARGET_VECTOR && can_create_pseudo_p ()"
>> -  "#"
>> -  "&& 1"
>> -  [(const_int 0)]
>> -{
>> -  insn_code icode = code_for_pred_mulh (UNSPEC_VMULHU, <MODE>mode);
>> +  insn_code icode = code_for_pred_mulh (<MULH_UNSPEC>, <MODE>mode);
>>     riscv_vector::emit_vlmax_insn (icode, riscv_vector::BINARY_OP, operands);
>>     DONE;
>>   }
>> diff --git a/gcc/config/riscv/vector-iterators.md b/gcc/config/riscv/vector-iterators.md
>> index 2f7f7cbe08c..e70a9bc5c74 100644
>> --- a/gcc/config/riscv/vector-iterators.md
>> +++ b/gcc/config/riscv/vector-iterators.md
>> @@ -2354,6 +2354,10 @@
>>   (define_code_iterator sat_int_plus_binop [ss_plus us_plus])
>>   (define_code_iterator sat_int_minus_binop [ss_minus us_minus])
>>
>> +(define_code_iterator mulh [smul_highpart umul_highpart])
>> +(define_code_attr mulh_table [(smul_highpart "smul") (umul_highpart "umul")])
>> +(define_code_attr MULH_UNSPEC [(smul_highpart "UNSPEC_VMULHS") (umul_highpart "UNSPEC_VMULHU")])
>> +
>>   (define_code_iterator any_widen_binop [plus minus mult])
>>   (define_code_iterator plus_minus [plus minus])
>>
>> diff --git a/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh-1.c b/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh-1.c
>> new file mode 100644
>> index 00000000000..6ff2dc580a4
>> --- /dev/null
>> +++ b/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh-1.c
>> @@ -0,0 +1,29 @@
>> +/* { dg-do compile } */
>> +/* { dg-additional-options "-march=rv32gcv_zvfh -mabi=ilp32d --param=riscv-autovec-preference=scalable -fno-vect-cost-model" } */
>> +
>> +#include <stdint-gcc.h>
>> +
>> +#define DEF_LOOP(TYPE, WTYPE)                                                  \
>> +  void __attribute__ ((noipa))                                                 \
>> +  test_##TYPE##_##WTYPE (TYPE *__restrict a, TYPE *__restrict b,               \
>> +                        TYPE *__restrict c, TYPE *__restrict pred, int n)     \
>> +  {                                                                            \
>> +    for (int i = 0; i < n; ++i)                                                \
>> +      c[i] = pred[i]                                                           \
>> +              ? (TYPE) (((WTYPE) a[i] * (WTYPE) b[i]) >> sizeof (TYPE) * 8)   \
>> +              : c[i];                                                         \
>> +  }
>> +
>> +#define TEST_ALL(T)                                                            \
>> +  T (int8_t, int16_t)                                                          \
>> +  T (int16_t, int32_t)                                                         \
>> +  T (int32_t, int64_t)                                                         \
>> +  T (uint8_t, uint16_t)                                                        \
>> +  T (uint16_t, uint32_t)                                                       \
>> +  T (uint32_t, uint64_t)
>> +
>> +TEST_ALL (DEF_LOOP)
>> +
>> +/* { dg-final { scan-assembler-times {\tvmulh\.vv\tv[0-9]+,v[0-9]+,v[0-9]+,v0.t} 3 } } */
>> +/* { dg-final { scan-assembler-times {\tvmulhu\.vv\tv[0-9]+,v[0-9]+,v[0-9]+,v0.t} 3 } } */
>> +/* { dg-final { scan-assembler-not {\tvmerge\.vvm\t} } } */
>> diff --git a/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh-2.c b/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh-2.c
>> new file mode 100644
>> index 00000000000..c4c2b50f203
>> --- /dev/null
>> +++ b/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh-2.c
>> @@ -0,0 +1,30 @@
>> +/* { dg-do compile } */
>> +/* { dg-additional-options "-march=rv32gcv_zvfh -mabi=ilp32d --param=riscv-autovec-preference=scalable -fno-vect-cost-model" } */
>> +
>> +#include <stdint-gcc.h>
>> +
>> +#define DEF_LOOP(TYPE1, WTYPE1, TYPE2, WTYPE2, TYPE3)                          \
>> +  void __attribute__ ((noipa))                                                 \
>> +  test_##TYPE1##_##TYPE2 (TYPE1 *__restrict a, TYPE2 *__restrict b,            \
>> +                         TYPE3 *__restrict c, TYPE3 *__restrict pred, int n)  \
>> +  {                                                                            \
>> +    for (int i = 0; i < n; ++i)                                                \
>> +      c[i]                                                                     \
>> +       = pred[i]                                                              \
>> +           ? (TYPE3) (((WTYPE1) a[i] * (WTYPE2) b[i]) >> sizeof (TYPE1) * 8)  \
>> +           : c[i];                                                            \
>> +  }
>> +
>> +#define TEST_ALL(T)                                                            \
>> +  T (int8_t, int16_t, uint8_t, uint16_t, int8_t)                               \
>> +  T (uint8_t, uint16_t, int8_t, int16_t, int8_t)                               \
>> +  T (int16_t, int32_t, uint16_t, uint32_t, int16_t)                            \
>> +  T (uint16_t, uint32_t, int16_t, int32_t, int16_t)                            \
>> +  T (int32_t, int64_t, uint32_t, uint64_t, int32_t)                            \
>> +  T (uint32_t, uint64_t, int32_t, int64_t, int32_t)
>> +
>> +TEST_ALL (DEF_LOOP)
>> +
>> +/* FIXME: need midend support: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=111381 */
>> +/* { dg-final { scan-assembler-times {\tvmulhsu\.vv\tv[0-9]+,v[0-9]+,v[0-9]+,v0.t} 6 {xfail riscv*-*-*} } } */
>> +/* { dg-final { scan-assembler-not {\tvmerge\.vvm\t} } } */
>> diff --git a/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh_run-1.c b/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh_run-1.c
>> new file mode 100644
>> index 00000000000..8260472add7
>> --- /dev/null
>> +++ b/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh_run-1.c
>> @@ -0,0 +1,32 @@
>> +/* { dg-do run { target { riscv_vector } } } */
>> +/* { dg-additional-options "--param=riscv-autovec-preference=scalable -fno-vect-cost-model" } */
>> +
>> +#include "cond_mulh-1.c"
>> +
>> +#define N 99
>> +
>> +#define TEST_LOOP(TYPE, WTYPE)                                                 \
>> +  {                                                                            \
>> +    TYPE a[N], b[N], c[N], pred[N];                                            \
>> +    for (int i = 0; i < N; ++i)                                                \
>> +      {                                                                        \
>> +       a[i] = i * i;                                                          \
>> +       b[i] = ((i + 2) % 3) * (i + 1);                                        \
>> +       pred[i] = i % 3;                                                       \
>> +       asm volatile ("" ::: "memory");                                        \
>> +      }                                                                        \
>> +    test_##TYPE##_##WTYPE (a, b, c, pred, N);                                  \
>> +    for (int i = 0; i < N; ++i)                                                \
>> +      if (c[i]                                                                 \
>> +         != (pred[i]                                                          \
>> +               ? (TYPE) (((WTYPE) a[i] * (WTYPE) b[i]) >> sizeof (TYPE) * 8)  \
>> +               : c[i]))                                                       \
>> +       __builtin_abort ();                                                    \
>> +  }
>> +
>> +int
>> +main ()
>> +{
>> +  TEST_ALL (TEST_LOOP)
>> +  return 0;
>> +}
>> diff --git a/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh_run-2.c b/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh_run-2.c
>> new file mode 100644
>> index 00000000000..24612ff84f7
>> --- /dev/null
>> +++ b/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh_run-2.c
>> @@ -0,0 +1,34 @@
>> +/* { dg-do run { target { riscv_vector } } } */
>> +/* { dg-additional-options "--param=riscv-autovec-preference=scalable -fno-vect-cost-model" } */
>> +
>> +#include "cond_mulh-2.c"
>> +
>> +#define N 99
>> +
>> +#define TEST_LOOP(TYPE1, WTYPE1, TYPE2, WTYPE2, TYPE3)                         \
>> +  {                                                                            \
>> +    TYPE1 a[N];                                                                \
>> +    TYPE2 b[N];                                                                \
>> +    TYPE3 c[N], pred[N];                                                       \
>> +    for (int i = 0; i < N; ++i)                                                \
>> +      {                                                                        \
>> +       a[i] = i * i;                                                          \
>> +       b[i] = ((i + 2) % 3) * (i + 1);                                        \
>> +       pred[i] = i % 3;                                                       \
>> +       asm volatile ("" ::: "memory");                                        \
>> +      }                                                                        \
>> +    test_##TYPE1##_##TYPE2 (a, b, c, pred, N);                                 \
>> +    for (int i = 0; i < N; ++i)                                                \
>> +      if (c[i]                                                                 \
>> +         != (pred[i] ? (TYPE3) (((WTYPE1) a[i] * (WTYPE2) b[i])               \
>> +                                >> sizeof (TYPE1) * 8)                        \
>> +                     : c[i]))                                                 \
>> +       __builtin_abort ();                                                    \
>> +  }
>> +
>> +int
>> +main ()
>> +{
>> +  TEST_ALL (TEST_LOOP)
>> +  return 0;
>> +}
>> --
>> 2.36.3
>>
> 

-- 
Best,
Lehua

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2023-09-13 10:45 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2023-09-12 16:25 [PATCH] RISC-V: Support cond vmulh.vv and vmulu.vv Lehua Ding
2023-09-13  8:50 ` Kito Cheng
2023-09-13 10:35   ` Lehua Ding

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).