public inbox for gcc-patches@gcc.gnu.org
 help / color / mirror / Atom feed
From: Lehua Ding <lehua.ding@rivai.ai>
To: Kito Cheng <kito.cheng@gmail.com>
Cc: gcc-patches@gcc.gnu.org, juzhe.zhong@rivai.ai,
	rdapp.gcc@gmail.com, palmer@rivosinc.com, jeffreyalaw@gmail.com
Subject: Re: [PATCH] RISC-V: Support cond vmulh.vv and vmulu.vv
Date: Wed, 13 Sep 2023 18:35:44 +0800	[thread overview]
Message-ID: <8349D5CF626A0FDC+c3e23287-6b24-4f59-bc2c-bf8c5ac2f048@rivai.ai> (raw)
In-Reply-To: <CA+yXCZC6vMRPa0xhMwBVrMXhZYUOvSBcFd+KirNv-U_5Ai4eHA@mail.gmail.com>

Committed, thanks Kito.

On 2023/9/13 16:50, Kito Cheng wrote:
> LGTM, thanks :)
> 
> On Wed, Sep 13, 2023 at 12:25 AM Lehua Ding <lehua.ding@rivai.ai> wrote:
>>
>> This patch adds combine patterns to combine vmulh[u].vv + vcond_mask
>> to mask vmulh[u].vv. For vmulsu.vv, it can not be produced in midend
>> currently. We will send another patch to take this issue.
>>
>> gcc/ChangeLog:
>>
>>          * config/riscv/autovec-opt.md (*cond_<mulh_table><mode>3_highpart):
>>          New combine pattern.
>>          * config/riscv/autovec.md (smul<mode>3_highpart): Mrege smul and umul.
>>          (<mulh_table><mode>3_highpart): Merged pattern.
>>          (umul<mode>3_highpart): Mrege smul and umul.
>>          * config/riscv/vector-iterators.md (umul): New iterators.
>>          (UNSPEC_VMULHU): New iterators.
>>
>> gcc/testsuite/ChangeLog:
>>
>>          * gcc.target/riscv/rvv/autovec/cond/cond_mulh-1.c: New test.
>>          * gcc.target/riscv/rvv/autovec/cond/cond_mulh-2.c: New test.
>>          * gcc.target/riscv/rvv/autovec/cond/cond_mulh_run-1.c: New test.
>>          * gcc.target/riscv/rvv/autovec/cond/cond_mulh_run-2.c: New test.
>>
>> ---
>>   gcc/config/riscv/autovec-opt.md               | 23 ++++++++++++-
>>   gcc/config/riscv/autovec.md                   | 22 ++----------
>>   gcc/config/riscv/vector-iterators.md          |  4 +++
>>   .../riscv/rvv/autovec/cond/cond_mulh-1.c      | 29 ++++++++++++++++
>>   .../riscv/rvv/autovec/cond/cond_mulh-2.c      | 30 ++++++++++++++++
>>   .../riscv/rvv/autovec/cond/cond_mulh_run-1.c  | 32 +++++++++++++++++
>>   .../riscv/rvv/autovec/cond/cond_mulh_run-2.c  | 34 +++++++++++++++++++
>>   7 files changed, 154 insertions(+), 20 deletions(-)
>>   create mode 100644 gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh-1.c
>>   create mode 100644 gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh-2.c
>>   create mode 100644 gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh_run-1.c
>>   create mode 100644 gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh_run-2.c
>>
>> diff --git a/gcc/config/riscv/autovec-opt.md b/gcc/config/riscv/autovec-opt.md
>> index 0d2721f0b29..552be48bf73 100644
>> --- a/gcc/config/riscv/autovec-opt.md
>> +++ b/gcc/config/riscv/autovec-opt.md
>> @@ -970,6 +970,28 @@
>>   }
>>    [(set_attr "type" "vnshift")])
>>
>> +;; Combine vmulh.vv/vmulhu.vv + vcond_mask
>> +(define_insn_and_split "*cond_<mulh_table><mode>3_highpart"
>> +   [(set (match_operand:VFULLI 0 "register_operand")
>> +    (if_then_else:VFULLI
>> +      (match_operand:<VM> 1 "register_operand")
>> +      (mulh:VFULLI
>> +        (match_operand:VFULLI 2 "register_operand")
>> +        (match_operand:VFULLI 3 "register_operand"))
>> +      (match_operand:VFULLI 4 "register_operand")))]
>> +   "TARGET_VECTOR && can_create_pseudo_p ()"
>> +   "#"
>> +   "&& 1"
>> +   [(const_int 0)]
>> +{
>> +  insn_code icode = code_for_pred_mulh (<MULH_UNSPEC>, <MODE>mode);
>> +  rtx ops[] = {operands[0], operands[1], operands[2], operands[3], operands[4],
>> +               gen_int_mode (GET_MODE_NUNITS (<MODE>mode), Pmode)};
>> +  riscv_vector::expand_cond_len_binop (icode, ops);
>> +   DONE;
>> +}
>> +[(set_attr "type" "vector")])
>> +
>>   ;; =============================================================================
>>   ;; Combine extend + binop to widen_binop
>>   ;; =============================================================================
>> @@ -1172,7 +1194,6 @@
>>   }
>>   [(set_attr "type" "vfwmul")])
>>
>> -
>>   ;; =============================================================================
>>   ;; Misc combine patterns
>>   ;; =============================================================================
>> diff --git a/gcc/config/riscv/autovec.md b/gcc/config/riscv/autovec.md
>> index e9dd40af935..b4ac22bb97b 100644
>> --- a/gcc/config/riscv/autovec.md
>> +++ b/gcc/config/riscv/autovec.md
>> @@ -1569,9 +1569,9 @@
>>   ;; - vmulhu.vv
>>   ;; -------------------------------------------------------------------------
>>
>> -(define_insn_and_split "smul<mode>3_highpart"
>> +(define_insn_and_split "<mulh_table><mode>3_highpart"
>>     [(set (match_operand:VFULLI 0 "register_operand")
>> -        (smul_highpart:VFULLI
>> +        (mulh:VFULLI
>>             (match_operand:VFULLI 1 "register_operand")
>>             (match_operand:VFULLI 2 "register_operand")))]
>>     "TARGET_VECTOR && can_create_pseudo_p ()"
>> @@ -1579,23 +1579,7 @@
>>     "&& 1"
>>     [(const_int 0)]
>>   {
>> -  insn_code icode = code_for_pred_mulh (UNSPEC_VMULHS, <MODE>mode);
>> -  riscv_vector::emit_vlmax_insn (icode, riscv_vector::BINARY_OP, operands);
>> -  DONE;
>> -}
>> -[(set_attr "type" "vimul")])
>> -
>> -(define_insn_and_split "umul<mode>3_highpart"
>> -  [(set (match_operand:VFULLI 0 "register_operand")
>> -        (umul_highpart:VFULLI
>> -          (match_operand:VFULLI 1 "register_operand")
>> -          (match_operand:VFULLI 2 "register_operand")))]
>> -  "TARGET_VECTOR && can_create_pseudo_p ()"
>> -  "#"
>> -  "&& 1"
>> -  [(const_int 0)]
>> -{
>> -  insn_code icode = code_for_pred_mulh (UNSPEC_VMULHU, <MODE>mode);
>> +  insn_code icode = code_for_pred_mulh (<MULH_UNSPEC>, <MODE>mode);
>>     riscv_vector::emit_vlmax_insn (icode, riscv_vector::BINARY_OP, operands);
>>     DONE;
>>   }
>> diff --git a/gcc/config/riscv/vector-iterators.md b/gcc/config/riscv/vector-iterators.md
>> index 2f7f7cbe08c..e70a9bc5c74 100644
>> --- a/gcc/config/riscv/vector-iterators.md
>> +++ b/gcc/config/riscv/vector-iterators.md
>> @@ -2354,6 +2354,10 @@
>>   (define_code_iterator sat_int_plus_binop [ss_plus us_plus])
>>   (define_code_iterator sat_int_minus_binop [ss_minus us_minus])
>>
>> +(define_code_iterator mulh [smul_highpart umul_highpart])
>> +(define_code_attr mulh_table [(smul_highpart "smul") (umul_highpart "umul")])
>> +(define_code_attr MULH_UNSPEC [(smul_highpart "UNSPEC_VMULHS") (umul_highpart "UNSPEC_VMULHU")])
>> +
>>   (define_code_iterator any_widen_binop [plus minus mult])
>>   (define_code_iterator plus_minus [plus minus])
>>
>> diff --git a/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh-1.c b/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh-1.c
>> new file mode 100644
>> index 00000000000..6ff2dc580a4
>> --- /dev/null
>> +++ b/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh-1.c
>> @@ -0,0 +1,29 @@
>> +/* { dg-do compile } */
>> +/* { dg-additional-options "-march=rv32gcv_zvfh -mabi=ilp32d --param=riscv-autovec-preference=scalable -fno-vect-cost-model" } */
>> +
>> +#include <stdint-gcc.h>
>> +
>> +#define DEF_LOOP(TYPE, WTYPE)                                                  \
>> +  void __attribute__ ((noipa))                                                 \
>> +  test_##TYPE##_##WTYPE (TYPE *__restrict a, TYPE *__restrict b,               \
>> +                        TYPE *__restrict c, TYPE *__restrict pred, int n)     \
>> +  {                                                                            \
>> +    for (int i = 0; i < n; ++i)                                                \
>> +      c[i] = pred[i]                                                           \
>> +              ? (TYPE) (((WTYPE) a[i] * (WTYPE) b[i]) >> sizeof (TYPE) * 8)   \
>> +              : c[i];                                                         \
>> +  }
>> +
>> +#define TEST_ALL(T)                                                            \
>> +  T (int8_t, int16_t)                                                          \
>> +  T (int16_t, int32_t)                                                         \
>> +  T (int32_t, int64_t)                                                         \
>> +  T (uint8_t, uint16_t)                                                        \
>> +  T (uint16_t, uint32_t)                                                       \
>> +  T (uint32_t, uint64_t)
>> +
>> +TEST_ALL (DEF_LOOP)
>> +
>> +/* { dg-final { scan-assembler-times {\tvmulh\.vv\tv[0-9]+,v[0-9]+,v[0-9]+,v0.t} 3 } } */
>> +/* { dg-final { scan-assembler-times {\tvmulhu\.vv\tv[0-9]+,v[0-9]+,v[0-9]+,v0.t} 3 } } */
>> +/* { dg-final { scan-assembler-not {\tvmerge\.vvm\t} } } */
>> diff --git a/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh-2.c b/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh-2.c
>> new file mode 100644
>> index 00000000000..c4c2b50f203
>> --- /dev/null
>> +++ b/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh-2.c
>> @@ -0,0 +1,30 @@
>> +/* { dg-do compile } */
>> +/* { dg-additional-options "-march=rv32gcv_zvfh -mabi=ilp32d --param=riscv-autovec-preference=scalable -fno-vect-cost-model" } */
>> +
>> +#include <stdint-gcc.h>
>> +
>> +#define DEF_LOOP(TYPE1, WTYPE1, TYPE2, WTYPE2, TYPE3)                          \
>> +  void __attribute__ ((noipa))                                                 \
>> +  test_##TYPE1##_##TYPE2 (TYPE1 *__restrict a, TYPE2 *__restrict b,            \
>> +                         TYPE3 *__restrict c, TYPE3 *__restrict pred, int n)  \
>> +  {                                                                            \
>> +    for (int i = 0; i < n; ++i)                                                \
>> +      c[i]                                                                     \
>> +       = pred[i]                                                              \
>> +           ? (TYPE3) (((WTYPE1) a[i] * (WTYPE2) b[i]) >> sizeof (TYPE1) * 8)  \
>> +           : c[i];                                                            \
>> +  }
>> +
>> +#define TEST_ALL(T)                                                            \
>> +  T (int8_t, int16_t, uint8_t, uint16_t, int8_t)                               \
>> +  T (uint8_t, uint16_t, int8_t, int16_t, int8_t)                               \
>> +  T (int16_t, int32_t, uint16_t, uint32_t, int16_t)                            \
>> +  T (uint16_t, uint32_t, int16_t, int32_t, int16_t)                            \
>> +  T (int32_t, int64_t, uint32_t, uint64_t, int32_t)                            \
>> +  T (uint32_t, uint64_t, int32_t, int64_t, int32_t)
>> +
>> +TEST_ALL (DEF_LOOP)
>> +
>> +/* FIXME: need midend support: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=111381 */
>> +/* { dg-final { scan-assembler-times {\tvmulhsu\.vv\tv[0-9]+,v[0-9]+,v[0-9]+,v0.t} 6 {xfail riscv*-*-*} } } */
>> +/* { dg-final { scan-assembler-not {\tvmerge\.vvm\t} } } */
>> diff --git a/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh_run-1.c b/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh_run-1.c
>> new file mode 100644
>> index 00000000000..8260472add7
>> --- /dev/null
>> +++ b/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh_run-1.c
>> @@ -0,0 +1,32 @@
>> +/* { dg-do run { target { riscv_vector } } } */
>> +/* { dg-additional-options "--param=riscv-autovec-preference=scalable -fno-vect-cost-model" } */
>> +
>> +#include "cond_mulh-1.c"
>> +
>> +#define N 99
>> +
>> +#define TEST_LOOP(TYPE, WTYPE)                                                 \
>> +  {                                                                            \
>> +    TYPE a[N], b[N], c[N], pred[N];                                            \
>> +    for (int i = 0; i < N; ++i)                                                \
>> +      {                                                                        \
>> +       a[i] = i * i;                                                          \
>> +       b[i] = ((i + 2) % 3) * (i + 1);                                        \
>> +       pred[i] = i % 3;                                                       \
>> +       asm volatile ("" ::: "memory");                                        \
>> +      }                                                                        \
>> +    test_##TYPE##_##WTYPE (a, b, c, pred, N);                                  \
>> +    for (int i = 0; i < N; ++i)                                                \
>> +      if (c[i]                                                                 \
>> +         != (pred[i]                                                          \
>> +               ? (TYPE) (((WTYPE) a[i] * (WTYPE) b[i]) >> sizeof (TYPE) * 8)  \
>> +               : c[i]))                                                       \
>> +       __builtin_abort ();                                                    \
>> +  }
>> +
>> +int
>> +main ()
>> +{
>> +  TEST_ALL (TEST_LOOP)
>> +  return 0;
>> +}
>> diff --git a/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh_run-2.c b/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh_run-2.c
>> new file mode 100644
>> index 00000000000..24612ff84f7
>> --- /dev/null
>> +++ b/gcc/testsuite/gcc.target/riscv/rvv/autovec/cond/cond_mulh_run-2.c
>> @@ -0,0 +1,34 @@
>> +/* { dg-do run { target { riscv_vector } } } */
>> +/* { dg-additional-options "--param=riscv-autovec-preference=scalable -fno-vect-cost-model" } */
>> +
>> +#include "cond_mulh-2.c"
>> +
>> +#define N 99
>> +
>> +#define TEST_LOOP(TYPE1, WTYPE1, TYPE2, WTYPE2, TYPE3)                         \
>> +  {                                                                            \
>> +    TYPE1 a[N];                                                                \
>> +    TYPE2 b[N];                                                                \
>> +    TYPE3 c[N], pred[N];                                                       \
>> +    for (int i = 0; i < N; ++i)                                                \
>> +      {                                                                        \
>> +       a[i] = i * i;                                                          \
>> +       b[i] = ((i + 2) % 3) * (i + 1);                                        \
>> +       pred[i] = i % 3;                                                       \
>> +       asm volatile ("" ::: "memory");                                        \
>> +      }                                                                        \
>> +    test_##TYPE1##_##TYPE2 (a, b, c, pred, N);                                 \
>> +    for (int i = 0; i < N; ++i)                                                \
>> +      if (c[i]                                                                 \
>> +         != (pred[i] ? (TYPE3) (((WTYPE1) a[i] * (WTYPE2) b[i])               \
>> +                                >> sizeof (TYPE1) * 8)                        \
>> +                     : c[i]))                                                 \
>> +       __builtin_abort ();                                                    \
>> +  }
>> +
>> +int
>> +main ()
>> +{
>> +  TEST_ALL (TEST_LOOP)
>> +  return 0;
>> +}
>> --
>> 2.36.3
>>
> 

-- 
Best,
Lehua

      reply	other threads:[~2023-09-13 10:45 UTC|newest]

Thread overview: 3+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2023-09-12 16:25 Lehua Ding
2023-09-13  8:50 ` Kito Cheng
2023-09-13 10:35   ` Lehua Ding [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=8349D5CF626A0FDC+c3e23287-6b24-4f59-bc2c-bf8c5ac2f048@rivai.ai \
    --to=lehua.ding@rivai.ai \
    --cc=gcc-patches@gcc.gnu.org \
    --cc=jeffreyalaw@gmail.com \
    --cc=juzhe.zhong@rivai.ai \
    --cc=kito.cheng@gmail.com \
    --cc=palmer@rivosinc.com \
    --cc=rdapp.gcc@gmail.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).