public inbox for gcc-patches@gcc.gnu.org
 help / color / mirror / Atom feed
From: Hongtao Liu <crazylht@gmail.com>
To: Richard Biener <richard.guenther@gmail.com>
Cc: liuhongt <hongtao.liu@intel.com>, gcc-patches@gcc.gnu.org
Subject: Re: [PATCH] [x86] Refine maskstore patterns with UNSPEC_MASKMOV.
Date: Tue, 27 Jun 2023 15:46:22 +0800	[thread overview]
Message-ID: <CAMZc-bz+9JEe1FFLyRkpWg8kaMzhtUbqpX1PUxPUvLd1LykOVA@mail.gmail.com> (raw)
In-Reply-To: <CAMZc-bxBPz3aMMw5L3DoxxABb79+H1AXc+6Mhf5fLxvmH5K1Eg@mail.gmail.com>

On Tue, Jun 27, 2023 at 3:28 PM Hongtao Liu <crazylht@gmail.com> wrote:
>
> On Tue, Jun 27, 2023 at 3:20 PM Richard Biener via Gcc-patches
> <gcc-patches@gcc.gnu.org> wrote:
> >
> > On Tue, Jun 27, 2023 at 7:38 AM liuhongt <hongtao.liu@intel.com> wrote:
> > >
> > > At the rtl level, we cannot guarantee that the maskstore is not optimized
> > > to other full-memory accesses, as the current implementations are equivalent
> > > in terms of pattern, to solve this potential problem, this patch refines
> > > the pattern of the maskstore and the intrinsics with unspec.
> > >
> > > One thing I'm not sure is VCOND_EXPR, should VCOND_EXPR also expect
> > > fault suppression for masked-out elements?
> >
> > You mean the vcond and vcond_eq optabs?  No, those do not expect
> > fault suppression.
> Yes, vcond/vcond_eq, thanks for clarifying.
> >
> > >
> > > Currently we're still using vec_merge for both AVX2 and AVX512 target.
> > >
> > > ------------------------
> > > Similar like r14-2070-gc79476da46728e
> > >
> > > If mem_addr points to a memory region with less than whole vector size
> > > bytes of accessible memory and k is a mask that would prevent reading
> > > the inaccessible bytes from mem_addr, add UNSPEC_MASKMOV to prevent
> > > it to be transformed to any other whole memory access instructions.
> > >
> > > Bootstrapped and regtested on x86_64-pc-linu-gnu{-m32,}.
> > > Ready to push to trunk.
I'm going to backpart this patch and masload one[1] to GCC11/GCC12/GCC13

[1] https://gcc.gnu.org/pipermail/gcc-patches/2023-June/622410.html
> > >
> > > gcc/ChangeLog:
> > >
> > >         PR rtl-optimization/110237
> > >         * config/i386/sse.md (<avx512>_store<mode>_mask): Refine with
> > >         UNSPEC_MASKMOV.
> > >         (maskstore<mode><avx512fmaskmodelower): Ditto.
> > >         (*<avx512>_store<mode>_mask): New define_insn, it's renamed
> > >         from original <avx512>_store<mode>_mask.
> > > ---
> > >  gcc/config/i386/sse.md | 69 ++++++++++++++++++++++++++++++++++--------
> > >  1 file changed, 57 insertions(+), 12 deletions(-)
> > >
> > > diff --git a/gcc/config/i386/sse.md b/gcc/config/i386/sse.md
> > > index 3b50c7117f8..812cfca4b92 100644
> > > --- a/gcc/config/i386/sse.md
> > > +++ b/gcc/config/i386/sse.md
> > > @@ -1608,7 +1608,7 @@ (define_insn "<avx512>_blendm<mode>"
> > >     (set_attr "prefix" "evex")
> > >     (set_attr "mode" "<sseinsnmode>")])
> > >
> > > -(define_insn "<avx512>_store<mode>_mask"
> > > +(define_insn "*<avx512>_store<mode>_mask"
> > >    [(set (match_operand:V48_AVX512VL 0 "memory_operand" "=m")
> > >         (vec_merge:V48_AVX512VL
> > >           (match_operand:V48_AVX512VL 1 "register_operand" "v")
> > > @@ -1636,7 +1636,7 @@ (define_insn "<avx512>_store<mode>_mask"
> > >     (set_attr "memory" "store")
> > >     (set_attr "mode" "<sseinsnmode>")])
> > >
> > > -(define_insn "<avx512>_store<mode>_mask"
> > > +(define_insn "*<avx512>_store<mode>_mask"
> > >    [(set (match_operand:VI12HFBF_AVX512VL 0 "memory_operand" "=m")
> > >         (vec_merge:VI12HFBF_AVX512VL
> > >           (match_operand:VI12HFBF_AVX512VL 1 "register_operand" "v")
> > > @@ -27008,21 +27008,66 @@ (define_expand "maskstore<mode><sseintvecmodelower>"
> > >    "TARGET_AVX")
> > >
> > >  (define_expand "maskstore<mode><avx512fmaskmodelower>"
> > > -  [(set (match_operand:V48H_AVX512VL 0 "memory_operand")
> > > -       (vec_merge:V48H_AVX512VL
> > > -         (match_operand:V48H_AVX512VL 1 "register_operand")
> > > -         (match_dup 0)
> > > -         (match_operand:<avx512fmaskmode> 2 "register_operand")))]
> > > +  [(set (match_operand:V48_AVX512VL 0 "memory_operand")
> > > +       (unspec:V48_AVX512VL
> > > +         [(match_operand:V48_AVX512VL 1 "register_operand")
> > > +          (match_dup 0)
> > > +          (match_operand:<avx512fmaskmode> 2 "register_operand")]
> > > +         UNSPEC_MASKMOV))]
> > >    "TARGET_AVX512F")
> > >
> > >  (define_expand "maskstore<mode><avx512fmaskmodelower>"
> > > -  [(set (match_operand:VI12_AVX512VL 0 "memory_operand")
> > > -       (vec_merge:VI12_AVX512VL
> > > -         (match_operand:VI12_AVX512VL 1 "register_operand")
> > > -         (match_dup 0)
> > > -         (match_operand:<avx512fmaskmode> 2 "register_operand")))]
> > > +  [(set (match_operand:VI12HFBF_AVX512VL 0 "memory_operand")
> > > +       (unspec:VI12HFBF_AVX512VL
> > > +         [(match_operand:VI12HFBF_AVX512VL 1 "register_operand")
> > > +          (match_dup 0)
> > > +          (match_operand:<avx512fmaskmode> 2 "register_operand")]
> > > +         UNSPEC_MASKMOV))]
> > >    "TARGET_AVX512BW")
> > >
> > > +(define_insn "<avx512>_store<mode>_mask"
> > > +  [(set (match_operand:V48_AVX512VL 0 "memory_operand" "=m")
> > > +       (unspec:V48_AVX512VL
> > > +         [(match_operand:V48_AVX512VL 1 "register_operand" "v")
> > > +          (match_dup 0)
> > > +          (match_operand:<avx512fmaskmode> 2 "register_operand" "Yk")]
> > > +         UNSPEC_MASKMOV))]
> > > +  "TARGET_AVX512F"
> > > +{
> > > +  if (FLOAT_MODE_P (GET_MODE_INNER (<MODE>mode)))
> > > +    {
> > > +      if (misaligned_operand (operands[0], <MODE>mode))
> > > +       return "vmovu<ssemodesuffix>\t{%1, %0%{%2%}|%0%{%2%}, %1}";
> > > +      else
> > > +       return "vmova<ssemodesuffix>\t{%1, %0%{%2%}|%0%{%2%}, %1}";
> > > +    }
> > > +  else
> > > +    {
> > > +      if (misaligned_operand (operands[0], <MODE>mode))
> > > +       return "vmovdqu<ssescalarsize>\t{%1, %0%{%2%}|%0%{%2%}, %1}";
> > > +      else
> > > +       return "vmovdqa<ssescalarsize>\t{%1, %0%{%2%}|%0%{%2%}, %1}";
> > > +    }
> > > +}
> > > +  [(set_attr "type" "ssemov")
> > > +   (set_attr "prefix" "evex")
> > > +   (set_attr "memory" "store")
> > > +   (set_attr "mode" "<sseinsnmode>")])
> > > +
> > > +(define_insn "<avx512>_store<mode>_mask"
> > > +  [(set (match_operand:VI12HFBF_AVX512VL 0 "memory_operand" "=m")
> > > +       (unspec:VI12HFBF_AVX512VL
> > > +         [(match_operand:VI12HFBF_AVX512VL 1 "register_operand" "v")
> > > +          (match_dup 0)
> > > +          (match_operand:<avx512fmaskmode> 2 "register_operand" "Yk")]
> > > +          UNSPEC_MASKMOV))]
> > > +  "TARGET_AVX512BW"
> > > +  "vmovdqu<ssescalarsize>\t{%1, %0%{%2%}|%0%{%2%}, %1}"
> > > +  [(set_attr "type" "ssemov")
> > > +   (set_attr "prefix" "evex")
> > > +   (set_attr "memory" "store")
> > > +   (set_attr "mode" "<sseinsnmode>")])
> > > +
> > >  (define_expand "cbranch<mode>4"
> > >    [(set (reg:CC FLAGS_REG)
> > >         (compare:CC (match_operand:VI48_AVX 1 "register_operand")
> > > --
> > > 2.39.1.388.g2fc9e9ca3c
> > >
>
>
>
> --
> BR,
> Hongtao



-- 
BR,
Hongtao

      reply	other threads:[~2023-06-27  7:46 UTC|newest]

Thread overview: 4+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2023-06-27  5:38 liuhongt
2023-06-27  7:20 ` Richard Biener
2023-06-27  7:28   ` Hongtao Liu
2023-06-27  7:46     ` Hongtao Liu [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=CAMZc-bz+9JEe1FFLyRkpWg8kaMzhtUbqpX1PUxPUvLd1LykOVA@mail.gmail.com \
    --to=crazylht@gmail.com \
    --cc=gcc-patches@gcc.gnu.org \
    --cc=hongtao.liu@intel.com \
    --cc=richard.guenther@gmail.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).