From: Hongtao Liu <crazylht@gmail.com>
To: Richard Biener <richard.guenther@gmail.com>
Cc: liuhongt <hongtao.liu@intel.com>, gcc-patches@gcc.gnu.org
Subject: Re: [PATCH] [x86] Refine maskstore patterns with UNSPEC_MASKMOV.
Date: Tue, 27 Jun 2023 15:28:54 +0800 [thread overview]
Message-ID: <CAMZc-bxBPz3aMMw5L3DoxxABb79+H1AXc+6Mhf5fLxvmH5K1Eg@mail.gmail.com> (raw)
In-Reply-To: <CAFiYyc169Am6u3v+-R0iRy+J3EYarAKd8Xo2b07bpJaAf7U5Tw@mail.gmail.com>
On Tue, Jun 27, 2023 at 3:20 PM Richard Biener via Gcc-patches
<gcc-patches@gcc.gnu.org> wrote:
>
> On Tue, Jun 27, 2023 at 7:38 AM liuhongt <hongtao.liu@intel.com> wrote:
> >
> > At the rtl level, we cannot guarantee that the maskstore is not optimized
> > to other full-memory accesses, as the current implementations are equivalent
> > in terms of pattern, to solve this potential problem, this patch refines
> > the pattern of the maskstore and the intrinsics with unspec.
> >
> > One thing I'm not sure is VCOND_EXPR, should VCOND_EXPR also expect
> > fault suppression for masked-out elements?
>
> You mean the vcond and vcond_eq optabs? No, those do not expect
> fault suppression.
Yes, vcond/vcond_eq, thanks for clarifying.
>
> >
> > Currently we're still using vec_merge for both AVX2 and AVX512 target.
> >
> > ------------------------
> > Similar like r14-2070-gc79476da46728e
> >
> > If mem_addr points to a memory region with less than whole vector size
> > bytes of accessible memory and k is a mask that would prevent reading
> > the inaccessible bytes from mem_addr, add UNSPEC_MASKMOV to prevent
> > it to be transformed to any other whole memory access instructions.
> >
> > Bootstrapped and regtested on x86_64-pc-linu-gnu{-m32,}.
> > Ready to push to trunk.
> >
> > gcc/ChangeLog:
> >
> > PR rtl-optimization/110237
> > * config/i386/sse.md (<avx512>_store<mode>_mask): Refine with
> > UNSPEC_MASKMOV.
> > (maskstore<mode><avx512fmaskmodelower): Ditto.
> > (*<avx512>_store<mode>_mask): New define_insn, it's renamed
> > from original <avx512>_store<mode>_mask.
> > ---
> > gcc/config/i386/sse.md | 69 ++++++++++++++++++++++++++++++++++--------
> > 1 file changed, 57 insertions(+), 12 deletions(-)
> >
> > diff --git a/gcc/config/i386/sse.md b/gcc/config/i386/sse.md
> > index 3b50c7117f8..812cfca4b92 100644
> > --- a/gcc/config/i386/sse.md
> > +++ b/gcc/config/i386/sse.md
> > @@ -1608,7 +1608,7 @@ (define_insn "<avx512>_blendm<mode>"
> > (set_attr "prefix" "evex")
> > (set_attr "mode" "<sseinsnmode>")])
> >
> > -(define_insn "<avx512>_store<mode>_mask"
> > +(define_insn "*<avx512>_store<mode>_mask"
> > [(set (match_operand:V48_AVX512VL 0 "memory_operand" "=m")
> > (vec_merge:V48_AVX512VL
> > (match_operand:V48_AVX512VL 1 "register_operand" "v")
> > @@ -1636,7 +1636,7 @@ (define_insn "<avx512>_store<mode>_mask"
> > (set_attr "memory" "store")
> > (set_attr "mode" "<sseinsnmode>")])
> >
> > -(define_insn "<avx512>_store<mode>_mask"
> > +(define_insn "*<avx512>_store<mode>_mask"
> > [(set (match_operand:VI12HFBF_AVX512VL 0 "memory_operand" "=m")
> > (vec_merge:VI12HFBF_AVX512VL
> > (match_operand:VI12HFBF_AVX512VL 1 "register_operand" "v")
> > @@ -27008,21 +27008,66 @@ (define_expand "maskstore<mode><sseintvecmodelower>"
> > "TARGET_AVX")
> >
> > (define_expand "maskstore<mode><avx512fmaskmodelower>"
> > - [(set (match_operand:V48H_AVX512VL 0 "memory_operand")
> > - (vec_merge:V48H_AVX512VL
> > - (match_operand:V48H_AVX512VL 1 "register_operand")
> > - (match_dup 0)
> > - (match_operand:<avx512fmaskmode> 2 "register_operand")))]
> > + [(set (match_operand:V48_AVX512VL 0 "memory_operand")
> > + (unspec:V48_AVX512VL
> > + [(match_operand:V48_AVX512VL 1 "register_operand")
> > + (match_dup 0)
> > + (match_operand:<avx512fmaskmode> 2 "register_operand")]
> > + UNSPEC_MASKMOV))]
> > "TARGET_AVX512F")
> >
> > (define_expand "maskstore<mode><avx512fmaskmodelower>"
> > - [(set (match_operand:VI12_AVX512VL 0 "memory_operand")
> > - (vec_merge:VI12_AVX512VL
> > - (match_operand:VI12_AVX512VL 1 "register_operand")
> > - (match_dup 0)
> > - (match_operand:<avx512fmaskmode> 2 "register_operand")))]
> > + [(set (match_operand:VI12HFBF_AVX512VL 0 "memory_operand")
> > + (unspec:VI12HFBF_AVX512VL
> > + [(match_operand:VI12HFBF_AVX512VL 1 "register_operand")
> > + (match_dup 0)
> > + (match_operand:<avx512fmaskmode> 2 "register_operand")]
> > + UNSPEC_MASKMOV))]
> > "TARGET_AVX512BW")
> >
> > +(define_insn "<avx512>_store<mode>_mask"
> > + [(set (match_operand:V48_AVX512VL 0 "memory_operand" "=m")
> > + (unspec:V48_AVX512VL
> > + [(match_operand:V48_AVX512VL 1 "register_operand" "v")
> > + (match_dup 0)
> > + (match_operand:<avx512fmaskmode> 2 "register_operand" "Yk")]
> > + UNSPEC_MASKMOV))]
> > + "TARGET_AVX512F"
> > +{
> > + if (FLOAT_MODE_P (GET_MODE_INNER (<MODE>mode)))
> > + {
> > + if (misaligned_operand (operands[0], <MODE>mode))
> > + return "vmovu<ssemodesuffix>\t{%1, %0%{%2%}|%0%{%2%}, %1}";
> > + else
> > + return "vmova<ssemodesuffix>\t{%1, %0%{%2%}|%0%{%2%}, %1}";
> > + }
> > + else
> > + {
> > + if (misaligned_operand (operands[0], <MODE>mode))
> > + return "vmovdqu<ssescalarsize>\t{%1, %0%{%2%}|%0%{%2%}, %1}";
> > + else
> > + return "vmovdqa<ssescalarsize>\t{%1, %0%{%2%}|%0%{%2%}, %1}";
> > + }
> > +}
> > + [(set_attr "type" "ssemov")
> > + (set_attr "prefix" "evex")
> > + (set_attr "memory" "store")
> > + (set_attr "mode" "<sseinsnmode>")])
> > +
> > +(define_insn "<avx512>_store<mode>_mask"
> > + [(set (match_operand:VI12HFBF_AVX512VL 0 "memory_operand" "=m")
> > + (unspec:VI12HFBF_AVX512VL
> > + [(match_operand:VI12HFBF_AVX512VL 1 "register_operand" "v")
> > + (match_dup 0)
> > + (match_operand:<avx512fmaskmode> 2 "register_operand" "Yk")]
> > + UNSPEC_MASKMOV))]
> > + "TARGET_AVX512BW"
> > + "vmovdqu<ssescalarsize>\t{%1, %0%{%2%}|%0%{%2%}, %1}"
> > + [(set_attr "type" "ssemov")
> > + (set_attr "prefix" "evex")
> > + (set_attr "memory" "store")
> > + (set_attr "mode" "<sseinsnmode>")])
> > +
> > (define_expand "cbranch<mode>4"
> > [(set (reg:CC FLAGS_REG)
> > (compare:CC (match_operand:VI48_AVX 1 "register_operand")
> > --
> > 2.39.1.388.g2fc9e9ca3c
> >
--
BR,
Hongtao
next prev parent reply other threads:[~2023-06-27 7:29 UTC|newest]
Thread overview: 4+ messages / expand[flat|nested] mbox.gz Atom feed top
2023-06-27 5:38 liuhongt
2023-06-27 7:20 ` Richard Biener
2023-06-27 7:28 ` Hongtao Liu [this message]
2023-06-27 7:46 ` Hongtao Liu
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=CAMZc-bxBPz3aMMw5L3DoxxABb79+H1AXc+6Mhf5fLxvmH5K1Eg@mail.gmail.com \
--to=crazylht@gmail.com \
--cc=gcc-patches@gcc.gnu.org \
--cc=hongtao.liu@intel.com \
--cc=richard.guenther@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).