From: Hongtao Liu <crazylht@gmail.com>
To: Jan Beulich <jbeulich@suse.com>
Cc: "gcc-patches@gcc.gnu.org" <gcc-patches@gcc.gnu.org>,
Kirill Yukhin <kirill.yukhin@gmail.com>,
Hongtao Liu <hongtao.liu@intel.com>
Subject: Re: [PATCH v3] x86: make VPTERNLOG* usable on less than 512-bit operands with just AVX512F
Date: Tue, 27 Jun 2023 13:11:46 +0800 [thread overview]
Message-ID: <CAMZc-bxaFtCVoKZLq51WzfO1=WZUHqTGpytYerKERbnjjL8jfQ@mail.gmail.com> (raw)
In-Reply-To: <CAMZc-bx63-a3Hmgv5+ZQGb=XeDkEPgbopqeOdB7KURGdKOyk4w@mail.gmail.com>
On Tue, Jun 20, 2023 at 5:34 PM Hongtao Liu <crazylht@gmail.com> wrote:
>
> On Tue, Jun 20, 2023 at 5:03 PM Jan Beulich <jbeulich@suse.com> wrote:
> >
> > On 20.06.2023 10:33, Hongtao Liu wrote:
> > > On Tue, Jun 20, 2023 at 3:07 PM Jan Beulich via Gcc-patches
> > > <gcc-patches@gcc.gnu.org> wrote:
> > >>
> > >> I guess the underlying pattern, going along the lines of what
> > >> <mask_codefor>one_cmpl<mode>2<mask_name> uses, can be applied elsewhere
> > >> as well.
> > > That should be guarded with !TARGET_PREFER_AVX256, let's handle that
> > > in a separate patch.
> >
> > Sure, and as indicated there are more places where similar things could
> > be done.
> >
> > >> --- /dev/null
> > >> +++ b/gcc/testsuite/gcc.target/i386/avx512f-copysign.c
> > >> @@ -0,0 +1,32 @@
> > >> +/* { dg-do compile } */
> > >> +/* { dg-options "-mavx512f -mno-avx512vl -O2" } */
> > > Please explicitly add -mprefer-vector-width=512, our tester will also
> > > test unix{-m32 \-march=cascadelake,\ -march=cascadelake} which set the
> > > - mprefer-vector-width=256, -mprefer-vector-width=512 in dg-options
> > > can overwrite that.
> >
> > Oh, I see. Will do. And I expect I then also need to adjust the newly
> > added avx512f-dupv2di.c from the earlier patch. I guess I could commit
> > that option addition there as obvious?
> Still need to send out the patch, and commit as an obvious fix.
> >
> > > Others LGTM.
> >
> > May I take this as "okay with that change", or should I submit v4?
> Okay. no need for a v4 version.
> >
avx512f-copysign.c failed for -m32, we need to add -mfpmath=sse to dg-options.
cs_df:
.LFB0:
.cfi_startproc
fldl 12(%esp)
fxam
fnstsw %ax
fstp %st(0)
fldl 4(%esp)
fabs
testb $2, %ah
je .L1
fchs
.L1:
ret
.cfi_endproc
.LFE0:
.size cs_df, .-cs_df
.p2align 4
.globl cs_sf
.type cs_sf, @function
cs_sf:
.LFB1:
.cfi_startproc
flds 8(%esp)
fxam
fnstsw %ax
fstp %st(0)
flds 4(%esp)
testb $2, %ah
fabs
fld %st(0)
fchs
fcmove %st(1), %st
fstp %st(1)
ret
.cfi_endproc
.LFE1:
.size cs_sf, .-cs_sf
.p2align 4
.globl cs_v2df
.type cs_v2df, @function
> > Jan
>
>
>
> --
> BR,
> Hongtao
--
BR,
Hongtao
next prev parent reply other threads:[~2023-06-27 5:11 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
2023-06-20 7:06 Jan Beulich
2023-06-20 8:33 ` Hongtao Liu
2023-06-20 9:03 ` Jan Beulich
2023-06-20 9:34 ` Hongtao Liu
2023-06-27 5:11 ` Hongtao Liu [this message]
2023-07-04 15:29 ` Jan Beulich
2023-07-05 1:15 ` Liu, Hongtao
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to='CAMZc-bxaFtCVoKZLq51WzfO1=WZUHqTGpytYerKERbnjjL8jfQ@mail.gmail.com' \
--to=crazylht@gmail.com \
--cc=gcc-patches@gcc.gnu.org \
--cc=hongtao.liu@intel.com \
--cc=jbeulich@suse.com \
--cc=kirill.yukhin@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).