public inbox for gcc-cvs@sourceware.org
help / color / mirror / Atom feed
* [gcc r12-4601] i386: Fix wrong codegen for V8HF move without TARGET_AVX512F
@ 2021-10-21  8:59 Hongyu Wang
  0 siblings, 0 replies; only message in thread
From: Hongyu Wang @ 2021-10-21  8:59 UTC (permalink / raw)
  To: gcc-cvs

https://gcc.gnu.org/g:c8a889fc0e115d40a2d02f32842655f3eadc8fa1

commit r12-4601-gc8a889fc0e115d40a2d02f32842655f3eadc8fa1
Author: Hongyu Wang <hongyu.wang@intel.com>
Date:   Wed Oct 20 13:13:39 2021 +0800

    i386: Fix wrong codegen for V8HF move without TARGET_AVX512F
    
    Since _Float16 type is enabled under sse2 target, returning
    V8HFmode vector without AVX512F target would generate wrong
    vmovdqa64 instruction. Adjust ix86_get_ssemov to avoid this.
    
    gcc/ChangeLog:
            PR target/102812
            * config/i386/i386.c (ix86_get_ssemov): Adjust HFmode vector
            move to use the same logic as HImode.
    
    gcc/testsuite/ChangeLog:
            PR target/102812
            * gcc.target/i386/pr102812.c: New test.

Diff:
---
 gcc/config/i386/i386.c                   | 15 ++++++++++++---
 gcc/testsuite/gcc.target/i386/pr102812.c | 12 ++++++++++++
 2 files changed, 24 insertions(+), 3 deletions(-)

diff --git a/gcc/config/i386/i386.c b/gcc/config/i386/i386.c
index e3988f82b56..299e1ab2621 100644
--- a/gcc/config/i386/i386.c
+++ b/gcc/config/i386/i386.c
@@ -5399,9 +5399,18 @@ ix86_get_ssemov (rtx *operands, unsigned size,
       switch (scalar_mode)
 	{
 	case E_HFmode:
-	  opcode = (misaligned_p
-		    ? (TARGET_AVX512BW ? "vmovdqu16" : "vmovdqu64")
-		    : "vmovdqa64");
+	  if (evex_reg_p)
+	    opcode = (misaligned_p
+		      ? (TARGET_AVX512BW
+			 ? "vmovdqu16"
+			 : "vmovdqu64")
+		      : "vmovdqa64");
+	  else
+	    opcode = (misaligned_p
+		      ? (TARGET_AVX512BW
+			 ? "vmovdqu16"
+			 : "%vmovdqu")
+		      : "%vmovdqa");
 	  break;
 	case E_SFmode:
 	  opcode = misaligned_p ? "%vmovups" : "%vmovaps";
diff --git a/gcc/testsuite/gcc.target/i386/pr102812.c b/gcc/testsuite/gcc.target/i386/pr102812.c
new file mode 100644
index 00000000000..bad4fa9394e
--- /dev/null
+++ b/gcc/testsuite/gcc.target/i386/pr102812.c
@@ -0,0 +1,12 @@
+/* PR target/102812 */
+/* { dg-do compile } */
+/* { dg-options "-O2 -msse4 -mno-avx" } */
+/* { dg-final { scan-assembler-not "vmovdqa64\t" } } */
+/* { dg-final { scan-assembler "movdqa\t" } } */
+
+typedef _Float16 v8hf __attribute__((__vector_size__ (16)));
+
+v8hf t (_Float16 a)
+{
+    return (v8hf) {a, 0, 0, 0, 0, 0, 0, 0};
+}


^ permalink raw reply	[flat|nested] only message in thread

only message in thread, other threads:[~2021-10-21  8:59 UTC | newest]

Thread overview: (only message) (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2021-10-21  8:59 [gcc r12-4601] i386: Fix wrong codegen for V8HF move without TARGET_AVX512F Hongyu Wang

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).