public inbox for gcc-cvs@sourceware.org
help / color / mirror / Atom feed
* [gcc r12-1727] expand: Fix up empty class return optimization [PR101160]
@ 2021-06-22 13:24 Jakub Jelinek
  0 siblings, 0 replies; only message in thread
From: Jakub Jelinek @ 2021-06-22 13:24 UTC (permalink / raw)
  To: gcc-cvs

https://gcc.gnu.org/g:9b613e825d706b18f69e40edaee3eaf27d28f5cb

commit r12-1727-g9b613e825d706b18f69e40edaee3eaf27d28f5cb
Author: Jakub Jelinek <jakub@redhat.com>
Date:   Tue Jun 22 15:21:35 2021 +0200

    expand: Fix up empty class return optimization [PR101160]
    
    On Mon, Jun 14, 2021 at 11:24:22PM -0400, Jason Merrill via Gcc-patches wrote:
    > The x86_64 psABI says that an empty class isn't passed or returned in memory or
    > registers, so we shouldn't set %eax in this function.  Is this a reasonable
    > place to implement that?  Another possibility would be to remove the hack to
    > prevent i386.c:function_value_64 from returning NULL in this case and fix the
    > callers to deal, but that seems like more work.
    >
    > The df-scan hunk catches the case where we look at a 0-length reg and build
    > a range the length of unsigned int, which happened before I changed
    > assign_parms to match expand_function_end.
    
    The assign_params change unfortunately breaks e.g. the following testcase.
    The problem is that some passes (e.g. subreg lowering but assign_parms
    comments also talk about delayed slot scheduling) rely on crtl->return_rtx
    not to contain pseudo registers, and the assign_parms change results
    in the pseudo in there not being replaced with a hard register.
    
    The following patch instead clears the crtl->return_rtx if a function
    returns TYPE_EMPTY_P structure, that way (use (pseudo)) is not emitted
    into the IL and it is treated like more like functions returning void.
    
    I've also changed the effective target on the empty-class1.C testcase, so
    that it doesn't fail on x86_64-linux with -m32 testing.
    
    2021-06-22  Jakub Jelinek  <jakub@redhat.com>
    
            PR middle-end/101160
            * function.c (assign_parms): For decl_result with TYPE_EMPTY_P type
            clear crtl->return_rtx instead of keeping it referencing a pseudo.
    
            * g++.target/i386/empty-class1.C: Require lp64 effective target
            instead of x86_64-*-*.
            * g++.target/i386/empty-class2.C: New test.

Diff:
---
 gcc/function.c                               | 21 +++++++++++++--------
 gcc/testsuite/g++.target/i386/empty-class1.C |  2 +-
 gcc/testsuite/g++.target/i386/empty-class2.C | 20 ++++++++++++++++++++
 3 files changed, 34 insertions(+), 9 deletions(-)

diff --git a/gcc/function.c b/gcc/function.c
index 6abaf3d116f..00b2fe70c7d 100644
--- a/gcc/function.c
+++ b/gcc/function.c
@@ -3821,17 +3821,22 @@ assign_parms (tree fndecl)
       tree decl_result = DECL_RESULT (fndecl);
       rtx decl_rtl = DECL_RTL (decl_result);
 
-      if ((REG_P (decl_rtl)
-	   ? REGNO (decl_rtl) >= FIRST_PSEUDO_REGISTER
-	   : DECL_REGISTER (decl_result))
-	  /* Unless the psABI says not to.  */
-	  && !TYPE_EMPTY_P (TREE_TYPE (decl_result)))
+      if (REG_P (decl_rtl)
+	  ? REGNO (decl_rtl) >= FIRST_PSEUDO_REGISTER
+	  : DECL_REGISTER (decl_result))
 	{
 	  rtx real_decl_rtl;
 
-	  real_decl_rtl = targetm.calls.function_value (TREE_TYPE (decl_result),
-							fndecl, true);
-	  REG_FUNCTION_VALUE_P (real_decl_rtl) = 1;
+	  /* Unless the psABI says not to.  */
+	  if (TYPE_EMPTY_P (TREE_TYPE (decl_result)))
+	    real_decl_rtl = NULL_RTX;
+	  else
+	    {
+	      real_decl_rtl
+		= targetm.calls.function_value (TREE_TYPE (decl_result),
+						fndecl, true);
+	      REG_FUNCTION_VALUE_P (real_decl_rtl) = 1;
+	    }
 	  /* The delay slot scheduler assumes that crtl->return_rtx
 	     holds the hard register containing the return value, not a
 	     temporary pseudo.  */
diff --git a/gcc/testsuite/g++.target/i386/empty-class1.C b/gcc/testsuite/g++.target/i386/empty-class1.C
index c1992772d26..96a1fad5046 100644
--- a/gcc/testsuite/g++.target/i386/empty-class1.C
+++ b/gcc/testsuite/g++.target/i386/empty-class1.C
@@ -1,5 +1,5 @@
 // PR target/88529
-// { dg-do compile { target { c++11 && x86_64-*-* } } }
+// { dg-do compile { target { c++11 && lp64 } } }
 // { dg-additional-options -fdump-rtl-expand }
 // { dg-final { scan-rtl-dump-not "set" "expand" } }
 // The x86_64 psABI says that f() doesn't put the return value anywhere.
diff --git a/gcc/testsuite/g++.target/i386/empty-class2.C b/gcc/testsuite/g++.target/i386/empty-class2.C
new file mode 100644
index 00000000000..b9317c56706
--- /dev/null
+++ b/gcc/testsuite/g++.target/i386/empty-class2.C
@@ -0,0 +1,20 @@
+// PR middle-end/101160
+// Test passing aligned empty aggregate
+// { dg-do compile }
+// { dg-options "-O2" }
+// { dg-additional-options "-Wno-psabi" { target { { i?86-*-* x86_64-*-* } && ilp32 } } }
+
+struct S { union {} a; } __attribute__((aligned));
+
+S
+foo (S arg)
+{
+  return arg;
+}
+
+void
+bar (void)
+{
+  S arg;
+  foo (arg);
+}


^ permalink raw reply	[flat|nested] only message in thread

only message in thread, other threads:[~2021-06-22 13:24 UTC | newest]

Thread overview: (only message) (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2021-06-22 13:24 [gcc r12-1727] expand: Fix up empty class return optimization [PR101160] Jakub Jelinek

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).