public inbox for gcc-patches@gcc.gnu.org
 help / color / mirror / Atom feed
From: Ramana Radhakrishnan <ramana.radhakrishnan@foss.arm.com>
To: Wilco Dijkstra <Wilco.Dijkstra@arm.com>,
	GCC Patches <gcc-patches@gcc.gnu.org>
Cc: nd <nd@arm.com>
Subject: Re: [PATCH][AArch64 - v2] Simplify eh_return implementation
Date: Thu, 01 Sep 2016 13:40:00 -0000	[thread overview]
Message-ID: <d7bc617f-2f79-e8be-5f56-f5031dfe7d24@foss.arm.com> (raw)
In-Reply-To: <AM5PR0802MB2610E248D5510D8CFC1AD7B6831D0@AM5PR0802MB2610.eurprd08.prod.outlook.com>



On 10/08/16 17:26, Wilco Dijkstra wrote:
> I noticed it would still be a good idea to add an extra barrier in the epilog as the
> scheduler doesn't appear to handle aliases of frame accesses properly.
> 
> This patch simplifies the handling of the EH return value.  We force the use of the
> frame pointer so the return location is always at FP + 8.  This means we can emit
> a simple volatile access in EH_RETURN_HANDLER_RTX without needing md
> patterns, splitters and frame offset calculations.  The new implementation also
> fixes various bugs in aarch64_final_eh_return_addr, which does not work with
> -fomit-frame-pointer, alloca or outgoing arguments.
> 
> Bootstrap OK, GCC Regression OK, OK for trunk? Would it be useful to backport
> this to GCC6.x?


Can you please file a PR for this and add some testcases ?  This sounds like a serious enough problem that needs to be looked at probably going back since the dawn of time.



Ramana
> 
> ChangeLog:
> 2016-08-10  Wilco Dijkstra  <wdijkstr@arm.com>
> gcc/
> 	* config/aarch64/aarch64.md (eh_return): Remove pattern and splitter.
> 	* config/aarch64/aarch64.h (AARCH64_EH_STACKADJ_REGNUM): Remove.
> 	(EH_RETURN_HANDLER_RTX): New define.
> 	* config/aarch64/aarch64.c (aarch64_frame_pointer_required):
> 	Force frame pointer in EH return functions.
> 	(aarch64_expand_epilogue): Add barrier for eh_return.
> 	(aarch64_final_eh_return_addr): Remove.
> 	(aarch64_eh_return_handler_rtx): New function.
> 	* config/aarch64/aarch64-protos.h (aarch64_final_eh_return_addr):
> 	Remove.
> 	(aarch64_eh_return_handler_rtx): New prototype.
> --
> 
> diff --git a/gcc/config/aarch64/aarch64-protos.h b/gcc/config/aarch64/aarch64-protos.h
> index 3cdd69b8af1089a839e5d45cda94bc70a15cd777..327c0a97f6f687604afef249b79ac22628418070 100644
> --- a/gcc/config/aarch64/aarch64-protos.h
> +++ b/gcc/config/aarch64/aarch64-protos.h
> @@ -358,7 +358,7 @@ int aarch64_hard_regno_mode_ok (unsigned, machine_mode);
>  int aarch64_hard_regno_nregs (unsigned, machine_mode);
>  int aarch64_uxt_size (int, HOST_WIDE_INT);
>  int aarch64_vec_fpconst_pow_of_2 (rtx);
> -rtx aarch64_final_eh_return_addr (void);
> +rtx aarch64_eh_return_handler_rtx (void);
>  rtx aarch64_mask_from_zextract_ops (rtx, rtx);
>  const char *aarch64_output_move_struct (rtx *operands);
>  rtx aarch64_return_addr (int, rtx);
> diff --git a/gcc/config/aarch64/aarch64.h b/gcc/config/aarch64/aarch64.h
> index 003fec87e41db618570663f28cc2387a87e8252a..fa81e4b853dafcccc08842955288861ec7e7acca 100644
> --- a/gcc/config/aarch64/aarch64.h
> +++ b/gcc/config/aarch64/aarch64.h
> @@ -400,9 +400,9 @@ extern unsigned aarch64_architecture_version;
>  #define ASM_DECLARE_FUNCTION_NAME(STR, NAME, DECL)	\
>    aarch64_declare_function_name (STR, NAME, DECL)
>  
> -/* The register that holds the return address in exception handlers.  */
> -#define AARCH64_EH_STACKADJ_REGNUM	(R0_REGNUM + 4)
> -#define EH_RETURN_STACKADJ_RTX	gen_rtx_REG (Pmode, AARCH64_EH_STACKADJ_REGNUM)
> +/* For EH returns X4 contains the stack adjustment.  */
> +#define EH_RETURN_STACKADJ_RTX	gen_rtx_REG (Pmode, R4_REGNUM)
> +#define EH_RETURN_HANDLER_RTX  aarch64_eh_return_handler_rtx ()
>  
>  /* Don't use __builtin_setjmp until we've defined it.  */
>  #undef DONT_USE_BUILTIN_SETJMP
> diff --git a/gcc/config/aarch64/aarch64.c b/gcc/config/aarch64/aarch64.c
> index 5a25fce17785af9f9dc12e0f2a9609af09af0b35..bb8baff1e7a06942c8b8f51c1d6b341673401ef9 100644
> --- a/gcc/config/aarch64/aarch64.c
> +++ b/gcc/config/aarch64/aarch64.c
> @@ -2739,6 +2739,10 @@ aarch64_frame_pointer_required (void)
>        && (!crtl->is_leaf || df_regs_ever_live_p (LR_REGNUM)))
>      return true;
>  
> +  /* Force a frame pointer for EH returns so the return address is at FP+8.  */
> +  if (crtl->calls_eh_return)
> +    return true;
> +
>    return false;
>  }
>  
> @@ -3298,7 +3302,8 @@ aarch64_expand_epilogue (bool for_sibcall)
>  			 + cfun->machine->frame.saved_varargs_size) != 0;
>  
>    /* Emit a barrier to prevent loads from a deallocated stack.  */
> -  if (final_adjust > crtl->outgoing_args_size || cfun->calls_alloca)
> +  if (final_adjust > crtl->outgoing_args_size || cfun->calls_alloca
> +      || crtl->calls_eh_return)
>      {
>        emit_insn (gen_stack_tie (stack_pointer_rtx, stack_pointer_rtx));
>        need_barrier_p = false;
> @@ -3366,52 +3371,15 @@ aarch64_expand_epilogue (bool for_sibcall)
>      emit_jump_insn (ret_rtx);
>  }
>  
> -/* Return the place to copy the exception unwinding return address to.
> -   This will probably be a stack slot, but could (in theory be the
> -   return register).  */
> +/* Implement EH_RETURN_HANDLER_RTX.  The return address is stored at FP + 8.
> +   The access needs to be volatile to prevent it from being removed.  */
>  rtx
> -aarch64_final_eh_return_addr (void)
> +aarch64_eh_return_handler_rtx (void)
>  {
> -  HOST_WIDE_INT fp_offset;
> -
> -  aarch64_layout_frame ();
> -
> -  fp_offset = cfun->machine->frame.frame_size
> -	      - cfun->machine->frame.hard_fp_offset;
> -
> -  if (cfun->machine->frame.reg_offset[LR_REGNUM] < 0)
> -    return gen_rtx_REG (DImode, LR_REGNUM);
> -
> -  /* DSE and CSELIB do not detect an alias between sp+k1 and fp+k2.  This can
> -     result in a store to save LR introduced by builtin_eh_return () being
> -     incorrectly deleted because the alias is not detected.
> -     So in the calculation of the address to copy the exception unwinding
> -     return address to, we note 2 cases.
> -     If FP is needed and the fp_offset is 0, it means that SP = FP and hence
> -     we return a SP-relative location since all the addresses are SP-relative
> -     in this case.  This prevents the store from being optimized away.
> -     If the fp_offset is not 0, then the addresses will be FP-relative and
> -     therefore we return a FP-relative location.  */
> -
> -  if (frame_pointer_needed)
> -    {
> -      if (fp_offset)
> -        return gen_frame_mem (DImode,
> -			      plus_constant (Pmode, hard_frame_pointer_rtx, UNITS_PER_WORD));
> -      else
> -        return gen_frame_mem (DImode,
> -			      plus_constant (Pmode, stack_pointer_rtx, UNITS_PER_WORD));
> -    }
> -
> -  /* If FP is not needed, we calculate the location of LR, which would be
> -     at the top of the saved registers block.  */
> -
> -  return gen_frame_mem (DImode,
> -			plus_constant (Pmode,
> -				       stack_pointer_rtx,
> -				       fp_offset
> -				       + cfun->machine->frame.saved_regs_size
> -				       - 2 * UNITS_PER_WORD));
> +  rtx tmp = gen_frame_mem (Pmode,
> +    plus_constant (Pmode, hard_frame_pointer_rtx, UNITS_PER_WORD));
> +  MEM_VOLATILE_P (tmp) = true;
> +  return tmp;
>  }
>  
>  /* Output code to add DELTA to the first argument, and then jump
> diff --git a/gcc/config/aarch64/aarch64.md b/gcc/config/aarch64/aarch64.md
> index 21f5a6aba74d28f04b9391ba917453a4cd7de1af..7d86aa7c0cb2fdb30889badc172d2270eeadb1e5 100644
> --- a/gcc/config/aarch64/aarch64.md
> +++ b/gcc/config/aarch64/aarch64.md
> @@ -591,25 +591,6 @@
>    [(set_attr "type" "branch")]
>  )
>  
> -(define_insn "eh_return"
> -  [(unspec_volatile [(match_operand:DI 0 "register_operand" "r")]
> -    UNSPECV_EH_RETURN)]
> -  ""
> -  "#"
> -  [(set_attr "type" "branch")]
> -
> -)
> -
> -(define_split
> -  [(unspec_volatile [(match_operand:DI 0 "register_operand" "")]
> -    UNSPECV_EH_RETURN)]
> -  "reload_completed"
> -  [(set (match_dup 1) (match_dup 0))]
> -  {
> -    operands[1] = aarch64_final_eh_return_addr ();
> -  }
> -)
> -
>  (define_insn "*cb<optab><mode>1"
>    [(set (pc) (if_then_else (EQL (match_operand:GPI 0 "register_operand" "r")
>  				(const_int 0))
> 

  parent reply	other threads:[~2016-09-01 13:40 UTC|newest]

Thread overview: 21+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2016-08-04 11:21 [PATCH][AArch64] " Wilco Dijkstra
2016-08-10 16:26 ` [PATCH][AArch64 - v2] " Wilco Dijkstra
2016-08-23 14:48   ` Wilco Dijkstra
2016-08-26 15:03     ` Jiong Wang
2016-08-26 18:47       ` Wilco Dijkstra
2016-09-01 12:33     ` Wilco Dijkstra
2016-09-01 13:40   ` Ramana Radhakrishnan [this message]
2016-09-02 11:31     ` [PATCH][AArch64 - v3] " Wilco Dijkstra
2016-09-12 14:51       ` Wilco Dijkstra
2016-09-21 14:50       ` Wilco Dijkstra
2016-10-17 12:41       ` Wilco Dijkstra
2016-10-25  9:48         ` Wilco Dijkstra
2016-11-02 16:49           ` Wilco Dijkstra
2016-11-14 13:05             ` Wilco Dijkstra
2016-12-06 15:27               ` Wilco Dijkstra
2016-12-14 16:43                 ` Wilco Dijkstra
2017-01-13 17:54       ` James Greenhalgh
2017-01-13 19:51         ` Wilco Dijkstra
2017-01-16 11:05           ` James Greenhalgh
2017-01-16 15:01             ` [PATCH][AArch64 - v4] " Wilco Dijkstra
2017-01-17 18:04               ` James Greenhalgh

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=d7bc617f-2f79-e8be-5f56-f5031dfe7d24@foss.arm.com \
    --to=ramana.radhakrishnan@foss.arm.com \
    --cc=Wilco.Dijkstra@arm.com \
    --cc=gcc-patches@gcc.gnu.org \
    --cc=nd@arm.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).