Re: [PATCH][ARM][RFC] PR target/65578 Fix gcc.dg/torture/stackalign/builtin-apply-4.c for single-precision fpus

public inbox for gcc-patches@gcc.gnu.org
 help / color / mirror / Atom feed

From: Kyrill Tkachov <kyrylo.tkachov@foss.arm.com>
To: Ramana Radhakrishnan <ramana.gcc@googlemail.com>
Cc: GCC Patches <gcc-patches@gcc.gnu.org>,
	 Ramana Radhakrishnan <ramana.radhakrishnan@arm.com>,
	Richard Earnshaw <Richard.Earnshaw@arm.com>
Subject: Re: [PATCH][ARM][RFC] PR target/65578 Fix gcc.dg/torture/stackalign/builtin-apply-4.c for single-precision fpus
Date: Thu, 07 Apr 2016 12:32:00 -0000	[thread overview]
Message-ID: <57065346.1040008@foss.arm.com> (raw)
In-Reply-To: <CAJA7tRaZfOT5qfi_+9e-Kw3_UZAt2036u-EoOeBD-BprNS5Xqw@mail.gmail.com>

Hi Ramana,

On 23/03/16 12:09, Ramana Radhakrishnan wrote:
> On Tue, Feb 9, 2016 at 5:21 PM, Kyrill Tkachov
> <kyrylo.tkachov@foss.arm.com> wrote:
>> Hi all,
>>
>> In this wrong-code PR the builtin-apply-4.c test fails with -flto but only
>> when targeting an fpu
>> with only single-precision capabilities.
>>
>> bar is a function returing a double. For non-LTO compilation the caller of
>> bar reads the return value
>> from it from the s0 and s1 VFP registers like expected, but for -flto the
>> caller seems to expect the
>> return value from the r0 and r1 regs.  The RTL dumps show that too.
>>
>> Debugging the calls to arm_function_value show that in the -flto compilation
>> the function bar is deemed
>> to be a local function call and assigned the ARM_PCS_AAPCS_LOCAL PCS
>> variant, whereas for the non-LTO (and non-breaking)
>> compilation it uses the ARM_PCS_AAPCS_VFP variant.
>>
>> Further down in use_vfp_abi when deciding whether to use VFP registers for
>> the result there is a bit of
>> logic that rejects VFP registers when handling the ARM_PCS_AAPCS_LOCAL
>> variant with a double precision value
>> on an FPU that is not TARGET_VFP_DOUBLE.
>>
>> This seems wrong for ARM_PCS_AAPCS_LOCAL to me. ARM_PCS_AAPCS_LOCAL means
>> that the function doesn't escape
>> the translation unit and we can thus use whatever variant we want. From what
>> I understand we want to use the
>> VFP regs when possible for FP values.
>>
>> So this patch removes that restriction and for the testcase the caller of
>> bar correctly reads the return
>> value of bar from the VFP registers and everything works.
>>
>> This patch has been bootstrapped and tested on arm-none-linux-gnueabihf
>> configured with --with-fpu=fpv4-sp-d16.
>> The bootstrapped was performed with LTO.
>> I didn't see any regressions.
>>
>> It seems that this logic was put there in 2009 with r154034 as part of a
>> large patch to enable support for half-precision
>> floating point.
>>
>> I'm not very familiar with this part of the code, so is this a safe patch to
>> do?
>> The patch should only ever change behaviour for single-precision-only fpus
>> and only for static functions
>> that don't get called outside their translation units (or during LTO I
>> suppose) so there shouldn't
>> be any ABI problems, I think.
>>
>> Is this ok for trunk?
> I spent sometime this morning reading through this patch and it does
> look reasonably ok. The AAPCS tests if run for hardfloat should catch
> any regressions. However given the stage we are in I'd like this
> tested through compat.exp and struct-layout.exp across the range of
> ABIs and FPU options to ensure we haven't missed anything. Richard ,
> could you also give this a once over ?

I've ran compat.exp and struct-layout-1.exp against GCC 5
and a trunk compiler without this patch and it didn't expose
any failures with any /-mfpu options that I tried.

Thanks,
Kyrill

>
> Ramana
>
>
>
>
>
>
>
>> Thanks,
>> Kyrill
>>
>> 2016-02-09  Kyrylo Tkachov  <kyrylo.tkachov@arm.com>
>>
>>      PR target/65578
>>      * config/arm/arm.c (use_vfp_abi): Remove id_double argument.
>>      Don't check for is_double and TARGET_VFP_DOUBLE.
>>      (aapcs_vfp_is_call_or_return_candidate): Update callsite.
>>      (aapcs_vfp_is_return_candidate): Likewise.
>>      (aapcs_vfp_is_call_candidate): Likewise.
>>      (aapcs_vfp_allocate_return_reg): Likewise.

next prev parent reply	other threads:[~2016-04-07 12:32 UTC|newest]

Thread overview: 9+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2016-02-09 17:21 Kyrill Tkachov
2016-02-09 17:24 ` Kyrill Tkachov
2016-02-17 10:12   ` Kyrill Tkachov
2016-02-24 13:48     ` Kyrill Tkachov
2016-03-02 13:46       ` Kyrill Tkachov
2016-03-09  9:32         ` Kyrill Tkachov
2016-03-23 12:25 ` Ramana Radhakrishnan
2016-04-07 12:32   ` Kyrill Tkachov [this message]
2016-04-08 10:10 ` Richard Earnshaw (lists)

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=57065346.1040008@foss.arm.com \
    --to=kyrylo.tkachov@foss.arm.com \
    --cc=Richard.Earnshaw@arm.com \
    --cc=gcc-patches@gcc.gnu.org \
    --cc=ramana.gcc@googlemail.com \
    --cc=ramana.radhakrishnan@arm.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).