public inbox for binutils@sourceware.org
 help / color / mirror / Atom feed
From: Jens Remus <jremus@linux.ibm.com>
To: Indu Bhagat <indu.bhagat@oracle.com>, binutils@sourceware.org
Cc: Andreas Krebbel <krebbel@linux.ibm.com>
Subject: Re: [RFC PATCH 1/1] sframe: Represent FP without RA on stack
Date: Tue, 23 Apr 2024 17:44:25 +0200	[thread overview]
Message-ID: <16c9b3b9-5b65-4722-81b4-0578ffe3409c@linux.ibm.com> (raw)
In-Reply-To: <23e2d1d2-0637-46fa-ba5c-002f292e62c7@oracle.com>

Am 23.04.2024 um 01:58 schrieb Indu Bhagat:
> On 4/22/24 08:58, Jens Remus wrote:
>> If an architecture uses both SFrame RA and FP tracking SFrame assumes
>> that the RA offset is the 2nd offset and the FP offset is the 3rd offset
>> following the SFrame FRE. An architecture does not need to store both on
>> the stack. SFrame cannot represent a FP without RA on stack, since it
>> cannot distinguish whether the 2nd offset is the RA or FP offset.
>>
>> Use an invalid SFrame FRE RA offset value of zero as dummy padding to
>> represent the FP being saved on the stack when the RA is not saved on
>> the stack.
>>
>> include/
>>     * sframe.h (SFRAME_FRE_RA_OFFSET_INVALID): New macro defining
>>     the invalid RA offset value used to represent a dummy padding
>>     offset.
>>
>> gas/
>>     * gen-sframe.c (get_fre_num_offsets): Accommodate for dummy
>>     padding RA offset if FP without RA on stack.
>>     (sframe_get_fre_offset_size): Likewise.
>>     (output_sframe_row_entry): Write a dummy padding RA offset
>>     if FP without RA needs to be represented.
>>
>> libsframe/
>>     * sframe-dump.c (dump_sframe_func_with_fres): Treat invalid RA
>>     offsets as if they were undefined. Display them as "u*" to
>>     distinguish them.
>>
>> Signed-off-by: Jens Remus <jremus@linux.ibm.com>
>> ---
>>
>> Notes (jremus):
>>      This patch eliminates 497 occurrences of the warning "skipping 
>> SFrame
>>      FDE due to FP without RA on stack" for a build of glibc on s390x. 
>> For
>>      libc.so this increases the number of FDEs by 166 and the number of
>>      FREs by 861, while adding 337 dummy padding RA offsets. With a total
>>      of 28157 offsets the dummy padding offsets account for ~1.20 % of 
>> the
>>      offsets.
> 
> While this increase seems small, it does look wasteful.
> 
> An orthogonal question below...
> 
>>      SFrame statistics without patch:
>>          VALUE        TOTAL      MIN        MAX        AVG
>>          FDEs:        3478       -          -          -
>>          FREs/FDE:    14441      1          15         4
>>          Offsets/FDE: 28157      1          31         8
>>             8-bit:    0          0          0          0
>>            16-bit:    28157      1          31         8
>>            32-bit:    0          0          0          0
>>          Offsets/FRE: 28157      1          3          1
>>             8-bit:    -          0          0          0
>>            16-bit:    -          1          3          1
>>            32-bit:    -          0          0          0
>>      SFrame statistics with patch applied:
>>          VALUE        TOTAL      MIN        MAX        AVG
>>          FDEs:        3644       -          -          -
>>          FREs/FDE:    15302      1          20         4
>>          Offsets/FDE: 29944      1          38         8
>>             8-bit:    0          0          0          0
>>            16-bit:    29944      1          38         8
>>            32-bit:    0          0          0          0
>>          Offsets/FRE: 29944      1          3          1
>>             8-bit:    -          0          0          0
>>            16-bit:    -          1          3          1
>>            32-bit:    -          0          0          0
>>          O_Padd/FDE:  337        -          -          0
>>             8-bit:    0
>>            16-bit:    337
>>            32-bit:    0
>>      Note that on s390x the offsets are at minimum 16-bits in size, 
>> due to
>>      the mandatory CFA offset being at least 160.
>>
> 
> IIUC, all stack layouts supported in the ABI use the offset 160. Is that 
> right ? I am wondering if adjusting the stored offsets in the SFrame 
> section (by decrementing 160 from it) will work ?
> 
> If yes, we could encode this constant in SFrame aux hdr bytes for s390x.

Thank you for the hint! Using a constant adjustment of -160 on s390x for 
the CFA offset from CFA base register should work to allow for 8-bit 
offsets to be used. Aren't all tracked offsets (i.e. CFA, FP, and RA) 
signed anyway? Thus applying a constant adjustment should work in any case?

Couldn't it simply be an architecture specific constant in the code to 
begin with? For example a new macro, which is only applied when defined?

#define S390_SFRAME_CFA_OFFSET_ADJUSTMENT -160
#define SFRAME_CFA_OFFSET_ADJUSTMENT S390_SFRAME_CFA_OFFSET_ADJUSTMENT

Implementing this in the SFrame auxiliary header would of course allow 
to implement this enhancement at a later stage and to change the 
adjustment value in the future, as the linker can then either reject or 
merge different adjustment values during link editing.

I wonder whether it would make sense to store the FP register number in 
the SFrame auxiliary header for s390x as well. Register 11 is just a 
convention of the compilers and not defined by the ABI. That would 
enable us to choose a different register as frame pointer in the future.

> 
>>   gas/gen-sframe.c        | 50 +++++++++++++++++++----------------------
>>   include/sframe.h        |  9 ++++++--
>>   libsframe/sframe-dump.c |  4 ++++
>>   3 files changed, 34 insertions(+), 29 deletions(-)
>>
>> diff --git a/gas/gen-sframe.c b/gas/gen-sframe.c
>> index 4cc86eb6c815..990b08d87953 100644
>> --- a/gas/gen-sframe.c
>> +++ b/gas/gen-sframe.c
>> @@ -347,7 +347,9 @@ get_fre_num_offsets (struct sframe_row_entry 
>> *sframe_fre)
>>       fre_num_offsets++;
>>   #ifdef SFRAME_FRE_RA_TRACKING
>>     if (sframe_ra_tracking_p ()
>> -      && sframe_fre->ra_loc == SFRAME_FRE_ELEM_LOC_STACK)
>> +      && (sframe_fre->ra_loc == SFRAME_FRE_ELEM_LOC_STACK
>> +      /* Accommodate for padding RA offset if FP without RA on 
>> stack.  */
>> +      || sframe_fre->bp_loc == SFRAME_FRE_ELEM_LOC_STACK))
>>       fre_num_offsets++;
>>   #endif
>>     return fre_num_offsets;
>> @@ -371,9 +373,14 @@ sframe_get_fre_offset_size (struct 
>> sframe_row_entry *sframe_fre)
>>     if (sframe_fre->bp_loc == SFRAME_FRE_ELEM_LOC_STACK)
>>       bp_offset_size = get_offset_size_in_bytes (sframe_fre->bp_offset);
>>   #ifdef SFRAME_FRE_RA_TRACKING
>> -  if (sframe_ra_tracking_p ()
>> -      && sframe_fre->ra_loc == SFRAME_FRE_ELEM_LOC_STACK)
>> -    ra_offset_size = get_offset_size_in_bytes (sframe_fre->ra_offset);
>> +  if (sframe_ra_tracking_p ())
>> +    {
>> +      if (sframe_fre->ra_loc == SFRAME_FRE_ELEM_LOC_STACK)
>> +    ra_offset_size = get_offset_size_in_bytes (sframe_fre->ra_offset);
>> +      /* Accommodate for padding RA offset if FP without RA on 
>> stack.  */
>> +      else if (sframe_fre->bp_loc == SFRAME_FRE_ELEM_LOC_STACK)
>> +    ra_offset_size = get_offset_size_in_bytes 
>> (SFRAME_FRE_RA_OFFSET_INVALID);
>> +    }
>>   #endif
>>     /* Get the maximum size needed to represent the offsets.  */
>> @@ -537,11 +544,19 @@ output_sframe_row_entry (symbolS *fde_start_addr,
>>     fre_write_offsets++;
>>   #ifdef SFRAME_FRE_RA_TRACKING
>> -  if (sframe_ra_tracking_p ()
>> -      && sframe_fre->ra_loc == SFRAME_FRE_ELEM_LOC_STACK)
>> +  if (sframe_ra_tracking_p ())
>>       {
>> -      fre_offset_func_map[idx].out_func (sframe_fre->ra_offset);
>> -      fre_write_offsets++;
>> +      if (sframe_fre->ra_loc == SFRAME_FRE_ELEM_LOC_STACK)
>> +    {
>> +      fre_offset_func_map[idx].out_func (sframe_fre->ra_offset);
>> +      fre_write_offsets++;
>> +    }
>> +      /* Write padding RA offset if FP without RA on stack.  */
>> +      else if (sframe_fre->bp_loc == SFRAME_FRE_ELEM_LOC_STACK)
>> +    {
>> +      fre_offset_func_map[idx].out_func (SFRAME_FRE_RA_OFFSET_INVALID);
>> +      fre_write_offsets++;
>> +    }
>>       }
>>   #endif
>>     if (sframe_fre->bp_loc == SFRAME_FRE_ELEM_LOC_STACK)
>> @@ -1497,25 +1512,6 @@ sframe_do_fde (struct sframe_xlate_ctx *xlate_ctx,
>>       = get_dw_fde_end_addrS (xlate_ctx->dw_fde);
>>       }
>> -#ifdef SFRAME_FRE_RA_TRACKING
>> -  if (sframe_ra_tracking_p ())
>> -    {
>> -      struct sframe_row_entry *fre;
>> -
>> -      /* Iterate over the scratchpad FREs and validate them.  */
>> -      for (fre = xlate_ctx->first_fre; fre; fre = fre->next)
>> -    {
>> -      /* SFrame format cannot represent FP on stack without RA on 
>> stack.  */
>> -      if (fre->ra_loc != SFRAME_FRE_ELEM_LOC_STACK
>> -          && fre->bp_loc == SFRAME_FRE_ELEM_LOC_STACK)
>> -        {
>> -          as_warn (_("skipping SFrame FDE due to FP without RA on 
>> stack"));
>> -          return SFRAME_XLATE_ERR_NOTREPRESENTED;
>> -        }
>> -    }
>> -    }
>> -#endif /* SFRAME_FRE_RA_TRACKING  */
>> -
>>     return SFRAME_XLATE_OK;
>>   }
>> diff --git a/include/sframe.h b/include/sframe.h
>> index 90bc92a32f84..d1a26875b3e2 100644
>> --- a/include/sframe.h
>> +++ b/include/sframe.h
>> @@ -237,6 +237,9 @@ typedef struct sframe_func_desc_entry
>>      may or may not be tracked.  */
>>   #define SFRAME_FRE_FP_OFFSET_IDX    2
>> +/* Invalid RA offset.  Used as padding to represent FP without RA on 
>> stack.  */
>> +#define SFRAME_FRE_RA_OFFSET_INVALID 0
>> +
>>   typedef struct sframe_fre_info
>>   {
>>     /* Information about
>> @@ -288,9 +291,11 @@ typedef struct sframe_fre_info
>>       offset1 (interpreted as CFA = BASE_REG + offset1)
>>       if RA is being tracked
>> -      offset2 (interpreted as RA = CFA + offset2)
>> +      offset2 (interpreted as RA = CFA + offset2; an offset value of
>> +           SFRAME_FRE_RA_OFFSET_INVALID indicates a dummy padding RA 
>> offset
>> +           to represent FP without RA saved on stack)
>>         if FP is being tracked
>> -    offset3 (intrepreted as FP = CFA + offset2)
>> +    offset3 (intrepreted as FP = CFA + offset3)
> 
> I too noticed this typo recently and have a patch fixing this.
> 
>>         fi
>>       else
>>         if FP is being tracked
>> diff --git a/libsframe/sframe-dump.c b/libsframe/sframe-dump.c
>> index 40ea531314ba..3ea4bc327efd 100644
>> --- a/libsframe/sframe-dump.c
>> +++ b/libsframe/sframe-dump.c
>> @@ -199,6 +199,10 @@ dump_sframe_func_with_fres (sframe_decoder_ctx 
>> *sfd_ctx,
>>         if (sframe_decoder_get_fixed_ra_offset (sfd_ctx)
>>         != SFRAME_CFA_FIXED_RA_INVALID)
>>       strcpy (temp, "f");
>> +      /* If an ABI does track RA offset, e.g. AArch64 and S390, it 
>> can be a
>> +     dummy as padding to represent FP without RA being saved on 
>> stack.  */
>> +      else if (err[2] == 0 && ra_offset == SFRAME_FRE_RA_OFFSET_INVALID)
>> +    sprintf (temp, "u*");
>>         else if (err[2] == 0)
>>       {
>>         if (is_sframe_abi_arch_s390 (sfd_ctx) && (ra_offset & 1))
> 

Regards,
Jens
-- 
Jens Remus
Linux on Z Development (D3303) and z/VSE Support
+49-7031-16-1128 Office
jremus@de.ibm.com

IBM

IBM Deutschland Research & Development GmbH; Vorsitzender des 
Aufsichtsrats: Wolfgang Wendt; Geschäftsführung: David Faller; Sitz der 
Gesellschaft: Böblingen; Registergericht: Amtsgericht Stuttgart, HRB 243294
IBM Data Privacy Statement: https://www.ibm.com/privacy/

  reply	other threads:[~2024-04-23 15:44 UTC|newest]

Thread overview: 6+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2024-04-22 15:58 [RFC PATCH 0/1] sframe: Represent FP without RA on stack (padding) Jens Remus
2024-04-22 15:58 ` [RFC PATCH 1/1] sframe: Represent FP without RA on stack Jens Remus
2024-04-22 23:58   ` Indu Bhagat
2024-04-23 15:44     ` Jens Remus [this message]
2024-04-25  6:59       ` Indu Bhagat
2024-04-22 15:59 [RFC PATCH 0/1] sframe: Represent FP without RA on stack (bitmap) Jens Remus
2024-04-22 15:59 ` [RFC PATCH 1/1] sframe: Represent FP without RA on stack Jens Remus

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=16c9b3b9-5b65-4722-81b4-0578ffe3409c@linux.ibm.com \
    --to=jremus@linux.ibm.com \
    --cc=binutils@sourceware.org \
    --cc=indu.bhagat@oracle.com \
    --cc=krebbel@linux.ibm.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).