public inbox for gcc-patches@gcc.gnu.org
 help / color / mirror / Atom feed
From: Sandra Loosemore <sandra@codesourcery.com>
To: GCC Patches <gcc-patches@gcc.gnu.org>,
	  Nigel Stephens <nigel@mips.com>,  Guy Morrogh <guym@mips.com>,
	David Ung <davidu@mips.com>,   Thiemo Seufer <ths@mips.com>,
	  richard@codesourcery.com
Subject: Re: PATCH: fine-tuning for can_store_by_pieces
Date: Thu, 16 Aug 2007 19:41:00 -0000	[thread overview]
Message-ID: <46C4A7DF.3010306@codesourcery.com> (raw)
In-Reply-To: <87ps1nop2x.fsf@firetop.home>

Richard Sandiford wrote:

>> + /* This is similar to CLEAR_RATIO, but for a non-zero constant, so when
>> +    optimizing for size adjust the ratio to account for the overhead of
>> +    loading the constant and replicating it across the word.  In fact in
>> +    that case it is never worth inlining, since calling memset should
>> +    always be smaller.  */
>> + 
>> + #define SET_RATIO	(optimize_size ? 3 : 15)
> 
> Is there a lower limit to the size of memsets for which SET_RATIO is
> used?  I'm not convinced that memset is smaller for power-of-2 stores up
> to word or (if aligned) doubleword size.  (I'm assuming here that we use
> left/right stores for unaligned 4-byte and (on 64-bit targets) 8-byte
> stores.  If we don't, we should.)

This is being handled by the non-target-specific code in builtins.c.  If 
optimize_size is true, it won't even consider can_store_by_pieces if the byte 
count is greater than 1, and the next alternative it tries is setmem.

> Perhaps one of these days we should add a setmem pattern...

Uh, yeah.  :-)  For optimize_size, it looks like it would be a win over the 
libcall to memset in the case where the buffer is properly aligned for an 
integer move and the value is a constant.  For a non-constant value, there's the 
extra overhead of replicating it across the word, as noted in the comment above.

>> + /* STORE_BY_PIECES_P can be used when copying a constant string, but
>> +    in that case each word takes 3 insns (lui, ori, sw), or more in
>> +    64-bit mode, instead of 2 (lw, sw). So better to always fail this
>> +    and let the move_by_pieces code copy the string from read-only
>> +    memory.  */
>> + 
>> + #define STORE_BY_PIECES_P(SIZE, ALIGN) 0
> 
> However, lui/ori/sw could easily be better speedwise in some situations,
> so should this be dependent on optimize_size too?

Ermmm.  In what "some situations" would it better?

> Finally, a general comment: the call costs seem to be based on *-elf
> targets, where a direct call is allowed.  The cost of a call is higher
> sizewise for n32 and n64 abicalls.  It is higher still for o32 abicalls,
> where we have to restore $gp afterwards.  (The cost for all three is even
> higher if the function didn't otherwise need $gp, but I don't think we
> can really account for that.)

I can figure this out by trial and error, of course, but do you (or Nigel, etc) 
know offhand what the cost adjustments should be for each ABI?

> Please check whether this patch fixes PR 11787.  (It should, if the
> cost are good.)  If it does, add "PR target/11787" to the changelog.

Yes, it looks like it does fix that issue.  I will mark the ChangeLog for the 
next version of the patch.

-Sandra

  reply	other threads:[~2007-08-16 19:41 UTC|newest]

Thread overview: 30+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2007-08-15 17:15 Sandra Loosemore
2007-08-15 17:22 ` Andrew Pinski
2007-08-15 18:32   ` Sandra Loosemore
2007-08-15 19:53     ` Nigel Stephens
2007-08-15 19:58   ` Sandra Loosemore
2007-08-17  4:50   ` Mark Mitchell
2007-08-17 13:24     ` Sandra Loosemore
2007-08-17 18:55       ` Mark Mitchell
2007-08-16  8:34 ` Richard Sandiford
2007-08-16 19:41   ` Sandra Loosemore [this message]
2007-08-19  0:03   ` Sandra Loosemore
2007-08-20  8:22     ` Richard Sandiford
2007-08-20 23:38       ` Sandra Loosemore
2007-08-21  8:21         ` Richard Sandiford
2007-08-21 10:34           ` Nigel Stephens
2007-08-21 11:53             ` Richard Sandiford
2007-08-21 12:14               ` Nigel Stephens
2007-08-21 12:35                 ` Richard Sandiford
2007-08-21 13:54           ` Sandra Loosemore
2007-08-21 14:22             ` Richard Sandiford
2007-08-21 20:39               ` Sandra Loosemore
2007-08-21 20:56                 ` Richard Sandiford
2007-08-23 14:35                   ` Sandra Loosemore
2007-08-23 14:44                     ` Richard Sandiford
2007-08-25  5:35                       ` [committed] " Sandra Loosemore
2007-08-25  9:18                         ` Jakub Jelinek
2007-08-25  9:58                           ` Jakub Jelinek
2007-08-25 14:30                           ` gcc.c-torture/execute/20030221-1.c regressed with "fine-tuning for can_store_by_pieces" Hans-Peter Nilsson
2007-08-25 14:40                           ` [committed] Re: PATCH: fine-tuning for can_store_by_pieces Sandra Loosemore
2007-08-24 22:06                     ` Mark Mitchell

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=46C4A7DF.3010306@codesourcery.com \
    --to=sandra@codesourcery.com \
    --cc=davidu@mips.com \
    --cc=gcc-patches@gcc.gnu.org \
    --cc=guym@mips.com \
    --cc=nigel@mips.com \
    --cc=richard@codesourcery.com \
    --cc=ths@mips.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).