public inbox for gcc-patches@gcc.gnu.org
 help / color / mirror / Atom feed
From: David Daney <ddaney@avtrex.com>
To: gcc-patches@gcc.gnu.org, rsandifo@nildram.co.uk
Subject: Re: [Patch] 2/3 MIPS support for builtin __builtin_flush_icache().
Date: Sun, 08 Jul 2007 20:00:00 -0000	[thread overview]
Message-ID: <46913DEA.7090904@avtrex.com> (raw)
In-Reply-To: <87odiq3cov.fsf@firetop.home>

[-- Attachment #1: Type: text/plain, Size: 8670 bytes --]

Richard Sandiford wrote:
> Thanks for all the updates.
>
> David Daney <ddaney@avtrex.com> writes:
>   
>> I kept the call to mips_cache_flush_func pretty much unchanged because I 
>> didn't want to break compatibility with the command line options that 
>> allow this to be changed.   Also I added CLEAR_INSN_CACHE that expands 
>> in libgcc that is used if  __builtin___clear_cache() cannot be expanded 
>> in-line.  This is slightly inefficient as it adds an extra function call 
>> layer through libgcc and has to do the arithmetic to convert the 
>> function parameters, but I am thinking I shouldn't be concerned about a 
>> few extra cycles doing this when we are about to do a system call.  The 
>> alternative is to have __builtin___clear_cache() directly expand to 
>> mips_cache_flush_func and leave libgcc out of the picture for MIPS. 
>>     
>
> I'm happy either way.  I suppose the latter is appealing in some ways --
> to mirror what you said above, it means that -mflush-func will be
> honoured -- but I can see why the consistency of calling a libgcc
> function is good too.
>
> Perhaps if we aren't sure, the libgcc-less version is better.
> Once we add the function, we'll have to keep it for ABI compatibility.
>   
Right.  That is what I did in this new version of the patch.
>   
>> Index: config/mips/mips.h
>> ===================================================================
>> --- config/mips/mips.h	(revision 125997)
>> +++ config/mips/mips.h	(working copy)
>> @@ -136,6 +136,10 @@ extern const struct mips_cpu_info mips_c
>>  extern const struct mips_cpu_info *mips_arch_info;
>>  extern const struct mips_cpu_info *mips_tune_info;
>>  extern const struct mips_rtx_cost_data *mips_cost;
>> +
>> +extern bool mips_builtin_clear_cache_inline_p (void);
>> +#undef TARGET_BUILTIN_CLEAR_CACHE_INLINE_P
>> +#define TARGET_BUILTIN_CLEAR_CACHE_INLINE_P mips_builtin_clear_cache_inline_p
>>     
>
> I think the last two lines belong in mips.c, and that
> mips_builtin_clear_cache_inline_p can be static.  See how
> similar hooks are handled.
>   
OK, this is moot now.  I removed the target hook from the architecture 
independent part of the patch.

>   
>> +;; Flush the instruction cache starting as operands[0] with length
>> +;; operands[1].
>> +(define_expand "flush_icache"
>> +  [(match_operand:SI 0 "pmode_register_operand")
>> +   (match_operand:SI 1 "register_operand")]
>>     
>
> I think this should be:
>
>   [(match_operand 0 "pmode_register_operand")
>    (match_operand:SI 1 "const_arith_operand")]
>   
This insn is gone now.

>> +;; Expand in-line code to clear the instruction cache between operand[0] and
>> +;; operand[1].
>> +(define_expand "clear_cache"
>> +  [(match_operand:SI 0 "general_operand")
>> +   (match_operand:SI 1 "general_operand")]
>>     
>
> I still think these should be:
>
>   [(match_operand 0 "pmode_register_operand")
>    (match_operand 1 "pmode_register_operand")]
>   
Fixed.

>> +(define_insn "sync"
>> +  [(unspec_volatile [(const_int 0)] UNSPEC_SYNC)]
>> +  ""
>> +  "sync"
>> +  [(set_attr "length" "4")])
>>     
>
> Best make this conditional on ISA_HAS_SYNC, just to be on the safe side.
> Very minor nit, but please drop the length attribute, since it's just
> restating the default.  Likewise for the others.
>
>   
Fixed.
>> +(define_insn "synci"
>> +  [(unspec_volatile [(match_operand:SI 0 "general_operand" "d")] UNSPEC_SYNCI)]
>> +  ""
>> +  "synci\t0(%0)"
>> +  [(set_attr "length" "4")])
>>     
>
> ISA_HAS_SYNC again.  Please also make the predicate "pmode_register_operand"
> (without a mode).
>
>   
Fixed.
>> +(define_insn "rdhwr"
>> +  [(set (match_operand:SI 0 "general_operand" "=d")
>> +        (unspec_volatile [(match_operand:SI 1 "immediate_operand" "n")]
>> +        UNSPEC_RDHWR))]
>> +  ""
>> +  "rdhwr\t%0,$%1"
>> +  [(set_attr "length" "4")])
>>     
>
> Getting even more minor here, but const_int_operand is a little
> more specific than immediate_operand, and matches "n" better.
>
>   
Fixed.
>> Index: config/mips/mips.c
>> ===================================================================
>> --- config/mips/mips.c	(revision 125997)
>> +++ config/mips/mips.c	(working copy)
>> @@ -3759,6 +3759,47 @@ mips_block_move_loop (rtx dest, rtx src,
>>      mips_block_move_straight (dest, src, leftover);
>>  }
>>  \f
>> +
>> +/* Return true if we will expand __builtin_clear_cache() in-line. */
>> +bool
>>     
>
> Silly formatting stuff: two spaces after ".".  Usual (and local) style
> is to have a blank line between comment and function.
>
> I tend to prefer comments like:
>
>    Implement TARGET_BUILTIN_CLEAR_CACHE_INLINE_P.
>
> instead, so that you know instantly which tm.texi macro you need to
> look at in order to get the full interface.
>
>   
Agreed, but moot.  This part was removed.
>> +/* Expand a loop of synci insns */
>> +void
>> +mips_expand_synci_loop (rtx begin, rtx end)
>>     
>
> Blank line, and ".  */".  Pedantically, you should mention every
> argument in the comment, so how about:
>
> /* Expand a loop of synci insns for the address range [BEGIN, END).  */
>
>   
Fixed.  That is what I used.

>> +{
>> +  rtx end_addr, inc, label, label_ref, cmp, cmp_result;
>> +
>> +  begin = force_reg(Pmode, begin);
>> +  end = force_reg(Pmode, end);
>>     
>
> This wouldn't be needed after the pmode_register_operand change,
> although I agree it is needed as things stand.
>
>   
Removed.
>> +  /* Load INC with the cache line size (rdhwr INC,$1). */
>> +  inc = gen_reg_rtx (SImode);
>> +  emit_insn (gen_rdhwr (inc ,gen_rtx_CONST_INT(SImode, 1)));
>>     
>
> CONST_INT objects don't have modes, and formatting.  Should be:
>
>   emit_insn (gen_rdhwr (inc, const1_rtx));
>
>   
Fixed.
>> +  /* Loop back to here. */
>> +  label = gen_label_rtx ();
>> +  emit_label (label);
>> +
>> +  emit_insn (gen_synci (begin));
>> +
>> +  cmp = gen_reg_rtx (SImode);
>>     
>
> Pmode rather than SImode.
>
>   
Fixed.
>> +  emit_insn (gen_rtx_SET (VOIDmode, cmp, gen_rtx_GTU (Pmode, begin, end)));
>>     
>
> mips_emit_binary (GTU, cmp, begin, end);
>
>   
Fixed.

>> +
>> +  emit_insn (gen_rtx_SET (VOIDmode, begin, gen_rtx_PLUS (Pmode, begin, inc)));
>>     
>
> Either:
>
>     emit_insn (gen_add3_insn (begin, begin, inc));
>
> or:
>
>     mips_emit_binary (PLUS, begin, begin, inc);
>
> Probably the latter, givne the GTU thing above.
>
>   
Fixed.  I chose the latter option as suggested.

>> +
>> +  label_ref = gen_rtx_LABEL_REF (VOIDmode, label);
>> +  cmp_result = gen_rtx_EQ (VOIDmode, cmp, gen_rtx_CONST_INT(SImode, 0));
>> +  emit_jump_insn (gen_rtx_SET (VOIDmode, pc_rtx,
>> +                               gen_rtx_IF_THEN_ELSE (VOIDmode, cmp_result,
>> +                                                     label_ref,
>> +                                                     pc_rtx)));
>>     
>
> cmp_result = gen_rtx_EQ (VOIDmode, cmp, const0_rtx);
> emit_jump_insn (gen_condjump (cmp_result, label));
>   
Fixed.

> All in all, very minor comments.  Thanks again for your work on this.
>   
OK, how about this new version.

INITIALIZE_TRAMPOLINE now uses the new clear_cache insn.  This means 
that it has to add the trampoline size to the ADDR argument before 
expanding clear_cache.  If clear_cache expands to a library call, it has 
to subtract BEGIN from END to get the length.  Looking at the generated 
code it appears that the RTL optimizers see that this 
addition/subtraction pair cancel each other out, and somewhat optimal 
code is generated.

Other than that and removing the __builtin_clear_cache_inline_p target 
hook as suggested by Mark Mitchell, this version of the patch only 
contains some of the cleanups you suggested.

Bootstrapped and regression tested  on: x86_64-unknown-linux-gnu all 
default languages.
i686-unknown-linux-gnu to mipsel-linux (--with-arch=mips32 and 
--with-arch-mips32r2) cross tested with --enable-languages-c,c++,java 
with no regressions.

OK to commit?

2007-07-08  David Daney  <ddaney@avtrex.com>

    * config/mips/mips.h (ISA_HAS_SYNCI): New target capability
    predicate.
    (INITIALIZE_TRAMPOLINE): Emit clear_cache insn instead of  library
    call.
    * config/mips/mips.c (mips_expand_synci_loop): New function.
    * config/mips/mips.md (UNSPEC_CLEAR_HAZARD): New constant.
    (UNSPEC_RDHWR): Same.
    (UNSPEC_SYNCI): Same.
    (UNSPEC_SYNC): Same.
    (clear_cache): New expand.
    (sync): New insn.
    (synci): Same.
    (rdhwr): Same.
    (clear_hazard): Same.
    * config/mips/mips-protos.h (mips_expand_synci_loop): Declare
    function.
    * testsuite/gcc.target/mips/clear-cache-1.c: New test.
    * testsuite/gcc.target/mips/clear-cache-2.c: New test.


[-- Attachment #2: flush-cache2.diff --]
[-- Type: text/x-patch, Size: 7653 bytes --]

Index: gcc/config/mips/mips.h
===================================================================
--- gcc/config/mips/mips.h	(revision 125997)
+++ gcc/config/mips/mips.h	(working copy)
@@ -770,6 +770,10 @@ extern const struct mips_rtx_cost_data *
 				 || ISA_MIPS32R2			\
 				 || ISA_MIPS64				\
 				 || TARGET_MIPS5500)
+
+/* ISA includes synci, jr.hb and jalr.hb */
+#define ISA_HAS_SYNCI ISA_MIPS32R2
+
 \f
 /* Add -G xx support.  */
 
@@ -2116,21 +2120,16 @@ typedef struct mips_args {
 
 #define INITIALIZE_TRAMPOLINE(ADDR, FUNC, CHAIN)			    \
 {									    \
-  rtx func_addr, chain_addr;						    \
+  rtx func_addr, chain_addr, end_addr;                                      \
 									    \
   func_addr = plus_constant (ADDR, 32);					    \
   chain_addr = plus_constant (func_addr, GET_MODE_SIZE (ptr_mode));	    \
   emit_move_insn (gen_rtx_MEM (ptr_mode, func_addr), FUNC);		    \
   emit_move_insn (gen_rtx_MEM (ptr_mode, chain_addr), CHAIN);		    \
-									    \
-  /* Flush both caches.  We need to flush the data cache in case	    \
-     the system has a write-back cache.  */				    \
-  /* ??? Should check the return value for errors.  */			    \
-  if (mips_cache_flush_func && mips_cache_flush_func[0])		    \
-    emit_library_call (gen_rtx_SYMBOL_REF (Pmode, mips_cache_flush_func),   \
-		       0, VOIDmode, 3, ADDR, Pmode,			    \
-		       GEN_INT (TRAMPOLINE_SIZE), TYPE_MODE (integer_type_node),\
-		       GEN_INT (3), TYPE_MODE (integer_type_node));	    \
+  end_addr = gen_reg_rtx (Pmode);					    \
+  emit_insn (gen_add3_insn (end_addr, copy_rtx (ADDR),			    \
+                            GEN_INT (TRAMPOLINE_SIZE)));		    \
+  emit_insn (gen_clear_cache (copy_rtx (ADDR), end_addr));		    \
 }
 \f
 /* Addressing modes, and classification of registers for them.  */
Index: gcc/config/mips/mips.c
===================================================================
--- gcc/config/mips/mips.c	(revision 125997)
+++ gcc/config/mips/mips.c	(working copy)
@@ -3759,6 +3759,33 @@ mips_block_move_loop (rtx dest, rtx src,
     mips_block_move_straight (dest, src, leftover);
 }
 \f
+
+/* Expand a loop of synci insns for the address range [BEGIN, END).  */
+
+void
+mips_expand_synci_loop (rtx begin, rtx end)
+{
+  rtx inc, label, cmp, cmp_result;
+
+  /* Load INC with the cache line size (rdhwr INC,$1). */
+  inc = gen_reg_rtx (SImode);
+  emit_insn (gen_rdhwr (inc , const1_rtx));
+
+  /* Loop back to here.  */
+  label = gen_label_rtx ();
+  emit_label (label);
+
+  emit_insn (gen_synci (begin));
+
+  cmp = gen_reg_rtx (Pmode);
+  mips_emit_binary (GTU, cmp, begin, end);
+
+  mips_emit_binary (PLUS, begin, begin, inc);
+
+  cmp_result = gen_rtx_EQ (VOIDmode, cmp, const0_rtx);
+  emit_jump_insn (gen_condjump (cmp_result, label));
+}
+\f
 /* Expand a movmemsi instruction.  */
 
 bool
Index: gcc/config/mips/mips.md
===================================================================
--- gcc/config/mips/mips.md	(revision 125997)
+++ gcc/config/mips/mips.md	(working copy)
@@ -50,6 +50,10 @@ (define_constants
    (UNSPEC_TLS_GET_TP		28)
    (UNSPEC_MFHC1		31)
    (UNSPEC_MTHC1		32)
+   (UNSPEC_CLEAR_HAZARD		33)
+   (UNSPEC_RDHWR		34)
+   (UNSPEC_SYNCI		35)
+   (UNSPEC_SYNC			36)
 
    (UNSPEC_ADDRESS_FIRST	100)
 
@@ -4171,6 +4175,69 @@ (define_insn "cprestore"
 }
   [(set_attr "type" "store")
    (set_attr "length" "4,12")])
+
+;; Expand in-line code to clear the instruction cache between operand[0] and
+;; operand[1].
+(define_expand "clear_cache"
+  [(match_operand 0 "pmode_register_operand")
+   (match_operand 1 "pmode_register_operand")]
+  ""
+  "
+{
+  if (ISA_HAS_SYNCI)
+    {
+      mips_expand_synci_loop (operands[0], operands[1]);
+      emit_insn (gen_sync ());
+      emit_insn (gen_clear_hazard ());
+    }
+  else if (mips_cache_flush_func && mips_cache_flush_func[0])
+    {
+      rtx len = gen_reg_rtx (SImode);
+      emit_insn (gen_sub3_insn (len, operands[1], operands[0]));
+      /* Flush both caches.  We need to flush the data cache in case
+         the system has a write-back cache.  */
+      emit_library_call (gen_rtx_SYMBOL_REF (Pmode, mips_cache_flush_func),
+                         0, VOIDmode, 3, operands[0], Pmode,
+                         len, TYPE_MODE (integer_type_node),
+                         GEN_INT (3), TYPE_MODE (integer_type_node));
+   }
+  DONE;
+}")
+
+(define_insn "sync"
+  [(unspec_volatile [(const_int 0)] UNSPEC_SYNC)]
+  "ISA_HAS_SYNCI"
+  "sync")
+
+(define_insn "synci"
+  [(unspec_volatile [(match_operand 0 "pmode_register_operand" "d")]
+		    UNSPEC_SYNCI)]
+  "ISA_HAS_SYNCI"
+  "synci\t0(%0)")
+
+(define_insn "rdhwr"
+  [(set (match_operand:SI 0 "general_operand" "=d")
+        (unspec_volatile [(match_operand:SI 1 "const_int_operand" "n")]
+        UNSPEC_RDHWR))]
+  "ISA_HAS_SYNCI"
+  "rdhwr\t%0,$%1")
+
+(define_insn "clear_hazard"
+  [(unspec_volatile [(const_int 0)] UNSPEC_CLEAR_HAZARD)
+   (clobber (reg:SI 31))]
+  "ISA_HAS_SYNCI"
+{
+  return ".set\tpush\n"
+         "\t.set\tnoreorder\n"
+         "\t.set\tnomacro\n"
+         "\tbal\t1f\n"
+         "\tnop\n"
+         "1:\taddiu\t$31,$31,12\n"
+         "\tjr.hb\t$31\n"
+         "\tnop\n"
+         "\t.set\tpop";
+}
+  [(set_attr "length" "20")])
 \f
 ;; Block moves, see mips.c for more details.
 ;; Argument 0 is the destination
Index: gcc/config/mips/mips-protos.h
===================================================================
--- gcc/config/mips/mips-protos.h	(revision 125997)
+++ gcc/config/mips/mips-protos.h	(working copy)
@@ -1,6 +1,6 @@
 /* Prototypes of target machine for GNU compiler.  MIPS version.
    Copyright (C) 1989, 1990, 1991, 1992, 1993, 1994, 1995, 1996, 1997, 1998,
-   1999, 2001, 2002, 2003, 2004, 2005 Free Software Foundation, Inc.
+   1999, 2001, 2002, 2003, 2004, 2005, 2007 Free Software Foundation, Inc.
    Contributed by A. Lichnewsky (lich@inria.inria.fr).
    Changed by Michael Meissner	(meissner@osf.org).
    64-bit r4000 support by Ian Lance Taylor (ian@cygnus.com) and
@@ -185,6 +185,7 @@ extern void mips_expand_call (rtx, rtx, 
 extern void mips_emit_fcc_reload (rtx, rtx, rtx);
 extern void mips_set_return_address (rtx, rtx);
 extern bool mips_expand_block_move (rtx, rtx, rtx);
+extern void mips_expand_synci_loop (rtx, rtx);
 
 extern void init_cumulative_args (CUMULATIVE_ARGS *, tree, rtx);
 extern void function_arg_advance (CUMULATIVE_ARGS *, enum machine_mode,
Index: gcc/testsuite/gcc.target/mips/clear-cache-1.c
===================================================================
--- gcc/testsuite/gcc.target/mips/clear-cache-1.c	(revision 0)
+++ gcc/testsuite/gcc.target/mips/clear-cache-1.c	(revision 0)
@@ -0,0 +1,13 @@
+/* { dg-do compile } */
+/* { dg-mips-options "-O2 -mips32r2" } */
+/* { dg-final { scan-assembler "synci" } } */
+/* { dg-final { scan-assembler "jr.hb" } } */
+/* { dg-final { scan-assembler-not "_flush_cache" } } */
+
+void f()
+{
+  int size = 40;
+  char *memory = __builtin_alloca(size);
+  __builtin___clear_cache(memory, memory + size);
+}
+
Index: gcc/testsuite/gcc.target/mips/clear-cache-2.c
===================================================================
--- gcc/testsuite/gcc.target/mips/clear-cache-2.c	(revision 0)
+++ gcc/testsuite/gcc.target/mips/clear-cache-2.c	(revision 0)
@@ -0,0 +1,13 @@
+/* { dg-do compile } */
+/* { dg-mips-options "-O2 -mips32" } */
+/* { dg-final { scan-assembler-not "synci" } } */
+/* { dg-final { scan-assembler-not "jr.hb" } } */
+/* { dg-final { scan-assembler "_flush_cache" } } */
+
+void f()
+{
+  int size = 40;
+  char *memory = __builtin_alloca(size);
+  __builtin___clear_cache(memory, memory + size);
+}
+

  reply	other threads:[~2007-07-08 19:42 UTC|newest]

Thread overview: 34+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2007-07-01  5:01 [Patch] 1/3 Add new " David Daney
2007-07-01  5:05 ` [Patch] 2/3 MIPS support for " David Daney
2007-07-01 10:56   ` Richard Sandiford
2007-07-05  7:50     ` David Daney
2007-07-05 19:05       ` Richard Sandiford
2007-07-08 20:00         ` David Daney [this message]
2007-07-09 19:07           ` Richard Sandiford
2007-07-11  5:45             ` David Daney
2007-07-01  5:11 ` [Patch] 3/3 FFI: Use __builtin_flush_icache() to flush MIPS i-cache David Daney
2007-07-05  8:34   ` David Daney
2007-07-05 19:08     ` Richard Sandiford
2007-07-11 18:55       ` Tom Tromey
2007-07-01  8:01 ` [Patch] 1/3 Add new builtin __builtin_flush_icache() Paolo Bonzini
2007-07-01 17:51   ` Thiemo Seufer
2007-07-01 18:47   ` David Daney
2007-07-02  5:04     ` Paolo Bonzini
2007-07-03 23:55       ` Mark Mitchell
2007-07-04  7:18         ` Paolo Bonzini
2007-07-04 17:17           ` Mark Mitchell
2007-07-04 17:10         ` David Daney
2007-07-04 17:51           ` Mark Mitchell
2007-07-05  7:36             ` David Daney
2007-07-05 17:59               ` Richard Sandiford
2007-07-05 18:23                 ` David Daney
2007-07-05 19:11                   ` Richard Sandiford
2007-07-06  6:07               ` Mark Mitchell
2007-07-06  6:19                 ` David Daney
2007-07-06 16:39                   ` Mark Mitchell
2007-07-08 19:22                     ` David Daney
2007-07-09 19:04                       ` Richard Sandiford
2007-07-11  2:22                       ` Mark Mitchell
2007-07-11  4:47                         ` David Daney
2007-07-05  9:05 ` [Patch] 4/3 i386 target support for __builtin___clear_cache() David Daney
2007-07-08 20:39   ` David Daney

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=46913DEA.7090904@avtrex.com \
    --to=ddaney@avtrex.com \
    --cc=gcc-patches@gcc.gnu.org \
    --cc=rsandifo@nildram.co.uk \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).