From: Richard Henderson <richard.henderson@linaro.org>
To: gcc-patches@gcc.gnu.org
Cc: richard.sandiford@arm.com, segher@kernel.crashing.org,
richard.earnshaw@arm.com, Wilco.Dijkstra@arm.com,
marcus.shawcroft@arm.com, kyrylo.tkachov@arm.com
Subject: [PATCH v2 06/11] aarch64: Use UNSPEC_ADCS for add-with-carry + output flags
Date: Thu, 2 Apr 2020 11:53:48 -0700 [thread overview]
Message-ID: <20200402185353.11047-7-richard.henderson@linaro.org> (raw)
In-Reply-To: <20200402185353.11047-1-richard.henderson@linaro.org>
Similar to UNSPEC_SBCS, we can unify the signed/unsigned overflow
paths by using an unspec.
Accept -1 for the second input by using SBCS.
* config/aarch64/aarch64.md (UNSPEC_ADCS): New.
(addvti4, uaddvti4): Use adddi_carryin_cmp.
(add<GPI>3_carryinC): Remove.
(*add<GPI>3_carryinC_zero): Remove.
(*add<GPI>3_carryinC): Remove.
(add<GPI>3_carryinV): Remove.
(*add<GPI>3_carryinV_zero): Remove.
(*add<GPI>3_carryinV): Remove.
(add<GPI>3_carryin_cmp): New expander.
(*add<GPI>3_carryin_cmp): New pattern.
(*add<GPI>3_carryin_cmp_0): New pattern.
(*cmn<GPI>3_carryin): New pattern.
(*cmn<GPI>3_carryin_0): New pattern.
---
gcc/config/aarch64/aarch64.md | 206 +++++++++++++++-------------------
1 file changed, 89 insertions(+), 117 deletions(-)
diff --git a/gcc/config/aarch64/aarch64.md b/gcc/config/aarch64/aarch64.md
index 564dea390be..99023494fa1 100644
--- a/gcc/config/aarch64/aarch64.md
+++ b/gcc/config/aarch64/aarch64.md
@@ -281,6 +281,7 @@
UNSPEC_GEN_TAG_RND ; Generate a random 4-bit MTE tag.
UNSPEC_TAG_SPACE ; Translate address to MTE tag address space.
UNSPEC_LD1RO
+ UNSPEC_ADCS
UNSPEC_SBCS
])
@@ -2062,7 +2063,7 @@
aarch64_expand_addsubti (operands[0], operands[1], operands[2],
CODE_FOR_adddi3_compareV,
CODE_FOR_adddi3_compareC,
- CODE_FOR_adddi3_carryinV);
+ CODE_FOR_adddi3_carryin_cmp);
aarch64_gen_unlikely_cbranch (NE, CC_Vmode, operands[3]);
DONE;
})
@@ -2077,7 +2078,7 @@
aarch64_expand_addsubti (operands[0], operands[1], operands[2],
CODE_FOR_adddi3_compareC,
CODE_FOR_adddi3_compareC,
- CODE_FOR_adddi3_carryinC);
+ CODE_FOR_adddi3_carryin_cmp);
aarch64_gen_unlikely_cbranch (GEU, CC_ADCmode, operands[3]);
DONE;
})
@@ -2579,133 +2580,104 @@
[(set_attr "type" "adc_reg")]
)
-(define_expand "add<mode>3_carryinC"
+(define_expand "add<mode>3_carryin_cmp"
[(parallel
- [(set (match_dup 3)
- (compare:CC_ADC
- (plus:<DWI>
- (plus:<DWI>
- (match_dup 4)
- (zero_extend:<DWI>
- (match_operand:GPI 1 "register_operand")))
- (zero_extend:<DWI>
- (match_operand:GPI 2 "register_operand")))
- (match_dup 6)))
- (set (match_operand:GPI 0 "register_operand")
- (plus:GPI
- (plus:GPI (match_dup 5) (match_dup 1))
- (match_dup 2)))])]
+ [(set (match_dup 3)
+ (unspec:CC
+ [(match_operand:GPI 1 "aarch64_reg_or_zero")
+ (match_operand:GPI 2 "aarch64_reg_zero_minus1")
+ (match_dup 4)]
+ UNSPEC_ADCS))
+ (set (match_operand:GPI 0 "register_operand")
+ (unspec:GPI
+ [(match_dup 1) (match_dup 2) (match_dup 4)]
+ UNSPEC_ADCS))])]
""
-{
- operands[3] = gen_rtx_REG (CC_ADCmode, CC_REGNUM);
- rtx ccin = gen_rtx_REG (CC_Cmode, CC_REGNUM);
- operands[4] = gen_rtx_LTU (<DWI>mode, ccin, const0_rtx);
- operands[5] = gen_rtx_LTU (<MODE>mode, ccin, const0_rtx);
- operands[6] = immed_wide_int_const (wi::shwi (1, <DWI>mode)
- << GET_MODE_BITSIZE (<MODE>mode),
- TImode);
-})
+ {
+ operands[3] = gen_rtx_REG (CCmode, CC_REGNUM);
+ operands[4] = gen_rtx_GEU (<MODE>mode, operands[3], const0_rtx);
+ }
+)
-(define_insn "*add<mode>3_carryinC_zero"
- [(set (reg:CC_ADC CC_REGNUM)
- (compare:CC_ADC
- (plus:<DWI>
- (match_operand:<DWI> 2 "aarch64_carry_operation" "")
- (zero_extend:<DWI> (match_operand:GPI 1 "register_operand" "r")))
- (match_operand 4 "const_scalar_int_operand" "")))
- (set (match_operand:GPI 0 "register_operand" "=r")
- (plus:GPI (match_operand:GPI 3 "aarch64_carry_operation" "")
- (match_dup 1)))]
- "rtx_mode_t (operands[4], <DWI>mode)
- == (wi::shwi (1, <DWI>mode) << (unsigned) GET_MODE_BITSIZE (<MODE>mode))"
- "adcs\\t%<w>0, %<w>1, <w>zr"
+(define_insn "*add<mode>3_carryin_cmp"
+ [(set (reg:CC CC_REGNUM)
+ (unspec:CC
+ [(match_operand:GPI 1 "aarch64_reg_or_zero" "%rZ,rZ")
+ (match_operand:GPI 2 "aarch64_reg_zero_minus1" "rZ,UsM")
+ (match_operand:GPI 3 "aarch64_carry_operation" "")]
+ UNSPEC_ADCS))
+ (set (match_operand:GPI 0 "register_operand" "=r,r")
+ (unspec:GPI
+ [(match_dup 1) (match_dup 2) (match_dup 3)]
+ UNSPEC_ADCS))]
+ ""
+ "@
+ adcs\\t%<w>0, %<w>1, %<w>2
+ sbcs\\t%<w>0, %<w>1, <w>zr"
[(set_attr "type" "adc_reg")]
)
-(define_insn "*add<mode>3_carryinC"
- [(set (reg:CC_ADC CC_REGNUM)
- (compare:CC_ADC
- (plus:<DWI>
- (plus:<DWI>
- (match_operand:<DWI> 3 "aarch64_carry_operation" "")
- (zero_extend:<DWI> (match_operand:GPI 1 "register_operand" "r")))
- (zero_extend:<DWI> (match_operand:GPI 2 "register_operand" "r")))
- (match_operand 5 "const_scalar_int_operand" "")))
- (set (match_operand:GPI 0 "register_operand" "=r")
- (plus:GPI
- (plus:GPI (match_operand:GPI 4 "aarch64_carry_operation" "")
- (match_dup 1))
- (match_dup 2)))]
- "rtx_mode_t (operands[5], <DWI>mode)
- == (wi::shwi (1, <DWI>mode) << (unsigned) GET_MODE_BITSIZE (<MODE>mode))"
- "adcs\\t%<w>0, %<w>1, %<w>2"
+(define_insn "*cmn<mode>3_carryin"
+ [(set (reg:CC CC_REGNUM)
+ (unspec:CC
+ [(match_operand:GPI 0 "aarch64_reg_or_zero" "%rZ,rZ")
+ (match_operand:GPI 1 "aarch64_reg_zero_minus1" "rZ,UsM")
+ (match_operand:GPI 2 "aarch64_carry_operation" "")]
+ UNSPEC_ADCS))]
+ ""
+ "@
+ adcs\\t<w>zr, %<w>0, %<w>1
+ sbcs\\t<w>zr, %<w>0, <w>zr"
[(set_attr "type" "adc_reg")]
)
-(define_expand "add<mode>3_carryinV"
- [(parallel
- [(set (reg:CC_V CC_REGNUM)
- (compare:CC_V
- (plus:<DWI>
- (plus:<DWI>
- (match_dup 3)
- (sign_extend:<DWI>
- (match_operand:GPI 1 "register_operand")))
- (sign_extend:<DWI>
- (match_operand:GPI 2 "register_operand")))
- (sign_extend:<DWI>
- (plus:GPI
- (plus:GPI (match_dup 4) (match_dup 1))
- (match_dup 2)))))
- (set (match_operand:GPI 0 "register_operand")
- (plus:GPI
- (plus:GPI (match_dup 4) (match_dup 1))
- (match_dup 2)))])]
- ""
-{
- rtx cc = gen_rtx_REG (CC_Cmode, CC_REGNUM);
- operands[3] = gen_rtx_LTU (<DWI>mode, cc, const0_rtx);
- operands[4] = gen_rtx_LTU (<MODE>mode, cc, const0_rtx);
-})
-
-(define_insn "*add<mode>3_carryinV_zero"
- [(set (reg:CC_V CC_REGNUM)
- (compare:CC_V
- (plus:<DWI>
- (match_operand:<DWI> 2 "aarch64_carry_operation" "")
- (sign_extend:<DWI> (match_operand:GPI 1 "register_operand" "r")))
- (sign_extend:<DWI>
- (plus:GPI
- (match_operand:GPI 3 "aarch64_carry_operation" "")
- (match_dup 1)))))
+;; If combine can show that the borrow is 0, fold ADCS to ADDS.
+(define_insn_and_split "*add<mode>3_carryin_cmp_0"
+ [(set (reg:CC CC_REGNUM)
+ (unspec:CC
+ [(match_operand:GPI 1 "aarch64_reg_or_zero" "%rk")
+ (match_operand:GPI 2 "aarch64_plus_immediate" "rIJ")
+ (const_int 0)]
+ UNSPEC_ADCS))
(set (match_operand:GPI 0 "register_operand" "=r")
- (plus:GPI (match_dup 3) (match_dup 1)))]
- ""
- "adcs\\t%<w>0, %<w>1, <w>zr"
- [(set_attr "type" "adc_reg")]
+ (unspec:GPI
+ [(match_dup 1) (match_dup 2) (const_int 0)]
+ UNSPEC_ADCS))]
+ ""
+ "#"
+ ""
+ [(scratch)]
+ {
+ if (CONST_INT_P (operands[1]))
+ {
+ /* If operand2 is also constant, this must be before reload.
+ Expanding this to an explicit plus of two constants would
+ result in invalid rtl. */
+ if (CONST_INT_P (operands[2]))
+ FAIL;
+ std::swap (operands[1], operands[2]);
+ }
+ emit_insn (gen_add<mode>3_compare0 (operands[0], operands[1],
+ operands[2]));
+ DONE;
+ }
)
-(define_insn "*add<mode>3_carryinV"
- [(set (reg:CC_V CC_REGNUM)
- (compare:CC_V
- (plus:<DWI>
- (plus:<DWI>
- (match_operand:<DWI> 3 "aarch64_carry_operation" "")
- (sign_extend:<DWI> (match_operand:GPI 1 "register_operand" "r")))
- (sign_extend:<DWI> (match_operand:GPI 2 "register_operand" "r")))
- (sign_extend:<DWI>
- (plus:GPI
- (plus:GPI
- (match_operand:GPI 4 "aarch64_carry_operation" "")
- (match_dup 1))
- (match_dup 2)))))
- (set (match_operand:GPI 0 "register_operand" "=r")
- (plus:GPI
- (plus:GPI (match_dup 4) (match_dup 1))
- (match_dup 2)))]
- ""
- "adcs\\t%<w>0, %<w>1, %<w>2"
- [(set_attr "type" "adc_reg")]
+;; ??? There's no one add*compare*cconly pattern that covers both C and V
+;; into which this can be split. Leave it whole for now.
+(define_insn "*cmn<mode>3_carryin_0"
+ [(set (reg:CC CC_REGNUM)
+ (unspec:CC
+ [(match_operand:GPI 0 "aarch64_reg_or_zero" "%rk,rk,rZ")
+ (match_operand:GPI 1 "aarch64_plus_operand" "I,J,rZ")
+ (const_int 0)]
+ UNSPEC_ADCS))]
+ ""
+ "@
+ cmn\\t%<w>0, %<w>1
+ cmp\\t%<w>0, #%n1
+ cmn\\t%<w>0, %<w>1"
+ [(set_attr "type" "alus_imm,alus_imm,alus_sreg")]
)
(define_insn "*add_uxt<mode>_shift2"
--
2.20.1
next prev parent reply other threads:[~2020-04-02 18:54 UTC|newest]
Thread overview: 33+ messages / expand[flat|nested] mbox.gz Atom feed top
2020-04-02 18:53 [PATCH v2 00/11] aarch64: Implement TImode comparisons Richard Henderson
2020-04-02 18:53 ` [PATCH v2 01/11] aarch64: Accept 0 as first argument to compares Richard Henderson
2020-04-02 18:53 ` [PATCH v2 02/11] aarch64: Accept zeros in add<GPI>3_carryin Richard Henderson
2020-04-02 18:53 ` [PATCH v2 03/11] aarch64: Provide expander for sub<GPI>3_compare1 Richard Henderson
2020-04-02 18:53 ` [PATCH v2 04/11] aarch64: Introduce aarch64_expand_addsubti Richard Henderson
2020-04-02 18:53 ` [PATCH v2 05/11] aarch64: Use UNSPEC_SBCS for subtract-with-borrow + output flags Richard Henderson
2020-04-09 21:52 ` Segher Boessenkool
2020-04-10 3:50 ` Richard Henderson
2020-04-02 18:53 ` Richard Henderson [this message]
2020-04-02 18:53 ` [PATCH v2 07/11] aarch64: Remove CC_ADCmode Richard Henderson
2020-04-02 18:53 ` [PATCH v2 08/11] aarch64: Accept -1 as second argument to add<mode>3_carryin Richard Henderson
2020-04-02 18:53 ` [PATCH v2 09/11] aarch64: Adjust result of aarch64_gen_compare_reg Richard Henderson
2020-04-02 18:53 ` [PATCH v2 10/11] aarch64: Implement TImode comparisons Richard Henderson
2020-04-02 18:53 ` [PATCH v2 11/11] aarch64: Implement absti2 Richard Henderson
2020-04-02 18:55 ` [PATCH v2 00/11] aarch64: Implement TImode comparisons Richard Henderson
2020-04-03 11:34 ` Richard Earnshaw (lists)
2020-04-03 12:27 ` Richard Sandiford
2020-04-03 13:17 ` Richard Earnshaw (lists)
2020-04-03 15:03 ` Richard Sandiford
2020-04-06 9:27 ` Richard Earnshaw (lists)
2020-04-06 11:19 ` Richard Sandiford
2020-04-06 12:22 ` Richard Biener
2020-04-08 9:10 ` Richard Sandiford
2020-04-07 9:52 ` Richard Earnshaw (lists)
2020-04-07 16:32 ` Richard Sandiford
2020-04-07 17:05 ` Richard Henderson
2020-04-07 19:43 ` Segher Boessenkool
2020-04-07 20:27 ` Segher Boessenkool
2020-04-07 21:43 ` Richard Henderson
2020-04-07 23:58 ` Segher Boessenkool
2020-04-08 0:50 ` Richard Henderson
2020-04-08 20:56 ` Segher Boessenkool
2020-04-08 9:06 ` Richard Sandiford
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20200402185353.11047-7-richard.henderson@linaro.org \
--to=richard.henderson@linaro.org \
--cc=Wilco.Dijkstra@arm.com \
--cc=gcc-patches@gcc.gnu.org \
--cc=kyrylo.tkachov@arm.com \
--cc=marcus.shawcroft@arm.com \
--cc=richard.earnshaw@arm.com \
--cc=richard.sandiford@arm.com \
--cc=segher@kernel.crashing.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).