* [PATCH][ARM] PR56315 Improve codegen for DImode immediates - XOR
@ 2013-06-07 13:03 Kyrylo Tkachov
2013-06-07 13:04 ` Kyrylo Tkachov
0 siblings, 1 reply; 3+ messages in thread
From: Kyrylo Tkachov @ 2013-06-07 13:03 UTC (permalink / raw)
To: gcc-patches; +Cc: Ramana Radhakrishnan, Richard Earnshaw
Hi all,
In a similar vein to
http://gcc.gnu.org/ml/gcc-patches/2013-05/msg01838.html this patch lets
us use the immediate forms of the eor instruction when dealing with
64-bit constants.
The insn is changed into an insn_and_split and merged with the NEON
version. A new constraint is introduced to make sure the pattern accepts
the correct range of immediates.
Now for code:
unsigned long long xor64 (unsigned long long input)
{
return input ^ 0x200000004ULL;
}
we generate:
eor r0, r0, #4
eor r1, r1, #2
without any mov immediates.
Regtested arm-none-eabi on qemu with and without -mthumb.
Ok for trunk?
Thanks,
Kyrill
gcc/
2013-06-07 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
PR target/56315
* config/arm/arm.md (*xordi3_insn): Change to insn_and_split.
(xordi3): Change operand 2 constraint to arm_xordi_operand.
* config/arm/constraints.md (Dg): New constraint.
* config/arm/neon.md (xordi3_neon): Remove.
(neon_veor<mode>): Generate xordi3 instead of xordi3_neon.
* config/arm/predicates.md (arm_xordi_operand): New predicate.
gcc/testsuite/
2013-06-07 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
PR target/56315
* gcc.target/arm/xordi3-opt.c: New test.
^ permalink raw reply [flat|nested] 3+ messages in thread
* RE: [PATCH][ARM] PR56315 Improve codegen for DImode immediates - XOR
2013-06-07 13:03 [PATCH][ARM] PR56315 Improve codegen for DImode immediates - XOR Kyrylo Tkachov
@ 2013-06-07 13:04 ` Kyrylo Tkachov
2013-06-07 13:23 ` Richard Earnshaw
0 siblings, 1 reply; 3+ messages in thread
From: Kyrylo Tkachov @ 2013-06-07 13:04 UTC (permalink / raw)
To: gcc-patches; +Cc: Ramana Radhakrishnan, Richard Earnshaw
[-- Attachment #1: Type: text/plain, Size: 2316 bytes --]
Oops, once again with patch attached...
gcc/
2013-06-07 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
PR target/56315
* config/arm/arm.md (*xordi3_insn): Change to insn_and_split.
(xordi3): Change operand 2 constraint to arm_xordi_operand.
* config/arm/constraints.md (Dg): New constraint.
* config/arm/neon.md (xordi3_neon): Remove.
(neon_veor<mode>): Generate xordi3 instead of xordi3_neon.
* config/arm/predicates.md (arm_xordi_operand): New predicate.
gcc/testsuite/
2013-06-07 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
PR target/56315
* gcc.target/arm/xordi3-opt.c: New test.
> -----Original Message-----
> From: gcc-patches-owner@gcc.gnu.org [mailto:gcc-patches-
> owner@gcc.gnu.org] On Behalf Of Kyrylo Tkachov
> Sent: 07 June 2013 14:03
> To: gcc-patches@gcc.gnu.org
> Cc: Ramana Radhakrishnan; Richard Earnshaw
> Subject: [PATCH][ARM] PR56315 Improve codegen for DImode immediates
> - XOR
>
> Hi all,
>
> In a similar vein to
> http://gcc.gnu.org/ml/gcc-patches/2013-05/msg01838.html this patch
> lets
> us use the immediate forms of the eor instruction when dealing with
> 64-bit constants.
> The insn is changed into an insn_and_split and merged with the NEON
> version. A new constraint is introduced to make sure the pattern
> accepts
> the correct range of immediates.
>
> Now for code:
> unsigned long long xor64 (unsigned long long input)
> {
> return input ^ 0x200000004ULL;
> }
>
> we generate:
> eor r0, r0, #4
> eor r1, r1, #2
>
> without any mov immediates.
>
> Regtested arm-none-eabi on qemu with and without -mthumb.
>
> Ok for trunk?
>
> Thanks,
> Kyrill
>
> gcc/
> 2013-06-07 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
>
> PR target/56315
> * config/arm/arm.md (*xordi3_insn): Change to insn_and_split.
> (xordi3): Change operand 2 constraint to arm_xordi_operand.
> * config/arm/constraints.md (Dg): New constraint.
> * config/arm/neon.md (xordi3_neon): Remove.
> (neon_veor<mode>): Generate xordi3 instead of xordi3_neon.
> * config/arm/predicates.md (arm_xordi_operand): New
> predicate.
>
>
> gcc/testsuite/
> 2013-06-07 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
>
> PR target/56315
> * gcc.target/arm/xordi3-opt.c: New test.
>
>
>
[-- Attachment #2: xordi.patch --]
[-- Type: application/octet-stream, Size: 5069 bytes --]
diff --git a/gcc/config/arm/arm.c b/gcc/config/arm/arm.c
index 63eb2b7..26eab28 100644
--- a/gcc/config/arm/arm.c
+++ b/gcc/config/arm/arm.c
@@ -2687,6 +2687,7 @@ const_ok_for_dimode_op (HOST_WIDE_INT i, enum rtx_code code)
{
case AND:
case IOR:
+ case XOR:
return (const_ok_for_op (hi_val, code) || hi_val == 0xFFFFFFFF)
&& (const_ok_for_op (lo_val, code) || lo_val == 0xFFFFFFFF);
case PLUS:
diff --git a/gcc/config/arm/arm.md b/gcc/config/arm/arm.md
index 95293bd..b17ffee 100644
--- a/gcc/config/arm/arm.md
+++ b/gcc/config/arm/arm.md
@@ -3190,19 +3190,49 @@
(define_expand "xordi3"
[(set (match_operand:DI 0 "s_register_operand" "")
(xor:DI (match_operand:DI 1 "s_register_operand" "")
- (match_operand:DI 2 "s_register_operand" "")))]
+ (match_operand:DI 2 "arm_xordi_operand" "")))]
"TARGET_32BIT"
""
)
-(define_insn "*xordi3_insn"
- [(set (match_operand:DI 0 "s_register_operand" "=&r,&r")
- (xor:DI (match_operand:DI 1 "s_register_operand" "%0,r")
- (match_operand:DI 2 "s_register_operand" "r,r")))]
- "TARGET_32BIT && !TARGET_IWMMXT && !TARGET_NEON"
- "#"
- [(set_attr "length" "8")
- (set_attr "predicable" "yes")]
+(define_insn_and_split "*xordi3_insn"
+ [(set (match_operand:DI 0 "s_register_operand" "=w,&r,&r,&r,&r,?w")
+ (xor:DI (match_operand:DI 1 "s_register_operand" "w ,%0,r ,0 ,r ,w")
+ (match_operand:DI 2 "arm_xordi_operand" "w ,r ,r ,Dg,Dg,w")))]
+ "TARGET_32BIT && !TARGET_IWMMXT"
+{
+ switch (which_alternative)
+ {
+ case 1:
+ case 2:
+ case 3:
+ case 4: /* fall through */
+ return "#";
+ case 0: /* fall through */
+ case 5: return "veor\t%P0, %P1, %P2";
+ default: gcc_unreachable ();
+ }
+}
+ "TARGET_32BIT && !TARGET_IWMMXT && reload_completed
+ && !(IS_VFP_REGNUM (REGNO (operands[0])))"
+ [(set (match_dup 3) (match_dup 4))
+ (set (match_dup 5) (match_dup 6))]
+ "
+ {
+ operands[3] = gen_lowpart (SImode, operands[0]);
+ operands[5] = gen_highpart (SImode, operands[0]);
+
+ operands[4] = simplify_gen_binary (XOR, SImode,
+ gen_lowpart (SImode, operands[1]),
+ gen_lowpart (SImode, operands[2]));
+ operands[6] = simplify_gen_binary (XOR, SImode,
+ gen_highpart (SImode, operands[1]),
+ gen_highpart_mode (SImode, DImode, operands[2]));
+
+ }"
+ [(set_attr "length" "*,8,8,8,8,*")
+ (set_attr "neon_type" "neon_int_1,*,*,*,*,neon_int_1")
+ (set_attr "arch" "neon_for_64bits,*,*,*,*,avoid_neon_for_64bits")]
)
(define_insn "*xordi_zesidi_di"
diff --git a/gcc/config/arm/constraints.md b/gcc/config/arm/constraints.md
index 53cbff6..7cd8e31 100644
--- a/gcc/config/arm/constraints.md
+++ b/gcc/config/arm/constraints.md
@@ -266,6 +266,12 @@
(and (match_code "const_int")
(match_test "TARGET_32BIT && const_ok_for_dimode_op (ival, IOR)")))
+(define_constraint "Dg"
+ "@internal
+ In ARM/Thumb-2 state a const_int that can be used by insn xordi."
+ (and (match_code "const_int")
+ (match_test "TARGET_32BIT && const_ok_for_dimode_op (ival, XOR)")))
+
(define_constraint "Di"
"@internal
In ARM/Thumb-2 state a const_int or const_double where both the high
diff --git a/gcc/config/arm/neon.md b/gcc/config/arm/neon.md
index 1697008..e814df0 100644
--- a/gcc/config/arm/neon.md
+++ b/gcc/config/arm/neon.md
@@ -782,21 +782,6 @@
[(set_attr "neon_type" "neon_int_1")]
)
-(define_insn "xordi3_neon"
- [(set (match_operand:DI 0 "s_register_operand" "=w,?&r,?&r,?w")
- (xor:DI (match_operand:DI 1 "s_register_operand" "%w,0,r,w")
- (match_operand:DI 2 "s_register_operand" "w,r,r,w")))]
- "TARGET_NEON"
- "@
- veor\t%P0, %P1, %P2
- #
- #
- veor\t%P0, %P1, %P2"
- [(set_attr "neon_type" "neon_int_1,*,*,neon_int_1")
- (set_attr "length" "*,8,8,*")
- (set_attr "arch" "neon_for_64bits,*,*,avoid_neon_for_64bits")]
-)
-
(define_insn "one_cmpl<mode>2"
[(set (match_operand:VDQ 0 "s_register_operand" "=w")
(not:VDQ (match_operand:VDQ 1 "s_register_operand" "w")))]
@@ -5605,7 +5590,7 @@
(match_operand:SI 3 "immediate_operand" "")]
"TARGET_NEON"
{
- emit_insn (gen_xor<mode>3<V_suf64> (operands[0], operands[1], operands[2]));
+ emit_insn (gen_xor<mode>3 (operands[0], operands[1], operands[2]));
DONE;
})
diff --git a/gcc/config/arm/predicates.md b/gcc/config/arm/predicates.md
index 3cec569..d169cb2 100644
--- a/gcc/config/arm/predicates.md
+++ b/gcc/config/arm/predicates.md
@@ -179,6 +179,11 @@
(match_test "const_ok_for_dimode_op (INTVAL (op), IOR)"))
(match_operand 0 "neon_logic_op2")))
+(define_predicate "arm_xordi_operand"
+ (ior (match_operand 0 "s_register_operand")
+ (and (match_code "const_int")
+ (match_test "const_ok_for_dimode_op (INTVAL (op), XOR)"))))
+
(define_predicate "arm_adddi_operand"
(ior (match_operand 0 "s_register_operand")
(and (match_code "const_int")
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: [PATCH][ARM] PR56315 Improve codegen for DImode immediates - XOR
2013-06-07 13:04 ` Kyrylo Tkachov
@ 2013-06-07 13:23 ` Richard Earnshaw
0 siblings, 0 replies; 3+ messages in thread
From: Richard Earnshaw @ 2013-06-07 13:23 UTC (permalink / raw)
To: Kyrylo Tkachov; +Cc: gcc-patches, Ramana Radhakrishnan
On 07/06/13 14:04, Kyrylo Tkachov wrote:
> Oops, once again with patch attached...
>
> gcc/
> 2013-06-07 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
>
> PR target/56315
> * config/arm/arm.md (*xordi3_insn): Change to insn_and_split.
> (xordi3): Change operand 2 constraint to arm_xordi_operand.
> * config/arm/constraints.md (Dg): New constraint.
> * config/arm/neon.md (xordi3_neon): Remove.
> (neon_veor<mode>): Generate xordi3 instead of xordi3_neon.
> * config/arm/predicates.md (arm_xordi_operand): New predicate.
>
>
> gcc/testsuite/
> 2013-06-07 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
>
> PR target/56315
> * gcc.target/arm/xordi3-opt.c: New test.
>
>
>> -----Original Message-----
>> From: gcc-patches-owner@gcc.gnu.org [mailto:gcc-patches-
>> owner@gcc.gnu.org] On Behalf Of Kyrylo Tkachov
>> Sent: 07 June 2013 14:03
>> To: gcc-patches@gcc.gnu.org
>> Cc: Ramana Radhakrishnan; Richard Earnshaw
>> Subject: [PATCH][ARM] PR56315 Improve codegen for DImode immediates
>> - XOR
>>
>> Hi all,
>>
>> In a similar vein to
>> http://gcc.gnu.org/ml/gcc-patches/2013-05/msg01838.html this patch
>> lets
>> us use the immediate forms of the eor instruction when dealing with
>> 64-bit constants.
>> The insn is changed into an insn_and_split and merged with the NEON
>> version. A new constraint is introduced to make sure the pattern
>> accepts
>> the correct range of immediates.
>>
>> Now for code:
>> unsigned long long xor64 (unsigned long long input)
>> {
>> return input ^ 0x200000004ULL;
>> }
>>
>> we generate:
>> eor r0, r0, #4
>> eor r1, r1, #2
>>
>> without any mov immediates.
>>
>> Regtested arm-none-eabi on qemu with and without -mthumb.
>>
>> Ok for trunk?
>>
>> Thanks,
>> Kyrill
>>
>> gcc/
>> 2013-06-07 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
>>
>> PR target/56315
>> * config/arm/arm.md (*xordi3_insn): Change to insn_and_split.
>> (xordi3): Change operand 2 constraint to arm_xordi_operand.
>> * config/arm/constraints.md (Dg): New constraint.
>> * config/arm/neon.md (xordi3_neon): Remove.
>> (neon_veor<mode>): Generate xordi3 instead of xordi3_neon.
>> * config/arm/predicates.md (arm_xordi_operand): New
>> predicate.
>>
>>
>> gcc/testsuite/
>> 2013-06-07 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
>>
>> PR target/56315
>> * gcc.target/arm/xordi3-opt.c: New test.
>>
OK.
R.
^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2013-06-07 13:23 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2013-06-07 13:03 [PATCH][ARM] PR56315 Improve codegen for DImode immediates - XOR Kyrylo Tkachov
2013-06-07 13:04 ` Kyrylo Tkachov
2013-06-07 13:23 ` Richard Earnshaw
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).