* [PATCH][AArch64] PR target/70809: Delete aarch64_vmls<mode> pattern
@ 2016-05-17 10:38 Kyrill Tkachov
2016-05-17 11:05 ` James Greenhalgh
0 siblings, 1 reply; 3+ messages in thread
From: Kyrill Tkachov @ 2016-05-17 10:38 UTC (permalink / raw)
To: GCC Patches; +Cc: Marcus Shawcroft, Richard Earnshaw, James Greenhalgh
[-- Attachment #1: Type: text/plain, Size: 926 bytes --]
Hi all,
The aarch64_vmls<mode> pattern claims to perform a normal vector floating-point multiply-subtract but in fact
performs a fused multiply-subtract. This is fine when -ffp-contract=fast, but it's not guarded on anything so
will generate the FMLS instruction even when -ffp-contract=off.
The solution is just to delete the pattern. If -ffp-contract=fast then an fma operation will have been generated
and the fnma<mode>4 would be used to generate the FMLS instruction.
Bootstrapped and tested on aarch64-none-linux-gnu.
Ok for trunk and GCC 6 and 5? GCC 4.9 needs a different -mtune option in the testcase to trigger the testcase...
Thanks,
Kyrill
2016-05-17 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
PR target/70809
* config/aarch64/aarch64-simd.md (aarch64_vmls<mode>): Delete.
2016-05-17 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
PR target/70809
* gcc.target/aarch64/pr70809_1.c: New test.
[-- Attachment #2: aarch64-vmls-delete.patch --]
[-- Type: text/x-patch, Size: 1738 bytes --]
diff --git a/gcc/config/aarch64/aarch64-simd.md b/gcc/config/aarch64/aarch64-simd.md
index a66948a28e99f4437824a8640b092f7be1c917f6..90272a09f2dd925cfc01caa09e9e8963a8e6c6ed 100644
--- a/gcc/config/aarch64/aarch64-simd.md
+++ b/gcc/config/aarch64/aarch64-simd.md
@@ -1919,16 +1919,6 @@ (define_expand "vec_pack_trunc_df"
}
)
-(define_insn "aarch64_vmls<mode>"
- [(set (match_operand:VDQF 0 "register_operand" "=w")
- (minus:VDQF (match_operand:VDQF 1 "register_operand" "0")
- (mult:VDQF (match_operand:VDQF 2 "register_operand" "w")
- (match_operand:VDQF 3 "register_operand" "w"))))]
- "TARGET_SIMD"
- "fmls\\t%0.<Vtype>, %2.<Vtype>, %3.<Vtype>"
- [(set_attr "type" "neon_fp_mla_<Vetype>_scalar<q>")]
-)
-
;; FP Max/Min
;; Max/Min are introduced by idiom recognition by GCC's mid-end. An
;; expression like:
diff --git a/gcc/testsuite/gcc.target/aarch64/pr70809_1.c b/gcc/testsuite/gcc.target/aarch64/pr70809_1.c
new file mode 100644
index 0000000000000000000000000000000000000000..df88c71c42afc7fafff703f801bbfced8daafc95
--- /dev/null
+++ b/gcc/testsuite/gcc.target/aarch64/pr70809_1.c
@@ -0,0 +1,18 @@
+/* PR target/70809. */
+/* { dg-do compile } */
+/* { dg-options "-O2 -ftree-vectorize -ffp-contract=off -mtune=xgene1" } */
+
+/* Check that vector FMLS is not generated when contraction is disabled. */
+
+void
+foo (float *__restrict__ __attribute__ ((aligned (16))) a,
+ float *__restrict__ __attribute__ ((aligned (16))) x,
+ float *__restrict__ __attribute__ ((aligned (16))) y,
+ float *__restrict__ __attribute__ ((aligned (16))) z)
+{
+ unsigned i = 0;
+ for (i = 0; i < 256; i++)
+ a[i] = x[i] - (y[i] * z[i]);
+}
+
+/* { dg-final { scan-assembler-not "fmls\tv.*" } } */
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: [PATCH][AArch64] PR target/70809: Delete aarch64_vmls<mode> pattern
2016-05-17 10:38 [PATCH][AArch64] PR target/70809: Delete aarch64_vmls<mode> pattern Kyrill Tkachov
@ 2016-05-17 11:05 ` James Greenhalgh
2016-05-20 9:04 ` Kyrill Tkachov
0 siblings, 1 reply; 3+ messages in thread
From: James Greenhalgh @ 2016-05-17 11:05 UTC (permalink / raw)
To: Kyrill Tkachov; +Cc: GCC Patches, Marcus Shawcroft, Richard Earnshaw, nd
On Tue, May 17, 2016 at 11:37:57AM +0100, Kyrill Tkachov wrote:
> Hi all,
>
> The aarch64_vmls<mode> pattern claims to perform a normal vector
> floating-point multiply-subtract but in fact performs a fused
> multiply-subtract. This is fine when -ffp-contract=fast, but it's not guarded
> on anything so will generate the FMLS instruction even when
> -ffp-contract=off.
>
> The solution is just to delete the pattern. If -ffp-contract=fast then an fma
> operation will have been generated and the fnma<mode>4 would be used to
> generate the FMLS instruction.
>
> Bootstrapped and tested on aarch64-none-linux-gnu.
>
> Ok for trunk and GCC 6 and 5? GCC 4.9 needs a different -mtune option in the
> testcase to trigger the testcase...
OK, thanks.
Please consider the GCC 4.9 backport preapproved with whatever flag is
needed to expose the issue.
Thanks,
James
>
> Thanks,
> Kyrill
>
> 2016-05-17 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
>
> PR target/70809
> * config/aarch64/aarch64-simd.md (aarch64_vmls<mode>): Delete.
>
> 2016-05-17 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
>
> PR target/70809
> * gcc.target/aarch64/pr70809_1.c: New test.
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: [PATCH][AArch64] PR target/70809: Delete aarch64_vmls<mode> pattern
2016-05-17 11:05 ` James Greenhalgh
@ 2016-05-20 9:04 ` Kyrill Tkachov
0 siblings, 0 replies; 3+ messages in thread
From: Kyrill Tkachov @ 2016-05-20 9:04 UTC (permalink / raw)
To: James Greenhalgh; +Cc: GCC Patches, Marcus Shawcroft, Richard Earnshaw
[-- Attachment #1: Type: text/plain, Size: 1315 bytes --]
On 17/05/16 12:04, James Greenhalgh wrote:
> On Tue, May 17, 2016 at 11:37:57AM +0100, Kyrill Tkachov wrote:
>> Hi all,
>>
>> The aarch64_vmls<mode> pattern claims to perform a normal vector
>> floating-point multiply-subtract but in fact performs a fused
>> multiply-subtract. This is fine when -ffp-contract=fast, but it's not guarded
>> on anything so will generate the FMLS instruction even when
>> -ffp-contract=off.
>>
>> The solution is just to delete the pattern. If -ffp-contract=fast then an fma
>> operation will have been generated and the fnma<mode>4 would be used to
>> generate the FMLS instruction.
>>
>> Bootstrapped and tested on aarch64-none-linux-gnu.
>>
>> Ok for trunk and GCC 6 and 5? GCC 4.9 needs a different -mtune option in the
>> testcase to trigger the testcase...
> OK, thanks.
>
> Please consider the GCC 4.9 backport preapproved with whatever flag is
> needed to expose the issue.
Thanks.
Here is the 4.9 backport.
Committing there.
Kyrill
> Thanks,
> James
>
>> Thanks,
>> Kyrill
>>
>> 2016-05-17 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
>>
>> PR target/70809
>> * config/aarch64/aarch64-simd.md (aarch64_vmls<mode>): Delete.
>>
>> 2016-05-17 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
>>
>> PR target/70809
>> * gcc.target/aarch64/pr70809_1.c: New test.
>
[-- Attachment #2: aarch64-vmls-49.patch --]
[-- Type: text/x-patch, Size: 1739 bytes --]
diff --git a/gcc/config/aarch64/aarch64-simd.md b/gcc/config/aarch64/aarch64-simd.md
index 7626ed31fd688102ca81cd13a88e02891c971c99..9776816bfa7c7a174184bf8ddf90d73c88e40d2f 100644
--- a/gcc/config/aarch64/aarch64-simd.md
+++ b/gcc/config/aarch64/aarch64-simd.md
@@ -1733,16 +1733,6 @@ (define_expand "vec_pack_trunc_df"
}
)
-(define_insn "aarch64_vmls<mode>"
- [(set (match_operand:VDQF 0 "register_operand" "=w")
- (minus:VDQF (match_operand:VDQF 1 "register_operand" "0")
- (mult:VDQF (match_operand:VDQF 2 "register_operand" "w")
- (match_operand:VDQF 3 "register_operand" "w"))))]
- "TARGET_SIMD"
- "fmls\\t%0.<Vtype>, %2.<Vtype>, %3.<Vtype>"
- [(set_attr "type" "neon_fp_mla_<Vetype>_scalar<q>")]
-)
-
;; FP Max/Min
;; Max/Min are introduced by idiom recognition by GCC's mid-end. An
;; expression like:
diff --git a/gcc/testsuite/gcc.target/aarch64/pr70809_1.c b/gcc/testsuite/gcc.target/aarch64/pr70809_1.c
new file mode 100644
index 0000000000000000000000000000000000000000..2e1bbf4b0902109f95949e9edbec9ec03fe40c78
--- /dev/null
+++ b/gcc/testsuite/gcc.target/aarch64/pr70809_1.c
@@ -0,0 +1,18 @@
+/* PR target/70809. */
+/* { dg-do compile } */
+/* { dg-options "-O2 -ftree-vectorize -ffp-contract=off -mtune=generic" } */
+
+/* Check that vector FMLS is not generated when contraction is disabled. */
+
+void
+foo (float *__restrict__ __attribute__ ((aligned (16))) a,
+ float *__restrict__ __attribute__ ((aligned (16))) x,
+ float *__restrict__ __attribute__ ((aligned (16))) y,
+ float *__restrict__ __attribute__ ((aligned (16))) z)
+{
+ unsigned i = 0;
+ for (i = 0; i < 256; i++)
+ a[i] = x[i] - (y[i] * z[i]);
+}
+
+/* { dg-final { scan-assembler-not "fmls\tv.*" } } */
^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2016-05-20 9:04 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2016-05-17 10:38 [PATCH][AArch64] PR target/70809: Delete aarch64_vmls<mode> pattern Kyrill Tkachov
2016-05-17 11:05 ` James Greenhalgh
2016-05-20 9:04 ` Kyrill Tkachov
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).