public inbox for gcc-bugs@sourceware.org
help / color / mirror / Atom feed
* [Bug target/82139] unnecessary movapd with _mm_castsi128_pd to use BLENDPD on __m128i results
[not found] <bug-82139-4@http.gcc.gnu.org/bugzilla/>
@ 2021-09-04 3:01 ` pinskia at gcc dot gnu.org
2021-09-06 6:09 ` crazylht at gmail dot com
1 sibling, 0 replies; 2+ messages in thread
From: pinskia at gcc dot gnu.org @ 2021-09-04 3:01 UTC (permalink / raw)
To: gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=82139
Andrew Pinski <pinskia at gcc dot gnu.org> changed:
What |Removed |Added
----------------------------------------------------------------------------
Last reconfirmed| |2021-09-04
Keywords| |ra
Status|UNCONFIRMED |NEW
Ever confirmed|0 |1
--- Comment #1 from Andrew Pinski <pinskia at gcc dot gnu.org> ---
It is worse on the trunk:
.L2:
movdqu (%rdi), %xmm1
movdqu (%rdi), %xmm0
addq $16, %rdi
paddd %xmm3, %xmm1
paddd %xmm2, %xmm0
blendpd $2, %xmm0, %xmm1
movups %xmm1, -16(%rdi)
cmpq %rdi, %rax
jne .L2
Why two loads from %rdi here?
This is done during RA as far as I can tell.
^ permalink raw reply [flat|nested] 2+ messages in thread
* [Bug target/82139] unnecessary movapd with _mm_castsi128_pd to use BLENDPD on __m128i results
[not found] <bug-82139-4@http.gcc.gnu.org/bugzilla/>
2021-09-04 3:01 ` [Bug target/82139] unnecessary movapd with _mm_castsi128_pd to use BLENDPD on __m128i results pinskia at gcc dot gnu.org
@ 2021-09-06 6:09 ` crazylht at gmail dot com
1 sibling, 0 replies; 2+ messages in thread
From: crazylht at gmail dot com @ 2021-09-06 6:09 UTC (permalink / raw)
To: gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=82139
--- Comment #2 from Hongtao.liu <crazylht at gmail dot com> ---
(In reply to Andrew Pinski from comment #1)
> It is worse on the trunk:
> .L2:
> movdqu (%rdi), %xmm1
> movdqu (%rdi), %xmm0
> addq $16, %rdi
> paddd %xmm3, %xmm1
> paddd %xmm2, %xmm0
> blendpd $2, %xmm0, %xmm1
> movups %xmm1, -16(%rdi)
> cmpq %rdi, %rax
> jne .L2
>
> Why two loads from %rdi here?
> This is done during RA as far as I can tell.
It looks like generic cost model should be updated.
w/ -O2 -msse4 -mno-avx -mtune=skylake looks optimal
movdqu (%rdi), %xmm0
movdqa %xmm3, %xmm1
paddd %xmm0, %xmm1
paddd %xmm2, %xmm0
blendpd $2, %xmm0, %xmm1
movups %xmm1, (%rdi)
addq $16, %rdi
cmpq %rdi, %rax
jne .L2
^ permalink raw reply [flat|nested] 2+ messages in thread
end of thread, other threads:[~2021-09-06 6:09 UTC | newest]
Thread overview: 2+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
[not found] <bug-82139-4@http.gcc.gnu.org/bugzilla/>
2021-09-04 3:01 ` [Bug target/82139] unnecessary movapd with _mm_castsi128_pd to use BLENDPD on __m128i results pinskia at gcc dot gnu.org
2021-09-06 6:09 ` crazylht at gmail dot com
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).