public inbox for gcc-bugs@sourceware.org
help / color / mirror / Atom feed
* [Bug c/40170] New: multiple movzx instructions generated when no partial register stall is evident
@ 2009-05-16 14:14 adam at consulting dot net dot nz
2009-05-17 19:47 ` [Bug target/40170] redundant zero extensions rguenth at gcc dot gnu dot org
0 siblings, 1 reply; 2+ messages in thread
From: adam at consulting dot net dot nz @ 2009-05-16 14:14 UTC (permalink / raw)
To: gcc-bugs
#include <stdint.h>
#include <stdio.h>
uint8_t nibble(size_t addr) {
size_t byte_addr;
uint8_t bit;
__asm__("shr $1, %[addr]; setc %[bit]\n"
: [addr] "=rm" (byte_addr), [bit] "=rm" (bit)
: "[addr]" (addr)
: "cc");
uint8_t byte=((uint8_t *) byte_addr)[0];
byte >>= bit;
byte >>= bit;
byte >>= bit;
byte >>= bit;
return byte & 15;
}
int main() {
return 0;
}
Generated code for nibble() at -O3 (Intel syntax):
400480: 48 d1 ef shr rdi,1
400483: 0f 92 c1 setb cl
400486: 0f b6 07 movzx eax,BYTE PTR [rdi]
400489: 0f b6 c9 movzx ecx,cl
40048c: d3 f8 sar eax,cl
40048e: 0f b6 c0 movzx eax,al
400491: d3 f8 sar eax,cl
400493: 0f b6 c0 movzx eax,al
400496: d3 f8 sar eax,cl
400498: 0f b6 c0 movzx eax,al
40049b: d3 f8 sar eax,cl
40049d: 83 e0 0f and eax,0xf
4004a0: c3 ret
Suggested code:
400480: 48 d1 ef shr rdi,1
400483: 0f 92 c1 setb cl
400486: 0f b6 07 movzx eax,BYTE PTR [rdi]
4004xx: d3 f8 sar eax,cl
4004xx: d3 f8 sar eax,cl
4004xx: d3 f8 sar eax,cl
4004xx: d3 f8 sar eax,cl
4004xx: 83 e0 0f and eax,0xf
4004xx: c3 ret
[Alternatively multiply CL by 4 and perform one right shift by CL]
I do not see a partial register stall in the suggested code that the additional
movzx instructions address.
--
Summary: multiple movzx instructions generated when no partial
register stall is evident
Product: gcc
Version: 4.4.1
Status: UNCONFIRMED
Severity: trivial
Priority: P3
Component: c
AssignedTo: unassigned at gcc dot gnu dot org
ReportedBy: adam at consulting dot net dot nz
GCC build triplet: core2
GCC host triplet: linux
GCC target triplet: x86_64
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=40170
^ permalink raw reply [flat|nested] 2+ messages in thread
* [Bug target/40170] redundant zero extensions
2009-05-16 14:14 [Bug c/40170] New: multiple movzx instructions generated when no partial register stall is evident adam at consulting dot net dot nz
@ 2009-05-17 19:47 ` rguenth at gcc dot gnu dot org
0 siblings, 0 replies; 2+ messages in thread
From: rguenth at gcc dot gnu dot org @ 2009-05-17 19:47 UTC (permalink / raw)
To: gcc-bugs
------- Comment #1 from rguenth at gcc dot gnu dot org 2009-05-17 19:47 -------
byte >>= bit;
is according to C
byte = (int)byte >> bit;
you see the (partly redundant) zero-extensions. It has nothing to do with
partial register stalls.
--
rguenth at gcc dot gnu dot org changed:
What |Removed |Added
----------------------------------------------------------------------------
Severity|trivial |normal
GCC build triplet|core2 |
GCC host triplet|linux |
Keywords| |missed-optimization
Summary|multiple movzx instructions |redundant zero extensions
|generated when no partial |
|register stall is evident |
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=40170
^ permalink raw reply [flat|nested] 2+ messages in thread
end of thread, other threads:[~2009-05-17 19:47 UTC | newest]
Thread overview: 2+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2009-05-16 14:14 [Bug c/40170] New: multiple movzx instructions generated when no partial register stall is evident adam at consulting dot net dot nz
2009-05-17 19:47 ` [Bug target/40170] redundant zero extensions rguenth at gcc dot gnu dot org
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).