public inbox for gcc-bugs@sourceware.org help / color / mirror / Atom feed
From: "thiago at kde dot org" <gcc-bugzilla@gcc.gnu.org> To: gcc-bugs@gcc.gnu.org Subject: [Bug target/110184] New: [i386] Missed optimisation: atomic operations should use PF, ZF and SF Date: Thu, 08 Jun 2023 22:55:32 +0000 [thread overview] Message-ID: <bug-110184-4@http.gcc.gnu.org/bugzilla/> (raw) https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110184 Bug ID: 110184 Summary: [i386] Missed optimisation: atomic operations should use PF, ZF and SF Product: gcc Version: 13.1.1 Status: UNCONFIRMED Severity: normal Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: thiago at kde dot org Target Milestone: --- Follow up from https://gcc.gnu.org/bugzilla/show_bug.cgi?id=102566 The x86 locked ALU operations always set PF, ZF and SF, so the atomic builtins could use those to emit more optimal code instead of a cmpxchg loop. Given: template <auto Op> int atomic_rmw_op(std::atomic_int &i) { int old = Op(i); if (old == 0) return 1; if (old < 0) return 2; return 0; } ------- Starting with the non-standard __atomic_OP_fetch, the current code for inline int andn_fetch_1(std::atomic_int &i) { return __atomic_and_fetch((int *)&i, ~1, 0); } is L33: movl %eax, %edx andl $-2, %edx lock cmpxchgl %edx, (%rdi) jne .L33 movl %edx, %eax shrl $31, %eax addl %eax, %eax // eax = 2 if edx < 0 testl %edx, %edx movl $1, %edx cmove %edx, %eax But it could be more optimally written as: movl %ecx, 1 movl %edx, 2 xorl %eax, %eax lock andl $-2, (%rdi) cmove %ecx, %eax cmovs %edx, %eax The other __atomic_OP_fetch operations are very similar. I note that GCC already realises that if you perform __atomic_and_fetch(ptr, 1), the result can't have the sign bit set. ------- For the standard atomic_fetch_OP operations, there are a couple of caveats: fetch_and: if the retrieved value is ANDed again with the same pattern; for example: int pattern = 0x80000001; return i.fetch_and(pattern, std::memory_order_relaxed) & pattern; This appears to be partially implemented, depending on what the pattern is. For example, it generates the optimal code for pattern = 3, 15, 0x7fffffff, 0x80000000. It appears to be related to testing for either SF or ZF, but not both. fetch_or: always for SF, for the useful case when the pattern being ORed doesn't already contain the sign bit. If it does (a "non-useful case"), then the comparison is a constant, and likewise for ZF because it's never set if the pattern isn't zero. fetch_xor: always, because the original value is reconstructible. Avoid generating unnecessary code in case the code already does the XOR itself, as in: return i.fetch_xor(1, std::memory_order_relaxed) ^ 1; See https://gcc.godbolt.org/z/n9bMnaE4e for full results.
next reply other threads:[~2023-06-08 22:55 UTC|newest] Thread overview: 5+ messages / expand[flat|nested] mbox.gz Atom feed top 2023-06-08 22:55 thiago at kde dot org [this message] 2023-06-08 23:12 ` [Bug target/110184] " pinskia at gcc dot gnu.org 2023-06-08 23:16 ` [Bug target/110184] [x86] " pinskia at gcc dot gnu.org 2023-11-19 17:58 ` securesneakers at gmail dot com 2023-11-19 17:59 ` securesneakers at gmail dot com
Reply instructions: You may reply publicly to this message via plain-text email using any one of the following methods: * Save the following mbox file, import it into your mail client, and reply-to-all from there: mbox Avoid top-posting and favor interleaved quoting: https://en.wikipedia.org/wiki/Posting_style#Interleaved_style * Reply using the --to, --cc, and --in-reply-to switches of git-send-email(1): git send-email \ --in-reply-to=bug-110184-4@http.gcc.gnu.org/bugzilla/ \ --to=gcc-bugzilla@gcc.gnu.org \ --cc=gcc-bugs@gcc.gnu.org \ /path/to/YOUR_REPLY https://kernel.org/pub/software/scm/git/docs/git-send-email.html * If your mail client supports setting the In-Reply-To header via mailto: links, try the mailto: linkBe sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).