From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from foss.arm.com (foss.arm.com [217.140.110.172]) by sourceware.org (Postfix) with ESMTP id 920773858C27 for ; Tue, 5 Apr 2022 13:04:52 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org 920773858C27 Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 41AECD6E; Tue, 5 Apr 2022 06:04:52 -0700 (PDT) Received: from FVFF77S0Q05N (unknown [10.57.8.234]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id 9B8213F5A1; Tue, 5 Apr 2022 06:04:49 -0700 (PDT) Date: Tue, 5 Apr 2022 14:04:45 +0100 From: Mark Rutland To: linux-arm-kernel@lists.infradead.org, linux-kernel@vger.kernel.org Cc: linux-arch@vger.kernel.org, gcc@gcc.gnu.org, catalin.marinas@arm.com, will@kernel.org, marcan@marcan.st, maz@kernel.org, szabolcs.nagy@arm.com, f.fainelli@gmail.com, opendmb@gmail.com, Andrew Pinski , Ard Biesheuvel , Peter Zijlstra , x86@kernel.org, andrew.cooper3@citrix.com, Jeremy Linton Subject: Re: GCC 12 miscompilation of volatile asm (was: Re: [PATCH] arm64/io: Remind compiler that there is a memory side effect) Message-ID: References: <20220401164406.61583-1-jeremy.linton@arm.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: X-Spam-Status: No, score=-5.3 required=5.0 tests=BAYES_00, KAM_DMARC_STATUS, SPF_HELO_NONE, SPF_PASS, TXREP, T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.4 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on server2.sourceware.org X-BeenThere: gcc@gcc.gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Gcc mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 05 Apr 2022 13:04:54 -0000 Sorry, I copied the wrong version of the x86_64 assembly as generated by GCC 11.2.0). Updated below. On Tue, Apr 05, 2022 at 01:51:30PM +0100, Mark Rutland wrote: > My x86_64 test case is: > > | unsigned long rdmsr(unsigned long reg) > | { > | unsigned int lo, hi; > | > | asm volatile( > | "rdmsr" > | : "=d" (hi), "=a" (lo) > | : "c" (reg) > | ); > | > | return ((unsigned long)hi << 32) | lo; > | } > | > | void wrmsr(unsigned long reg, unsigned long val) > | { > | unsigned int lo = val; > | unsigned int hi = val >> 32; > | > | asm volatile( > | "wrmsr" > | : > | : "d" (hi), "a" (lo), "c" (reg) > | ); > | } > | > | void msr_rmw_set_bits(unsigned long reg, unsigned long bits) > | { > | unsigned long val; > | > | val = rdmsr(reg); > | val |= bits; > | wrmsr(reg, val); > | } > | > | void func_with_msr_side_effects(unsigned long reg) > | { > | msr_rmw_set_bits(reg, 1UL << 0); > | msr_rmw_set_bits(reg, 1UL << 1); > | msr_rmw_set_bits(reg, 1UL << 2); > | msr_rmw_set_bits(reg, 1UL << 3); > | } > > Per compiler explorer (https://godbolt.org/z/cveff9hq5) GCC trunk currently > compiles this as: > > | msr_rmw_set_bits: > | mov rcx, rdi > | rdmsr > | sal rdx, 32 > | mov eax, eax > | or rax, rsi > | or rax, rdx > | mov rdx, rax > | shr rdx, 32 > | wrmsr > | ret > | func_with_msr_side_effects: > | ret > GCC 11.2 compiles that as: | rdmsr: | mov rcx, rdi | rdmsr | sal rdx, 32 | mov eax, eax | or rax, rdx | ret | wrmsr: | mov rax, rsi | mov rdx, rsi | shr rdx, 32 | mov rcx, rdi | wrmsr | ret | msr_rmw_set_bits: | mov rcx, rdi | rdmsr | sal rdx, 32 | mov eax, eax | or rax, rsi | or rax, rdx | mov rdx, rax | shr rdx, 32 | wrmsr | ret | func_with_msr_side_effects: | push rbx | mov rbx, rdi | mov esi, 1 | call msr_rmw_set_bits | mov esi, 2 | mov rdi, rbx | call msr_rmw_set_bits | mov esi, 4 | mov rdi, rbx | call msr_rmw_set_bits | mov esi, 8 | mov rdi, rbx | call msr_rmw_set_bits | pop rbx | ret Thanks, Mark.