From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: by sourceware.org (Postfix, from userid 48) id A442A3847718; Wed, 3 Apr 2024 23:36:43 +0000 (GMT) DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org A442A3847718 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gcc.gnu.org; s=default; t=1712187403; bh=jpBR0mi0NBLQ6p15wsaatFAx9idljxB4pyY15ioG79Q=; h=From:To:Subject:Date:In-Reply-To:References:From; b=bdzNNZhWlzLIh21uDi9omGByeJcj6TC7cUK35U/rC1F0WCacLc72uq6Ddf1fyytir wmkUtGq6K+7VqSsUJl22NTOt38rMBB4wHmFAaHRLyv1RVfcCCbtDJOQKYxI1f16hFA NKCFBu7GQWJuV8RMsCWbyFBXxGUavG35G7Z6fvII= From: "pinskia at gcc dot gnu.org" To: gcc-bugs@gcc.gnu.org Subject: [Bug target/40771] generated code is ~25% slower when autovectorization is enabled Date: Wed, 03 Apr 2024 23:36:42 +0000 X-Bugzilla-Reason: CC X-Bugzilla-Type: changed X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: gcc X-Bugzilla-Component: target X-Bugzilla-Version: 4.5.0 X-Bugzilla-Keywords: missed-optimization X-Bugzilla-Severity: enhancement X-Bugzilla-Who: pinskia at gcc dot gnu.org X-Bugzilla-Status: UNCONFIRMED X-Bugzilla-Resolution: X-Bugzilla-Priority: P3 X-Bugzilla-Assigned-To: unassigned at gcc dot gnu.org X-Bugzilla-Target-Milestone: --- X-Bugzilla-Flags: X-Bugzilla-Changed-Fields: Message-ID: In-Reply-To: References: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://gcc.gnu.org/bugzilla/ Auto-Submitted: auto-generated MIME-Version: 1.0 List-Id: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=3D40771 --- Comment #4 from Andrew Pinski --- AARCH64 vectorization looks decent too: ``` dup v31.8h, w0 adrp x2, .LC0 adrp x0, .LC1 adrp x1, .LANCHOR0 ldr q30, [x2, #:lo12:.LC0] ldr q29, [x0, #:lo12:.LC1] add v30.8h, v31.8h, v30.8h add v29.8h, v31.8h, v29.8h uzp2 v29.16b, v30.16b, v29.16b str q29, [x1, #:lo12:.LANCHOR0] ``` The only improvement that can be made there is with SVE, those ldr could be `index` instructions instead but that is PR 113328 .=