From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: by sourceware.org (Postfix, from userid 48) id 6F0BF3858D35; Fri, 31 Jul 2020 14:14:02 +0000 (GMT) DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org 6F0BF3858D35 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gcc.gnu.org; s=default; t=1596204842; bh=E+E1vrX4l7eT+QQc/tiYWsukZmSrXlkUEEe5d0cgwb4=; h=From:To:Subject:Date:In-Reply-To:References:From; b=alw2wEtfBKhh+aBa1Pi695SsbzQnebJtu1rRbMJjXlrpVFy3qfR9fV5zGLejOw0E1 JzJIEHbIf7V8dEEZ4xOsmOOJ6g8bL7JNzCFkFx5X1yR31zj6q42h9bL5OResKM5BYp AkTpUVf7cyn4v0uEDw4bSW9rHe/duNnvRoIqWxo0= From: "vries at gcc dot gnu.org" To: gcc-bugs@gcc.gnu.org Subject: [Bug target/96401] [nvptx] Take advantage of subword ld/st/cvt Date: Fri, 31 Jul 2020 14:14:02 +0000 X-Bugzilla-Reason: CC X-Bugzilla-Type: changed X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: gcc X-Bugzilla-Component: target X-Bugzilla-Version: 11.0 X-Bugzilla-Keywords: X-Bugzilla-Severity: enhancement X-Bugzilla-Who: vries at gcc dot gnu.org X-Bugzilla-Status: UNCONFIRMED X-Bugzilla-Resolution: X-Bugzilla-Priority: P3 X-Bugzilla-Assigned-To: unassigned at gcc dot gnu.org X-Bugzilla-Target-Milestone: --- X-Bugzilla-Flags: X-Bugzilla-Changed-Fields: Message-ID: In-Reply-To: References: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://gcc.gnu.org/bugzilla/ Auto-Submitted: auto-generated MIME-Version: 1.0 X-BeenThere: gcc-bugs@gcc.gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Gcc-bugs mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 31 Jul 2020 14:14:02 -0000 https://gcc.gnu.org/bugzilla/show_bug.cgi?id=3D96401 --- Comment #3 from Tom de Vries --- Note that with the proposed TARGET_TRULY_NOOP_TRUNCATION -> false change ( https://gcc.gnu.org/pipermail/gcc-patches/2020-July/549896.html ), we start= out with the same ptx insns, but with the cvt.u16.u32 a truncate instead of a subreg move: ... //(insn 5 2 6 2 // (set (reg:SI 22 [ v$0_1 ]) // (mem/v/c:SI (reg/f:DI 2 %frame) [1 v+0 S4 A128])) // "test.c":7:6 6 {*movsi_insn} // (nil)) ld.u32 %r22, [%frame]; // 5 [c=3D4] *movsi_insn/1 //(insn 6 5 9 2 // (set (reg:HI 24 [ v$0_1 ]) // (truncate:HI (reg:SI 22 [ v$0_1 ]))) "test.c":7:6 30 {truncsihi2} // (expr_list:REG_DEAD (reg:SI 22 [ v$0_1 ]) // (nil))) cvt.u16.u32 %r24, %r22; // 6 [c=3D4] truncsihi2= /0 //(insn 9 6 12 2 // (set (mem/v/c:HI (plus:DI (reg/f:DI 2 %frame) // (const_int 4 [0x4])) [2 v2+0 S2 A32]) // (reg:HI 24 [ v$0_1 ])) "test.c":7:6 5 {*movhi_insn} // (expr_list:REG_DEAD (reg:HI 24 [ v$0_1 ]) // (nil))) st.u16 [%frame+4], %r24; // 9 [c=3D4] *movhi_ins= n/2 ... Still, with the changes in comment 1 enabled we end up with the desired two insns, though a bit later, at cse2 (265r), and not using movhi_insn: ... (insn 9 5 0 2 (set (mem/v/c:HI (plus:DI (reg/f:DI 2 %frame) (const_int 4 [0x4])) [2 v2+0 S2 A32]) (truncate:HI (reg:SI 22 [ v$0_1 ]))) "test.c":7:6 30 {truncsihi2} (expr_list:REG_DEAD (reg:HI 24 [ v$0_1 ]) (nil))) ... so we might get this just with the nvptx_modes_tieable_p change.=