From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: by sourceware.org (Postfix, from userid 48) id 998F33858C52; Fri, 3 Feb 2023 08:20:09 +0000 (GMT) DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org 998F33858C52 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gcc.gnu.org; s=default; t=1675412409; bh=Un2Ez3AHUMksGT+lghqTTRRj7hBP4uVkWZN6cQ3ZDMg=; h=From:To:Subject:Date:From; b=q49hoOu05xrKc/HEnjkwkOZBiCRLdz3A2zRT5BkkjYGpqxn6f2lq9y/LPUuC8aN1T rEYufLSn9dQMCFlfYby1HxFtITGcNA4wpTAry3E6VMeyAO82/hHE2HMAr+A7CnoTwf l1Q8+WFxZ1w3n5frUj7mJ1Z6stEFi+maDVbuU+/U= From: "juzhe.zhong at rivai dot ai" To: gcc-bugs@gcc.gnu.org Subject: [Bug c/108654] New: Incorrect codegen of RVV GCC Date: Fri, 03 Feb 2023 08:20:07 +0000 X-Bugzilla-Reason: CC X-Bugzilla-Type: new X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: gcc X-Bugzilla-Component: c X-Bugzilla-Version: 13.0 X-Bugzilla-Keywords: X-Bugzilla-Severity: normal X-Bugzilla-Who: juzhe.zhong at rivai dot ai X-Bugzilla-Status: UNCONFIRMED X-Bugzilla-Resolution: X-Bugzilla-Priority: P3 X-Bugzilla-Assigned-To: unassigned at gcc dot gnu.org X-Bugzilla-Target-Milestone: --- X-Bugzilla-Flags: X-Bugzilla-Changed-Fields: bug_id short_desc product version bug_status bug_severity priority component assigned_to reporter target_milestone Message-ID: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://gcc.gnu.org/bugzilla/ Auto-Submitted: auto-generated MIME-Version: 1.0 List-Id: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=3D108654 Bug ID: 108654 Summary: Incorrect codegen of RVV GCC Product: gcc Version: 13.0 Status: UNCONFIRMED Severity: normal Priority: P3 Component: c Assignee: unassigned at gcc dot gnu.org Reporter: juzhe.zhong at rivai dot ai Target Milestone: --- #include "riscv_vector.h" void foo5_3 (int32_t * restrict in, int32_t * restrict out, size_t n, int c= ond) { vint8m1_t v =3D *(vint8m1_t*)in; *(vint8m1_t*)out =3D v; vbool8_t v3 =3D *(vbool8_t*)in; *(vbool8_t*)(out + 200) =3D v3; vbool16_t v4 =3D *(vbool16_t *)in; *(vbool16_t *)(out + 300) =3D v4; } ASM: foo5_3: csrr t0,vlenb slli t1,t0,1 csrr a5,vlenb sub sp,sp,t1 slli a3,a5,1 vl1re8.v v24,0(a0) add a3,a3,sp vs1r.v v24,0(a1) addi a4,a1,800 sub a5,a3,a5 vsetvli a3,zero,e8,m1,ta,ma vsm.v v24,0(a4) vs1r.v v24,0(a5) addi a1,a1,1200 csrr t0,vlenb slli t1,t0,1 vsetvli a5,zero,e8,mf2,ta,ma vsm.v v24,0(a1) add sp,sp,t1 jr ra There are 2 issues here: First, 2 vlm.v mask loads are missing which is incorrect code-gen. Second, the code quality is bad which is the duplicate bug filed here: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=3D108185=