From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: by sourceware.org (Postfix, from userid 48) id AA2293870C27; Wed, 16 Mar 2022 15:03:11 +0000 (GMT) DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org AA2293870C27 From: "rguenth at gcc dot gnu.org" To: gcc-bugs@gcc.gnu.org Subject: [Bug tree-optimization/104935] [12 Regression] 554.roms_r ELF binary increased with r12-7612-g69619acd8d9b5856 Date: Wed, 16 Mar 2022 15:03:11 +0000 X-Bugzilla-Reason: CC X-Bugzilla-Type: changed X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: gcc X-Bugzilla-Component: tree-optimization X-Bugzilla-Version: 12.0 X-Bugzilla-Keywords: X-Bugzilla-Severity: normal X-Bugzilla-Who: rguenth at gcc dot gnu.org X-Bugzilla-Status: UNCONFIRMED X-Bugzilla-Resolution: X-Bugzilla-Priority: P3 X-Bugzilla-Assigned-To: unassigned at gcc dot gnu.org X-Bugzilla-Target-Milestone: 12.0 X-Bugzilla-Flags: X-Bugzilla-Changed-Fields: Message-ID: In-Reply-To: References: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://gcc.gnu.org/bugzilla/ Auto-Submitted: auto-generated MIME-Version: 1.0 X-BeenThere: gcc-bugs@gcc.gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Gcc-bugs mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 16 Mar 2022 15:03:11 -0000 https://gcc.gnu.org/bugzilla/show_bug.cgi?id=3D104935 --- Comment #3 from Richard Biener --- So in 554.roms_r I see cases (like for mod_grid.F90:allocate_grid) where we now vectorize more V4DI stores from a CTOR of scalars which reduces code size so jump-threading now goes wild (from DOM threading), threading across the long repetition of if (div =3D=3D 0) ; else ... =3D ... / div; if (div =3D=3D 0) ; else ... =3D ... / div; where the vectorized blob is now smaller than the threading threshold. For extract_sta.F90 we now vectorize two more loops with low VF (high VF is not profitable) but using only strided loads (they are reductions) which has extra size cost on the scalar epilogues plus we are vectorizing conditional reductions here. It doesn't look overly bad here. There's also a TU with a size win btw, but overall we vectorize more.=