From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <gcc-bugzilla@gcc.gnu.org>
Received: by sourceware.org (Postfix, from userid 48)
 id AA2293870C27; Wed, 16 Mar 2022 15:03:11 +0000 (GMT)
DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org AA2293870C27
From: "rguenth at gcc dot gnu.org" <gcc-bugzilla@gcc.gnu.org>
To: gcc-bugs@gcc.gnu.org
Subject: [Bug tree-optimization/104935] [12 Regression] 554.roms_r ELF binary
 increased with r12-7612-g69619acd8d9b5856
Date: Wed, 16 Mar 2022 15:03:11 +0000
X-Bugzilla-Reason: CC
X-Bugzilla-Type: changed
X-Bugzilla-Watch-Reason: None
X-Bugzilla-Product: gcc
X-Bugzilla-Component: tree-optimization
X-Bugzilla-Version: 12.0
X-Bugzilla-Keywords: 
X-Bugzilla-Severity: normal
X-Bugzilla-Who: rguenth at gcc dot gnu.org
X-Bugzilla-Status: UNCONFIRMED
X-Bugzilla-Resolution: 
X-Bugzilla-Priority: P3
X-Bugzilla-Assigned-To: unassigned at gcc dot gnu.org
X-Bugzilla-Target-Milestone: 12.0
X-Bugzilla-Flags: 
X-Bugzilla-Changed-Fields: 
Message-ID: <bug-104935-4-qzL8cU9FR8@http.gcc.gnu.org/bugzilla/>
In-Reply-To: <bug-104935-4@http.gcc.gnu.org/bugzilla/>
References: <bug-104935-4@http.gcc.gnu.org/bugzilla/>
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
X-Bugzilla-URL: http://gcc.gnu.org/bugzilla/
Auto-Submitted: auto-generated
MIME-Version: 1.0
X-BeenThere: gcc-bugs@gcc.gnu.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Gcc-bugs mailing list <gcc-bugs.gcc.gnu.org>
List-Unsubscribe: <https://gcc.gnu.org/mailman/options/gcc-bugs>,
 <mailto:gcc-bugs-request@gcc.gnu.org?subject=unsubscribe>
List-Archive: <https://gcc.gnu.org/pipermail/gcc-bugs/>
List-Post: <mailto:gcc-bugs@gcc.gnu.org>
List-Help: <mailto:gcc-bugs-request@gcc.gnu.org?subject=help>
List-Subscribe: <https://gcc.gnu.org/mailman/listinfo/gcc-bugs>,
 <mailto:gcc-bugs-request@gcc.gnu.org?subject=subscribe>
X-List-Received-Date: Wed, 16 Mar 2022 15:03:11 -0000

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=3D104935
--- Comment #3 from Richard Biener <rguenth at gcc dot gnu.org> ---
So in 554.roms_r I see cases (like for mod_grid.F90:allocate_grid) where
we now vectorize more V4DI stores from a CTOR of scalars which reduces code
size so jump-threading now goes wild (from DOM threading), threading across
the long repetition of

  if (div =3D=3D 0)
    ;
  else
    ... =3D ... / div;


  <vectorized blob>
  if (div =3D=3D 0)
    ;
  else
    ... =3D ... / div;

where the vectorized blob is now smaller than the threading threshold.

For extract_sta.F90 we now vectorize two more loops with low VF (high VF
is not profitable) but using only strided loads (they are reductions)
which has extra size cost on the scalar epilogues plus we are vectorizing
conditional reductions here.  It doesn't look overly bad here.

There's also a TU with a size win btw, but overall we vectorize more.=