From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: by sourceware.org (Postfix, from userid 48) id 911523858415; Mon, 31 Jul 2023 12:56:07 +0000 (GMT) DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org 911523858415 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gcc.gnu.org; s=default; t=1690808167; bh=6sHntjqpurT7gMaOExc8ovptvQp7Ky03B87ZTnkQLow=; h=From:To:Subject:Date:In-Reply-To:References:From; b=ce9gk3s11OOh2t5erEz8EGHEqveA34LN1/xjImPAtncbpHLo3aT5SSpDPg9yXeof0 csr0x8gfk/dc372uTWJ+ebY2cL1jCiq7RUTa9dd9vgACnQ5ryEANbUkoMAQZUz5L/+ 2VFhz3rLSCq3JwDeNsiFYv19h+VXLCxFrrNA79bE= From: "cvs-commit at gcc dot gnu.org" To: gcc-bugs@gcc.gnu.org Subject: [Bug target/110625] [AArch64] Vect: SLP fails to vectorize a loop as the reduction_latency calculated by new costs is too large Date: Mon, 31 Jul 2023 12:56:05 +0000 X-Bugzilla-Reason: CC X-Bugzilla-Type: changed X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: gcc X-Bugzilla-Component: target X-Bugzilla-Version: 14.0 X-Bugzilla-Keywords: missed-optimization X-Bugzilla-Severity: normal X-Bugzilla-Who: cvs-commit at gcc dot gnu.org X-Bugzilla-Status: ASSIGNED X-Bugzilla-Resolution: X-Bugzilla-Priority: P3 X-Bugzilla-Assigned-To: rsandifo at gcc dot gnu.org X-Bugzilla-Target-Milestone: --- X-Bugzilla-Flags: X-Bugzilla-Changed-Fields: Message-ID: In-Reply-To: References: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://gcc.gnu.org/bugzilla/ Auto-Submitted: auto-generated MIME-Version: 1.0 List-Id: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=3D110625 --- Comment #12 from CVS Commits --- The master branch has been updated by Hao Liu : https://gcc.gnu.org/g:bf67bf4880ce5be0b6e48c7c35828528b7be12ed commit r14-2877-gbf67bf4880ce5be0b6e48c7c35828528b7be12ed Author: Hao Liu Date: Mon Jul 31 20:53:37 2023 +0800 AArch64: Do not increase the vect reduction latency by multiplying count [PR110625] The new costs should only count reduction latency by multiplying count = for single_defuse_cycle. For other situations, this will increase the reduction latency a lot and miss vectorization opportunities. Tested on aarch64-linux-gnu. gcc/ChangeLog: PR target/110625 * config/aarch64/aarch64.cc (count_ops): Only '* count' for single_defuse_cycle while counting reduction_latency. gcc/testsuite/ChangeLog: * gcc.target/aarch64/pr110625_1.c: New testcase. * gcc.target/aarch64/pr110625_2.c: New testcase.=