From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from smtp-out2.suse.de (smtp-out2.suse.de [IPv6:2001:67c:2178:6::1d]) by sourceware.org (Postfix) with ESMTPS id 4F4FE385770B for ; Tue, 11 Jul 2023 10:22:25 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org 4F4FE385770B Authentication-Results: sourceware.org; dmarc=pass (p=none dis=none) header.from=suse.de Authentication-Results: sourceware.org; spf=pass smtp.mailfrom=suse.de Received: from relay2.suse.de (relay2.suse.de [149.44.160.134]) by smtp-out2.suse.de (Postfix) with ESMTP id BD99E20263 for ; Tue, 11 Jul 2023 10:22:23 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=suse.de; s=susede2_rsa; t=1689070943; h=from:from:reply-to:date:date:to:to:cc:mime-version:mime-version: content-type:content-type; bh=xZXSWLhCjt1eL3UW23NwZVPnRuKFNVfZgDi35aAEmVw=; b=ptNf6n+ZocArVVkD43EkL9FRwBfSGjhYEqmEuq+IKbdd/58cpxpgQOy37dQVg5mcWpf+vv +grn+vT9+b6pjYE/W0DPiPuuMU7gyzUq0dc5wTUcfklCQtK0RzWMwYH+8qUhbB7K3O+BpP cp9lhlF2kIdINqMKmKDZQQZCdRp6RDw= DKIM-Signature: v=1; a=ed25519-sha256; c=relaxed/relaxed; d=suse.de; s=susede2_ed25519; t=1689070943; h=from:from:reply-to:date:date:to:to:cc:mime-version:mime-version: content-type:content-type; bh=xZXSWLhCjt1eL3UW23NwZVPnRuKFNVfZgDi35aAEmVw=; b=npxkrgf/aPMz4lYQRKr8WXe0AP4xK7UhF6aGV0PE/RF6C9hjwI0HYGVW0WhSnnQWRWiFtq 5ouXdTTJeqZtL+Bg== Received: from wotan.suse.de (wotan.suse.de [10.160.0.1]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by relay2.suse.de (Postfix) with ESMTPS id B36ED2C145 for ; Tue, 11 Jul 2023 10:22:23 +0000 (UTC) Date: Tue, 11 Jul 2023 10:22:23 +0000 (UTC) From: Richard Biener To: gcc-patches@gcc.gnu.org Subject: [PATCH] tree-optimization/110614 - SLP splat and re-align (optimized) User-Agent: Alpine 2.22 (LSU 394 2020-01-19) MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII X-Spam-Status: No, score=-10.5 required=5.0 tests=BAYES_00,DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,GIT_PATCH_0,MISSING_MID,SPF_HELO_NONE,SPF_PASS,TXREP,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server2.sourceware.org List-Id: Message-ID: <20230711102223.CAqCM9evBczKi-LSMvAR_BWax8T24z9wa6e_4Mh-8Ug@z> The following properly guards the re-align (optimized) paths used on old power CPUs for the added case of SLP splats from non-grouped loads. Testcases are existing in dg-torture. Bootstrapped and tested on x86_64-unknown-linux-gnu, pushed. PR tree-optimization/110614 * tree-vect-data-refs.cc (vect_supportable_dr_alignment): SLP splats are not suitable for re-align ops. --- gcc/tree-vect-data-refs.cc | 9 +++++---- 1 file changed, 5 insertions(+), 4 deletions(-) diff --git a/gcc/tree-vect-data-refs.cc b/gcc/tree-vect-data-refs.cc index ab2af103cb4..9edc8989de9 100644 --- a/gcc/tree-vect-data-refs.cc +++ b/gcc/tree-vect-data-refs.cc @@ -6829,10 +6829,11 @@ vect_supportable_dr_alignment (vec_info *vinfo, dr_vec_info *dr_info, same alignment, instead it depends on the SLP group size. */ if (loop_vinfo && STMT_SLP_TYPE (stmt_info) - && !multiple_p (LOOP_VINFO_VECT_FACTOR (loop_vinfo) - * (DR_GROUP_SIZE - (DR_GROUP_FIRST_ELEMENT (stmt_info))), - TYPE_VECTOR_SUBPARTS (vectype))) + && (!STMT_VINFO_GROUPED_ACCESS (stmt_info) + || !multiple_p (LOOP_VINFO_VECT_FACTOR (loop_vinfo) + * (DR_GROUP_SIZE + (DR_GROUP_FIRST_ELEMENT (stmt_info))), + TYPE_VECTOR_SUBPARTS (vectype)))) ; else if (!loop_vinfo || (nested_in_vect_loop -- 2.35.3