From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from smtp-out2.suse.de (smtp-out2.suse.de [195.135.220.29]) by sourceware.org (Postfix) with ESMTPS id 5AEBF3858C2C for ; Tue, 28 Sep 2021 06:59:30 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org 5AEBF3858C2C Received: from relay1.suse.de (relay1.suse.de [149.44.160.133]) by smtp-out2.suse.de (Postfix) with ESMTP id 0F7A9201C9; Tue, 28 Sep 2021 06:59:29 +0000 (UTC) Received: from murzim.suse.de (murzim.suse.de [10.160.4.192]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by relay1.suse.de (Postfix) with ESMTPS id DED4D25D82; Tue, 28 Sep 2021 06:59:28 +0000 (UTC) Date: Tue, 28 Sep 2021 08:59:28 +0200 (CEST) From: Richard Biener To: "sunil.k.pandey" cc: gcc-patches@gcc.gnu.org, richard.sandiford@arm.com, ubizjak@gmail.com Subject: Re: [r12-3893 Regression] FAIL: gcc.target/i386/vect-pr97352.c scan-assembler-times vmov.pd 4 on Linux/x86_64 In-Reply-To: <20210927182833.03B56286470F@gskx-2.sc.intel.com> Message-ID: <543s464s-6282-so3o-rqr2-o9o78851o1@fhfr.qr> References: <20210927182833.03B56286470F@gskx-2.sc.intel.com> MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII X-Spam-Status: No, score=-4.7 required=5.0 tests=BAYES_00, DKIM_SIGNED, DKIM_VALID, DKIM_VALID_AU, DKIM_VALID_EF, KAM_NUMSUBJECT, KAM_SHORT, SPF_HELO_NONE, SPF_PASS, TXREP autolearn=no autolearn_force=no version=3.4.4 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on server2.sourceware.org X-BeenThere: gcc-patches@gcc.gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Gcc-patches mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 28 Sep 2021 06:59:31 -0000 On Mon, 27 Sep 2021, sunil.k.pandey wrote: > On Linux/x86_64, > > 6390c5047adb75960f86d56582e6322aaa4d9281 is the first bad commit > commit 6390c5047adb75960f86d56582e6322aaa4d9281 > Author: Richard Biener > Date: Wed Nov 18 09:36:57 2020 +0100 > > Allow different vector types for stmt groups > > caused > > FAIL: gcc.dg/vect/bb-slp-17.c -flto -ffat-lto-objects scan-tree-dump-times slp2 "optimized: basic block" 1 > FAIL: gcc.dg/vect/bb-slp-17.c scan-tree-dump-times slp2 "optimized: basic block" 1 This shows that it is maybe a bad idea to support V2SImode vectorization with -m32 when we refuse to implement even plus. OTOH it's just the mode that's available, autovectorize_vector_modes doesn't include the corresponding mode but we still pick it up via the related vector mode for group-size == 2. > FAIL: gcc.dg/vect/bb-slp-pr65935.c -flto -ffat-lto-objects scan-tree-dump-times slp1 "optimized: basic block" 10 > FAIL: gcc.dg/vect/bb-slp-pr65935.c scan-tree-dump-times slp1 "optimized: basic block" 10 We are now vectorizing the SSE tail when vectorizing with AVX. I'll adjust the testcase to prefer SSE. > FAIL: gcc.target/i386/vect-pr97352.c scan-assembler-times vmov.pd 4 With -mach=cascadelake we get vpermpd $68, c, %ymm0 vpermpd $238, c, %ymm0 instead of vmovapd c, %ymm1 vinsertf128 $1, %xmm1, %ymm1, %ymm0 vperm2f128 $49, %ymm1, %ymm1, %ymm0 what's a way to disallow additional -march= from taking effect? It's really impossible to cater for all possible ISA variants in these kind of testcases. Richard.