From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail-vk1-xa29.google.com (mail-vk1-xa29.google.com [IPv6:2607:f8b0:4864:20::a29]) by sourceware.org (Postfix) with ESMTPS id 15E2B3858C2C for ; Tue, 28 Sep 2021 07:37:41 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org 15E2B3858C2C Received: by mail-vk1-xa29.google.com with SMTP id s137so8043278vke.11 for ; Tue, 28 Sep 2021 00:37:41 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=2uAHISkr2GrSJE10Nc/pSD6tq+c7j5Z8yZoboE/tl10=; b=0yd6T/CBOCgaVBvSEvPTvPN0sBi4+ReN9eazORD15No71F0h3Durf+8g88DZruUNXz +WBeCEkdK7zOCypbC55OShLzJhXXjs2W2PFmOc9OZ/mLoErmx5KAIbXPXq5MVMN7/0S6 QSDVmDKzUO3erKs22jQ+TCWn5WcO4i7kjfOq7Z/mRLsVFKW2bSkVxyaAnU9J37RV1SrW 11EFR1C1rix5I5xhh0G84aUzSYRMnE8YLLo3VwCuANTqErv3F/QkeCAqFjIro16W8PCz jLboft3P8dtZw6vxQqMlsAnOx5JJAPeOWpNKNSF/vMZBESa+soIjyOa0mlKyr8du0F8Z cxTA== X-Gm-Message-State: AOAM533Oz630gswVLotfwsgu+lgFbIUhHkg+c5TCyPIwKCCMB5EPD4SY 9Kcfa9M7J3OA1iYdQeUKkGMUtu1LV30C+oWwgehcCunjIBU07w== X-Google-Smtp-Source: ABdhPJyZwcSimBlpnBhn9KKzUMd5Y0JNOzXE8fyc2acDE8t1uBe+eQpy11uVvPYajcAnC/C8C2g9nxP02dhdJn82wpk= X-Received: by 2002:a1f:257:: with SMTP id 84mr3538169vkc.19.1632814660609; Tue, 28 Sep 2021 00:37:40 -0700 (PDT) MIME-Version: 1.0 References: <20210927182833.03B56286470F@gskx-2.sc.intel.com> <543s464s-6282-so3o-rqr2-o9o78851o1@fhfr.qr> In-Reply-To: <543s464s-6282-so3o-rqr2-o9o78851o1@fhfr.qr> From: Hongtao Liu Date: Tue, 28 Sep 2021 15:43:52 +0800 Message-ID: Subject: Re: [r12-3893 Regression] FAIL: gcc.target/i386/vect-pr97352.c scan-assembler-times vmov.pd 4 on Linux/x86_64 To: Richard Biener Cc: "sunil.k.pandey" , Richard Sandiford , GCC Patches Content-Type: text/plain; charset="UTF-8" X-Spam-Status: No, score=-3.1 required=5.0 tests=BAYES_00, DKIM_SIGNED, DKIM_VALID, DKIM_VALID_AU, DKIM_VALID_EF, FREEMAIL_FROM, KAM_NUMSUBJECT, KAM_SHORT, RCVD_IN_DNSWL_NONE, SPF_HELO_NONE, SPF_PASS, TXREP autolearn=no autolearn_force=no version=3.4.4 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on server2.sourceware.org X-BeenThere: gcc-patches@gcc.gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Gcc-patches mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 28 Sep 2021 07:37:42 -0000 On Tue, Sep 28, 2021 at 2:59 PM Richard Biener via Gcc-patches wrote: > > On Mon, 27 Sep 2021, sunil.k.pandey wrote: > > > On Linux/x86_64, > > > > 6390c5047adb75960f86d56582e6322aaa4d9281 is the first bad commit > > commit 6390c5047adb75960f86d56582e6322aaa4d9281 > > Author: Richard Biener > > Date: Wed Nov 18 09:36:57 2020 +0100 > > > > Allow different vector types for stmt groups > > > > caused > > > > FAIL: gcc.dg/vect/bb-slp-17.c -flto -ffat-lto-objects scan-tree-dump-times slp2 "optimized: basic block" 1 > > FAIL: gcc.dg/vect/bb-slp-17.c scan-tree-dump-times slp2 "optimized: basic block" 1 > > This shows that it is maybe a bad idea to support V2SImode vectorization > with -m32 when we refuse to implement even plus. > > OTOH it's just the mode that's available, autovectorize_vector_modes > doesn't include the corresponding mode but we still pick it up via > the related vector mode for group-size == 2. > > > FAIL: gcc.dg/vect/bb-slp-pr65935.c -flto -ffat-lto-objects scan-tree-dump-times slp1 "optimized: basic block" 10 > > FAIL: gcc.dg/vect/bb-slp-pr65935.c scan-tree-dump-times slp1 "optimized: basic block" 10 > > We are now vectorizing the SSE tail when vectorizing with AVX. I'll > adjust the testcase to prefer SSE. > > > FAIL: gcc.target/i386/vect-pr97352.c scan-assembler-times vmov.pd 4 > > With -mach=cascadelake we get > > vpermpd $68, c, %ymm0 > vpermpd $238, c, %ymm0 > > instead of > > vmovapd c, %ymm1 > vinsertf128 $1, %xmm1, %ymm1, %ymm0 > vperm2f128 $49, %ymm1, %ymm1, %ymm0 > > what's a way to disallow additional -march= from taking effect? It's I usually add -mno-{avx,avx512f} and -mtune=generic or sometimes -mprefer-vector-width=* to the testcases. or use (?:vinsertf128|vpermpd) for alternative instructions. > really impossible to cater for all possible ISA variants in these kind > of testcases. Additional option -march=cascadelake sometimes can find real regression. > > Richard. -- BR, Hongtao