From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 106461 invoked by alias); 9 Apr 2015 15:50:44 -0000 Mailing-List: contact gcc-patches-help@gcc.gnu.org; run by ezmlm Precedence: bulk List-Id: List-Archive: List-Post: List-Help: Sender: gcc-patches-owner@gcc.gnu.org Received: (qmail 106450 invoked by uid 89); 9 Apr 2015 15:50:43 -0000 Authentication-Results: sourceware.org; auth=none X-Virus-Found: No X-Spam-SWARE-Status: No, score=-2.2 required=5.0 tests=AWL,BAYES_00,FREEMAIL_FROM,RCVD_IN_DNSWL_LOW,SPF_PASS autolearn=ham version=3.3.2 X-HELO: mail-ob0-f169.google.com Received: from mail-ob0-f169.google.com (HELO mail-ob0-f169.google.com) (209.85.214.169) by sourceware.org (qpsmtpd/0.93/v0.84-503-g423c35a) with (AES128-GCM-SHA256 encrypted) ESMTPS; Thu, 09 Apr 2015 15:50:41 +0000 Received: by obbfy7 with SMTP id fy7so20671034obb.2 for ; Thu, 09 Apr 2015 08:50:39 -0700 (PDT) MIME-Version: 1.0 X-Received: by 10.182.19.167 with SMTP id g7mr2688497obe.75.1428594638661; Thu, 09 Apr 2015 08:50:38 -0700 (PDT) Received: by 10.60.168.19 with HTTP; Thu, 9 Apr 2015 08:50:38 -0700 (PDT) In-Reply-To: <20150409143658.GA48753@msticlxl57.ims.intel.com> References: <20150409143658.GA48753@msticlxl57.ims.intel.com> Date: Thu, 09 Apr 2015 15:50:00 -0000 Message-ID: Subject: Re: [PATCH, i386] Fix PR target/65671. Generate 32x4 extract even for DF in absence of AVX-512DQ. From: Uros Bizjak To: Kirill Yukhin Cc: GCC Patches Content-Type: text/plain; charset=UTF-8 X-SW-Source: 2015-04/txt/msg00402.txt.bz2 On Thu, Apr 9, 2015 at 4:37 PM, Kirill Yukhin wrote: > Patch in the bottom fixes PR target/65671. > > It simply generates vextract32x4 (float form) for double extract. > > Bootstrap & regtesting in progress. > > I'll check it in if pass and back port to 4.9.x. > Feel free comment. > > gcc/ > * config/i386/sse.md: Generate vextract32x4 if AVX-512DQ > is disabled. > > gcc/testsuite/ > * gcc.target/i386/pr65671.c: New. > OK with fixed ChangeLogs. (Please add PR target/65671 to both ChangeLogs). Thanks, Uros. > commit cb8d5b1c3156d81ae81600217d0861be1aade0ec > Author: Kirill Yukhin > Date: Thu Apr 9 13:05:54 2015 +0300 > > Fix PR target/65671. Generate 32x4 extract even for DF in absence of AVX-512DQ. > > diff --git a/gcc/config/i386/sse.md b/gcc/config/i386/sse.md > index 490fd6b..6d3b54a 100644 > --- a/gcc/config/i386/sse.md > +++ b/gcc/config/i386/sse.md > @@ -7015,10 +7015,15 @@ > (vec_select: > (match_operand:VI8F_256 1 "register_operand" "v,v") > (parallel [(const_int 2) (const_int 3)])))] > - "TARGET_AVX" > + "TARGET_AVX && && " > { > - if (TARGET_AVX512DQ && TARGET_AVX512VL) > - return "vextract64x2\t{$0x1, %1, %0|%0, %1, 0x1}"; > + if (TARGET_AVX512VL) > + { > + if (TARGET_AVX512DQ) > + return "vextract64x2\t{$0x1, %1, %0|%0, %1, 0x1}"; > + else > + return "vextract32x4\t{$0x1, %1, %0|%0, %1, 0x1}"; > + } > else > return "vextract\t{$0x1, %1, %0|%0, %1, 0x1}"; > } > diff --git a/gcc/testsuite/gcc.target/i386/pr65671.c b/gcc/testsuite/gcc.target/i386/pr65671.c > new file mode 100644 > index 0000000..8e5d00d > --- /dev/null > +++ b/gcc/testsuite/gcc.target/i386/pr65671.c > @@ -0,0 +1,15 @@ > +/* PR target/65671 */ > +/* { dg-do assemble } */ > +/* { dg-require-effective-target lp64 } */ > +/* { dg-options "-O2 -mavx512vl -ffixed-ymm16" } */ > + > +#include > + > +register __m256d a asm ("ymm16"); > +__m128d b; > + > +void > +foo () > +{ > + b = _mm256_extractf128_pd (a, 1); > +}