From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 116404 invoked by alias); 10 Nov 2015 13:02:44 -0000 Mailing-List: contact gcc-patches-help@gcc.gnu.org; run by ezmlm Precedence: bulk List-Id: List-Archive: List-Post: List-Help: Sender: gcc-patches-owner@gcc.gnu.org Received: (qmail 116390 invoked by uid 89); 10 Nov 2015 13:02:43 -0000 Authentication-Results: sourceware.org; auth=none X-Virus-Found: No X-Spam-SWARE-Status: No, score=-1.9 required=5.0 tests=AWL,BAYES_00,FREEMAIL_FROM,RCVD_IN_DNSWL_LOW,SPF_PASS autolearn=ham version=3.3.2 X-HELO: mail-io0-f172.google.com Received: from mail-io0-f172.google.com (HELO mail-io0-f172.google.com) (209.85.223.172) by sourceware.org (qpsmtpd/0.93/v0.84-503-g423c35a) with (AES128-GCM-SHA256 encrypted) ESMTPS; Tue, 10 Nov 2015 13:02:40 +0000 Received: by ioc74 with SMTP id 74so152127416ioc.2 for ; Tue, 10 Nov 2015 05:02:38 -0800 (PST) MIME-Version: 1.0 X-Received: by 10.107.36.208 with SMTP id k199mr4219019iok.147.1447160558018; Tue, 10 Nov 2015 05:02:38 -0800 (PST) Received: by 10.36.209.7 with HTTP; Tue, 10 Nov 2015 05:02:37 -0800 (PST) In-Reply-To: References: Date: Tue, 10 Nov 2015 13:02:00 -0000 Message-ID: Subject: Re: [RFC] Combine vectorized loops with its scalar remainder. From: Ilya Enkovich To: Richard Biener Cc: Yuri Rumyantsev , gcc-patches , Jeff Law , Igor Zamyatin Content-Type: text/plain; charset=UTF-8 X-IsSubscribed: yes X-SW-Source: 2015-11/txt/msg01208.txt.bz2 2015-11-10 15:30 GMT+03:00 Richard Biener : > On Tue, Nov 3, 2015 at 1:08 PM, Yuri Rumyantsev wrote: >> Richard, >> >> It looks like misunderstanding - we assume that for GCCv6 the simple >> scheme of remainder will be used through introducing new IV : >> https://gcc.gnu.org/ml/gcc-patches/2015-09/msg01435.html >> >> Is it true or we missed something? > > >> > Do you have an idea how "masking" is better be organized to be usable >> > for both 4b and 4c? >> >> Do 2a ... > Okay. > 2a was 'transform already vectorized loop as a separate post-processing'. Isn't it what this prototype patch implements? Current version only masks loop body which is in practice applicable for AVX-512 only in the most cases. With AVX-512 it's easier to see how profitable masking might be and it is a main target for the first masking version. Extending it to prologues/epilogues and thus making it more profitable for other targets is the next step and is out of the scope of this patch. Thanks, Ilya > > Richard. >