From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 66659 invoked by alias); 11 Nov 2015 12:06:44 -0000 Mailing-List: contact gcc-patches-help@gcc.gnu.org; run by ezmlm Precedence: bulk List-Id: List-Archive: List-Post: List-Help: Sender: gcc-patches-owner@gcc.gnu.org Received: (qmail 66648 invoked by uid 89); 11 Nov 2015 12:06:43 -0000 Authentication-Results: sourceware.org; auth=none X-Virus-Found: No X-Spam-SWARE-Status: No, score=-1.8 required=5.0 tests=AWL,BAYES_00,RP_MATCHES_RCVD,SPF_HELO_PASS autolearn=ham version=3.3.2 X-HELO: mx1.redhat.com Received: from mx1.redhat.com (HELO mx1.redhat.com) (209.132.183.28) by sourceware.org (qpsmtpd/0.93/v0.84-503-g423c35a) with (AES256-GCM-SHA384 encrypted) ESMTPS; Wed, 11 Nov 2015 12:06:43 +0000 Received: from int-mx09.intmail.prod.int.phx2.redhat.com (int-mx09.intmail.prod.int.phx2.redhat.com [10.5.11.22]) by mx1.redhat.com (Postfix) with ESMTPS id C579A79; Wed, 11 Nov 2015 12:06:41 +0000 (UTC) Received: from localhost.localdomain (vpn1-7-11.ams2.redhat.com [10.36.7.11]) by int-mx09.intmail.prod.int.phx2.redhat.com (8.14.4/8.14.4) with ESMTP id tABC6eSg015359; Wed, 11 Nov 2015 07:06:41 -0500 Subject: Re: [ptx] partitioning optimization To: Nathan Sidwell , GCC Patches References: <564270D6.6090303@acm.org> From: Bernd Schmidt Message-ID: <56432F50.6000208@redhat.com> Date: Wed, 11 Nov 2015 12:06:00 -0000 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:38.0) Gecko/20100101 Thunderbird/38.3.0 MIME-Version: 1.0 In-Reply-To: <564270D6.6090303@acm.org> Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit X-IsSubscribed: yes X-SW-Source: 2015-11/txt/msg01346.txt.bz2 On 11/10/2015 11:33 PM, Nathan Sidwell wrote: > I've committed this patch to trunk. It implements a partitioning > optimization for a loop partitioned over both vector and worker axes. > We can elide the inner vector partitioning state propagation, if there > are no intervening instructions in the worker-partitioned outer loop > other than the forking and joining. We simply execute the worker > propagation on all vectors. Patch LGTM, although I wonder if you really need the extra option rather than just optimize. > I've been unable to introduce a testcase for this. The difficulty is we > want to check an rtl dump from the acceleration compiler, and there > doesn't appear to be existing machinery for that in the testsuite. > Perhaps something to be added later? What's the difficulty exactly? Getting a dump should be possible with -foffload=-fdump-whatever, does the testsuite have a problem finding the right filename? Bernd