From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <gcc-patches-return-386998-listarch-gcc-patches=gcc.gnu.org@gcc.gnu.org>
Received: (qmail 11420 invoked by alias); 9 Dec 2014 19:15:43 -0000
Mailing-List: contact gcc-patches-help@gcc.gnu.org; run by ezmlm
Precedence: bulk
List-Id: <gcc-patches.gcc.gnu.org>
List-Archive: <http://gcc.gnu.org/ml/gcc-patches/>
List-Post: <mailto:gcc-patches@gcc.gnu.org>
List-Help: <mailto:gcc-patches-help@gcc.gnu.org>
Sender: gcc-patches-owner@gcc.gnu.org
Received: (qmail 11349 invoked by uid 89); 9 Dec 2014 19:15:37 -0000
Authentication-Results: sourceware.org; auth=none
X-Virus-Found: No
X-Spam-SWARE-Status: No, score=-1.8 required=5.0 tests=AWL,BAYES_00,SPF_HELO_PASS,SPF_PASS,T_RP_MATCHES_RCVD autolearn=ham version=3.3.2
X-HELO: mx1.redhat.com
Received: from mx1.redhat.com (HELO mx1.redhat.com) (209.132.183.28) by sourceware.org (qpsmtpd/0.93/v0.84-503-g423c35a) with (AES256-GCM-SHA384 encrypted) ESMTPS; Tue, 09 Dec 2014 19:15:36 +0000
Received: from int-mx13.intmail.prod.int.phx2.redhat.com (int-mx13.intmail.prod.int.phx2.redhat.com [10.5.11.26])	by mx1.redhat.com (8.14.4/8.14.4) with ESMTP id sB9JFVdZ015805	(version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-GCM-SHA384 bits=256 verify=FAIL);	Tue, 9 Dec 2014 14:15:31 -0500
Received: from [10.3.113.190] (ovpn-113-190.phx2.redhat.com [10.3.113.190])	by int-mx13.intmail.prod.int.phx2.redhat.com (8.14.4/8.14.4) with ESMTP id sB9JFUfj031999;	Tue, 9 Dec 2014 14:15:31 -0500
Message-ID: <54874A52.3030304@redhat.com>
Date: Tue, 09 Dec 2014 19:15:00 -0000
From: Jeff Law <law@redhat.com>
User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:31.0) Gecko/20100101 Thunderbird/31.2.0
MIME-Version: 1.0
To: Segher Boessenkool <segher@kernel.crashing.org>,        Zhenqiang Chen <zhenqiang.chen@arm.com>
CC: gcc-patches@gcc.gnu.org
Subject: Re: [PATCH] Fix PR 61225
References: <CACgzC7A7+JXBQ-HCcoqjhmbjoALCi8nngJy2J8SXHF1GRvyQPQ@mail.gmail.com> <CABu31nP5TqCU3iB-3jBTU93bORrCAmohq5nzcVBdjx2cqabrzA@mail.gmail.com> <CACgzC7D40=bxqag59GYQuXtze_ZAquo3PhpLmh0EGxZte5Mw4A@mail.gmail.com> <53C73EBD.9070909@redhat.com> <CACgzC7CJnoaOsFMDjXDZ0CLiECWPbejxDHg2GOk_AUtjrzC80w@mail.gmail.com> <547CE75B.6090709@redhat.com> <000001d00f9e$64f43600$2edca200$@arm.com> <54861817.4030409@redhat.com> <000301d01395$675d5890$361809b0$@arm.com> <20141209190744.GA8314@gate.crashing.org>
In-Reply-To: <20141209190744.GA8314@gate.crashing.org>
Content-Type: text/plain; charset=windows-1252; format=flowed
Content-Transfer-Encoding: 7bit
X-IsSubscribed: yes
X-SW-Source: 2014-12/txt/msg00827.txt.bz2

On 12/09/14 12:07, Segher Boessenkool wrote:
> On Tue, Dec 09, 2014 at 05:49:18PM +0800, Zhenqiang Chen wrote:
>>> Do you need to verify SETA and SETB satisfy single_set?  Or has that
>>> already been done elsewhere?
>>
>> A is NEXT_INSN (insn)
>> B is prev_nonnote_nondebug_insn (insn),
>>
>> For I1 -> I2 -> B; I2 -> A;
>> LOG_LINK can make sure I1 and I2 are single_set,
>
> It cannot, not anymore anyway.  LOG_LINKs can point to an insn with multiple
> SETs; multiple LOG_LINKs can point to such an insn.
So let's go ahead and put a single_set test in this function.

>>>> Is this fragment really needed?  Does it ever trigger?  I'd think that
>>> for > 2 uses punting would be fine.  Do we really commonly have cases
>>> with > 2 uses, but where they're all in SETA and SETB?
>
> Can't you just check for a death note on the second insn?  Together with
> reg_used_between_p?
Yea, that'd accomplish the same thing I think Zhenqiang is trying to 
catch and is simpler than walking the lists.

>
>>>> +	  /* Try to combine a compare insn that sets CC
>>>> +	     with a preceding insn that can set CC, and maybe with its
>>>> +	     logical predecessor as well.
>>>> +	     We need this special code because data flow connections
>>>> +	     do not get entered in LOG_LINKS.  */
>
> I think you mean "not _all_ data flow connections"?
I almost said something about this comment, but figured I was nitpicking 
too much :-)

>>> So you've got two new combine cases here, but I think the testcase only
>>> tests one of them.  Can you include a testcase for both of hte major
>>> paths above (I1->I2->I3; I2->insn and I2->I3; I2->INSN)
>>
>> pr61225.c is the case to cover I1->I2->I3; I2->insn.
>>
>> For I2 -> I3; I2 -> insn, I tried my test cases and found peephole2 can also
>> handle them. So I removed the code from the patch.
>
> Why?  The simpler case has much better chances of being used.
The question does it actually catch anything not already handled?  I 
guess you could argue that doing it in combine is better than peep2 and 
I'd agree with that.

>
> In fact, there are many more cases you could handle:
>
> You handle
>
> I1 -> I2 -> I3; I2 -> insn
>        I2 -> I3; I2 -> insn
>
> but there are also
>
>     I1,I2 -> I3; I2 -> insn
>
> and the many 4-insn combos, too.
Yes, but I wonder how much of this is really necessary in practice.  We 
could do exhaustive testing here, but I suspect the payoff isn't all 
that great.  Thus I'm comfortable with faulting in the cases we actually 
find are useful in practice.

>
>> +/* A is a compare (reg1, 0) and B is SINGLE_SET which SET_SRC is reg2.
>> +   It returns TRUE, if reg1 == reg2, and no other refer of reg1
>> +   except A and B.  */
>
> That sound like the only correct inputs are such a compare etc., but the
> routine tests whether that is true.
Correct, the RTL has to have a specific form and that is tested for. 
Comment updates can't hurt.


>
>> +static bool
>> +can_reuse_cc_set_p (rtx_insn *a, rtx_insn *b)
>> +{
>> +  rtx seta = single_set (a);
>> +  rtx setb = single_set (b);
>> +
>> +  if (BLOCK_FOR_INSN (a) != BLOCK_FOR_INSN (b)
>
> Neither the comment nor the function name mention this.  This test is
> better placed in the caller of this function, anyway.
Didn't consider it terribly important.  Moving it to the caller doesn't 
change anything significantly, though I would agree it's martinally cleaner.

>
>> @@ -3323,7 +3396,11 @@ try_combine (rtx_insn *i3, rtx_insn *i2, rtx_insn
>> *i1, rtx_insn *i0,
>>   	  rtx old = newpat;
>>   	  total_sets = 1 + extra_sets;
>>   	  newpat = gen_rtx_PARALLEL (VOIDmode, rtvec_alloc (total_sets));
>> -	  XVECEXP (newpat, 0, 0) = old;
>> +
>> +	  if (to_combined_insn)
>> +	    XVECEXP (newpat, 0, --total_sets) = old;
>> +	  else
>> +	    XVECEXP (newpat, 0, 0) = old;
>>   	}
>
> Is this correct?  If so, it needs a big fat comment, because it is
> not exactly obvious :-)
>
> Also, it doesn't handle at all the case where the new pattern already is
> a PARALLEL; can that never happen?
I'd convinced myself it was.  But yes, a comment here would be good.

Presumably you're thinking about a PARALLEL that satisfies single_set_p?

jeff