From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <gcc-patches-return-472952-listarch-gcc-patches=gcc.gnu.org@gcc.gnu.org>
Received: (qmail 55009 invoked by alias); 9 Feb 2018 14:50:11 -0000
Mailing-List: contact gcc-patches-help@gcc.gnu.org; run by ezmlm
Precedence: bulk
List-Id: <gcc-patches.gcc.gnu.org>
List-Archive: <http://gcc.gnu.org/ml/gcc-patches/>
List-Post: <mailto:gcc-patches@gcc.gnu.org>
List-Help: <mailto:gcc-patches-help@gcc.gnu.org>
Sender: gcc-patches-owner@gcc.gnu.org
Received: (qmail 54996 invoked by uid 89); 9 Feb 2018 14:50:11 -0000
Authentication-Results: sourceware.org; auth=none
X-Virus-Found: No
X-Spam-SWARE-Status: No, score=-0.9 required=5.0 tests=BAYES_00,KAM_LAZY_DOMAIN_SECURITY,T_RP_MATCHES_RCVD autolearn=no version=3.3.2 spammy=(unknown)
X-HELO: foss.arm.com
Received: from usa-sjc-mx-foss1.foss.arm.com (HELO foss.arm.com) (217.140.101.70) by sourceware.org (qpsmtpd/0.93/v0.84-503-g423c35a) with ESMTP; Fri, 09 Feb 2018 14:50:08 +0000
Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.72.51.249])	by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 1ABC41435;	Fri,  9 Feb 2018 06:50:07 -0800 (PST)
Received: from [10.2.207.77] (e100706-lin.cambridge.arm.com [10.2.207.77])	by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id 501F23F487;	Fri,  9 Feb 2018 06:50:06 -0800 (PST)
Message-ID: <5A7DB51C.20108@foss.arm.com>
Date: Fri, 09 Feb 2018 14:50:00 -0000
From: Kyrill  Tkachov <kyrylo.tkachov@foss.arm.com>
User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:31.0) Gecko/20100101 Thunderbird/31.2.0
MIME-Version: 1.0
To: Uros Bizjak <ubizjak@gmail.com>
CC: "gcc-patches@gcc.gnu.org" <gcc-patches@gcc.gnu.org>,  Jeff Law <law@redhat.com>, richard.sandiford@linaro.org
Subject: Re: [PATCH][i386][3/3] PR target/84164: Make *cmpqi_ext_<n> patterns accept more zero_extract modes
References: <5A7C84A7.9090500@foss.arm.com> <CAFULd4aEk4_t=4KqyStu1QFdTiscLeuezd4D9pGsewkVNfGveQ@mail.gmail.com>
In-Reply-To: <CAFULd4aEk4_t=4KqyStu1QFdTiscLeuezd4D9pGsewkVNfGveQ@mail.gmail.com>
Content-Type: text/plain; charset=utf-8; format=flowed
Content-Transfer-Encoding: 7bit
X-SW-Source: 2018-02/txt/msg00520.txt.bz2

Hi Uros,

On 08/02/18 22:54, Uros Bizjak wrote:
> On Thu, Feb 8, 2018 at 6:11 PM, Kyrill  Tkachov
> <kyrylo.tkachov@foss.arm.com> wrote:
>> Hi all,
>>
>> This patch fixes some fallout in the i386 testsuite that occurs after the
>> simplification in patch [1/3] [1].
>> The gcc.target/i386/extract-2.c FAILs because it expects to match:
>> (set (reg:CC 17 flags)
>>      (compare:CC (subreg:QI (zero_extract:SI (reg:HI 98)
>>                  (const_int 8 [0x8])
>>                  (const_int 8 [0x8])) 0)
>>          (const_int 4 [0x4])))
>>
>> which is the *cmpqi_ext_2 pattern in i386.md but with the new simplification
>> the combine/simplify-rtx
>> machinery produces:
>> (set (reg:CC 17 flags)
>>      (compare:CC (subreg:QI (zero_extract:HI (reg:HI 98)
>>                  (const_int 8 [0x8])
>>                  (const_int 8 [0x8])) 0)
>>          (const_int 4 [0x4])))
>>
>> Notice that the zero_extract now has HImode like the register source rather
>> than SImode.
>> The existing *cmpqi_ext_<n> patterns however explicitly demand an SImode on
>> the zero_extract.
>> I'm not overly familiar with the i386 port but I think that's too
>> restrictive.
>> The RTL documentation says:
>> For (zero_extract:m loc size pos) "The mode m is the same as the mode that
>> would be used for loc if it were a register."
>> I'm not sure if that means that the mode of the zero_extract and the source
>> register must always match (as is the
>> case after patch [1/3]) but in any case it shouldn't matter semantically
>> since we're taking a QImode subreg of the whole
>> thing anyway.
>>
>> So the proposed solution in this patch is to allow HI, SI and DImode
>> zero_extracts in these patterns as these are the
>> modes that the ext_register_operand predicate accepts, so that the patterns
>> can match the new form above.
>>
>> With this patch the aforementioned test passes again and bootstrap and
>> testing on x86_64-unknown-linux-gnu shows
>> no regressions.
>>
>> Is this ok for trunk if the first patch is accepted?
> Huh, there are many other zero-extract patterns besides cmpqi_ext_*
> with QImode subreg of SImode zero_extract in i386.md, used to access
> high QImode register of HImode pair. A quick grep shows these that
> have _ext_ in their name:
>
> (define_insn "*cmpqi_ext_1"
> (define_insn "*cmpqi_ext_2"
> (define_expand "cmpqi_ext_3"
> (define_insn "*cmpqi_ext_3"
> (define_insn "*cmpqi_ext_4"
> (define_insn "addqi_ext_1"
> (define_insn "*addqi_ext_2"
> (define_expand "testqi_ext_1_ccno"
> (define_insn "*testqi_ext_1"
> (define_insn "*testqi_ext_2"
> (define_insn_and_split "*testqi_ext_3"
> (define_insn "andqi_ext_1"
> (define_insn "*andqi_ext_1_cc"
> (define_insn "*andqi_ext_2"
> (define_insn "*<code>qi_ext_1"
> (define_insn "*<code>qi_ext_2"
> (define_expand "xorqi_ext_1_cc"
> (define_insn "*xorqi_ext_1_cc"
>
> There are also relevant splitters and peephole2 patterns.

I see. Another approach I've looked at is removing the mode specifier from
the zero_extract in these patterns. This means that they can be of any mode
so they will match all of these modes without creating new patterns through
iterators. That also works for the testcase and passes bootstrap and testing
however there is the snag that the define_insns that don't start with a "*"
are used to generate RTL through the gen_* mechanism and in that context
the absence of a mode on the zero_extract would mean a VOIDmode zero_extract
would be created, which I'm fairly sure is not good. So in my experiments I left
those patterns alone (with an explicit SI on the zero_extract).

> IIRC, SImode zero_extract was enough to catch all high-register uses.
> There will be a pattern explosion if we want to handle all other
> integer modes here. However, I'm not a RTL expert, so someone will
> have to say what is the correct RTX form here.

Jeff, Richard, could you please give us some guidance on this issue?
Sorry for the trouble.

Thanks,
Kyrill

> Uros.