From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <gcc-patches-return-456070-listarch-gcc-patches=gcc.gnu.org@gcc.gnu.org>
Received: (qmail 54685 invoked by alias); 16 Jun 2017 07:53:00 -0000
Mailing-List: contact gcc-patches-help@gcc.gnu.org; run by ezmlm
Precedence: bulk
List-Id: <gcc-patches.gcc.gnu.org>
List-Archive: <http://gcc.gnu.org/ml/gcc-patches/>
List-Post: <mailto:gcc-patches@gcc.gnu.org>
List-Help: <mailto:gcc-patches-help@gcc.gnu.org>
Sender: gcc-patches-owner@gcc.gnu.org
Received: (qmail 54386 invoked by uid 89); 16 Jun 2017 07:53:00 -0000
Authentication-Results: sourceware.org; auth=none
X-Virus-Found: No
X-Spam-SWARE-Status: No, score=-9.8 required=5.0 tests=AWL,BAYES_00,GIT_PATCH_2,GIT_PATCH_3,KAM_ASCII_DIVIDERS,RCVD_IN_DNSWL_NONE,RCVD_IN_SORBS_WEB,SPF_PASS autolearn=ham version=3.3.2 spammy=
X-HELO: mail-wr0-f181.google.com
Received: from mail-wr0-f181.google.com (HELO mail-wr0-f181.google.com) (209.85.128.181) by sourceware.org (qpsmtpd/0.93/v0.84-503-g423c35a) with ESMTP; Fri, 16 Jun 2017 07:52:57 +0000
Received: by mail-wr0-f181.google.com with SMTP id 36so33705050wry.3        for <gcc-patches@gcc.gnu.org>; Fri, 16 Jun 2017 00:53:02 -0700 (PDT)
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;        d=1e100.net; s=20161025;        h=x-gm-message-state:from:to:mail-followup-to:cc:subject:references         :date:in-reply-to:message-id:user-agent:mime-version;        bh=pfxUFJZZIdt5MvFfbgkgseXlK8ycf3IbzTg5XwjsHxc=;        b=qQKNR/5D6hftKFcju260LmT2TmzkfwBAPxrhvlfnDw9if1Ro5Yt2zu4i9mKy+bpY/4         FGPxtDsOuS3nLNrnESfxl+Kzz6kOS0pjltLfI99C79OAihH+qZI7EHRHgeI+mII+dwx4         wSrXj728LOmFoS0RiQDKMWG7+JR8XFfOhgwjvZ+mAhHQS/363Jkxv3/79eu6njQRh39B         Tbz9LQtbugm4KSrLeWro4EbQSa3iq+2+DR77DeY9kpRJ24GnccR9PMea3AnuUqkgjVWY         86SxvZcpwcJUZHoFrXI8QzPRsIA8GR468I3+Ni+RI3FqXoi2s4tSVSOWSy+dvWfGKroo         RkUw==
X-Gm-Message-State: AKS2vOxFpO31nX+yTA4XY00FwGJAytuOE3+QepaYnnGHkvv6bGeSxim/	SqzG782746x0k68lYTo1Wg==
X-Received: by 10.223.157.25 with SMTP id k25mr6742822wre.156.1497599580320;        Fri, 16 Jun 2017 00:53:00 -0700 (PDT)
Received: from localhost (94.197.120.173.threembb.co.uk. [94.197.120.173])        by smtp.gmail.com with ESMTPSA id u18sm980337wrc.14.2017.06.16.00.52.58        (version=TLS1_2 cipher=ECDHE-RSA-CHACHA20-POLY1305 bits=256/256);        Fri, 16 Jun 2017 00:52:59 -0700 (PDT)
From: Richard Sandiford <richard.sandiford@linaro.org>
To: Tamar Christina <Tamar.Christina@arm.com>
Mail-Followup-To: Tamar Christina <Tamar.Christina@arm.com>,GCC Patches <gcc-patches@gcc.gnu.org>,  nd <nd@arm.com>,  James Greenhalgh	<James.Greenhalgh@arm.com>,  Marcus Shawcroft <Marcus.Shawcroft@arm.com>, 	Richard Earnshaw <Richard.Earnshaw@arm.com>, richard.sandiford@linaro.org
Cc: GCC Patches <gcc-patches@gcc.gnu.org>,  nd <nd@arm.com>,  James Greenhalgh	<James.Greenhalgh@arm.com>,  Marcus Shawcroft <Marcus.Shawcroft@arm.com>, 	Richard Earnshaw <Richard.Earnshaw@arm.com>
Subject: Re: [PATCH][GCC][AArch64] optimize float immediate moves (2 /4) - HF/DF/SF mode.
References: <VI1PR0801MB203117D353E8A26AA9909037FFC80@VI1PR0801MB2031.eurprd08.prod.outlook.com>	<VI1PR0801MB20316A7D3165E0EDD14CD43DFFCD0@VI1PR0801MB2031.eurprd08.prod.outlook.com>	<87mv9adgtb.fsf@linaro.org>	<VI1PR0801MB2031B9D822E987A87E7EB218FFC00@VI1PR0801MB2031.eurprd08.prod.outlook.com>	<VI1PR0801MB20319CC7B60446DEDF77EE72FFC00@VI1PR0801MB2031.eurprd08.prod.outlook.com>
Date: Fri, 16 Jun 2017 07:53:00 -0000
In-Reply-To: <VI1PR0801MB20319CC7B60446DEDF77EE72FFC00@VI1PR0801MB2031.eurprd08.prod.outlook.com>	(Tamar Christina's message of "Thu, 15 Jun 2017 14:28:27 +0000")
Message-ID: <87fuf0z7vq.fsf@linaro.org>
User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/25.2 (gnu/linux)
MIME-Version: 1.0
Content-Type: text/plain
X-SW-Source: 2017-06/txt/msg01165.txt.bz2

Tamar Christina <Tamar.Christina@arm.com> writes:
> Hi Richard,
>> > > +    rtx tmp = gen_reg_rtx (SImode);
>> > > +    aarch64_expand_mov_immediate (tmp, gen_int_mode (ival,
>> SImode));
>> > > +    tmp = simplify_gen_subreg (HImode, tmp, SImode, 0);
>> >
>> > This looks wrong for big-endian, and...
>> >
>> > > +    emit_move_insn (operands[0], gen_lowpart (HFmode, tmp));
>> >
>> > ...either it should be OK to go directly from tmp to the HFmode
>> > lowpart, or we should move the HImode temporary into a fresh REG.
>> > Current validate_subreg seems to suggest that we need the latter.
>> >
>> > Isn't it possible to use a HImode move immediate instead of an SImode
>> one?
>> 
>> We don't really have a movehi pattern, currently a movhi would end up
>> in the general mov<mode>_aarch64 pattern

movqi and movhi patterns are defined from the same mov<mode> template,
but they're still "proper" move patterns.

>> which would then use end up using a w register as well.

Isn't that what you want though?  f16_mov_immediate_1.c is testing for:

/* { dg-final { scan-assembler-times "mov\tw\[0-9\]+, #?19520"           3 } } */

> Also aarch64_expand_mov_immediate doesn't allow HImode moves, only SI and DI.

It doesn't need to, because all HImode CONST_INTs are already legitimate.
You can just use emit_move_insn instead.

FWIW, the following seems to pass the same tests and avoids the subreg
dance.  Just a proof of concept, and I'm not attached to the new
iterator name.

Thanks,
Richard


Index: gcc/gcc/config/aarch64/aarch64.md
===================================================================
--- gcc.orig/gcc/config/aarch64/aarch64.md
+++ gcc/gcc/config/aarch64/aarch64.md
@@ -1063,7 +1063,28 @@
   }
 )
 
-(define_insn_and_split "*movhf_aarch64"
+(define_split
+  [(set (match_operand:GPF_MOV_F16 0 "nonimmediate_operand")
+        (match_operand:GPF_MOV_F16 1 "immediate_operand"))]
+  "TARGET_FLOAT
+   && can_create_pseudo_p ()
+   && !aarch64_can_const_movi_rtx_p (operands[1], <MODE>mode)
+   && !aarch64_float_const_representable_p (operands[1])
+   && aarch64_float_const_rtx_p (operands[1])"
+  [(const_int 0)]
+  {
+    unsigned HOST_WIDE_INT ival;
+    if (!aarch64_reinterpret_float_as_int (operands[1], &ival))
+      FAIL;
+
+    rtx tmp = gen_reg_rtx (<FCVT_TARGET>mode);
+    emit_move_insn (tmp, gen_int_mode (ival, <FCVT_TARGET>mode));
+    emit_move_insn (operands[0], gen_lowpart (<MODE>mode, tmp));
+    DONE;
+  }
+)
+
+(define_insn "*movhf_aarch64"
   [(set (match_operand:HF 0 "nonimmediate_operand" "=w,w  ,?r,w,w  ,w  ,w,m,r,m ,r")
 	(match_operand:HF 1 "general_operand"      "Y ,?rY, w,w,Ufc,Uvi,m,w,m,rY,r"))]
   "TARGET_FLOAT && (register_operand (operands[0], HFmode)
@@ -1080,28 +1101,12 @@
    ldrh\\t%w0, %1
    strh\\t%w1, %0
    mov\\t%w0, %w1"
-  "&& can_create_pseudo_p ()
-   && !aarch64_can_const_movi_rtx_p (operands[1], HFmode)
-   && !aarch64_float_const_representable_p (operands[1])
-   &&  aarch64_float_const_rtx_p (operands[1])"
-  [(const_int 0)]
-  "{
-    unsigned HOST_WIDE_INT ival;
-    if (!aarch64_reinterpret_float_as_int (operands[1], &ival))
-      FAIL;
-
-    rtx tmp = gen_reg_rtx (SImode);
-    aarch64_expand_mov_immediate (tmp, GEN_INT (ival));
-    tmp = simplify_gen_subreg (HImode, tmp, SImode, 0);
-    emit_move_insn (operands[0], gen_lowpart (HFmode, tmp));
-    DONE;
-  }"
   [(set_attr "type" "neon_move,f_mcr,neon_to_gp,neon_move,fconsts, \
 		     neon_move,f_loads,f_stores,load1,store1,mov_reg")
    (set_attr "simd" "yes,*,yes,yes,*,yes,*,*,*,*,*")]
 )
 
-(define_insn_and_split "*movsf_aarch64"
+(define_insn "*movsf_aarch64"
   [(set (match_operand:SF 0 "nonimmediate_operand" "=w,w  ,?r,w,w  ,w  ,w,m,r,m ,r,r")
 	(match_operand:SF 1 "general_operand"      "Y ,?rY, w,w,Ufc,Uvi,m,w,m,rY,r,M"))]
   "TARGET_FLOAT && (register_operand (operands[0], SFmode)
@@ -1119,28 +1124,13 @@
    str\\t%w1, %0
    mov\\t%w0, %w1
    mov\\t%w0, %1"
-  "&& can_create_pseudo_p ()
-   && !aarch64_can_const_movi_rtx_p (operands[1], SFmode)
-   && !aarch64_float_const_representable_p (operands[1])
-   &&  aarch64_float_const_rtx_p (operands[1])"
-  [(const_int 0)]
-  "{
-    unsigned HOST_WIDE_INT ival;
-    if (!aarch64_reinterpret_float_as_int (operands[1], &ival))
-      FAIL;
-
-    rtx tmp = gen_reg_rtx (SImode);
-    aarch64_expand_mov_immediate (tmp, GEN_INT (ival));
-    emit_move_insn (operands[0], gen_lowpart (SFmode, tmp));
-    DONE;
-  }"
   [(set_attr "type" "neon_move,f_mcr,f_mrc,fmov,fconsts,neon_move,\
 		     f_loads,f_stores,load1,store1,mov_reg,\
 		     fconsts")
    (set_attr "simd" "yes,*,*,*,*,yes,*,*,*,*,*,*")]
 )
 
-(define_insn_and_split "*movdf_aarch64"
+(define_insn "*movdf_aarch64"
   [(set (match_operand:DF 0 "nonimmediate_operand" "=w, w  ,?r,w,w  ,w  ,w,m,r,m ,r,r")
 	(match_operand:DF 1 "general_operand"      "Y , ?rY, w,w,Ufc,Uvi,m,w,m,rY,r,N"))]
   "TARGET_FLOAT && (register_operand (operands[0], DFmode)
@@ -1158,21 +1148,6 @@
    str\\t%x1, %0
    mov\\t%x0, %x1
    mov\\t%x0, %1"
-  "&& can_create_pseudo_p ()
-   && !aarch64_can_const_movi_rtx_p (operands[1], DFmode)
-   && !aarch64_float_const_representable_p (operands[1])
-   &&  aarch64_float_const_rtx_p (operands[1])"
-  [(const_int 0)]
-  "{
-    unsigned HOST_WIDE_INT ival;
-    if (!aarch64_reinterpret_float_as_int (operands[1], &ival))
-      FAIL;
-
-    rtx tmp = gen_reg_rtx (DImode);
-    aarch64_expand_mov_immediate (tmp, GEN_INT (ival));
-    emit_move_insn (operands[0], gen_lowpart (DFmode, tmp));
-    DONE;
-  }"
   [(set_attr "type" "neon_move,f_mcr,f_mrc,fmov,fconstd,neon_move,\
 		     f_loadd,f_stored,load1,store1,mov_reg,\
 		     fconstd")
Index: gcc/gcc/config/aarch64/iterators.md
===================================================================
--- gcc.orig/gcc/config/aarch64/iterators.md
+++ gcc/gcc/config/aarch64/iterators.md
@@ -44,6 +44,10 @@
 ;; Iterator for all scalar floating point modes (HF, SF, DF)
 (define_mode_iterator GPF_F16 [(HF "AARCH64_ISA_F16") SF DF])
 
+;; Iterator for all scalar floating point modes (HF, SF, DF), without
+;; requiring AARCH64_ISA_F16 for HF.
+(define_mode_iterator GPF_MOV_F16 [HF SF DF])
+
 ;; Iterator for all scalar floating point modes (HF, SF, DF and TF)
 (define_mode_iterator GPF_TF_F16 [HF SF DF TF])