[PATCH AArch64/V3]Add new patterns for vcond_mask and vec

public inbox for gcc-patches@gcc.gnu.org
 help / color / mirror / Atom feed

* [PATCH AArch64/V3]Add new patterns for vcond_mask and vec_cmp
@ 2016-08-01 13:19 Bin Cheng
  2016-08-08 10:40 ` James Greenhalgh
  2017-06-28  5:10 ` Andrew Pinski
  0 siblings, 2 replies; 8+ messages in thread
From: Bin Cheng @ 2016-08-01 13:19 UTC (permalink / raw)
  To: gcc-patches; +Cc: nd

[-- Attachment #1: Type: text/plain, Size: 507 bytes --]

Hi,
This is the 3rd version patch implementing vcond_mask and vec_cmp patterns on AArch64.
Bootstrap and test along with next patch on AArch64, is it OK?

Thanks,
bin

2016-07-28  Alan Lawrence  <alan.lawrence@arm.com>
	    Renlin Li  <renlin.li@arm.com>
	    Bin Cheng  <bin.cheng@arm.com>

	* config/aarch64/aarch64-simd.md (vec_cmp<mode><mode>): New pattern.
	(vec_cmp<mode><v_cmp_result>): New pattern.
	(vec_cmpu<mode><mode>): New pattern.
	(vcond_mask_<mode><v_cmp_result>): New pattern.

[-- Attachment #2: vcond_mask-vec_cmp-20160703.txt --]
[-- Type: text/plain, Size: 7926 bytes --]

diff --git a/gcc/config/aarch64/aarch64-simd.md b/gcc/config/aarch64/aarch64-simd.md
index a19d171..3918136 100644
--- a/gcc/config/aarch64/aarch64-simd.md
+++ b/gcc/config/aarch64/aarch64-simd.md
@@ -2271,6 +2271,277 @@
   DONE;
 })
 
+(define_expand "vcond_mask_<mode><v_cmp_result>"
+  [(match_operand:VALLDI 0 "register_operand")
+   (match_operand:VALLDI 1 "nonmemory_operand")
+   (match_operand:VALLDI 2 "nonmemory_operand")
+   (match_operand:<V_cmp_result> 3 "register_operand")]
+  "TARGET_SIMD"
+{
+  /* If we have (a = (P) ? -1 : 0);
+     Then we can simply move the generated mask (result must be int).  */
+  if (operands[1] == CONSTM1_RTX (<MODE>mode)
+      && operands[2] == CONST0_RTX (<MODE>mode))
+    emit_move_insn (operands[0], operands[3]);
+  /* Similarly, (a = (P) ? 0 : -1) is just inverting the generated mask.  */
+  else if (operands[1] == CONST0_RTX (<MODE>mode)
+	   && operands[2] == CONSTM1_RTX (<MODE>mode))
+    emit_insn (gen_one_cmpl<v_cmp_result>2 (operands[0], operands[3]));
+  else
+    {
+      if (!REG_P (operands[1]))
+	operands[1] = force_reg (<MODE>mode, operands[1]);
+      if (!REG_P (operands[2]))
+	operands[2] = force_reg (<MODE>mode, operands[2]);
+      emit_insn (gen_aarch64_simd_bsl<mode> (operands[0], operands[3],
+					     operands[1], operands[2]));
+    }
+
+  DONE;
+})
+
+;; Patterns comparing two vectors to produce a mask.
+
+(define_expand "vec_cmp<mode><mode>"
+  [(set (match_operand:VSDQ_I_DI 0 "register_operand")
+	  (match_operator 1 "comparison_operator"
+	    [(match_operand:VSDQ_I_DI 2 "register_operand")
+	     (match_operand:VSDQ_I_DI 3 "nonmemory_operand")]))]
+  "TARGET_SIMD"
+{
+  rtx mask = operands[0];
+  enum rtx_code code = GET_CODE (operands[1]);
+
+  switch (code)
+    {
+    case NE:
+    case LE:
+    case LT:
+    case GE:
+    case GT:
+    case EQ:
+      if (operands[3] == CONST0_RTX (<MODE>mode))
+	break;
+
+      /* Fall through.  */
+    default:
+      if (!REG_P (operands[3]))
+	operands[3] = force_reg (<MODE>mode, operands[3]);
+
+      break;
+    }
+
+  switch (code)
+    {
+    case LT:
+      emit_insn (gen_aarch64_cmlt<mode> (mask, operands[2], operands[3]));
+      break;
+
+    case GE:
+      emit_insn (gen_aarch64_cmge<mode> (mask, operands[2], operands[3]));
+      break;
+
+    case LE:
+      emit_insn (gen_aarch64_cmle<mode> (mask, operands[2], operands[3]));
+      break;
+
+    case GT:
+      emit_insn (gen_aarch64_cmgt<mode> (mask, operands[2], operands[3]));
+      break;
+
+    case LTU:
+      emit_insn (gen_aarch64_cmgtu<mode> (mask, operands[3], operands[2]));
+      break;
+
+    case GEU:
+      emit_insn (gen_aarch64_cmgeu<mode> (mask, operands[2], operands[3]));
+      break;
+
+    case LEU:
+      emit_insn (gen_aarch64_cmgeu<mode> (mask, operands[3], operands[2]));
+      break;
+
+    case GTU:
+      emit_insn (gen_aarch64_cmgtu<mode> (mask, operands[2], operands[3]));
+      break;
+
+    case NE:
+      /* Handle NE as !EQ.  */
+      emit_insn (gen_aarch64_cmeq<mode> (mask, operands[2], operands[3]));
+      emit_insn (gen_one_cmpl<v_cmp_result>2 (mask, mask));
+      break;
+
+    case EQ:
+      emit_insn (gen_aarch64_cmeq<mode> (mask, operands[2], operands[3]));
+      break;
+
+    default:
+      gcc_unreachable ();
+    }
+
+  DONE;
+})
+
+(define_expand "vec_cmp<mode><v_cmp_result>"
+  [(set (match_operand:<V_cmp_result> 0 "register_operand")
+	(match_operator 1 "comparison_operator"
+	    [(match_operand:VDQF 2 "register_operand")
+	     (match_operand:VDQF 3 "nonmemory_operand")]))]
+  "TARGET_SIMD"
+{
+  int use_zero_form = 0;
+  enum rtx_code code = GET_CODE (operands[1]);
+  rtx tmp = gen_reg_rtx (<V_cmp_result>mode);
+
+  rtx (*comparison) (rtx, rtx, rtx);
+
+  switch (code)
+    {
+    case LE:
+    case LT:
+    case GE:
+    case GT:
+    case EQ:
+      if (operands[3] == CONST0_RTX (<MODE>mode))
+	{
+	  use_zero_form = 1;
+	  break;
+	}
+      /* Fall through.  */
+    default:
+      if (!REG_P (operands[3]))
+	operands[3] = force_reg (<MODE>mode, operands[3]);
+
+      break;
+    }
+
+  switch (code)
+    {
+    case LT:
+      if (use_zero_form)
+	{
+	  comparison = gen_aarch64_cmlt<mode>;
+	  break;
+	}
+      /* Else, fall through.  */
+    case UNGE:
+      std::swap (operands[2], operands[3]);
+      /* Fall through.  */
+    case UNLE:
+    case GT:
+      comparison = gen_aarch64_cmgt<mode>;
+      break;
+    case LE:
+      if (use_zero_form)
+	{
+	  comparison = gen_aarch64_cmle<mode>;
+	  break;
+	}
+      /* Else, fall through.  */
+    case UNGT:
+      std::swap (operands[2], operands[3]);
+      /* Fall through.  */
+    case UNLT:
+    case GE:
+      comparison = gen_aarch64_cmge<mode>;
+      break;
+    case NE:
+    case EQ:
+      comparison = gen_aarch64_cmeq<mode>;
+      break;
+    case UNEQ:
+    case ORDERED:
+    case UNORDERED:
+      break;
+    default:
+      gcc_unreachable ();
+    }
+
+  switch (code)
+    {
+    case UNGE:
+    case UNGT:
+    case UNLE:
+    case UNLT:
+    case NE:
+      /* FCM returns false for lanes which are unordered, so if we use
+	 the inverse of the comparison we actually want to emit, then
+	 revert the result, we will end up with the correct result.
+	 Note that a NE NaN and NaN NE b are true for all a, b.
+
+	 Our transformations are:
+	 a UNGE b -> !(b GT a)
+	 a UNGT b -> !(b GE a)
+	 a UNLE b -> !(a GT b)
+	 a UNLT b -> !(a GE b)
+	 a   NE b -> !(a EQ b)  */
+      emit_insn (comparison (operands[0], operands[2], operands[3]));
+      emit_insn (gen_one_cmpl<v_cmp_result>2 (operands[0], operands[0]));
+      break;
+
+    case LT:
+    case LE:
+    case GT:
+    case GE:
+    case EQ:
+      /* The easy case.  Here we emit one of FCMGE, FCMGT or FCMEQ.
+	 As a LT b <=> b GE a && a LE b <=> b GT a.  Our transformations are:
+	 a GE b -> a GE b
+	 a GT b -> a GT b
+	 a LE b -> b GE a
+	 a LT b -> b GT a
+	 a EQ b -> a EQ b
+	 Note that there also exist direct comparison against 0 forms,
+	 so catch those as a special case.  */
+      emit_insn (comparison (operands[0], operands[2], operands[3]));
+      break;
+
+    case UNEQ:
+      /* We first check (a > b ||  b > a) which is !UNEQ, inverting
+	 this result will then give us (a == b || a UNORDERED b).  */
+      emit_insn (gen_aarch64_cmgt<mode> (operands[0],
+					 operands[2], operands[3]));
+      emit_insn (gen_aarch64_cmgt<mode> (tmp, operands[3], operands[2]));
+      emit_insn (gen_ior<v_cmp_result>3 (operands[0], operands[0], tmp));
+      emit_insn (gen_one_cmpl<v_cmp_result>2 (operands[0], operands[0]));
+      break;
+
+    case UNORDERED:
+      /* Operands are ORDERED iff (a > b || b >= a), so we can compute
+	 UNORDERED as !ORDERED.  */
+      emit_insn (gen_aarch64_cmgt<mode> (tmp, operands[2], operands[3]));
+      emit_insn (gen_aarch64_cmge<mode> (operands[0],
+					 operands[3], operands[2]));
+      emit_insn (gen_ior<v_cmp_result>3 (operands[0], operands[0], tmp));
+      emit_insn (gen_one_cmpl<v_cmp_result>2 (operands[0], operands[0]));
+      break;
+
+    case ORDERED:
+      emit_insn (gen_aarch64_cmgt<mode> (tmp, operands[2], operands[3]));
+      emit_insn (gen_aarch64_cmge<mode> (operands[0],
+					 operands[3], operands[2]));
+      emit_insn (gen_ior<v_cmp_result>3 (operands[0], operands[0], tmp));
+      break;
+
+    default:
+      gcc_unreachable ();
+    }
+
+  DONE;
+})
+
+(define_expand "vec_cmpu<mode><mode>"
+  [(set (match_operand:VSDQ_I_DI 0 "register_operand")
+	  (match_operator 1 "comparison_operator"
+	    [(match_operand:VSDQ_I_DI 2 "register_operand")
+	     (match_operand:VSDQ_I_DI 3 "nonmemory_operand")]))]
+  "TARGET_SIMD"
+{
+  emit_insn (gen_vec_cmp<mode><mode> (operands[0], operands[1],
+				      operands[2], operands[3]));
+  DONE;
+})
+
 (define_expand "aarch64_vcond_internal<mode><mode>"
   [(set (match_operand:VSDQ_I_DI 0 "register_operand")
 	(if_then_else:VSDQ_I_DI

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [PATCH AArch64/V3]Add new patterns for vcond_mask and vec_cmp
  2016-08-01 13:19 [PATCH AArch64/V3]Add new patterns for vcond_mask and vec_cmp Bin Cheng
@ 2016-08-08 10:40 ` James Greenhalgh
  2016-08-08 11:10   ` Bin.Cheng
  2017-06-28  5:10 ` Andrew Pinski
  1 sibling, 1 reply; 8+ messages in thread
From: James Greenhalgh @ 2016-08-08 10:40 UTC (permalink / raw)
  To: Bin Cheng; +Cc: gcc-patches, nd

On Mon, Aug 01, 2016 at 01:18:53PM +0000, Bin Cheng wrote:
> Hi,
> This is the 3rd version patch implementing vcond_mask and vec_cmp patterns on
> AArch64.  Bootstrap and test along with next patch on AArch64, is it OK?

OK, with a couple of comments below, one on an extension and once style nit.

> 2016-07-28  Alan Lawrence  <alan.lawrence@arm.com>
> 	    Renlin Li  <renlin.li@arm.com>
> 	    Bin Cheng  <bin.cheng@arm.com>
> 
> 	* config/aarch64/aarch64-simd.md (vec_cmp<mode><mode>): New pattern.
> 	(vec_cmp<mode><v_cmp_result>): New pattern.
> 	(vec_cmpu<mode><mode>): New pattern.
> 	(vcond_mask_<mode><v_cmp_result>): New pattern.

> +(define_expand "vcond_mask_<mode><v_cmp_result>"
> +  [(match_operand:VALLDI 0 "register_operand")
> +   (match_operand:VALLDI 1 "nonmemory_operand")
> +   (match_operand:VALLDI 2 "nonmemory_operand")
> +   (match_operand:<V_cmp_result> 3 "register_operand")]
> +  "TARGET_SIMD"
> +{
> +  /* If we have (a = (P) ? -1 : 0);
> +     Then we can simply move the generated mask (result must be int).  */
> +  if (operands[1] == CONSTM1_RTX (<MODE>mode)
> +      && operands[2] == CONST0_RTX (<MODE>mode))
> +    emit_move_insn (operands[0], operands[3]);
> +  /* Similarly, (a = (P) ? 0 : -1) is just inverting the generated mask.  */
> +  else if (operands[1] == CONST0_RTX (<MODE>mode)
> +	   && operands[2] == CONSTM1_RTX (<MODE>mode))
> +    emit_insn (gen_one_cmpl<v_cmp_result>2 (operands[0], operands[3]));

Should we be also be catching these in a generic way before expanding?

<snip>

> +(define_expand "vec_cmp<mode><v_cmp_result>"
> +  [(set (match_operand:<V_cmp_result> 0 "register_operand")
> +	(match_operator 1 "comparison_operator"
> +	    [(match_operand:VDQF 2 "register_operand")
> +	     (match_operand:VDQF 3 "nonmemory_operand")]))]
> +  "TARGET_SIMD"
> +{
> +  int use_zero_form = 0;
> +  enum rtx_code code = GET_CODE (operands[1]);
> +  rtx tmp = gen_reg_rtx (<V_cmp_result>mode);
> +
> +  rtx (*comparison) (rtx, rtx, rtx);
> +
> +  switch (code)
> +    {
> +    case LE:
> +    case LT:
> +    case GE:
> +    case GT:
> +    case EQ:
> +      if (operands[3] == CONST0_RTX (<MODE>mode))
> +	{
> +	  use_zero_form = 1;
> +	  break;
> +	}
> +      /* Fall through.  */
> +    default:
> +      if (!REG_P (operands[3]))
> +	operands[3] = force_reg (<MODE>mode, operands[3]);
> +
> +      break;
> +    }
> +
> +  switch (code)
> +    {
> +    case LT:
> +      if (use_zero_form)
> +	{
> +	  comparison = gen_aarch64_cmlt<mode>;
> +	  break;
> +	}
> +      /* Else, fall through.  */
> +    case UNGE:
> +      std::swap (operands[2], operands[3]);
> +      /* Fall through.  */
> +    case UNLE:
> +    case GT:
> +      comparison = gen_aarch64_cmgt<mode>;
> +      break;
> +    case LE:
> +      if (use_zero_form)
> +	{
> +	  comparison = gen_aarch64_cmle<mode>;
> +	  break;
> +	}
> +      /* Else, fall through.  */
> +    case UNGT:
> +      std::swap (operands[2], operands[3]);
> +      /* Fall through.  */
> +    case UNLT:
> +    case GE:
> +      comparison = gen_aarch64_cmge<mode>;
> +      break;
> +    case NE:
> +    case EQ:
> +      comparison = gen_aarch64_cmeq<mode>;
> +      break;
> +    case UNEQ:
> +    case ORDERED:
> +    case UNORDERED:
> +      break;
> +    default:
> +      gcc_unreachable ();
> +    }
> +
> +  switch (code)
> +    {
> +    case UNGE:
> +    case UNGT:
> +    case UNLE:
> +    case UNLT:
> +    case NE:
> +      /* FCM returns false for lanes which are unordered, so if we use
> +	 the inverse of the comparison we actually want to emit, then
> +	 revert the result, we will end up with the correct result.

s/revert/invert/

> +	 Note that a NE NaN and NaN NE b are true for all a, b.
> +
> +	 Our transformations are:
> +	 a UNGE b -> !(b GT a)
> +	 a UNGT b -> !(b GE a)
> +	 a UNLE b -> !(a GT b)
> +	 a UNLT b -> !(a GE b)
> +	 a   NE b -> !(a EQ b)  */
> +      emit_insn (comparison (operands[0], operands[2], operands[3]));
> +      emit_insn (gen_one_cmpl<v_cmp_result>2 (operands[0], operands[0]));
> +      break;
> +
> +    case LT:
> +    case LE:
> +    case GT:
> +    case GE:
> +    case EQ:
> +      /* The easy case.  Here we emit one of FCMGE, FCMGT or FCMEQ.
> +	 As a LT b <=> b GE a && a LE b <=> b GT a.  Our transformations are:
> +	 a GE b -> a GE b
> +	 a GT b -> a GT b
> +	 a LE b -> b GE a
> +	 a LT b -> b GT a
> +	 a EQ b -> a EQ b

> +	 Note that there also exist direct comparison against 0 forms,
> +	 so catch those as a special case.  */

This part of the comment is no longer true, there is no special casing here.

> +      emit_insn (comparison (operands[0], operands[2], operands[3]));
> +      break;
> +
> +    case UNEQ:
> +      /* We first check (a > b ||  b > a) which is !UNEQ, inverting
> +	 this result will then give us (a == b || a UNORDERED b).  */
> +      emit_insn (gen_aarch64_cmgt<mode> (operands[0],
> +					 operands[2], operands[3]));
> +      emit_insn (gen_aarch64_cmgt<mode> (tmp, operands[3], operands[2]));
> +      emit_insn (gen_ior<v_cmp_result>3 (operands[0], operands[0], tmp));
> +      emit_insn (gen_one_cmpl<v_cmp_result>2 (operands[0], operands[0]));
> +      break;
> +
> +    case UNORDERED:
> +      /* Operands are ORDERED iff (a > b || b >= a), so we can compute
> +	 UNORDERED as !ORDERED.  */
> +      emit_insn (gen_aarch64_cmgt<mode> (tmp, operands[2], operands[3]));
> +      emit_insn (gen_aarch64_cmge<mode> (operands[0],
> +					 operands[3], operands[2]));
> +      emit_insn (gen_ior<v_cmp_result>3 (operands[0], operands[0], tmp));
> +      emit_insn (gen_one_cmpl<v_cmp_result>2 (operands[0], operands[0]));
> +      break;
> +
> +    case ORDERED:
> +      emit_insn (gen_aarch64_cmgt<mode> (tmp, operands[2], operands[3]));
> +      emit_insn (gen_aarch64_cmge<mode> (operands[0],
> +					 operands[3], operands[2]));
> +      emit_insn (gen_ior<v_cmp_result>3 (operands[0], operands[0], tmp));
> +      break;
> +
> +    default:
> +      gcc_unreachable ();
> +    }
> +
> +  DONE;
> +})

Thanks,
James

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [PATCH AArch64/V3]Add new patterns for vcond_mask and vec_cmp
  2016-08-08 10:40 ` James Greenhalgh
@ 2016-08-08 11:10   ` Bin.Cheng
  2016-08-09 14:05     ` James Greenhalgh
  2016-08-11  7:08     ` Andreas Schwab
  0 siblings, 2 replies; 8+ messages in thread
From: Bin.Cheng @ 2016-08-08 11:10 UTC (permalink / raw)
  To: James Greenhalgh; +Cc: Bin Cheng, gcc-patches, nd

[-- Attachment #1: Type: text/plain, Size: 6924 bytes --]

On Mon, Aug 8, 2016 at 11:40 AM, James Greenhalgh
<james.greenhalgh@arm.com> wrote:
> On Mon, Aug 01, 2016 at 01:18:53PM +0000, Bin Cheng wrote:
>> Hi,
>> This is the 3rd version patch implementing vcond_mask and vec_cmp patterns on
>> AArch64.  Bootstrap and test along with next patch on AArch64, is it OK?
>
> OK, with a couple of comments below, one on an extension and once style nit.
>
>> 2016-07-28  Alan Lawrence  <alan.lawrence@arm.com>
>>           Renlin Li  <renlin.li@arm.com>
>>           Bin Cheng  <bin.cheng@arm.com>
>>
>>       * config/aarch64/aarch64-simd.md (vec_cmp<mode><mode>): New pattern.
>>       (vec_cmp<mode><v_cmp_result>): New pattern.
>>       (vec_cmpu<mode><mode>): New pattern.
>>       (vcond_mask_<mode><v_cmp_result>): New pattern.
>
>> +(define_expand "vcond_mask_<mode><v_cmp_result>"
>> +  [(match_operand:VALLDI 0 "register_operand")
>> +   (match_operand:VALLDI 1 "nonmemory_operand")
>> +   (match_operand:VALLDI 2 "nonmemory_operand")
>> +   (match_operand:<V_cmp_result> 3 "register_operand")]
>> +  "TARGET_SIMD"
>> +{
>> +  /* If we have (a = (P) ? -1 : 0);
>> +     Then we can simply move the generated mask (result must be int).  */
>> +  if (operands[1] == CONSTM1_RTX (<MODE>mode)
>> +      && operands[2] == CONST0_RTX (<MODE>mode))
>> +    emit_move_insn (operands[0], operands[3]);
>> +  /* Similarly, (a = (P) ? 0 : -1) is just inverting the generated mask.  */
>> +  else if (operands[1] == CONST0_RTX (<MODE>mode)
>> +        && operands[2] == CONSTM1_RTX (<MODE>mode))
>> +    emit_insn (gen_one_cmpl<v_cmp_result>2 (operands[0], operands[3]));
Hi,
Thanks for reviewing, here is updated patch.
>
> Should we be also be catching these in a generic way before expanding?
This pattern was copied unchanged from Alan's original patch.
Yes, standard match-and-simplify pattern may be added to check such
special simplifications.  Not sure what the current behavior is, we'd
better keep this at expanding too?

Comment issues fixed.  Is this version OK?

Thanks,
bin
>
> <snip>
>
>> +(define_expand "vec_cmp<mode><v_cmp_result>"
>> +  [(set (match_operand:<V_cmp_result> 0 "register_operand")
>> +     (match_operator 1 "comparison_operator"
>> +         [(match_operand:VDQF 2 "register_operand")
>> +          (match_operand:VDQF 3 "nonmemory_operand")]))]
>> +  "TARGET_SIMD"
>> +{
>> +  int use_zero_form = 0;
>> +  enum rtx_code code = GET_CODE (operands[1]);
>> +  rtx tmp = gen_reg_rtx (<V_cmp_result>mode);
>> +
>> +  rtx (*comparison) (rtx, rtx, rtx);
>> +
>> +  switch (code)
>> +    {
>> +    case LE:
>> +    case LT:
>> +    case GE:
>> +    case GT:
>> +    case EQ:
>> +      if (operands[3] == CONST0_RTX (<MODE>mode))
>> +     {
>> +       use_zero_form = 1;
>> +       break;
>> +     }
>> +      /* Fall through.  */
>> +    default:
>> +      if (!REG_P (operands[3]))
>> +     operands[3] = force_reg (<MODE>mode, operands[3]);
>> +
>> +      break;
>> +    }
>> +
>> +  switch (code)
>> +    {
>> +    case LT:
>> +      if (use_zero_form)
>> +     {
>> +       comparison = gen_aarch64_cmlt<mode>;
>> +       break;
>> +     }
>> +      /* Else, fall through.  */
>> +    case UNGE:
>> +      std::swap (operands[2], operands[3]);
>> +      /* Fall through.  */
>> +    case UNLE:
>> +    case GT:
>> +      comparison = gen_aarch64_cmgt<mode>;
>> +      break;
>> +    case LE:
>> +      if (use_zero_form)
>> +     {
>> +       comparison = gen_aarch64_cmle<mode>;
>> +       break;
>> +     }
>> +      /* Else, fall through.  */
>> +    case UNGT:
>> +      std::swap (operands[2], operands[3]);
>> +      /* Fall through.  */
>> +    case UNLT:
>> +    case GE:
>> +      comparison = gen_aarch64_cmge<mode>;
>> +      break;
>> +    case NE:
>> +    case EQ:
>> +      comparison = gen_aarch64_cmeq<mode>;
>> +      break;
>> +    case UNEQ:
>> +    case ORDERED:
>> +    case UNORDERED:
>> +      break;
>> +    default:
>> +      gcc_unreachable ();
>> +    }
>> +
>> +  switch (code)
>> +    {
>> +    case UNGE:
>> +    case UNGT:
>> +    case UNLE:
>> +    case UNLT:
>> +    case NE:
>> +      /* FCM returns false for lanes which are unordered, so if we use
>> +      the inverse of the comparison we actually want to emit, then
>> +      revert the result, we will end up with the correct result.
>
> s/revert/invert/
>
>> +      Note that a NE NaN and NaN NE b are true for all a, b.
>> +
>> +      Our transformations are:
>> +      a UNGE b -> !(b GT a)
>> +      a UNGT b -> !(b GE a)
>> +      a UNLE b -> !(a GT b)
>> +      a UNLT b -> !(a GE b)
>> +      a   NE b -> !(a EQ b)  */
>> +      emit_insn (comparison (operands[0], operands[2], operands[3]));
>> +      emit_insn (gen_one_cmpl<v_cmp_result>2 (operands[0], operands[0]));
>> +      break;
>> +
>> +    case LT:
>> +    case LE:
>> +    case GT:
>> +    case GE:
>> +    case EQ:
>> +      /* The easy case.  Here we emit one of FCMGE, FCMGT or FCMEQ.
>> +      As a LT b <=> b GE a && a LE b <=> b GT a.  Our transformations are:
>> +      a GE b -> a GE b
>> +      a GT b -> a GT b
>> +      a LE b -> b GE a
>> +      a LT b -> b GT a
>> +      a EQ b -> a EQ b
>
>> +      Note that there also exist direct comparison against 0 forms,
>> +      so catch those as a special case.  */
>
> This part of the comment is no longer true, there is no special casing here.
>
>> +      emit_insn (comparison (operands[0], operands[2], operands[3]));
>> +      break;
>> +
>> +    case UNEQ:
>> +      /* We first check (a > b ||  b > a) which is !UNEQ, inverting
>> +      this result will then give us (a == b || a UNORDERED b).  */
>> +      emit_insn (gen_aarch64_cmgt<mode> (operands[0],
>> +                                      operands[2], operands[3]));
>> +      emit_insn (gen_aarch64_cmgt<mode> (tmp, operands[3], operands[2]));
>> +      emit_insn (gen_ior<v_cmp_result>3 (operands[0], operands[0], tmp));
>> +      emit_insn (gen_one_cmpl<v_cmp_result>2 (operands[0], operands[0]));
>> +      break;
>> +
>> +    case UNORDERED:
>> +      /* Operands are ORDERED iff (a > b || b >= a), so we can compute
>> +      UNORDERED as !ORDERED.  */
>> +      emit_insn (gen_aarch64_cmgt<mode> (tmp, operands[2], operands[3]));
>> +      emit_insn (gen_aarch64_cmge<mode> (operands[0],
>> +                                      operands[3], operands[2]));
>> +      emit_insn (gen_ior<v_cmp_result>3 (operands[0], operands[0], tmp));
>> +      emit_insn (gen_one_cmpl<v_cmp_result>2 (operands[0], operands[0]));
>> +      break;
>> +
>> +    case ORDERED:
>> +      emit_insn (gen_aarch64_cmgt<mode> (tmp, operands[2], operands[3]));
>> +      emit_insn (gen_aarch64_cmge<mode> (operands[0],
>> +                                      operands[3], operands[2]));
>> +      emit_insn (gen_ior<v_cmp_result>3 (operands[0], operands[0], tmp));
>> +      break;
>> +
>> +    default:
>> +      gcc_unreachable ();
>> +    }
>> +
>> +  DONE;
>> +})
>
> Thanks,
> James
>

[-- Attachment #2: vcond_mask-vec_cmp-20160808.txt --]
[-- Type: text/plain, Size: 7824 bytes --]

diff --git a/gcc/config/aarch64/aarch64-simd.md b/gcc/config/aarch64/aarch64-simd.md
index f2575a0..41cc60a 100644
--- a/gcc/config/aarch64/aarch64-simd.md
+++ b/gcc/config/aarch64/aarch64-simd.md
@@ -2291,6 +2291,275 @@
   DONE;
 })
 
+(define_expand "vcond_mask_<mode><v_cmp_result>"
+  [(match_operand:VALLDI 0 "register_operand")
+   (match_operand:VALLDI 1 "nonmemory_operand")
+   (match_operand:VALLDI 2 "nonmemory_operand")
+   (match_operand:<V_cmp_result> 3 "register_operand")]
+  "TARGET_SIMD"
+{
+  /* If we have (a = (P) ? -1 : 0);
+     Then we can simply move the generated mask (result must be int).  */
+  if (operands[1] == CONSTM1_RTX (<MODE>mode)
+      && operands[2] == CONST0_RTX (<MODE>mode))
+    emit_move_insn (operands[0], operands[3]);
+  /* Similarly, (a = (P) ? 0 : -1) is just inverting the generated mask.  */
+  else if (operands[1] == CONST0_RTX (<MODE>mode)
+	   && operands[2] == CONSTM1_RTX (<MODE>mode))
+    emit_insn (gen_one_cmpl<v_cmp_result>2 (operands[0], operands[3]));
+  else
+    {
+      if (!REG_P (operands[1]))
+	operands[1] = force_reg (<MODE>mode, operands[1]);
+      if (!REG_P (operands[2]))
+	operands[2] = force_reg (<MODE>mode, operands[2]);
+      emit_insn (gen_aarch64_simd_bsl<mode> (operands[0], operands[3],
+					     operands[1], operands[2]));
+    }
+
+  DONE;
+})
+
+;; Patterns comparing two vectors to produce a mask.
+
+(define_expand "vec_cmp<mode><mode>"
+  [(set (match_operand:VSDQ_I_DI 0 "register_operand")
+	  (match_operator 1 "comparison_operator"
+	    [(match_operand:VSDQ_I_DI 2 "register_operand")
+	     (match_operand:VSDQ_I_DI 3 "nonmemory_operand")]))]
+  "TARGET_SIMD"
+{
+  rtx mask = operands[0];
+  enum rtx_code code = GET_CODE (operands[1]);
+
+  switch (code)
+    {
+    case NE:
+    case LE:
+    case LT:
+    case GE:
+    case GT:
+    case EQ:
+      if (operands[3] == CONST0_RTX (<MODE>mode))
+	break;
+
+      /* Fall through.  */
+    default:
+      if (!REG_P (operands[3]))
+	operands[3] = force_reg (<MODE>mode, operands[3]);
+
+      break;
+    }
+
+  switch (code)
+    {
+    case LT:
+      emit_insn (gen_aarch64_cmlt<mode> (mask, operands[2], operands[3]));
+      break;
+
+    case GE:
+      emit_insn (gen_aarch64_cmge<mode> (mask, operands[2], operands[3]));
+      break;
+
+    case LE:
+      emit_insn (gen_aarch64_cmle<mode> (mask, operands[2], operands[3]));
+      break;
+
+    case GT:
+      emit_insn (gen_aarch64_cmgt<mode> (mask, operands[2], operands[3]));
+      break;
+
+    case LTU:
+      emit_insn (gen_aarch64_cmgtu<mode> (mask, operands[3], operands[2]));
+      break;
+
+    case GEU:
+      emit_insn (gen_aarch64_cmgeu<mode> (mask, operands[2], operands[3]));
+      break;
+
+    case LEU:
+      emit_insn (gen_aarch64_cmgeu<mode> (mask, operands[3], operands[2]));
+      break;
+
+    case GTU:
+      emit_insn (gen_aarch64_cmgtu<mode> (mask, operands[2], operands[3]));
+      break;
+
+    case NE:
+      /* Handle NE as !EQ.  */
+      emit_insn (gen_aarch64_cmeq<mode> (mask, operands[2], operands[3]));
+      emit_insn (gen_one_cmpl<v_cmp_result>2 (mask, mask));
+      break;
+
+    case EQ:
+      emit_insn (gen_aarch64_cmeq<mode> (mask, operands[2], operands[3]));
+      break;
+
+    default:
+      gcc_unreachable ();
+    }
+
+  DONE;
+})
+
+(define_expand "vec_cmp<mode><v_cmp_result>"
+  [(set (match_operand:<V_cmp_result> 0 "register_operand")
+	(match_operator 1 "comparison_operator"
+	    [(match_operand:VDQF 2 "register_operand")
+	     (match_operand:VDQF 3 "nonmemory_operand")]))]
+  "TARGET_SIMD"
+{
+  int use_zero_form = 0;
+  enum rtx_code code = GET_CODE (operands[1]);
+  rtx tmp = gen_reg_rtx (<V_cmp_result>mode);
+
+  rtx (*comparison) (rtx, rtx, rtx);
+
+  switch (code)
+    {
+    case LE:
+    case LT:
+    case GE:
+    case GT:
+    case EQ:
+      if (operands[3] == CONST0_RTX (<MODE>mode))
+	{
+	  use_zero_form = 1;
+	  break;
+	}
+      /* Fall through.  */
+    default:
+      if (!REG_P (operands[3]))
+	operands[3] = force_reg (<MODE>mode, operands[3]);
+
+      break;
+    }
+
+  switch (code)
+    {
+    case LT:
+      if (use_zero_form)
+	{
+	  comparison = gen_aarch64_cmlt<mode>;
+	  break;
+	}
+      /* Else, fall through.  */
+    case UNGE:
+      std::swap (operands[2], operands[3]);
+      /* Fall through.  */
+    case UNLE:
+    case GT:
+      comparison = gen_aarch64_cmgt<mode>;
+      break;
+    case LE:
+      if (use_zero_form)
+	{
+	  comparison = gen_aarch64_cmle<mode>;
+	  break;
+	}
+      /* Else, fall through.  */
+    case UNGT:
+      std::swap (operands[2], operands[3]);
+      /* Fall through.  */
+    case UNLT:
+    case GE:
+      comparison = gen_aarch64_cmge<mode>;
+      break;
+    case NE:
+    case EQ:
+      comparison = gen_aarch64_cmeq<mode>;
+      break;
+    case UNEQ:
+    case ORDERED:
+    case UNORDERED:
+      break;
+    default:
+      gcc_unreachable ();
+    }
+
+  switch (code)
+    {
+    case UNGE:
+    case UNGT:
+    case UNLE:
+    case UNLT:
+    case NE:
+      /* FCM returns false for lanes which are unordered, so if we use
+	 the inverse of the comparison we actually want to emit, then
+	 invert the result, we will end up with the correct result.
+	 Note that a NE NaN and NaN NE b are true for all a, b.
+
+	 Our transformations are:
+	 a UNGE b -> !(b GT a)
+	 a UNGT b -> !(b GE a)
+	 a UNLE b -> !(a GT b)
+	 a UNLT b -> !(a GE b)
+	 a   NE b -> !(a EQ b)  */
+      emit_insn (comparison (operands[0], operands[2], operands[3]));
+      emit_insn (gen_one_cmpl<v_cmp_result>2 (operands[0], operands[0]));
+      break;
+
+    case LT:
+    case LE:
+    case GT:
+    case GE:
+    case EQ:
+      /* The easy case.  Here we emit one of FCMGE, FCMGT or FCMEQ.
+	 As a LT b <=> b GE a && a LE b <=> b GT a.  Our transformations are:
+	 a GE b -> a GE b
+	 a GT b -> a GT b
+	 a LE b -> b GE a
+	 a LT b -> b GT a
+	 a EQ b -> a EQ b  */
+      emit_insn (comparison (operands[0], operands[2], operands[3]));
+      break;
+
+    case UNEQ:
+      /* We first check (a > b ||  b > a) which is !UNEQ, inverting
+	 this result will then give us (a == b || a UNORDERED b).  */
+      emit_insn (gen_aarch64_cmgt<mode> (operands[0],
+					 operands[2], operands[3]));
+      emit_insn (gen_aarch64_cmgt<mode> (tmp, operands[3], operands[2]));
+      emit_insn (gen_ior<v_cmp_result>3 (operands[0], operands[0], tmp));
+      emit_insn (gen_one_cmpl<v_cmp_result>2 (operands[0], operands[0]));
+      break;
+
+    case UNORDERED:
+      /* Operands are ORDERED iff (a > b || b >= a), so we can compute
+	 UNORDERED as !ORDERED.  */
+      emit_insn (gen_aarch64_cmgt<mode> (tmp, operands[2], operands[3]));
+      emit_insn (gen_aarch64_cmge<mode> (operands[0],
+					 operands[3], operands[2]));
+      emit_insn (gen_ior<v_cmp_result>3 (operands[0], operands[0], tmp));
+      emit_insn (gen_one_cmpl<v_cmp_result>2 (operands[0], operands[0]));
+      break;
+
+    case ORDERED:
+      emit_insn (gen_aarch64_cmgt<mode> (tmp, operands[2], operands[3]));
+      emit_insn (gen_aarch64_cmge<mode> (operands[0],
+					 operands[3], operands[2]));
+      emit_insn (gen_ior<v_cmp_result>3 (operands[0], operands[0], tmp));
+      break;
+
+    default:
+      gcc_unreachable ();
+    }
+
+  DONE;
+})
+
+(define_expand "vec_cmpu<mode><mode>"
+  [(set (match_operand:VSDQ_I_DI 0 "register_operand")
+	  (match_operator 1 "comparison_operator"
+	    [(match_operand:VSDQ_I_DI 2 "register_operand")
+	     (match_operand:VSDQ_I_DI 3 "nonmemory_operand")]))]
+  "TARGET_SIMD"
+{
+  emit_insn (gen_vec_cmp<mode><mode> (operands[0], operands[1],
+				      operands[2], operands[3]));
+  DONE;
+})
+
 (define_expand "aarch64_vcond_internal<mode><mode>"
   [(set (match_operand:VSDQ_I_DI 0 "register_operand")
 	(if_then_else:VSDQ_I_DI

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [PATCH AArch64/V3]Add new patterns for vcond_mask and vec_cmp
  2016-08-08 11:10   ` Bin.Cheng
@ 2016-08-09 14:05     ` James Greenhalgh
  2016-08-11  7:08     ` Andreas Schwab
  1 sibling, 0 replies; 8+ messages in thread
From: James Greenhalgh @ 2016-08-09 14:05 UTC (permalink / raw)
  To: Bin.Cheng; +Cc: Bin Cheng, gcc-patches, nd

On Mon, Aug 08, 2016 at 12:10:00PM +0100, Bin.Cheng wrote:
> On Mon, Aug 8, 2016 at 11:40 AM, James Greenhalgh
> <james.greenhalgh@arm.com> wrote:
> > On Mon, Aug 01, 2016 at 01:18:53PM +0000, Bin Cheng wrote:
> >> Hi,
> >> This is the 3rd version patch implementing vcond_mask and vec_cmp patterns on
> >> AArch64.  Bootstrap and test along with next patch on AArch64, is it OK?
> >
> > OK, with a couple of comments below, one on an extension and once style nit.
> >
> >> 2016-07-28  Alan Lawrence  <alan.lawrence@arm.com>
> >>           Renlin Li  <renlin.li@arm.com>
> >>           Bin Cheng  <bin.cheng@arm.com>
> >>
> >>       * config/aarch64/aarch64-simd.md (vec_cmp<mode><mode>): New pattern.
> >>       (vec_cmp<mode><v_cmp_result>): New pattern.
> >>       (vec_cmpu<mode><mode>): New pattern.
> >>       (vcond_mask_<mode><v_cmp_result>): New pattern.
> >
> >> +(define_expand "vcond_mask_<mode><v_cmp_result>"
> >> +  [(match_operand:VALLDI 0 "register_operand")
> >> +   (match_operand:VALLDI 1 "nonmemory_operand")
> >> +   (match_operand:VALLDI 2 "nonmemory_operand")
> >> +   (match_operand:<V_cmp_result> 3 "register_operand")]
> >> +  "TARGET_SIMD"
> >> +{
> >> +  /* If we have (a = (P) ? -1 : 0);
> >> +     Then we can simply move the generated mask (result must be int).  */
> >> +  if (operands[1] == CONSTM1_RTX (<MODE>mode)
> >> +      && operands[2] == CONST0_RTX (<MODE>mode))
> >> +    emit_move_insn (operands[0], operands[3]);
> >> +  /* Similarly, (a = (P) ? 0 : -1) is just inverting the generated mask.  */
> >> +  else if (operands[1] == CONST0_RTX (<MODE>mode)
> >> +        && operands[2] == CONSTM1_RTX (<MODE>mode))
> >> +    emit_insn (gen_one_cmpl<v_cmp_result>2 (operands[0], operands[3]));
> Hi,
> Thanks for reviewing, here is updated patch.
> >
> > Should we be also be catching these in a generic way before expanding?
> This pattern was copied unchanged from Alan's original patch.
> Yes, standard match-and-simplify pattern may be added to check such
> special simplifications.  Not sure what the current behavior is, we'd
> better keep this at expanding too?

Yes, that's what I'd expect.

> Comment issues fixed.  Is this version OK?

Yes this is OK.

Thanks,
James

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [PATCH AArch64/V3]Add new patterns for vcond_mask and vec_cmp
  2016-08-08 11:10   ` Bin.Cheng
  2016-08-09 14:05     ` James Greenhalgh
@ 2016-08-11  7:08     ` Andreas Schwab
  2016-08-11  8:37       ` Bin.Cheng
  1 sibling, 1 reply; 8+ messages in thread
From: Andreas Schwab @ 2016-08-11  7:08 UTC (permalink / raw)
  To: Bin.Cheng; +Cc: James Greenhalgh, Bin Cheng, gcc-patches, nd

../../gcc/config/aarch64/aarch64-simd.md: In function 'rtx_def* gen_vec_cmpv2sfv2si(rtx, rtx, rtx, rtx)':
../../gcc/config/aarch64/aarch64-simd.md:2498:17: error: 'comparison' may be used uninitialized in this function [-Werror=maybe-uninitialized]
       emit_insn (comparison (operands[0], operands[2], operands[3]));
       ~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
../../gcc/config/aarch64/aarch64-simd.md:2416:9: note: 'comparison' was declared here
   rtx (*comparison) (rtx, rtx, rtx);
         ^~~~~~~~~~
../../gcc/config/aarch64/aarch64-simd.md: In function 'rtx_def* gen_vec_cmpv4sfv4si(rtx, rtx, rtx, rtx)':
../../gcc/config/aarch64/aarch64-simd.md:2498:17: error: 'comparison' may be used uninitialized in this function [-Werror=maybe-uninitialized]
       emit_insn (comparison (operands[0], operands[2], operands[3]));
       ~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
../../gcc/config/aarch64/aarch64-simd.md:2416:9: note: 'comparison' was declared here
   rtx (*comparison) (rtx, rtx, rtx);
         ^~~~~~~~~~
../../gcc/config/aarch64/aarch64-simd.md: In function 'rtx_def* gen_vec_cmpv2dfv2di(rtx, rtx, rtx, rtx)':
../../gcc/config/aarch64/aarch64-simd.md:2498:17: error: 'comparison' may be used uninitialized in this function [-Werror=maybe-uninitialized]
       emit_insn (comparison (operands[0], operands[2], operands[3]));
       ~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
../../gcc/config/aarch64/aarch64-simd.md:2416:9: note: 'comparison' was declared here
   rtx (*comparison) (rtx, rtx, rtx);
         ^~~~~~~~~~

Andreas.

-- 
Andreas Schwab, SUSE Labs, schwab@suse.de
GPG Key fingerprint = 0196 BAD8 1CE9 1970 F4BE  1748 E4D4 88E3 0EEA B9D7
"And now for something completely different."

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [PATCH AArch64/V3]Add new patterns for vcond_mask and vec_cmp
  2016-08-11  7:08     ` Andreas Schwab
@ 2016-08-11  8:37       ` Bin.Cheng
  2016-08-11  8:43         ` Bin.Cheng
  0 siblings, 1 reply; 8+ messages in thread
From: Bin.Cheng @ 2016-08-11  8:37 UTC (permalink / raw)
  To: Andreas Schwab; +Cc: James Greenhalgh, Bin Cheng, gcc-patches, nd

On Thu, Aug 11, 2016 at 8:08 AM, Andreas Schwab <schwab@suse.de> wrote:
> ../../gcc/config/aarch64/aarch64-simd.md: In function 'rtx_def* gen_vec_cmpv2sfv2si(rtx, rtx, rtx, rtx)':
> ../../gcc/config/aarch64/aarch64-simd.md:2498:17: error: 'comparison' may be used uninitialized in this function [-Werror=maybe-uninitialized]
>        emit_insn (comparison (operands[0], operands[2], operands[3]));
>        ~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
> ../../gcc/config/aarch64/aarch64-simd.md:2416:9: note: 'comparison' was declared here
>    rtx (*comparison) (rtx, rtx, rtx);
>          ^~~~~~~~~~
> ../../gcc/config/aarch64/aarch64-simd.md: In function 'rtx_def* gen_vec_cmpv4sfv4si(rtx, rtx, rtx, rtx)':
> ../../gcc/config/aarch64/aarch64-simd.md:2498:17: error: 'comparison' may be used uninitialized in this function [-Werror=maybe-uninitialized]
>        emit_insn (comparison (operands[0], operands[2], operands[3]));
>        ~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
> ../../gcc/config/aarch64/aarch64-simd.md:2416:9: note: 'comparison' was declared here
>    rtx (*comparison) (rtx, rtx, rtx);
>          ^~~~~~~~~~
> ../../gcc/config/aarch64/aarch64-simd.md: In function 'rtx_def* gen_vec_cmpv2dfv2di(rtx, rtx, rtx, rtx)':
> ../../gcc/config/aarch64/aarch64-simd.md:2498:17: error: 'comparison' may be used uninitialized in this function [-Werror=maybe-uninitialized]
>        emit_insn (comparison (operands[0], operands[2], operands[3]));
>        ~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
> ../../gcc/config/aarch64/aarch64-simd.md:2416:9: note: 'comparison' was declared here
>    rtx (*comparison) (rtx, rtx, rtx);
>          ^~~~~~~~~~

Hi Andreas,
Thanks very much for reporting this.

I also got below error messages:
../../gcc/gcc/config/aarch64/aarch64-simd.md: In function ‘rtx_def*
gen_vcondv8qiv8qi(rtx, rtx, rtx, rtx, rtx, rtx)’:
../../gcc/gcc/config/aarch64/aarch64-simd.md:2574:17: error: unused
variable ‘code’ [-Werror=unused-variable]
   enum rtx_code code = GET_CODE (operands[3]);
                 ^~~~
../../gcc/gcc/config/aarch64/aarch64-simd.md: In function ‘rtx_def*
gen_vcondv16qiv16qi(rtx, rtx, rtx, rtx, rtx, rtx)’:
../../gcc/gcc/config/aarch64/aarch64-simd.md:2574:17: error: unused
variable ‘code’ [-Werror=unused-variable]
   enum rtx_code code = GET_CODE (operands[3]);
                 ^~~~
../../gcc/gcc/config/aarch64/aarch64-simd.md: In function ‘rtx_def*
gen_vcondv4hiv4hi(rtx, rtx, rtx, rtx, rtx, rtx)’:
../../gcc/gcc/config/aarch64/aarch64-simd.md:2574:17: error: unused
variable ‘code’ [-Werror=unused-variable]
   enum rtx_code code = GET_CODE (operands[3]);
                 ^~~~
My bad forgetting to remove this when separating the patch set, the
following patch
@https://gcc.gnu.org/ml/gcc-patches/2016-08/msg00842.html will fix
this.

For the reported error message, I think it's a bug in
maybe-unintialized warning, because all uses of "comparison" are
initialized in the previous switch statement.  I will initialize it
explicitly, also file a PR for recording.

Thanks,
bin
>
> Andreas.
>
> --
> Andreas Schwab, SUSE Labs, schwab@suse.de
> GPG Key fingerprint = 0196 BAD8 1CE9 1970 F4BE  1748 E4D4 88E3 0EEA B9D7
> "And now for something completely different."

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [PATCH AArch64/V3]Add new patterns for vcond_mask and vec_cmp
  2016-08-11  8:37       ` Bin.Cheng
@ 2016-08-11  8:43         ` Bin.Cheng
  0 siblings, 0 replies; 8+ messages in thread
From: Bin.Cheng @ 2016-08-11  8:43 UTC (permalink / raw)
  To: Andreas Schwab; +Cc: James Greenhalgh, Bin Cheng, gcc-patches, nd

On Thu, Aug 11, 2016 at 9:36 AM, Bin.Cheng <amker.cheng@gmail.com> wrote:
> On Thu, Aug 11, 2016 at 8:08 AM, Andreas Schwab <schwab@suse.de> wrote:
>> ../../gcc/config/aarch64/aarch64-simd.md: In function 'rtx_def* gen_vec_cmpv2sfv2si(rtx, rtx, rtx, rtx)':
>> ../../gcc/config/aarch64/aarch64-simd.md:2498:17: error: 'comparison' may be used uninitialized in this function [-Werror=maybe-uninitialized]
>>        emit_insn (comparison (operands[0], operands[2], operands[3]));
>>        ~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
>> ../../gcc/config/aarch64/aarch64-simd.md:2416:9: note: 'comparison' was declared here
>>    rtx (*comparison) (rtx, rtx, rtx);
>>          ^~~~~~~~~~
>> ../../gcc/config/aarch64/aarch64-simd.md: In function 'rtx_def* gen_vec_cmpv4sfv4si(rtx, rtx, rtx, rtx)':
>> ../../gcc/config/aarch64/aarch64-simd.md:2498:17: error: 'comparison' may be used uninitialized in this function [-Werror=maybe-uninitialized]
>>        emit_insn (comparison (operands[0], operands[2], operands[3]));
>>        ~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
>> ../../gcc/config/aarch64/aarch64-simd.md:2416:9: note: 'comparison' was declared here
>>    rtx (*comparison) (rtx, rtx, rtx);
>>          ^~~~~~~~~~
>> ../../gcc/config/aarch64/aarch64-simd.md: In function 'rtx_def* gen_vec_cmpv2dfv2di(rtx, rtx, rtx, rtx)':
>> ../../gcc/config/aarch64/aarch64-simd.md:2498:17: error: 'comparison' may be used uninitialized in this function [-Werror=maybe-uninitialized]
>>        emit_insn (comparison (operands[0], operands[2], operands[3]));
>>        ~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
>> ../../gcc/config/aarch64/aarch64-simd.md:2416:9: note: 'comparison' was declared here
>>    rtx (*comparison) (rtx, rtx, rtx);
>>          ^~~~~~~~~~
>
> Hi Andreas,
> Thanks very much for reporting this.
>
> I also got below error messages:
> ../../gcc/gcc/config/aarch64/aarch64-simd.md: In function ‘rtx_def*
> gen_vcondv8qiv8qi(rtx, rtx, rtx, rtx, rtx, rtx)’:
> ../../gcc/gcc/config/aarch64/aarch64-simd.md:2574:17: error: unused
> variable ‘code’ [-Werror=unused-variable]
>    enum rtx_code code = GET_CODE (operands[3]);
>                  ^~~~
> ../../gcc/gcc/config/aarch64/aarch64-simd.md: In function ‘rtx_def*
> gen_vcondv16qiv16qi(rtx, rtx, rtx, rtx, rtx, rtx)’:
> ../../gcc/gcc/config/aarch64/aarch64-simd.md:2574:17: error: unused
> variable ‘code’ [-Werror=unused-variable]
>    enum rtx_code code = GET_CODE (operands[3]);
>                  ^~~~
> ../../gcc/gcc/config/aarch64/aarch64-simd.md: In function ‘rtx_def*
> gen_vcondv4hiv4hi(rtx, rtx, rtx, rtx, rtx, rtx)’:
> ../../gcc/gcc/config/aarch64/aarch64-simd.md:2574:17: error: unused
> variable ‘code’ [-Werror=unused-variable]
>    enum rtx_code code = GET_CODE (operands[3]);
>                  ^~~~
> My bad forgetting to remove this when separating the patch set, the
> following patch
> @https://gcc.gnu.org/ml/gcc-patches/2016-08/msg00842.html will fix
> this.
>
> For the reported error message, I think it's a bug in
> maybe-unintialized warning, because all uses of "comparison" are
> initialized in the previous switch statement.  I will initialize it
> explicitly, also file a PR for recording.
PR73550 filed.

Thanks,
bin

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [PATCH AArch64/V3]Add new patterns for vcond_mask and vec_cmp
  2016-08-01 13:19 [PATCH AArch64/V3]Add new patterns for vcond_mask and vec_cmp Bin Cheng
  2016-08-08 10:40 ` James Greenhalgh
@ 2017-06-28  5:10 ` Andrew Pinski
  1 sibling, 0 replies; 8+ messages in thread
From: Andrew Pinski @ 2017-06-28  5:10 UTC (permalink / raw)
  To: Bin Cheng; +Cc: gcc-patches, nd

On Mon, Aug 1, 2016 at 6:18 AM, Bin Cheng <Bin.Cheng@arm.com> wrote:
> Hi,
> This is the 3rd version patch implementing vcond_mask and vec_cmp patterns on AArch64.
> Bootstrap and test along with next patch on AArch64, is it OK?
>
> Thanks,
> bin
>
> 2016-07-28  Alan Lawrence  <alan.lawrence@arm.com>
>             Renlin Li  <renlin.li@arm.com>
>             Bin Cheng  <bin.cheng@arm.com>
>
>         * config/aarch64/aarch64-simd.md (vec_cmp<mode><mode>): New pattern.
>         (vec_cmp<mode><v_cmp_result>): New pattern.
>         (vec_cmpu<mode><mode>): New pattern.
>         (vcond_mask_<mode><v_cmp_result>): New pattern.

LTGT support is missing and can be generated via __builtin_islessgreater .
See PR 81228.

Thanks,
Andrew Pinski

^ permalink raw reply	[flat|nested] 8+ messages in thread

end of thread, other threads:[~2017-06-28  5:10 UTC | newest]

Thread overview: 8+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2016-08-01 13:19 [PATCH AArch64/V3]Add new patterns for vcond_mask and vec_cmp Bin Cheng
2016-08-08 10:40 ` James Greenhalgh
2016-08-08 11:10   ` Bin.Cheng
2016-08-09 14:05     ` James Greenhalgh
2016-08-11  7:08     ` Andreas Schwab
2016-08-11  8:37       ` Bin.Cheng
2016-08-11  8:43         ` Bin.Cheng
2017-06-28  5:10 ` Andrew Pinski

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).