public inbox for gcc-patches@gcc.gnu.org
 help / color / mirror / Atom feed
* GCN, RDNA 3: Adjust 'sync_compare_and_swap<mode>_lds_insn'
@ 2024-01-31 10:36 Thomas Schwinge
  2024-01-31 11:31 ` Andrew Stubbs
  0 siblings, 1 reply; 4+ messages in thread
From: Thomas Schwinge @ 2024-01-31 10:36 UTC (permalink / raw)
  To: Andrew Stubbs, gcc-patches

[-- Attachment #1: Type: text/plain, Size: 626 bytes --]

Hi!

OK to push "GCN, RDNA 3: Adjust 'sync_compare_and_swap<mode>_lds_insn'",
see attached?

In pre-RDNA 3 ISA manuals, there are notes for 'DS_CMPST_[...]', like:

    Caution, the order of src and cmp are the *opposite* of the BUFFER_ATOMIC_CMPSWAP opcode.

..., and conversely in the RDNA 3 ISA manual, for 'DS_CMPSTORE_[...]':

    In this architecture the order of src and cmp agree with the BUFFER_ATOMIC_CMPSWAP opcode.

Is my understanding correct, that this isn't something we have to worry
about at the GCC machine description level; that's resolved at the
assembler level?


Grüße
 Thomas



[-- Warning: decoded text below may be mangled, UTF-8 assumed --]
[-- Attachment #2: 0001-GCN-RDNA-3-Adjust-sync_compare_and_swap-mode-_lds_in.patch --]
[-- Type: text/x-diff, Size: 1302 bytes --]

From df6e031bf4b46d9e5b2de117fecd66b8b9b6dd20 Mon Sep 17 00:00:00 2001
From: Thomas Schwinge <tschwinge@baylibre.com>
Date: Wed, 31 Jan 2024 10:19:00 +0100
Subject: [PATCH] GCN, RDNA 3: Adjust 'sync_compare_and_swap<mode>_lds_insn'

For OpenACC/GCN '-march=gfx1100', a lot of test cases FAIL:

    /tmp/ccGfLJ8a.mkoffload.2.s:406:2: error: instruction not supported on this GPU
            ds_cmpst_rtn_b32 v0, v0, v4, v3
            ^

Apparently, in RDNA 3, 'ds_cmpst_[...]' has been replaced by
'ds_cmpstore_[...]'.

	gcc/
	* config/gcn/gcn.md (sync_compare_and_swap<mode>_lds_insn)
	[TARGET_RDNA3]: Adjust.
---
 gcc/config/gcn/gcn.md | 7 ++++++-
 1 file changed, 6 insertions(+), 1 deletion(-)

diff --git a/gcc/config/gcn/gcn.md b/gcc/config/gcn/gcn.md
index 8abaef3bbdec..bbb75704140b 100644
--- a/gcc/config/gcn/gcn.md
+++ b/gcc/config/gcn/gcn.md
@@ -2095,7 +2095,12 @@
 	   (match_operand:SIDI 3 "register_operand" "  v")]
 	  UNSPECV_ATOMIC))]
   ""
-  "ds_cmpst_rtn_b<bitsize> %0, %1, %2, %3\;s_waitcnt\tlgkmcnt(0)"
+  {
+    if (TARGET_RDNA3)
+      return "ds_cmpstore_rtn_b<bitsize> %0, %1, %2, %3\;s_waitcnt\tlgkmcnt(0)";
+    else
+      return "ds_cmpst_rtn_b<bitsize> %0, %1, %2, %3\;s_waitcnt\tlgkmcnt(0)";
+  }
   [(set_attr "type" "ds")
    (set_attr "length" "12")])
 
-- 
2.43.0


^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: GCN, RDNA 3: Adjust 'sync_compare_and_swap<mode>_lds_insn'
  2024-01-31 10:36 GCN, RDNA 3: Adjust 'sync_compare_and_swap<mode>_lds_insn' Thomas Schwinge
@ 2024-01-31 11:31 ` Andrew Stubbs
  2024-02-01 11:36   ` Thomas Schwinge
  0 siblings, 1 reply; 4+ messages in thread
From: Andrew Stubbs @ 2024-01-31 11:31 UTC (permalink / raw)
  To: Thomas Schwinge, gcc-patches

On 31/01/2024 10:36, Thomas Schwinge wrote:
> Hi!
> 
> OK to push "GCN, RDNA 3: Adjust 'sync_compare_and_swap<mode>_lds_insn'",
> see attached?
> 
> In pre-RDNA 3 ISA manuals, there are notes for 'DS_CMPST_[...]', like:
> 
>      Caution, the order of src and cmp are the *opposite* of the BUFFER_ATOMIC_CMPSWAP opcode.
> 
> ..., and conversely in the RDNA 3 ISA manual, for 'DS_CMPSTORE_[...]':
> 
>      In this architecture the order of src and cmp agree with the BUFFER_ATOMIC_CMPSWAP opcode.
> 
> Is my understanding correct, that this isn't something we have to worry
> about at the GCC machine description level; that's resolved at the
> assembler level?

Right, the IR uses GCC's operand order and has nothing to do with the 
assembler syntax; the output template does the mapping.

> --- a/gcc/config/gcn/gcn.md
> +++ b/gcc/config/gcn/gcn.md
> @@ -2095,7 +2095,12 @@
>  	   (match_operand:SIDI 3 "register_operand" "  v")]
>  	  UNSPECV_ATOMIC))]
>    ""
> -  "ds_cmpst_rtn_b<bitsize> %0, %1, %2, %3\;s_waitcnt\tlgkmcnt(0)"
> +  {
> +    if (TARGET_RDNA3)
> +      return "ds_cmpstore_rtn_b<bitsize> %0, %1, %2, %3\;s_waitcnt\tlgkmcnt(0)";
> +    else
> +      return "ds_cmpst_rtn_b<bitsize> %0, %1, %2, %3\;s_waitcnt\tlgkmcnt(0)";
> +  }
>    [(set_attr "type" "ds")
>     (set_attr "length" "12")])

I think you need to swap %2 and %3 in the new format. ds_cmpst matches 
GCC operand order, but ds_cmpstore has "cmp" and "src" reversed.

Andrew

^ permalink raw reply	[flat|nested] 4+ messages in thread

* GCN, RDNA 3: Adjust 'sync_compare_and_swap<mode>_lds_insn'
  2024-01-31 11:31 ` Andrew Stubbs
@ 2024-02-01 11:36   ` Thomas Schwinge
  2024-02-01 11:40     ` Andrew Stubbs
  0 siblings, 1 reply; 4+ messages in thread
From: Thomas Schwinge @ 2024-02-01 11:36 UTC (permalink / raw)
  To: Andrew Stubbs, gcc-patches

[-- Attachment #1: Type: text/plain, Size: 1963 bytes --]

Hi!

On 2024-01-31T11:31:00+0000, Andrew Stubbs <ams@baylibre.com> wrote:
> On 31/01/2024 10:36, Thomas Schwinge wrote:
>> OK to push "GCN, RDNA 3: Adjust 'sync_compare_and_swap<mode>_lds_insn'",
>> see attached?
>> 
>> In pre-RDNA 3 ISA manuals, there are notes for 'DS_CMPST_[...]', like:
>> 
>>      Caution, the order of src and cmp are the *opposite* of the BUFFER_ATOMIC_CMPSWAP opcode.
>> 
>> ..., and conversely in the RDNA 3 ISA manual, for 'DS_CMPSTORE_[...]':
>> 
>>      In this architecture the order of src and cmp agree with the BUFFER_ATOMIC_CMPSWAP opcode.
>> 
>> Is my understanding correct, that this isn't something we have to worry
>> about at the GCC machine description level; that's resolved at the
>> assembler level?
>
> Right, the IR uses GCC's operand order and has nothing to do with the 
> assembler syntax; the output template does the mapping.
>
>> --- a/gcc/config/gcn/gcn.md
>> +++ b/gcc/config/gcn/gcn.md
>> @@ -2095,7 +2095,12 @@
>>  	   (match_operand:SIDI 3 "register_operand" "  v")]
>>  	  UNSPECV_ATOMIC))]
>>    ""
>> -  "ds_cmpst_rtn_b<bitsize> %0, %1, %2, %3\;s_waitcnt\tlgkmcnt(0)"
>> +  {
>> +    if (TARGET_RDNA3)
>> +      return "ds_cmpstore_rtn_b<bitsize> %0, %1, %2, %3\;s_waitcnt\tlgkmcnt(0)";
>> +    else
>> +      return "ds_cmpst_rtn_b<bitsize> %0, %1, %2, %3\;s_waitcnt\tlgkmcnt(0)";
>> +  }
>>    [(set_attr "type" "ds")
>>     (set_attr "length" "12")])
>
> I think you need to swap %2 and %3 in the new format. ds_cmpst matches 
> GCC operand order, but ds_cmpstore has "cmp" and "src" reversed.

OK, thanks.  That was my actual question -- so, we do need to swap, and
indeed, most of the affected libgomp OpenACC test cases then PASS their
execution test.  With that changed, I've pushed to master branch
commit 6c2a40f4f4577f5d0f7bd1cfda48a5701b75744c
"GCN, RDNA 3: Adjust 'sync_compare_and_swap<mode>_lds_insn'", see
attached.


Grüße
 Thomas



[-- Warning: decoded text below may be mangled, UTF-8 assumed --]
[-- Attachment #2: 0001-GCN-RDNA-3-Adjust-sync_compare_and_swap-mode-_lds_in.patch --]
[-- Type: text/x-diff, Size: 1792 bytes --]

From 6c2a40f4f4577f5d0f7bd1cfda48a5701b75744c Mon Sep 17 00:00:00 2001
From: Thomas Schwinge <tschwinge@baylibre.com>
Date: Wed, 31 Jan 2024 10:19:00 +0100
Subject: [PATCH] GCN, RDNA 3: Adjust 'sync_compare_and_swap<mode>_lds_insn'

For OpenACC/GCN '-march=gfx1100', a lot of libgomp OpenACC test cases FAIL:

    /tmp/ccGfLJ8a.mkoffload.2.s:406:2: error: instruction not supported on this GPU
            ds_cmpst_rtn_b32 v0, v0, v4, v3
            ^

In RDNA 3, 'ds_cmpst_[...]' has been replaced by 'ds_cmpstore_[...]', and the
notes for 'ds_cmpst_[...]' in pre-RDNA 3 ISA manuals:

    Caution, the order of src and cmp are the *opposite* of the BUFFER_ATOMIC_CMPSWAP opcode.

..., have been resolved for 'ds_cmpstore_[...]' in the RDNA 3 ISA manual:

    In this architecture the order of src and cmp agree with the BUFFER_ATOMIC_CMPSWAP opcode.

..., and therefore '%2', '%3' now swapped with regards to GCC operand order.
Most of the affected libgomp OpenACC test cases then PASS their execution test.

	gcc/
	* config/gcn/gcn.md (sync_compare_and_swap<mode>_lds_insn)
	[TARGET_RDNA3]: Adjust.
---
 gcc/config/gcn/gcn.md | 7 ++++++-
 1 file changed, 6 insertions(+), 1 deletion(-)

diff --git a/gcc/config/gcn/gcn.md b/gcc/config/gcn/gcn.md
index 1f3c692b7a67..925e2cea4895 100644
--- a/gcc/config/gcn/gcn.md
+++ b/gcc/config/gcn/gcn.md
@@ -2074,7 +2074,12 @@
 	   (match_operand:SIDI 3 "register_operand" "  v")]
 	  UNSPECV_ATOMIC))]
   ""
-  "ds_cmpst_rtn_b<bitsize> %0, %1, %2, %3\;s_waitcnt\tlgkmcnt(0)"
+  {
+    if (TARGET_RDNA3)
+      return "ds_cmpstore_rtn_b<bitsize> %0, %1, %3, %2\;s_waitcnt\tlgkmcnt(0)";
+    else
+      return "ds_cmpst_rtn_b<bitsize> %0, %1, %2, %3\;s_waitcnt\tlgkmcnt(0)";
+  }
   [(set_attr "type" "ds")
    (set_attr "length" "12")])
 
-- 
2.43.0


^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: GCN, RDNA 3: Adjust 'sync_compare_and_swap<mode>_lds_insn'
  2024-02-01 11:36   ` Thomas Schwinge
@ 2024-02-01 11:40     ` Andrew Stubbs
  0 siblings, 0 replies; 4+ messages in thread
From: Andrew Stubbs @ 2024-02-01 11:40 UTC (permalink / raw)
  To: Thomas Schwinge, gcc-patches

On 01/02/2024 11:36, Thomas Schwinge wrote:
> Hi!
> 
> On 2024-01-31T11:31:00+0000, Andrew Stubbs <ams@baylibre.com> wrote:
>> On 31/01/2024 10:36, Thomas Schwinge wrote:
>>> OK to push "GCN, RDNA 3: Adjust 'sync_compare_and_swap<mode>_lds_insn'",
>>> see attached?
>>>
>>> In pre-RDNA 3 ISA manuals, there are notes for 'DS_CMPST_[...]', like:
>>>
>>>       Caution, the order of src and cmp are the *opposite* of the BUFFER_ATOMIC_CMPSWAP opcode.
>>>
>>> ..., and conversely in the RDNA 3 ISA manual, for 'DS_CMPSTORE_[...]':
>>>
>>>       In this architecture the order of src and cmp agree with the BUFFER_ATOMIC_CMPSWAP opcode.
>>>
>>> Is my understanding correct, that this isn't something we have to worry
>>> about at the GCC machine description level; that's resolved at the
>>> assembler level?
>>
>> Right, the IR uses GCC's operand order and has nothing to do with the
>> assembler syntax; the output template does the mapping.
>>
>>> --- a/gcc/config/gcn/gcn.md
>>> +++ b/gcc/config/gcn/gcn.md
>>> @@ -2095,7 +2095,12 @@
>>>   	   (match_operand:SIDI 3 "register_operand" "  v")]
>>>   	  UNSPECV_ATOMIC))]
>>>     ""
>>> -  "ds_cmpst_rtn_b<bitsize> %0, %1, %2, %3\;s_waitcnt\tlgkmcnt(0)"
>>> +  {
>>> +    if (TARGET_RDNA3)
>>> +      return "ds_cmpstore_rtn_b<bitsize> %0, %1, %2, %3\;s_waitcnt\tlgkmcnt(0)";
>>> +    else
>>> +      return "ds_cmpst_rtn_b<bitsize> %0, %1, %2, %3\;s_waitcnt\tlgkmcnt(0)";
>>> +  }
>>>     [(set_attr "type" "ds")
>>>      (set_attr "length" "12")])
>>
>> I think you need to swap %2 and %3 in the new format. ds_cmpst matches
>> GCC operand order, but ds_cmpstore has "cmp" and "src" reversed.
> 
> OK, thanks.  That was my actual question -- so, we do need to swap, and
> indeed, most of the affected libgomp OpenACC test cases then PASS their
> execution test.  With that changed, I've pushed to master branch
> commit 6c2a40f4f4577f5d0f7bd1cfda48a5701b75744c
> "GCN, RDNA 3: Adjust 'sync_compare_and_swap<mode>_lds_insn'", see
> attached.

OK to commit.

Andrew

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2024-02-01 11:40 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2024-01-31 10:36 GCN, RDNA 3: Adjust 'sync_compare_and_swap<mode>_lds_insn' Thomas Schwinge
2024-01-31 11:31 ` Andrew Stubbs
2024-02-01 11:36   ` Thomas Schwinge
2024-02-01 11:40     ` Andrew Stubbs

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).