public inbox for gcc-patches@gcc.gnu.org
 help / color / mirror / Atom feed
* [PATCH 1/2]AArch64 Fix 128-bit sequential consistency atomic operations.
@ 2022-06-08 14:49 Tamar Christina
  2022-06-08 14:49 ` [PATCH 2/2][AArch32] " Tamar Christina
                   ` (2 more replies)
  0 siblings, 3 replies; 8+ messages in thread
From: Tamar Christina @ 2022-06-08 14:49 UTC (permalink / raw)
  To: gcc-patches
  Cc: nd, Richard.Earnshaw, Marcus.Shawcroft, Kyrylo.Tkachov,
	richard.sandiford

[-- Attachment #1: Type: text/plain, Size: 4648 bytes --]

Hi All,

The AArch64 implementation of 128-bit atomics is broken.

For 128-bit atomics we rely on pthread barriers to correct guard the address
in the pointer to get correct memory ordering.  However for 128-bit atomics the
address under the lock is different from the original pointer.

This means that one of the values under the atomic operation is not protected
properly and so we fail during when the user has requested sequential
consistency as there's no barrier to enforce this requirement.

As such users have resorted to adding an

#ifdef GCC
<emit barrier>
#endif

around the use of these atomics.

This corrects the issue by issuing a barrier only when __ATOMIC_SEQ_CST was
requested.  To remedy this performance hit I think we should revisit using a
similar approach to out-line-atomics for the 128-bit atomics.

Note that I believe I need the empty file due to the include_next chain but
I am not entirely sure.  I have hand verified that the barriers are inserted
for atomic seq cst.

Bootstrapped Regtested on aarch64-none-linux-gnu and no issues.

Ok for master? and for backporting to GCC 12, 11 and 10?

Thanks,
Tamar

libatomic/ChangeLog:

	PR target/102218
	* config/aarch64/aarch64-config.h: New file.
	* config/aarch64/host-config.h: New file.

--- inline copy of patch -- 
diff --git a/libatomic/config/aarch64/aarch64-config.h b/libatomic/config/aarch64/aarch64-config.h
new file mode 100644
index 0000000000000000000000000000000000000000..d3474fa8ff80cb0c3ddbf8c48acd931d2339d33d
--- /dev/null
+++ b/libatomic/config/aarch64/aarch64-config.h
@@ -0,0 +1,23 @@
+/* Copyright (C) 2022 Free Software Foundation, Inc.
+
+   This file is part of the GNU Atomic Library (libatomic).
+
+   Libatomic is free software; you can redistribute it and/or modify it
+   under the terms of the GNU General Public License as published by
+   the Free Software Foundation; either version 3 of the License, or
+   (at your option) any later version.
+
+   Libatomic is distributed in the hope that it will be useful, but WITHOUT ANY
+   WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS
+   FOR A PARTICULAR PURPOSE.  See the GNU General Public License for
+   more details.
+
+   Under Section 7 of GPL version 3, you are granted additional
+   permissions described in the GCC Runtime Library Exception, version
+   3.1, as published by the Free Software Foundation.
+
+   You should have received a copy of the GNU General Public License and
+   a copy of the GCC Runtime Library Exception along with this program;
+   see the files COPYING3 and COPYING.RUNTIME respectively.  If not, see
+   <http://www.gnu.org/licenses/>.  */
+
diff --git a/libatomic/config/aarch64/host-config.h b/libatomic/config/aarch64/host-config.h
new file mode 100644
index 0000000000000000000000000000000000000000..f445a47d25ef5cc51cd2167069500245d07bf1bc
--- /dev/null
+++ b/libatomic/config/aarch64/host-config.h
@@ -0,0 +1,46 @@
+/* Copyright (C) 2022 Free Software Foundation, Inc.
+
+   This file is part of the GNU Atomic Library (libatomic).
+
+   Libatomic is free software; you can redistribute it and/or modify it
+   under the terms of the GNU General Public License as published by
+   the Free Software Foundation; either version 3 of the License, or
+   (at your option) any later version.
+
+   Libatomic is distributed in the hope that it will be useful, but WITHOUT ANY
+   WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS
+   FOR A PARTICULAR PURPOSE.  See the GNU General Public License for
+   more details.
+
+   Under Section 7 of GPL version 3, you are granted additional
+   permissions described in the GCC Runtime Library Exception, version
+   3.1, as published by the Free Software Foundation.
+
+   You should have received a copy of the GNU General Public License and
+   a copy of the GCC Runtime Library Exception along with this program;
+   see the files COPYING3 and COPYING.RUNTIME respectively.  If not, see
+   <http://www.gnu.org/licenses/>.  */
+
+/* Avoiding the DMB (or kernel helper) can be a good thing.  */
+#define WANT_SPECIALCASE_RELAXED
+
+/* Glibc, at least, uses acq_rel in its pthread mutex
+   implementation.  If the user is asking for seq_cst,
+   this is insufficient.  */
+
+static inline void __attribute__((always_inline, artificial))
+pre_seq_barrier(int model)
+{
+  if (model == __ATOMIC_SEQ_CST)
+    __atomic_thread_fence (__ATOMIC_SEQ_CST);
+}
+
+static inline void __attribute__((always_inline, artificial))
+post_seq_barrier(int model)
+{
+  pre_seq_barrier(model);
+}
+
+#define pre_post_seq_barrier 1
+
+#include_next <host-config.h>




-- 

[-- Attachment #2: rb15782.patch --]
[-- Type: text/plain, Size: 3346 bytes --]

diff --git a/libatomic/config/aarch64/aarch64-config.h b/libatomic/config/aarch64/aarch64-config.h
new file mode 100644
index 0000000000000000000000000000000000000000..d3474fa8ff80cb0c3ddbf8c48acd931d2339d33d
--- /dev/null
+++ b/libatomic/config/aarch64/aarch64-config.h
@@ -0,0 +1,23 @@
+/* Copyright (C) 2022 Free Software Foundation, Inc.
+
+   This file is part of the GNU Atomic Library (libatomic).
+
+   Libatomic is free software; you can redistribute it and/or modify it
+   under the terms of the GNU General Public License as published by
+   the Free Software Foundation; either version 3 of the License, or
+   (at your option) any later version.
+
+   Libatomic is distributed in the hope that it will be useful, but WITHOUT ANY
+   WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS
+   FOR A PARTICULAR PURPOSE.  See the GNU General Public License for
+   more details.
+
+   Under Section 7 of GPL version 3, you are granted additional
+   permissions described in the GCC Runtime Library Exception, version
+   3.1, as published by the Free Software Foundation.
+
+   You should have received a copy of the GNU General Public License and
+   a copy of the GCC Runtime Library Exception along with this program;
+   see the files COPYING3 and COPYING.RUNTIME respectively.  If not, see
+   <http://www.gnu.org/licenses/>.  */
+
diff --git a/libatomic/config/aarch64/host-config.h b/libatomic/config/aarch64/host-config.h
new file mode 100644
index 0000000000000000000000000000000000000000..f445a47d25ef5cc51cd2167069500245d07bf1bc
--- /dev/null
+++ b/libatomic/config/aarch64/host-config.h
@@ -0,0 +1,46 @@
+/* Copyright (C) 2022 Free Software Foundation, Inc.
+
+   This file is part of the GNU Atomic Library (libatomic).
+
+   Libatomic is free software; you can redistribute it and/or modify it
+   under the terms of the GNU General Public License as published by
+   the Free Software Foundation; either version 3 of the License, or
+   (at your option) any later version.
+
+   Libatomic is distributed in the hope that it will be useful, but WITHOUT ANY
+   WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS
+   FOR A PARTICULAR PURPOSE.  See the GNU General Public License for
+   more details.
+
+   Under Section 7 of GPL version 3, you are granted additional
+   permissions described in the GCC Runtime Library Exception, version
+   3.1, as published by the Free Software Foundation.
+
+   You should have received a copy of the GNU General Public License and
+   a copy of the GCC Runtime Library Exception along with this program;
+   see the files COPYING3 and COPYING.RUNTIME respectively.  If not, see
+   <http://www.gnu.org/licenses/>.  */
+
+/* Avoiding the DMB (or kernel helper) can be a good thing.  */
+#define WANT_SPECIALCASE_RELAXED
+
+/* Glibc, at least, uses acq_rel in its pthread mutex
+   implementation.  If the user is asking for seq_cst,
+   this is insufficient.  */
+
+static inline void __attribute__((always_inline, artificial))
+pre_seq_barrier(int model)
+{
+  if (model == __ATOMIC_SEQ_CST)
+    __atomic_thread_fence (__ATOMIC_SEQ_CST);
+}
+
+static inline void __attribute__((always_inline, artificial))
+post_seq_barrier(int model)
+{
+  pre_seq_barrier(model);
+}
+
+#define pre_post_seq_barrier 1
+
+#include_next <host-config.h>




^ permalink raw reply	[flat|nested] 8+ messages in thread

* [PATCH 2/2][AArch32] Fix 128-bit sequential consistency atomic operations.
  2022-06-08 14:49 [PATCH 1/2]AArch64 Fix 128-bit sequential consistency atomic operations Tamar Christina
@ 2022-06-08 14:49 ` Tamar Christina
  2022-06-16  9:14   ` Tamar Christina
  2022-08-08 12:53   ` Kyrylo Tkachov
  2022-06-16  9:14 ` [PATCH 1/2]AArch64 " Tamar Christina
  2022-07-12 13:46 ` Kyrylo Tkachov
  2 siblings, 2 replies; 8+ messages in thread
From: Tamar Christina @ 2022-06-08 14:49 UTC (permalink / raw)
  To: gcc-patches
  Cc: nd, Ramana.Radhakrishnan, Richard.Earnshaw, nickc, Kyrylo.Tkachov

[-- Attachment #1: Type: text/plain, Size: 2026 bytes --]

Hi All,

Similar to AArch64 the Arm implementation of 128-bit atomics is broken.

For 128-bit atomics we rely on pthread barriers to correct guard the address
in the pointer to get correct memory ordering.  However for 128-bit atomics the
address under the lock is different from the original pointer.

This means that one of the values under the atomic operation is not protected
properly and so we fail during when the user has requested sequential
consistency as there's no barrier to enforce this requirement.

As such users have resorted to adding an

#ifdef GCC
<emit barrier>
#endif

around the use of these atomics.

This corrects the issue by issuing a barrier only when __ATOMIC_SEQ_CST was
requested.  I have hand verified that the barriers are inserted
for atomic seq cst.


Bootstrapped Regtested on arm-none-linux-gnueabihf and no issues.

Ok for master? and for backporting to GCC 12, 11 and 10?

Thanks,
Tamar

libatomic/ChangeLog:

	PR target/102218
	* config/arm/host-config.h (pre_seq_barrier, post_seq_barrier,
	pre_post_seq_barrier): Require barrier on __ATOMIC_SEQ_CST.

--- inline copy of patch -- 
diff --git a/libatomic/config/arm/host-config.h b/libatomic/config/arm/host-config.h
index bbf4a3f84c3f3ae21fb2162aab68bdedf3fbdcb4..ef16fad2a35ec9055e918849e69a1a0e23b62838 100644
--- a/libatomic/config/arm/host-config.h
+++ b/libatomic/config/arm/host-config.h
@@ -1,4 +1,23 @@
 /* Avoiding the DMB (or kernel helper) can be a good thing.  */
 #define WANT_SPECIALCASE_RELAXED
 
+/* Glibc, at least, uses acq_rel in its pthread mutex
+   implementation.  If the user is asking for seq_cst,
+   this is insufficient.  */
+
+static inline void __attribute__((always_inline, artificial))
+pre_seq_barrier(int model)
+{
+  if (model == __ATOMIC_SEQ_CST)
+    __atomic_thread_fence (__ATOMIC_SEQ_CST);
+}
+
+static inline void __attribute__((always_inline, artificial))
+post_seq_barrier(int model)
+{
+  pre_seq_barrier(model);
+}
+
+#define pre_post_seq_barrier 1
+
 #include_next <host-config.h>




-- 

[-- Attachment #2: rb15783.patch --]
[-- Type: text/plain, Size: 899 bytes --]

diff --git a/libatomic/config/arm/host-config.h b/libatomic/config/arm/host-config.h
index bbf4a3f84c3f3ae21fb2162aab68bdedf3fbdcb4..ef16fad2a35ec9055e918849e69a1a0e23b62838 100644
--- a/libatomic/config/arm/host-config.h
+++ b/libatomic/config/arm/host-config.h
@@ -1,4 +1,23 @@
 /* Avoiding the DMB (or kernel helper) can be a good thing.  */
 #define WANT_SPECIALCASE_RELAXED
 
+/* Glibc, at least, uses acq_rel in its pthread mutex
+   implementation.  If the user is asking for seq_cst,
+   this is insufficient.  */
+
+static inline void __attribute__((always_inline, artificial))
+pre_seq_barrier(int model)
+{
+  if (model == __ATOMIC_SEQ_CST)
+    __atomic_thread_fence (__ATOMIC_SEQ_CST);
+}
+
+static inline void __attribute__((always_inline, artificial))
+post_seq_barrier(int model)
+{
+  pre_seq_barrier(model);
+}
+
+#define pre_post_seq_barrier 1
+
 #include_next <host-config.h>




^ permalink raw reply	[flat|nested] 8+ messages in thread

* RE: [PATCH 1/2]AArch64 Fix 128-bit sequential consistency atomic operations.
  2022-06-08 14:49 [PATCH 1/2]AArch64 Fix 128-bit sequential consistency atomic operations Tamar Christina
  2022-06-08 14:49 ` [PATCH 2/2][AArch32] " Tamar Christina
@ 2022-06-16  9:14 ` Tamar Christina
  2022-07-12 13:46 ` Kyrylo Tkachov
  2 siblings, 0 replies; 8+ messages in thread
From: Tamar Christina @ 2022-06-16  9:14 UTC (permalink / raw)
  To: gcc-patches
  Cc: nd, Richard Earnshaw, Marcus Shawcroft, Kyrylo Tkachov,
	Richard Sandiford

ping

> -----Original Message-----
> From: Tamar Christina <tamar.christina@arm.com>
> Sent: Wednesday, June 8, 2022 3:49 PM
> To: gcc-patches@gcc.gnu.org
> Cc: nd <nd@arm.com>; Richard Earnshaw <Richard.Earnshaw@arm.com>;
> Marcus Shawcroft <Marcus.Shawcroft@arm.com>; Kyrylo Tkachov
> <Kyrylo.Tkachov@arm.com>; Richard Sandiford
> <Richard.Sandiford@arm.com>
> Subject: [PATCH 1/2]AArch64 Fix 128-bit sequential consistency atomic
> operations.
> 
> Hi All,
> 
> The AArch64 implementation of 128-bit atomics is broken.
> 
> For 128-bit atomics we rely on pthread barriers to correct guard the address
> in the pointer to get correct memory ordering.  However for 128-bit atomics
> the address under the lock is different from the original pointer.
> 
> This means that one of the values under the atomic operation is not
> protected properly and so we fail during when the user has requested
> sequential consistency as there's no barrier to enforce this requirement.
> 
> As such users have resorted to adding an
> 
> #ifdef GCC
> <emit barrier>
> #endif
> 
> around the use of these atomics.
> 
> This corrects the issue by issuing a barrier only when __ATOMIC_SEQ_CST
> was requested.  To remedy this performance hit I think we should revisit
> using a similar approach to out-line-atomics for the 128-bit atomics.
> 
> Note that I believe I need the empty file due to the include_next chain but I
> am not entirely sure.  I have hand verified that the barriers are inserted for
> atomic seq cst.
> 
> Bootstrapped Regtested on aarch64-none-linux-gnu and no issues.
> 
> Ok for master? and for backporting to GCC 12, 11 and 10?
> 
> Thanks,
> Tamar
> 
> libatomic/ChangeLog:
> 
> 	PR target/102218
> 	* config/aarch64/aarch64-config.h: New file.
> 	* config/aarch64/host-config.h: New file.
> 
> --- inline copy of patch --
> diff --git a/libatomic/config/aarch64/aarch64-config.h
> b/libatomic/config/aarch64/aarch64-config.h
> new file mode 100644
> index
> 0000000000000000000000000000000000000000..d3474fa8ff80cb0c3ddbf8c48ac
> d931d2339d33d
> --- /dev/null
> +++ b/libatomic/config/aarch64/aarch64-config.h
> @@ -0,0 +1,23 @@
> +/* Copyright (C) 2022 Free Software Foundation, Inc.
> +
> +   This file is part of the GNU Atomic Library (libatomic).
> +
> +   Libatomic is free software; you can redistribute it and/or modify it
> +   under the terms of the GNU General Public License as published by
> +   the Free Software Foundation; either version 3 of the License, or
> +   (at your option) any later version.
> +
> +   Libatomic is distributed in the hope that it will be useful, but WITHOUT
> ANY
> +   WARRANTY; without even the implied warranty of MERCHANTABILITY or
> FITNESS
> +   FOR A PARTICULAR PURPOSE.  See the GNU General Public License for
> +   more details.
> +
> +   Under Section 7 of GPL version 3, you are granted additional
> +   permissions described in the GCC Runtime Library Exception, version
> +   3.1, as published by the Free Software Foundation.
> +
> +   You should have received a copy of the GNU General Public License and
> +   a copy of the GCC Runtime Library Exception along with this program;
> +   see the files COPYING3 and COPYING.RUNTIME respectively.  If not, see
> +   <http://www.gnu.org/licenses/>.  */
> +
> diff --git a/libatomic/config/aarch64/host-config.h
> b/libatomic/config/aarch64/host-config.h
> new file mode 100644
> index
> 0000000000000000000000000000000000000000..f445a47d25ef5cc51cd2167069
> 500245d07bf1bc
> --- /dev/null
> +++ b/libatomic/config/aarch64/host-config.h
> @@ -0,0 +1,46 @@
> +/* Copyright (C) 2022 Free Software Foundation, Inc.
> +
> +   This file is part of the GNU Atomic Library (libatomic).
> +
> +   Libatomic is free software; you can redistribute it and/or modify it
> +   under the terms of the GNU General Public License as published by
> +   the Free Software Foundation; either version 3 of the License, or
> +   (at your option) any later version.
> +
> +   Libatomic is distributed in the hope that it will be useful, but WITHOUT
> ANY
> +   WARRANTY; without even the implied warranty of MERCHANTABILITY or
> FITNESS
> +   FOR A PARTICULAR PURPOSE.  See the GNU General Public License for
> +   more details.
> +
> +   Under Section 7 of GPL version 3, you are granted additional
> +   permissions described in the GCC Runtime Library Exception, version
> +   3.1, as published by the Free Software Foundation.
> +
> +   You should have received a copy of the GNU General Public License and
> +   a copy of the GCC Runtime Library Exception along with this program;
> +   see the files COPYING3 and COPYING.RUNTIME respectively.  If not, see
> +   <http://www.gnu.org/licenses/>.  */
> +
> +/* Avoiding the DMB (or kernel helper) can be a good thing.  */ #define
> +WANT_SPECIALCASE_RELAXED
> +
> +/* Glibc, at least, uses acq_rel in its pthread mutex
> +   implementation.  If the user is asking for seq_cst,
> +   this is insufficient.  */
> +
> +static inline void __attribute__((always_inline, artificial))
> +pre_seq_barrier(int model) {
> +  if (model == __ATOMIC_SEQ_CST)
> +    __atomic_thread_fence (__ATOMIC_SEQ_CST); }
> +
> +static inline void __attribute__((always_inline, artificial))
> +post_seq_barrier(int model) {
> +  pre_seq_barrier(model);
> +}
> +
> +#define pre_post_seq_barrier 1
> +
> +#include_next <host-config.h>
> 
> 
> 
> 
> --

^ permalink raw reply	[flat|nested] 8+ messages in thread

* RE: [PATCH 2/2][AArch32] Fix 128-bit sequential consistency atomic operations.
  2022-06-08 14:49 ` [PATCH 2/2][AArch32] " Tamar Christina
@ 2022-06-16  9:14   ` Tamar Christina
  2022-08-08 12:53   ` Kyrylo Tkachov
  1 sibling, 0 replies; 8+ messages in thread
From: Tamar Christina @ 2022-06-16  9:14 UTC (permalink / raw)
  To: gcc-patches
  Cc: nd, Ramana Radhakrishnan, Richard Earnshaw, nickc, Kyrylo Tkachov

ping

> -----Original Message-----
> From: Tamar Christina <tamar.christina@arm.com>
> Sent: Wednesday, June 8, 2022 3:50 PM
> To: gcc-patches@gcc.gnu.org
> Cc: nd <nd@arm.com>; Ramana Radhakrishnan
> <Ramana.Radhakrishnan@arm.com>; Richard Earnshaw
> <Richard.Earnshaw@arm.com>; nickc@redhat.com; Kyrylo Tkachov
> <Kyrylo.Tkachov@arm.com>
> Subject: [PATCH 2/2][AArch32] Fix 128-bit sequential consistency atomic
> operations.
> 
> Hi All,
> 
> Similar to AArch64 the Arm implementation of 128-bit atomics is broken.
> 
> For 128-bit atomics we rely on pthread barriers to correct guard the address
> in the pointer to get correct memory ordering.  However for 128-bit atomics
> the address under the lock is different from the original pointer.
> 
> This means that one of the values under the atomic operation is not
> protected properly and so we fail during when the user has requested
> sequential consistency as there's no barrier to enforce this requirement.
> 
> As such users have resorted to adding an
> 
> #ifdef GCC
> <emit barrier>
> #endif
> 
> around the use of these atomics.
> 
> This corrects the issue by issuing a barrier only when __ATOMIC_SEQ_CST
> was requested.  I have hand verified that the barriers are inserted for atomic
> seq cst.
> 
> 
> Bootstrapped Regtested on arm-none-linux-gnueabihf and no issues.
> 
> Ok for master? and for backporting to GCC 12, 11 and 10?
> 
> Thanks,
> Tamar
> 
> libatomic/ChangeLog:
> 
> 	PR target/102218
> 	* config/arm/host-config.h (pre_seq_barrier, post_seq_barrier,
> 	pre_post_seq_barrier): Require barrier on __ATOMIC_SEQ_CST.
> 
> --- inline copy of patch --
> diff --git a/libatomic/config/arm/host-config.h b/libatomic/config/arm/host-
> config.h
> index
> bbf4a3f84c3f3ae21fb2162aab68bdedf3fbdcb4..ef16fad2a35ec9055e918849e6
> 9a1a0e23b62838 100644
> --- a/libatomic/config/arm/host-config.h
> +++ b/libatomic/config/arm/host-config.h
> @@ -1,4 +1,23 @@
>  /* Avoiding the DMB (or kernel helper) can be a good thing.  */  #define
> WANT_SPECIALCASE_RELAXED
> 
> +/* Glibc, at least, uses acq_rel in its pthread mutex
> +   implementation.  If the user is asking for seq_cst,
> +   this is insufficient.  */
> +
> +static inline void __attribute__((always_inline, artificial))
> +pre_seq_barrier(int model) {
> +  if (model == __ATOMIC_SEQ_CST)
> +    __atomic_thread_fence (__ATOMIC_SEQ_CST); }
> +
> +static inline void __attribute__((always_inline, artificial))
> +post_seq_barrier(int model) {
> +  pre_seq_barrier(model);
> +}
> +
> +#define pre_post_seq_barrier 1
> +
>  #include_next <host-config.h>
> 
> 
> 
> 
> --

^ permalink raw reply	[flat|nested] 8+ messages in thread

* RE: [PATCH 1/2]AArch64 Fix 128-bit sequential consistency atomic operations.
  2022-06-08 14:49 [PATCH 1/2]AArch64 Fix 128-bit sequential consistency atomic operations Tamar Christina
  2022-06-08 14:49 ` [PATCH 2/2][AArch32] " Tamar Christina
  2022-06-16  9:14 ` [PATCH 1/2]AArch64 " Tamar Christina
@ 2022-07-12 13:46 ` Kyrylo Tkachov
  2022-08-08  9:27   ` Tamar Christina
  2 siblings, 1 reply; 8+ messages in thread
From: Kyrylo Tkachov @ 2022-07-12 13:46 UTC (permalink / raw)
  To: Tamar Christina, gcc-patches
  Cc: nd, Richard Earnshaw, Marcus Shawcroft, Richard Sandiford

Hi Tamar,

Let me be the latest to offer my apologies for the slow review.

> -----Original Message-----
> From: Tamar Christina <Tamar.Christina@arm.com>
> Sent: Wednesday, June 8, 2022 3:49 PM
> To: gcc-patches@gcc.gnu.org
> Cc: nd <nd@arm.com>; Richard Earnshaw <Richard.Earnshaw@arm.com>;
> Marcus Shawcroft <Marcus.Shawcroft@arm.com>; Kyrylo Tkachov
> <Kyrylo.Tkachov@arm.com>; Richard Sandiford
> <Richard.Sandiford@arm.com>
> Subject: [PATCH 1/2]AArch64 Fix 128-bit sequential consistency atomic
> operations.
> 
> Hi All,
> 
> The AArch64 implementation of 128-bit atomics is broken.
> 
> For 128-bit atomics we rely on pthread barriers to correct guard the address
> in the pointer to get correct memory ordering.  However for 128-bit atomics
> the
> address under the lock is different from the original pointer.
> 
> This means that one of the values under the atomic operation is not
> protected
> properly and so we fail during when the user has requested sequential
> consistency as there's no barrier to enforce this requirement.
> 
> As such users have resorted to adding an
> 
> #ifdef GCC
> <emit barrier>
> #endif
> 
> around the use of these atomics.
> 
> This corrects the issue by issuing a barrier only when __ATOMIC_SEQ_CST
> was
> requested.  To remedy this performance hit I think we should revisit using a
> similar approach to out-line-atomics for the 128-bit atomics.
> 
> Note that I believe I need the empty file due to the include_next chain but
> I am not entirely sure.  I have hand verified that the barriers are inserted
> for atomic seq cst.
> 
> Bootstrapped Regtested on aarch64-none-linux-gnu and no issues.
> 
> Ok for master? and for backporting to GCC 12, 11 and 10?

I'll admit I'm not too familiar with the mechanics of libatomic but...

> 
> Thanks,
> Tamar
> 
> libatomic/ChangeLog:
> 
> 	PR target/102218
> 	* config/aarch64/aarch64-config.h: New file.
> 	* config/aarch64/host-config.h: New file.
> 
> --- inline copy of patch --
> diff --git a/libatomic/config/aarch64/aarch64-config.h
> b/libatomic/config/aarch64/aarch64-config.h
> new file mode 100644
> index
> 0000000000000000000000000000000000000000..d3474fa8ff80cb0c3ddbf8c4
> 8acd931d2339d33d
> --- /dev/null
> +++ b/libatomic/config/aarch64/aarch64-config.h
> @@ -0,0 +1,23 @@
> +/* Copyright (C) 2022 Free Software Foundation, Inc.
> +
> +   This file is part of the GNU Atomic Library (libatomic).
> +
> +   Libatomic is free software; you can redistribute it and/or modify it
> +   under the terms of the GNU General Public License as published by
> +   the Free Software Foundation; either version 3 of the License, or
> +   (at your option) any later version.
> +
> +   Libatomic is distributed in the hope that it will be useful, but WITHOUT
> ANY
> +   WARRANTY; without even the implied warranty of MERCHANTABILITY or
> FITNESS
> +   FOR A PARTICULAR PURPOSE.  See the GNU General Public License for
> +   more details.
> +
> +   Under Section 7 of GPL version 3, you are granted additional
> +   permissions described in the GCC Runtime Library Exception, version
> +   3.1, as published by the Free Software Foundation.
> +
> +   You should have received a copy of the GNU General Public License and
> +   a copy of the GCC Runtime Library Exception along with this program;
> +   see the files COPYING3 and COPYING.RUNTIME respectively.  If not, see
> +   <http://www.gnu.org/licenses/>.  */
> +
> diff --git a/libatomic/config/aarch64/host-config.h
> b/libatomic/config/aarch64/host-config.h
> new file mode 100644
> index
> 0000000000000000000000000000000000000000..f445a47d25ef5cc51cd21670
> 69500245d07bf1bc
> --- /dev/null
> +++ b/libatomic/config/aarch64/host-config.h
> @@ -0,0 +1,46 @@
> +/* Copyright (C) 2022 Free Software Foundation, Inc.
> +
> +   This file is part of the GNU Atomic Library (libatomic).
> +
> +   Libatomic is free software; you can redistribute it and/or modify it
> +   under the terms of the GNU General Public License as published by
> +   the Free Software Foundation; either version 3 of the License, or
> +   (at your option) any later version.
> +
> +   Libatomic is distributed in the hope that it will be useful, but WITHOUT
> ANY
> +   WARRANTY; without even the implied warranty of MERCHANTABILITY or
> FITNESS
> +   FOR A PARTICULAR PURPOSE.  See the GNU General Public License for
> +   more details.
> +
> +   Under Section 7 of GPL version 3, you are granted additional
> +   permissions described in the GCC Runtime Library Exception, version
> +   3.1, as published by the Free Software Foundation.
> +
> +   You should have received a copy of the GNU General Public License and
> +   a copy of the GCC Runtime Library Exception along with this program;
> +   see the files COPYING3 and COPYING.RUNTIME respectively.  If not, see
> +   <http://www.gnu.org/licenses/>.  */
> +
> +/* Avoiding the DMB (or kernel helper) can be a good thing.  */
> +#define WANT_SPECIALCASE_RELAXED
> +
> +/* Glibc, at least, uses acq_rel in its pthread mutex
> +   implementation.  If the user is asking for seq_cst,
> +   this is insufficient.  */
> +
> +static inline void __attribute__((always_inline, artificial))
> +pre_seq_barrier(int model)
> +{
> +  if (model == __ATOMIC_SEQ_CST)
> +    __atomic_thread_fence (__ATOMIC_SEQ_CST);
> +}
> +
> +static inline void __attribute__((always_inline, artificial))
> +post_seq_barrier(int model)
> +{
> +  pre_seq_barrier(model);
> +}
> +
> +#define pre_post_seq_barrier 1
> +
> +#include_next <host-config.h>

... This does looks sensible and similar to what's done on powerpc, which is similar to the aarch64 target in this regard.
However, there is already a host-config.h in config/linux/aarch64/host-config.h . Does this file end up including the one in config/linux?
If so, does this mean that this works correctly (i.e. was tested) for aarch64-none-elf as well as Linux? 

Thanks,
Kyrill

> 
> 
> 
> 
> --

^ permalink raw reply	[flat|nested] 8+ messages in thread

* RE: [PATCH 1/2]AArch64 Fix 128-bit sequential consistency atomic operations.
  2022-07-12 13:46 ` Kyrylo Tkachov
@ 2022-08-08  9:27   ` Tamar Christina
  2022-08-08 12:51     ` Kyrylo Tkachov
  0 siblings, 1 reply; 8+ messages in thread
From: Tamar Christina @ 2022-08-08  9:27 UTC (permalink / raw)
  To: Kyrylo Tkachov, gcc-patches
  Cc: nd, Richard Earnshaw, Marcus Shawcroft, Richard Sandiford


> -----Original Message-----
> From: Kyrylo Tkachov <Kyrylo.Tkachov@arm.com>
> Sent: Tuesday, July 12, 2022 2:46 PM
> To: Tamar Christina <Tamar.Christina@arm.com>; gcc-patches@gcc.gnu.org
> Cc: nd <nd@arm.com>; Richard Earnshaw <Richard.Earnshaw@arm.com>;
> Marcus Shawcroft <Marcus.Shawcroft@arm.com>; Richard Sandiford
> <Richard.Sandiford@arm.com>
> Subject: RE: [PATCH 1/2]AArch64 Fix 128-bit sequential consistency atomic
> operations.
> 
> Hi Tamar,
> 
> Let me be the latest to offer my apologies for the slow review.
> 
> > -----Original Message-----
> > From: Tamar Christina <Tamar.Christina@arm.com>
> > Sent: Wednesday, June 8, 2022 3:49 PM
> > To: gcc-patches@gcc.gnu.org
> > Cc: nd <nd@arm.com>; Richard Earnshaw <Richard.Earnshaw@arm.com>;
> > Marcus Shawcroft <Marcus.Shawcroft@arm.com>; Kyrylo Tkachov
> > <Kyrylo.Tkachov@arm.com>; Richard Sandiford
> > <Richard.Sandiford@arm.com>
> > Subject: [PATCH 1/2]AArch64 Fix 128-bit sequential consistency atomic
> > operations.
> >
> > Hi All,
> >
> > The AArch64 implementation of 128-bit atomics is broken.
> >
> > For 128-bit atomics we rely on pthread barriers to correct guard the
> > address in the pointer to get correct memory ordering.  However for
> > 128-bit atomics the address under the lock is different from the
> > original pointer.
> >
> > This means that one of the values under the atomic operation is not
> > protected properly and so we fail during when the user has requested
> > sequential consistency as there's no barrier to enforce this
> > requirement.
> >
> > As such users have resorted to adding an
> >
> > #ifdef GCC
> > <emit barrier>
> > #endif
> >
> > around the use of these atomics.
> >
> > This corrects the issue by issuing a barrier only when
> > __ATOMIC_SEQ_CST was requested.  To remedy this performance hit I
> > think we should revisit using a similar approach to out-line-atomics
> > for the 128-bit atomics.
> >
> > Note that I believe I need the empty file due to the include_next
> > chain but I am not entirely sure.  I have hand verified that the
> > barriers are inserted for atomic seq cst.
> >
> > Bootstrapped Regtested on aarch64-none-linux-gnu and no issues.
> >
> > Ok for master? and for backporting to GCC 12, 11 and 10?
> 
> I'll admit I'm not too familiar with the mechanics of libatomic but...
> 
> >
> > Thanks,
> > Tamar
> >
> > libatomic/ChangeLog:
> >
> > 	PR target/102218
> > 	* config/aarch64/aarch64-config.h: New file.
> > 	* config/aarch64/host-config.h: New file.
> >
> > --- inline copy of patch --
> > diff --git a/libatomic/config/aarch64/aarch64-config.h
> > b/libatomic/config/aarch64/aarch64-config.h
> > new file mode 100644
> > index
> > 0000000000000000000000000000000000000000..d3474fa8ff80cb0c3ddbf8c4
> > 8acd931d2339d33d
> > --- /dev/null
> > +++ b/libatomic/config/aarch64/aarch64-config.h
> > @@ -0,0 +1,23 @@
> > +/* Copyright (C) 2022 Free Software Foundation, Inc.
> > +
> > +   This file is part of the GNU Atomic Library (libatomic).
> > +
> > +   Libatomic is free software; you can redistribute it and/or modify it
> > +   under the terms of the GNU General Public License as published by
> > +   the Free Software Foundation; either version 3 of the License, or
> > +   (at your option) any later version.
> > +
> > +   Libatomic is distributed in the hope that it will be useful, but
> > + WITHOUT
> > ANY
> > +   WARRANTY; without even the implied warranty of MERCHANTABILITY or
> > FITNESS
> > +   FOR A PARTICULAR PURPOSE.  See the GNU General Public License for
> > +   more details.
> > +
> > +   Under Section 7 of GPL version 3, you are granted additional
> > +   permissions described in the GCC Runtime Library Exception, version
> > +   3.1, as published by the Free Software Foundation.
> > +
> > +   You should have received a copy of the GNU General Public License and
> > +   a copy of the GCC Runtime Library Exception along with this program;
> > +   see the files COPYING3 and COPYING.RUNTIME respectively.  If not, see
> > +   <http://www.gnu.org/licenses/>.  */
> > +
> > diff --git a/libatomic/config/aarch64/host-config.h
> > b/libatomic/config/aarch64/host-config.h
> > new file mode 100644
> > index
> > 0000000000000000000000000000000000000000..f445a47d25ef5cc51cd21670
> > 69500245d07bf1bc
> > --- /dev/null
> > +++ b/libatomic/config/aarch64/host-config.h
> > @@ -0,0 +1,46 @@
> > +/* Copyright (C) 2022 Free Software Foundation, Inc.
> > +
> > +   This file is part of the GNU Atomic Library (libatomic).
> > +
> > +   Libatomic is free software; you can redistribute it and/or modify it
> > +   under the terms of the GNU General Public License as published by
> > +   the Free Software Foundation; either version 3 of the License, or
> > +   (at your option) any later version.
> > +
> > +   Libatomic is distributed in the hope that it will be useful, but
> > + WITHOUT
> > ANY
> > +   WARRANTY; without even the implied warranty of MERCHANTABILITY or
> > FITNESS
> > +   FOR A PARTICULAR PURPOSE.  See the GNU General Public License for
> > +   more details.
> > +
> > +   Under Section 7 of GPL version 3, you are granted additional
> > +   permissions described in the GCC Runtime Library Exception, version
> > +   3.1, as published by the Free Software Foundation.
> > +
> > +   You should have received a copy of the GNU General Public License and
> > +   a copy of the GCC Runtime Library Exception along with this program;
> > +   see the files COPYING3 and COPYING.RUNTIME respectively.  If not, see
> > +   <http://www.gnu.org/licenses/>.  */
> > +
> > +/* Avoiding the DMB (or kernel helper) can be a good thing.  */
> > +#define WANT_SPECIALCASE_RELAXED
> > +
> > +/* Glibc, at least, uses acq_rel in its pthread mutex
> > +   implementation.  If the user is asking for seq_cst,
> > +   this is insufficient.  */
> > +
> > +static inline void __attribute__((always_inline, artificial))
> > +pre_seq_barrier(int model) {
> > +  if (model == __ATOMIC_SEQ_CST)
> > +    __atomic_thread_fence (__ATOMIC_SEQ_CST); }
> > +
> > +static inline void __attribute__((always_inline, artificial))
> > +post_seq_barrier(int model) {
> > +  pre_seq_barrier(model);
> > +}
> > +
> > +#define pre_post_seq_barrier 1
> > +
> > +#include_next <host-config.h>
> 
> ... This does looks sensible and similar to what's done on powerpc, which is
> similar to the aarch64 target in this regard.
> However, there is already a host-config.h in config/linux/aarch64/host-
> config.h . Does this file end up including the one in config/linux?
> If so, does this mean that this works correctly (i.e. was tested) for aarch64-
> none-elf as well as Linux?
>

Hi,

We don't build libatomic on any elf platforms.  It has a default unsupported
flag which we don't override.  Indeed we don't produce libatomic.a for elf
and a simple example fails to link as well..

Regards,
Tamar

> Thanks,
> Kyrill
> 
> >
> >
> >
> >
> > --

^ permalink raw reply	[flat|nested] 8+ messages in thread

* RE: [PATCH 1/2]AArch64 Fix 128-bit sequential consistency atomic operations.
  2022-08-08  9:27   ` Tamar Christina
@ 2022-08-08 12:51     ` Kyrylo Tkachov
  0 siblings, 0 replies; 8+ messages in thread
From: Kyrylo Tkachov @ 2022-08-08 12:51 UTC (permalink / raw)
  To: Tamar Christina, gcc-patches
  Cc: nd, Richard Earnshaw, Marcus Shawcroft, Richard Sandiford



> -----Original Message-----
> From: Tamar Christina <Tamar.Christina@arm.com>
> Sent: Monday, August 8, 2022 10:28 AM
> To: Kyrylo Tkachov <Kyrylo.Tkachov@arm.com>; gcc-patches@gcc.gnu.org
> Cc: nd <nd@arm.com>; Richard Earnshaw <Richard.Earnshaw@arm.com>;
> Marcus Shawcroft <Marcus.Shawcroft@arm.com>; Richard Sandiford
> <Richard.Sandiford@arm.com>
> Subject: RE: [PATCH 1/2]AArch64 Fix 128-bit sequential consistency atomic
> operations.
> 
> 
> > -----Original Message-----
> > From: Kyrylo Tkachov <Kyrylo.Tkachov@arm.com>
> > Sent: Tuesday, July 12, 2022 2:46 PM
> > To: Tamar Christina <Tamar.Christina@arm.com>; gcc-
> patches@gcc.gnu.org
> > Cc: nd <nd@arm.com>; Richard Earnshaw <Richard.Earnshaw@arm.com>;
> > Marcus Shawcroft <Marcus.Shawcroft@arm.com>; Richard Sandiford
> > <Richard.Sandiford@arm.com>
> > Subject: RE: [PATCH 1/2]AArch64 Fix 128-bit sequential consistency atomic
> > operations.
> >
> > Hi Tamar,
> >
> > Let me be the latest to offer my apologies for the slow review.
> >
> > > -----Original Message-----
> > > From: Tamar Christina <Tamar.Christina@arm.com>
> > > Sent: Wednesday, June 8, 2022 3:49 PM
> > > To: gcc-patches@gcc.gnu.org
> > > Cc: nd <nd@arm.com>; Richard Earnshaw
> <Richard.Earnshaw@arm.com>;
> > > Marcus Shawcroft <Marcus.Shawcroft@arm.com>; Kyrylo Tkachov
> > > <Kyrylo.Tkachov@arm.com>; Richard Sandiford
> > > <Richard.Sandiford@arm.com>
> > > Subject: [PATCH 1/2]AArch64 Fix 128-bit sequential consistency atomic
> > > operations.
> > >
> > > Hi All,
> > >
> > > The AArch64 implementation of 128-bit atomics is broken.
> > >
> > > For 128-bit atomics we rely on pthread barriers to correct guard the
> > > address in the pointer to get correct memory ordering.  However for
> > > 128-bit atomics the address under the lock is different from the
> > > original pointer.
> > >
> > > This means that one of the values under the atomic operation is not
> > > protected properly and so we fail during when the user has requested
> > > sequential consistency as there's no barrier to enforce this
> > > requirement.
> > >
> > > As such users have resorted to adding an
> > >
> > > #ifdef GCC
> > > <emit barrier>
> > > #endif
> > >
> > > around the use of these atomics.
> > >
> > > This corrects the issue by issuing a barrier only when
> > > __ATOMIC_SEQ_CST was requested.  To remedy this performance hit I
> > > think we should revisit using a similar approach to out-line-atomics
> > > for the 128-bit atomics.
> > >
> > > Note that I believe I need the empty file due to the include_next
> > > chain but I am not entirely sure.  I have hand verified that the
> > > barriers are inserted for atomic seq cst.
> > >
> > > Bootstrapped Regtested on aarch64-none-linux-gnu and no issues.
> > >
> > > Ok for master? and for backporting to GCC 12, 11 and 10?
> >
> > I'll admit I'm not too familiar with the mechanics of libatomic but...
> >
> > >
> > > Thanks,
> > > Tamar
> > >
> > > libatomic/ChangeLog:
> > >
> > > 	PR target/102218
> > > 	* config/aarch64/aarch64-config.h: New file.
> > > 	* config/aarch64/host-config.h: New file.
> > >
> > > --- inline copy of patch --
> > > diff --git a/libatomic/config/aarch64/aarch64-config.h
> > > b/libatomic/config/aarch64/aarch64-config.h
> > > new file mode 100644
> > > index
> > >
> 0000000000000000000000000000000000000000..d3474fa8ff80cb0c3ddbf8c4
> > > 8acd931d2339d33d
> > > --- /dev/null
> > > +++ b/libatomic/config/aarch64/aarch64-config.h
> > > @@ -0,0 +1,23 @@
> > > +/* Copyright (C) 2022 Free Software Foundation, Inc.
> > > +
> > > +   This file is part of the GNU Atomic Library (libatomic).
> > > +
> > > +   Libatomic is free software; you can redistribute it and/or modify it
> > > +   under the terms of the GNU General Public License as published by
> > > +   the Free Software Foundation; either version 3 of the License, or
> > > +   (at your option) any later version.
> > > +
> > > +   Libatomic is distributed in the hope that it will be useful, but
> > > + WITHOUT
> > > ANY
> > > +   WARRANTY; without even the implied warranty of MERCHANTABILITY
> or
> > > FITNESS
> > > +   FOR A PARTICULAR PURPOSE.  See the GNU General Public License for
> > > +   more details.
> > > +
> > > +   Under Section 7 of GPL version 3, you are granted additional
> > > +   permissions described in the GCC Runtime Library Exception, version
> > > +   3.1, as published by the Free Software Foundation.
> > > +
> > > +   You should have received a copy of the GNU General Public License
> and
> > > +   a copy of the GCC Runtime Library Exception along with this program;
> > > +   see the files COPYING3 and COPYING.RUNTIME respectively.  If not,
> see
> > > +   <http://www.gnu.org/licenses/>.  */
> > > +
> > > diff --git a/libatomic/config/aarch64/host-config.h
> > > b/libatomic/config/aarch64/host-config.h
> > > new file mode 100644
> > > index
> > >
> 0000000000000000000000000000000000000000..f445a47d25ef5cc51cd21670
> > > 69500245d07bf1bc
> > > --- /dev/null
> > > +++ b/libatomic/config/aarch64/host-config.h
> > > @@ -0,0 +1,46 @@
> > > +/* Copyright (C) 2022 Free Software Foundation, Inc.
> > > +
> > > +   This file is part of the GNU Atomic Library (libatomic).
> > > +
> > > +   Libatomic is free software; you can redistribute it and/or modify it
> > > +   under the terms of the GNU General Public License as published by
> > > +   the Free Software Foundation; either version 3 of the License, or
> > > +   (at your option) any later version.
> > > +
> > > +   Libatomic is distributed in the hope that it will be useful, but
> > > + WITHOUT
> > > ANY
> > > +   WARRANTY; without even the implied warranty of MERCHANTABILITY
> or
> > > FITNESS
> > > +   FOR A PARTICULAR PURPOSE.  See the GNU General Public License for
> > > +   more details.
> > > +
> > > +   Under Section 7 of GPL version 3, you are granted additional
> > > +   permissions described in the GCC Runtime Library Exception, version
> > > +   3.1, as published by the Free Software Foundation.
> > > +
> > > +   You should have received a copy of the GNU General Public License
> and
> > > +   a copy of the GCC Runtime Library Exception along with this program;
> > > +   see the files COPYING3 and COPYING.RUNTIME respectively.  If not,
> see
> > > +   <http://www.gnu.org/licenses/>.  */
> > > +
> > > +/* Avoiding the DMB (or kernel helper) can be a good thing.  */
> > > +#define WANT_SPECIALCASE_RELAXED
> > > +
> > > +/* Glibc, at least, uses acq_rel in its pthread mutex
> > > +   implementation.  If the user is asking for seq_cst,
> > > +   this is insufficient.  */
> > > +
> > > +static inline void __attribute__((always_inline, artificial))
> > > +pre_seq_barrier(int model) {
> > > +  if (model == __ATOMIC_SEQ_CST)
> > > +    __atomic_thread_fence (__ATOMIC_SEQ_CST); }
> > > +
> > > +static inline void __attribute__((always_inline, artificial))
> > > +post_seq_barrier(int model) {
> > > +  pre_seq_barrier(model);
> > > +}
> > > +
> > > +#define pre_post_seq_barrier 1
> > > +
> > > +#include_next <host-config.h>
> >
> > ... This does looks sensible and similar to what's done on powerpc, which is
> > similar to the aarch64 target in this regard.
> > However, there is already a host-config.h in config/linux/aarch64/host-
> > config.h . Does this file end up including the one in config/linux?
> > If so, does this mean that this works correctly (i.e. was tested) for aarch64-
> > none-elf as well as Linux?
> >
> 
> Hi,
> 
> We don't build libatomic on any elf platforms.  It has a default unsupported
> flag which we don't override.  Indeed we don't produce libatomic.a for elf
> and a simple example fails to link as well..

Ok, then I think this patch is a step in the right direction.
Ok for trunk.
Thanks,
Kyrill

> 
> Regards,
> Tamar
> 
> > Thanks,
> > Kyrill
> >
> > >
> > >
> > >
> > >
> > > --

^ permalink raw reply	[flat|nested] 8+ messages in thread

* RE: [PATCH 2/2][AArch32] Fix 128-bit sequential consistency atomic operations.
  2022-06-08 14:49 ` [PATCH 2/2][AArch32] " Tamar Christina
  2022-06-16  9:14   ` Tamar Christina
@ 2022-08-08 12:53   ` Kyrylo Tkachov
  1 sibling, 0 replies; 8+ messages in thread
From: Kyrylo Tkachov @ 2022-08-08 12:53 UTC (permalink / raw)
  To: Tamar Christina, gcc-patches
  Cc: nd, Ramana Radhakrishnan, Richard Earnshaw, nickc



> -----Original Message-----
> From: Tamar Christina <Tamar.Christina@arm.com>
> Sent: Wednesday, June 8, 2022 3:50 PM
> To: gcc-patches@gcc.gnu.org
> Cc: nd <nd@arm.com>; Ramana Radhakrishnan
> <Ramana.Radhakrishnan@arm.com>; Richard Earnshaw
> <Richard.Earnshaw@arm.com>; nickc@redhat.com; Kyrylo Tkachov
> <Kyrylo.Tkachov@arm.com>
> Subject: [PATCH 2/2][AArch32] Fix 128-bit sequential consistency atomic
> operations.
> 
> Hi All,
> 
> Similar to AArch64 the Arm implementation of 128-bit atomics is broken.
> 
> For 128-bit atomics we rely on pthread barriers to correct guard the address
> in the pointer to get correct memory ordering.  However for 128-bit atomics
> the
> address under the lock is different from the original pointer.
> 
> This means that one of the values under the atomic operation is not
> protected
> properly and so we fail during when the user has requested sequential
> consistency as there's no barrier to enforce this requirement.
> 
> As such users have resorted to adding an
> 
> #ifdef GCC
> <emit barrier>
> #endif
> 
> around the use of these atomics.
> 
> This corrects the issue by issuing a barrier only when __ATOMIC_SEQ_CST
> was
> requested.  I have hand verified that the barriers are inserted
> for atomic seq cst.
> 
> 
> Bootstrapped Regtested on arm-none-linux-gnueabihf and no issues.
> 
> Ok for master? and for backporting to GCC 12, 11 and 10?

Ok, with backports after a couple weeks on master.
Thanks,
Kyrill

> 
> Thanks,
> Tamar
> 
> libatomic/ChangeLog:
> 
> 	PR target/102218
> 	* config/arm/host-config.h (pre_seq_barrier, post_seq_barrier,
> 	pre_post_seq_barrier): Require barrier on __ATOMIC_SEQ_CST.
> 
> --- inline copy of patch --
> diff --git a/libatomic/config/arm/host-config.h b/libatomic/config/arm/host-
> config.h
> index
> bbf4a3f84c3f3ae21fb2162aab68bdedf3fbdcb4..ef16fad2a35ec9055e918849e
> 69a1a0e23b62838 100644
> --- a/libatomic/config/arm/host-config.h
> +++ b/libatomic/config/arm/host-config.h
> @@ -1,4 +1,23 @@
>  /* Avoiding the DMB (or kernel helper) can be a good thing.  */
>  #define WANT_SPECIALCASE_RELAXED
> 
> +/* Glibc, at least, uses acq_rel in its pthread mutex
> +   implementation.  If the user is asking for seq_cst,
> +   this is insufficient.  */
> +
> +static inline void __attribute__((always_inline, artificial))
> +pre_seq_barrier(int model)
> +{
> +  if (model == __ATOMIC_SEQ_CST)
> +    __atomic_thread_fence (__ATOMIC_SEQ_CST);
> +}
> +
> +static inline void __attribute__((always_inline, artificial))
> +post_seq_barrier(int model)
> +{
> +  pre_seq_barrier(model);
> +}
> +
> +#define pre_post_seq_barrier 1
> +
>  #include_next <host-config.h>
> 
> 
> 
> 
> --

^ permalink raw reply	[flat|nested] 8+ messages in thread

end of thread, other threads:[~2022-08-08 12:53 UTC | newest]

Thread overview: 8+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2022-06-08 14:49 [PATCH 1/2]AArch64 Fix 128-bit sequential consistency atomic operations Tamar Christina
2022-06-08 14:49 ` [PATCH 2/2][AArch32] " Tamar Christina
2022-06-16  9:14   ` Tamar Christina
2022-08-08 12:53   ` Kyrylo Tkachov
2022-06-16  9:14 ` [PATCH 1/2]AArch64 " Tamar Christina
2022-07-12 13:46 ` Kyrylo Tkachov
2022-08-08  9:27   ` Tamar Christina
2022-08-08 12:51     ` Kyrylo Tkachov

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).