public inbox for gdb-patches@sourceware.org
 help / color / mirror / Atom feed
* [PATCH 0/1] gdbserver: Reinstall software single-step breakpoints in resume_stopped_resumed_lwps
@ 2023-08-03  0:56 Kevin Buettner
  2023-08-03  0:56 ` [PATCH 1/1] " Kevin Buettner
  0 siblings, 1 reply; 7+ messages in thread
From: Kevin Buettner @ 2023-08-03  0:56 UTC (permalink / raw)
  To: gdb-patches; +Cc: zhiyong.yan, simon.marchi, Kevin Buettner

This patch fixes a problem with software single-stepping in gdbserver.
See the 1/1 patch for the details.

The fix proper is based on the work of Zhiyong Yan, who I've credited
with a Co-Authored-By tag in the 1/1 patch.  The test case was also
partly Zhiyong's work since I adapted the C file from the reproducer
in the bug that was filed for this problem.

Zhiyong, please check to make sure that this revised change to
gdbserver/linux-low.c still fixes the bug on your board.

I've added Simon as a CC since the .exp file for the new test is based
on gdb.threads/next-fork-other-thread.exp, which he authored.  Also,
from my reading of the commit log for that test, I gather that Simon
has looked at a related problem in the past.

Kevin Buettner (1):
  gdbserver: Reinstall software single-step breakpoints in
    resume_stopped_resumed_lwps

 .../gdb.threads/next-fork-exec-other-thread.c |  82 +++++++++++
 .../next-fork-exec-other-thread.exp           | 131 ++++++++++++++++++
 gdbserver/linux-low.cc                        |   7 +-
 3 files changed, 219 insertions(+), 1 deletion(-)
 create mode 100644 gdb/testsuite/gdb.threads/next-fork-exec-other-thread.c
 create mode 100644 gdb/testsuite/gdb.threads/next-fork-exec-other-thread.exp

-- 
2.41.0


^ permalink raw reply	[flat|nested] 7+ messages in thread

* [PATCH 1/1] gdbserver: Reinstall software single-step breakpoints in resume_stopped_resumed_lwps
  2023-08-03  0:56 [PATCH 0/1] gdbserver: Reinstall software single-step breakpoints in resume_stopped_resumed_lwps Kevin Buettner
@ 2023-08-03  0:56 ` Kevin Buettner
  2023-08-07 10:54   ` Yan, Zhiyong
  2023-08-12  4:00   ` Kevin Buettner
  0 siblings, 2 replies; 7+ messages in thread
From: Kevin Buettner @ 2023-08-03  0:56 UTC (permalink / raw)
  To: gdb-patches; +Cc: zhiyong.yan, simon.marchi, Kevin Buettner

At the moment, while performing a software single-step, gdbserver fails
to reinsert software single-step breakpoints for a LWP when
interrupted by a signal in another thread.  This commit fixes this
problem by reinstalling software single-step breakpoints in
linux_process_target::resume_stopped_resumed_lwps in
gdbserver/linux-low.cc.

This bug was discovered due to a failing assert in maybe_hw_step()
in gdbserver/linux-low.cc.  Looking at the backtrace revealed
that the caller was linux_process_target::resume_stopped_resumed_lwps.
I was uncertain whether the assert should still be valid when called
from that method, so I tried hoisting the assert from maybe_hw_step
to all callers except resume_stopped_resumed_lwps.  But running the
new test case, described below, showed that merely eliminating the
assert for this case was NOT a good fix - a study of the log file for
the test showed that the single-step operation failed to occur.
Instead GDB (via gdbserver) stopped at the next breakpoint that was
hit.

Zhiyong Yan had proposed a fix which resinserted software single-step
breakpoints, albeit at a different location in linux-low.cc.  Testing
revealed that, while running gdb.threads/pending-fork-event-detach,
the executable associated with that test would die due to a SIGTRAP
after the test program was detached.  Examination of the core file(s)
showed that a breakpoint instruction had been left in program memory.
Test results were otherwise very good, so Zhiyong was definitely on
the right track!

This commit causes software single-step breakpoint(s) to be inserted
before the call to maybe_hw_step in resume_stopped_resumed_lwps.  This
will cause 'has_single_step_breakpoints (thread)' to be true, so that
the assert in maybe_hw_step...

      /* GDBserver must insert single-step breakpoint for software
	 single step.  */
      gdb_assert (has_single_step_breakpoints (thread));

...will no longer fail.  And better still, the single-step breakpoints
are reinstalled, so that stepping will actually work, even when
interrupted.

The C code for the test case was loosely adapted from the reproducer
provided in Zhiyong's bug report for this problem.  The .exp file was
copied from next-fork-other-thread.exp and then tweaked slightly.  As
noted in a comment in next-fork-exec-other-thread.exp, I had to remove
"on" from the loop for non-stop as it was failing on all architectures
(including x86-64) that I tested.  I have a feeling that it ought to
work, but this can be investigated separately and (re)enabled once it
works.  I also increased the number of iterations for the loop running
the "next" commands.  I've had some test runs which don't show the bug
until the loop counter exceeded 100 iterations.  The C file for the
new test uses shorter delays than next-fork-other-thread.c though, so
it doesn't take overly long (IMO) to run this new test.

Running the new test on a Raspberry Pi w/ a 32-bit (Arm) kernel and
userland using a gdbserver build without the fix in this commit shows
the following results:

FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=fork: target-non-stop=auto: non-stop=off: displaced-stepping=auto: i=12: next to other line
FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=fork: target-non-stop=auto: non-stop=off: displaced-stepping=on: i=9: next to other line
FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=fork: target-non-stop=auto: non-stop=off: displaced-stepping=off: i=18: next to other line
FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=fork: target-non-stop=off: non-stop=off: displaced-stepping=auto: i=3: next to other line
FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=fork: target-non-stop=off: non-stop=off: displaced-stepping=on: i=11: next to other line
FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=fork: target-non-stop=off: non-stop=off: displaced-stepping=off: i=1: next to other line
FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: target-non-stop=auto: non-stop=off: displaced-stepping=auto: i=1: next to break here
FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: target-non-stop=auto: non-stop=off: displaced-stepping=on: i=3: next to break here
FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: target-non-stop=auto: non-stop=off: displaced-stepping=off: i=1: next to break here
FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: target-non-stop=on: non-stop=off: displaced-stepping=auto: i=47: next to other line
FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: target-non-stop=on: non-stop=off: displaced-stepping=on: i=57: next to other line
FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: target-non-stop=off: non-stop=off: displaced-stepping=auto: i=1: next to break here
FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: target-non-stop=off: non-stop=off: displaced-stepping=on: i=10: next to break here
FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: target-non-stop=off: non-stop=off: displaced-stepping=off: i=1: next to break here

		=== gdb Summary ===

 # of unexpected core files	12
 # of expected passes		3011
 # of unexpected failures	14

Each of the 12 core files were caused by the failed assertion in
maybe_hw_step in linux-low.c.  These correspond to 12 of the
unexpected failures.

When the tests are run using a gdbserver build which includes the fix
in this commit, the results are significantly better, but not perfect:

FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: target-non-stop=on: non-stop=off: displaced-stepping=auto: i=143: next to other line
FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: target-non-stop=on: non-stop=off: displaced-stepping=on: i=25: next to other line

		=== gdb Summary ===

 # of expected passes		10178
 # of unexpected failures	2

(I think that the two remaining failures are due to some different
problem.  And, FWIW, I don't see these failures on the Pi when using
--target_board=native-extended-gdbserver.)

Running the new test on x86-64 and aarch64, both native and
native-gdbserver shows no failures.

Also, I see no regressions when running the entire test suite for
armv7l-unknown-linux-gnueabihf (i.e.  the Raspberry Pi w/ 32-bit
kernel+userland) with --target_board=native-gdbserver.  Additionally,
using --target_board=native-gdbserver, I also see no regressions for
the entire test suite for x86-64 and aarch64 running Fedora 38.

Bug: https://sourceware.org/bugzilla/show_bug.cgi?id=30387
Co-Authored-By: Zhiyong Yan <zhiyong.yan@windriver.com>
---
 .../gdb.threads/next-fork-exec-other-thread.c |  82 +++++++++++
 .../next-fork-exec-other-thread.exp           | 131 ++++++++++++++++++
 gdbserver/linux-low.cc                        |   7 +-
 3 files changed, 219 insertions(+), 1 deletion(-)
 create mode 100644 gdb/testsuite/gdb.threads/next-fork-exec-other-thread.c
 create mode 100644 gdb/testsuite/gdb.threads/next-fork-exec-other-thread.exp

diff --git a/gdb/testsuite/gdb.threads/next-fork-exec-other-thread.c b/gdb/testsuite/gdb.threads/next-fork-exec-other-thread.c
new file mode 100644
index 00000000000..884706c6c3c
--- /dev/null
+++ b/gdb/testsuite/gdb.threads/next-fork-exec-other-thread.c
@@ -0,0 +1,82 @@
+/* This testcase is part of GDB, the GNU debugger.
+
+   Copyright 2023 Free Software Foundation, Inc.
+
+   This program is free software; you can redistribute it and/or modify
+   it under the terms of the GNU General Public License as published by
+   the Free Software Foundation; either version 3 of the License, or
+   (at your option) any later version.
+
+   This program is distributed in the hope that it will be useful,
+   but WITHOUT ANY WARRANTY; without even the implied warranty of
+   MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.	 See the
+   GNU General Public License for more details.
+
+   You should have received a copy of the GNU General Public License
+   along with this program.  If not, see <http://www.gnu.org/licenses/>.  */
+
+#include <stdio.h>
+#include <stdlib.h>
+#include <string.h>
+#include <fcntl.h>
+#include <errno.h>
+#include <unistd.h>
+#include <pthread.h>
+#include <sys/wait.h>
+
+#define MAX_LOOP_ITER 10000
+
+static char *argv0;
+
+static void*
+worker_a (void *pArg)
+{
+  int iter = 0;
+  char *args[] = {argv0, "self-call", NULL };
+
+  while (iter++ < MAX_LOOP_ITER)
+    {
+      pid_t pid = FORK_FUNC ();
+      if (pid == 0)
+	{
+	  /* child */
+	  if (execvp (args[0], args) == -1)
+	    {
+	      fprintf (stderr, "execvp error: %d\n", errno);
+	      exit (1);
+	    }
+	}
+
+      waitpid (pid, NULL, 0);
+      usleep (5);
+    }
+}
+
+static void*
+worker_b (void *pArg)
+{
+  int iter = 0;
+  while (iter++ < MAX_LOOP_ITER)  /* for loop */
+    {
+      usleep (5);  /* break here */
+      usleep (5);  /* other line */
+    }
+}
+
+int
+main (int argc, char **argv)
+{
+  pthread_t wa_pid;
+  pthread_t wb_pid;
+
+  argv0 = argv[0];
+
+  if (argc > 1 && strcmp (argv[1], "self-call") == 0)
+    exit (0);
+
+  pthread_create (&wa_pid, NULL, worker_a, NULL);
+  pthread_create (&wb_pid, NULL, worker_b, NULL);
+  pthread_join (wa_pid, NULL);
+
+  exit (0);
+}
diff --git a/gdb/testsuite/gdb.threads/next-fork-exec-other-thread.exp b/gdb/testsuite/gdb.threads/next-fork-exec-other-thread.exp
new file mode 100644
index 00000000000..e4d6ff53ded
--- /dev/null
+++ b/gdb/testsuite/gdb.threads/next-fork-exec-other-thread.exp
@@ -0,0 +1,131 @@
+# Copyright 2022-2023 Free Software Foundation, Inc.
+
+# This program is free software; you can redistribute it and/or modify
+# it under the terms of the GNU General Public License as published by
+# the Free Software Foundation; either version 3 of the License, or
+# (at your option) any later version.
+#
+# This program is distributed in the hope that it will be useful,
+# but WITHOUT ANY WARRANTY; without even the implied warranty of
+# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
+# GNU General Public License for more details.
+#
+# You should have received a copy of the GNU General Public License
+# along with this program.  If not, see <http://www.gnu.org/licenses/>.
+
+# This test was adapted from next-fork-other-thread.exp. The .c file
+# was adapted from the reproducer for this bug:
+#
+# https://sourceware.org/bugzilla/show_bug.cgi?id=30387#
+#
+# That bug demonstrates a problem with software-singlestep in gdbserver.
+# Prior to being fixed, this test also demonstrated that bug for a
+# 32-bit ARM target.  (Use RUNTESTFLAGS="--target_board=native-gdbserver".)
+# It has been reproduced on a Raspberry Pi running Ubunutu server
+# 20.04.5 LTS with 32-bit kernel + 32-bit userland.  It was NOT reproducible
+# using a circa 2023 Raspberry Pi OS w/ 64-bit kernel and 32-bit userland.
+
+standard_testfile
+
+# Line where to stop the main thread.
+set break_here_line [gdb_get_line_number "break here"]
+
+# Build executables, one for each fork flavor.
+foreach_with_prefix fork_func {fork vfork} {
+    set opts [list debug pthreads additional_flags=-DFORK_FUNC=${fork_func}]
+    if { [build_executable "failed to prepare" \
+	    ${testfile}-${fork_func} ${srcfile} $opts] } {
+	return
+    }
+}
+
+# If testing against GDBserver, consume all it its output.
+
+proc drain_gdbserver_output { } {
+    if { [info exists ::server_spawn_id] } {
+	gdb_test_multiple "" "" {
+	    -i "$::server_spawn_id"
+	    -timeout 0
+	    -re ".+" {
+	      exp_continue
+	    }
+	}
+    }
+}
+
+# Run the test with the given parameters:
+#
+#   - FORK_FUNC: fork flavor, "fork" or "vfork".
+#   - TARGET-NON-STOP: "maintenance set target-non-stop" value, "auto", "on" or
+#     "off".
+#   - NON-STOP: "set non-stop" value, "on" or "off".
+#   - DISPLACED-STEPPING: "set displaced-stepping" value, "auto", "on" or "off".
+
+proc do_test { fork_func target-non-stop non-stop displaced-stepping } {
+    save_vars { ::GDBFLAGS } {
+	append ::GDBFLAGS " -ex \"maintenance set target-non-stop ${target-non-stop}\""
+	append ::GDBFLAGS " -ex \"set non-stop ${non-stop}\""
+	clean_restart ${::binfile}-${fork_func}
+    }
+
+    gdb_test_no_output "set displaced-stepping ${displaced-stepping}"
+
+    if { ![runto_main] } {
+	return
+    }
+
+    # The "Detached after (v)fork" messages get in the way in non-stop, disable
+    # them.
+    gdb_test_no_output "set print inferior-events off"
+
+    # Advance the next-ing thread to the point where we'll execute the nexts.
+    # Leave the breakpoint in: it will force GDB to step over it while next-ing,
+    # which exercises some additional code paths.
+    gdb_test "break $::break_here_line" "Breakpoint $::decimal at $::hex.*"
+    gdb_test "continue" "hit Breakpoint $::decimal, worker_b.*"
+
+    # Next an arbitrary number of times over the lines of the loop.
+    for { set i 0 } { $i < 200 } { incr i } {
+	# If testing against GDBserver, the forking threads cause a lot of
+	# "Detaching from process XYZ" messages to appear.  If we don't consume
+	# that output, GDBserver eventually blocks on a full stderr.  Drain it
+	# once every loop.  It may not be needed for 20 iterations, but it's
+	# needed if you increase to 200 iterations.
+	drain_gdbserver_output
+
+	with_test_prefix "i=$i" {
+	    if { [gdb_test "next" "other line.*" "next to other line"] != 0 } {
+		return
+	    }
+
+	    if { [gdb_test "next" "for loop.*" "next to for loop"] != 0 } {
+		return
+	    }
+
+	    if { [gdb_test "next" "break here.*" "next to break here"] != 0} {
+		return
+	    }
+	}
+    }
+}
+
+foreach_with_prefix fork_func {fork vfork} {
+    foreach_with_prefix target-non-stop {auto on off} {
+	# This file was copied from next-fork-other-thread.exp and
+	# then adapted for the a case which also involves an exec in
+	# addition to the fork.  Ideally, we should test non-stop "on"
+	# in addition to "off", but, for this test, that results in a
+	# number of failures occur preceded by the message:
+	#
+	# Cannot execute this command while the selected thread is running.
+	#
+	# That seems like correct behavior to me, but perhaps the
+	# non-stop case can be made to work; if so, simply add "on"
+	# after "off" on the line below...
+	foreach_with_prefix non-stop {off} {
+	    foreach_with_prefix displaced-stepping {auto on off} {
+		do_test ${fork_func} ${target-non-stop} ${non-stop} ${displaced-stepping}
+	    }
+	}
+    }
+}
diff --git a/gdbserver/linux-low.cc b/gdbserver/linux-low.cc
index 651f219b738..e1806ade82f 100644
--- a/gdbserver/linux-low.cc
+++ b/gdbserver/linux-low.cc
@@ -2463,7 +2463,12 @@ linux_process_target::resume_stopped_resumed_lwps (thread_info *thread)
       int step = 0;
 
       if (thread->last_resume_kind == resume_step)
-	step = maybe_hw_step (thread);
+	{
+	  if (supports_software_single_step ())
+	    install_software_single_step_breakpoints (lp);
+
+	  step = maybe_hw_step (thread);
+	}
 
       threads_debug_printf ("resuming stopped-resumed LWP %s at %s: step=%d",
 			    target_pid_to_str (ptid_of (thread)).c_str (),
-- 
2.41.0


^ permalink raw reply	[flat|nested] 7+ messages in thread

* RE: [PATCH 1/1] gdbserver: Reinstall software single-step breakpoints in resume_stopped_resumed_lwps
  2023-08-03  0:56 ` [PATCH 1/1] " Kevin Buettner
@ 2023-08-07 10:54   ` Yan, Zhiyong
  2023-08-08  2:31     ` Kevin Buettner
  2023-08-12  4:00   ` Kevin Buettner
  1 sibling, 1 reply; 7+ messages in thread
From: Yan, Zhiyong @ 2023-08-07 10:54 UTC (permalink / raw)
  To: Kevin Buettner, gdb-patches; +Cc: simon.marchi

Hi Kevin,
     Today I backport below patch to gdb 13.0.5 which is used by our customer board.
     I verify this patch by customer's app where the assert error was firstly reported,  and verify it by test app "osm".
     This patch can fix the assert error in my tests.

Best Regards.
Zhiyong


-----Original Message-----
From: Kevin Buettner <kevinb@redhat.com> 
Sent: Thursday, August 3, 2023 8:56 AM
To: gdb-patches@sourceware.org
Cc: Yan, Zhiyong <Zhiyong.Yan@windriver.com>; simon.marchi@polymtl.ca; Kevin Buettner <kevinb@redhat.com>
Subject: [PATCH 1/1] gdbserver: Reinstall software single-step breakpoints in resume_stopped_resumed_lwps

CAUTION: This email comes from a non Wind River email account!
Do not click links or open attachments unless you recognize the sender and know the content is safe.

At the moment, while performing a software single-step, gdbserver fails to reinsert software single-step breakpoints for a LWP when interrupted by a signal in another thread.  This commit fixes this problem by reinstalling software single-step breakpoints in linux_process_target::resume_stopped_resumed_lwps in gdbserver/linux-low.cc.

This bug was discovered due to a failing assert in maybe_hw_step() in gdbserver/linux-low.cc.  Looking at the backtrace revealed that the caller was linux_process_target::resume_stopped_resumed_lwps.
I was uncertain whether the assert should still be valid when called from that method, so I tried hoisting the assert from maybe_hw_step to all callers except resume_stopped_resumed_lwps.  But running the new test case, described below, showed that merely eliminating the assert for this case was NOT a good fix - a study of the log file for the test showed that the single-step operation failed to occur.
Instead GDB (via gdbserver) stopped at the next breakpoint that was hit.

Zhiyong Yan had proposed a fix which resinserted software single-step breakpoints, albeit at a different location in linux-low.cc.  Testing revealed that, while running gdb.threads/pending-fork-event-detach,
the executable associated with that test would die due to a SIGTRAP after the test program was detached.  Examination of the core file(s) showed that a breakpoint instruction had been left in program memory.
Test results were otherwise very good, so Zhiyong was definitely on the right track!

This commit causes software single-step breakpoint(s) to be inserted before the call to maybe_hw_step in resume_stopped_resumed_lwps.  This will cause 'has_single_step_breakpoints (thread)' to be true, so that the assert in maybe_hw_step...

      /* GDBserver must insert single-step breakpoint for software
         single step.  */
      gdb_assert (has_single_step_breakpoints (thread));

...will no longer fail.  And better still, the single-step breakpoints are reinstalled, so that stepping will actually work, even when interrupted.

The C code for the test case was loosely adapted from the reproducer provided in Zhiyong's bug report for this problem.  The .exp file was copied from next-fork-other-thread.exp and then tweaked slightly.  As noted in a comment in next-fork-exec-other-thread.exp, I had to remove "on" from the loop for non-stop as it was failing on all architectures (including x86-64) that I tested.  I have a feeling that it ought to work, but this can be investigated separately and (re)enabled once it works.  I also increased the number of iterations for the loop running the "next" commands.  I've had some test runs which don't show the bug until the loop counter exceeded 100 iterations.  The C file for the new test uses shorter delays than next-fork-other-thread.c though, so it doesn't take overly long (IMO) to run this new test.

Running the new test on a Raspberry Pi w/ a 32-bit (Arm) kernel and userland using a gdbserver build without the fix in this commit shows the following results:

FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=fork: target-non-stop=auto: non-stop=off: displaced-stepping=auto: i=12: next to other line
FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=fork: target-non-stop=auto: non-stop=off: displaced-stepping=on: i=9: next to other line
FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=fork: target-non-stop=auto: non-stop=off: displaced-stepping=off: i=18: next to other line
FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=fork: target-non-stop=off: non-stop=off: displaced-stepping=auto: i=3: next to other line
FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=fork: target-non-stop=off: non-stop=off: displaced-stepping=on: i=11: next to other line
FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=fork: target-non-stop=off: non-stop=off: displaced-stepping=off: i=1: next to other line
FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: target-non-stop=auto: non-stop=off: displaced-stepping=auto: i=1: next to break here
FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: target-non-stop=auto: non-stop=off: displaced-stepping=on: i=3: next to break here
FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: target-non-stop=auto: non-stop=off: displaced-stepping=off: i=1: next to break here
FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: target-non-stop=on: non-stop=off: displaced-stepping=auto: i=47: next to other line
FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: target-non-stop=on: non-stop=off: displaced-stepping=on: i=57: next to other line
FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: target-non-stop=off: non-stop=off: displaced-stepping=auto: i=1: next to break here
FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: target-non-stop=off: non-stop=off: displaced-stepping=on: i=10: next to break here
FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: target-non-stop=off: non-stop=off: displaced-stepping=off: i=1: next to break here

                === gdb Summary ===

 # of unexpected core files     12
 # of expected passes           3011
 # of unexpected failures       14

Each of the 12 core files were caused by the failed assertion in maybe_hw_step in linux-low.c.  These correspond to 12 of the unexpected failures.

When the tests are run using a gdbserver build which includes the fix in this commit, the results are significantly better, but not perfect:

FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: target-non-stop=on: non-stop=off: displaced-stepping=auto: i=143: next to other line
FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: target-non-stop=on: non-stop=off: displaced-stepping=on: i=25: next to other line

                === gdb Summary ===

 # of expected passes           10178
 # of unexpected failures       2

(I think that the two remaining failures are due to some different problem.  And, FWIW, I don't see these failures on the Pi when using
--target_board=native-extended-gdbserver.)

Running the new test on x86-64 and aarch64, both native and native-gdbserver shows no failures.

Also, I see no regressions when running the entire test suite for armv7l-unknown-linux-gnueabihf (i.e.  the Raspberry Pi w/ 32-bit
kernel+userland) with --target_board=native-gdbserver.  Additionally,
using --target_board=native-gdbserver, I also see no regressions for the entire test suite for x86-64 and aarch64 running Fedora 38.

Bug: https://sourceware.org/bugzilla/show_bug.cgi?id=30387
Co-Authored-By: Zhiyong Yan <zhiyong.yan@windriver.com>
---
 .../gdb.threads/next-fork-exec-other-thread.c |  82 +++++++++++
 .../next-fork-exec-other-thread.exp           | 131 ++++++++++++++++++
 gdbserver/linux-low.cc                        |   7 +-
 3 files changed, 219 insertions(+), 1 deletion(-)  create mode 100644 gdb/testsuite/gdb.threads/next-fork-exec-other-thread.c
 create mode 100644 gdb/testsuite/gdb.threads/next-fork-exec-other-thread.exp

diff --git a/gdb/testsuite/gdb.threads/next-fork-exec-other-thread.c b/gdb/testsuite/gdb.threads/next-fork-exec-other-thread.c
new file mode 100644
index 00000000000..884706c6c3c
--- /dev/null
+++ b/gdb/testsuite/gdb.threads/next-fork-exec-other-thread.c
@@ -0,0 +1,82 @@
+/* This testcase is part of GDB, the GNU debugger.
+
+   Copyright 2023 Free Software Foundation, Inc.
+
+   This program is free software; you can redistribute it and/or modify
+   it under the terms of the GNU General Public License as published by
+   the Free Software Foundation; either version 3 of the License, or
+   (at your option) any later version.
+
+   This program is distributed in the hope that it will be useful,
+   but WITHOUT ANY WARRANTY; without even the implied warranty of
+   MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.         See the
+   GNU General Public License for more details.
+
+   You should have received a copy of the GNU General Public License
+   along with this program.  If not, see 
+ <http://www.gnu.org/licenses/>.  */
+
+#include <stdio.h>
+#include <stdlib.h>
+#include <string.h>
+#include <fcntl.h>
+#include <errno.h>
+#include <unistd.h>
+#include <pthread.h>
+#include <sys/wait.h>
+
+#define MAX_LOOP_ITER 10000
+
+static char *argv0;
+
+static void*
+worker_a (void *pArg)
+{
+  int iter = 0;
+  char *args[] = {argv0, "self-call", NULL };
+
+  while (iter++ < MAX_LOOP_ITER)
+    {
+      pid_t pid = FORK_FUNC ();
+      if (pid == 0)
+       {
+         /* child */
+         if (execvp (args[0], args) == -1)
+           {
+             fprintf (stderr, "execvp error: %d\n", errno);
+             exit (1);
+           }
+       }
+
+      waitpid (pid, NULL, 0);
+      usleep (5);
+    }
+}
+
+static void*
+worker_b (void *pArg)
+{
+  int iter = 0;
+  while (iter++ < MAX_LOOP_ITER)  /* for loop */
+    {
+      usleep (5);  /* break here */
+      usleep (5);  /* other line */
+    }
+}
+
+int
+main (int argc, char **argv)
+{
+  pthread_t wa_pid;
+  pthread_t wb_pid;
+
+  argv0 = argv[0];
+
+  if (argc > 1 && strcmp (argv[1], "self-call") == 0)
+    exit (0);
+
+  pthread_create (&wa_pid, NULL, worker_a, NULL);  pthread_create 
+ (&wb_pid, NULL, worker_b, NULL);  pthread_join (wa_pid, NULL);
+
+  exit (0);
+}
diff --git a/gdb/testsuite/gdb.threads/next-fork-exec-other-thread.exp b/gdb/testsuite/gdb.threads/next-fork-exec-other-thread.exp
new file mode 100644
index 00000000000..e4d6ff53ded
--- /dev/null
+++ b/gdb/testsuite/gdb.threads/next-fork-exec-other-thread.exp
@@ -0,0 +1,131 @@
+# Copyright 2022-2023 Free Software Foundation, Inc.
+
+# This program is free software; you can redistribute it and/or modify 
+# it under the terms of the GNU General Public License as published by 
+# the Free Software Foundation; either version 3 of the License, or # 
+(at your option) any later version.
+#
+# This program is distributed in the hope that it will be useful, # but 
+WITHOUT ANY WARRANTY; without even the implied warranty of # 
+MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the # GNU 
+General Public License for more details.
+#
+# You should have received a copy of the GNU General Public License # 
+along with this program.  If not, see <http://www.gnu.org/licenses/>.
+
+# This test was adapted from next-fork-other-thread.exp. The .c file # 
+was adapted from the reproducer for this bug:
+#
+# https://sourceware.org/bugzilla/show_bug.cgi?id=30387#
+#
+# That bug demonstrates a problem with software-singlestep in gdbserver.
+# Prior to being fixed, this test also demonstrated that bug for a # 
+32-bit ARM target.  (Use 
+RUNTESTFLAGS="--target_board=native-gdbserver".)
+# It has been reproduced on a Raspberry Pi running Ubunutu server # 
+20.04.5 LTS with 32-bit kernel + 32-bit userland.  It was NOT 
+reproducible # using a circa 2023 Raspberry Pi OS w/ 64-bit kernel and 32-bit userland.
+
+standard_testfile
+
+# Line where to stop the main thread.
+set break_here_line [gdb_get_line_number "break here"]
+
+# Build executables, one for each fork flavor.
+foreach_with_prefix fork_func {fork vfork} {
+    set opts [list debug pthreads additional_flags=-DFORK_FUNC=${fork_func}]
+    if { [build_executable "failed to prepare" \
+           ${testfile}-${fork_func} ${srcfile} $opts] } {
+       return
+    }
+}
+
+# If testing against GDBserver, consume all it its output.
+
+proc drain_gdbserver_output { } {
+    if { [info exists ::server_spawn_id] } {
+       gdb_test_multiple "" "" {
+           -i "$::server_spawn_id"
+           -timeout 0
+           -re ".+" {
+             exp_continue
+           }
+       }
+    }
+}
+
+# Run the test with the given parameters:
+#
+#   - FORK_FUNC: fork flavor, "fork" or "vfork".
+#   - TARGET-NON-STOP: "maintenance set target-non-stop" value, "auto", "on" or
+#     "off".
+#   - NON-STOP: "set non-stop" value, "on" or "off".
+#   - DISPLACED-STEPPING: "set displaced-stepping" value, "auto", "on" or "off".
+
+proc do_test { fork_func target-non-stop non-stop displaced-stepping } {
+    save_vars { ::GDBFLAGS } {
+       append ::GDBFLAGS " -ex \"maintenance set target-non-stop ${target-non-stop}\""
+       append ::GDBFLAGS " -ex \"set non-stop ${non-stop}\""
+       clean_restart ${::binfile}-${fork_func}
+    }
+
+    gdb_test_no_output "set displaced-stepping ${displaced-stepping}"
+
+    if { ![runto_main] } {
+       return
+    }
+
+    # The "Detached after (v)fork" messages get in the way in non-stop, disable
+    # them.
+    gdb_test_no_output "set print inferior-events off"
+
+    # Advance the next-ing thread to the point where we'll execute the nexts.
+    # Leave the breakpoint in: it will force GDB to step over it while next-ing,
+    # which exercises some additional code paths.
+    gdb_test "break $::break_here_line" "Breakpoint $::decimal at $::hex.*"
+    gdb_test "continue" "hit Breakpoint $::decimal, worker_b.*"
+
+    # Next an arbitrary number of times over the lines of the loop.
+    for { set i 0 } { $i < 200 } { incr i } {
+       # If testing against GDBserver, the forking threads cause a lot of
+       # "Detaching from process XYZ" messages to appear.  If we don't consume
+       # that output, GDBserver eventually blocks on a full stderr.  Drain it
+       # once every loop.  It may not be needed for 20 iterations, but it's
+       # needed if you increase to 200 iterations.
+       drain_gdbserver_output
+
+       with_test_prefix "i=$i" {
+           if { [gdb_test "next" "other line.*" "next to other line"] != 0 } {
+               return
+           }
+
+           if { [gdb_test "next" "for loop.*" "next to for loop"] != 0 } {
+               return
+           }
+
+           if { [gdb_test "next" "break here.*" "next to break here"] != 0} {
+               return
+           }
+       }
+    }
+}
+
+foreach_with_prefix fork_func {fork vfork} {
+    foreach_with_prefix target-non-stop {auto on off} {
+       # This file was copied from next-fork-other-thread.exp and
+       # then adapted for the a case which also involves an exec in
+       # addition to the fork.  Ideally, we should test non-stop "on"
+       # in addition to "off", but, for this test, that results in a
+       # number of failures occur preceded by the message:
+       #
+       # Cannot execute this command while the selected thread is running.
+       #
+       # That seems like correct behavior to me, but perhaps the
+       # non-stop case can be made to work; if so, simply add "on"
+       # after "off" on the line below...
+       foreach_with_prefix non-stop {off} {
+           foreach_with_prefix displaced-stepping {auto on off} {
+               do_test ${fork_func} ${target-non-stop} ${non-stop} ${displaced-stepping}
+           }
+       }
+    }
+}
diff --git a/gdbserver/linux-low.cc b/gdbserver/linux-low.cc index 651f219b738..e1806ade82f 100644
--- a/gdbserver/linux-low.cc
+++ b/gdbserver/linux-low.cc
@@ -2463,7 +2463,12 @@ linux_process_target::resume_stopped_resumed_lwps (thread_info *thread)
       int step = 0;

       if (thread->last_resume_kind == resume_step)
-       step = maybe_hw_step (thread);
+       {
+         if (supports_software_single_step ())
+           install_software_single_step_breakpoints (lp);
+
+         step = maybe_hw_step (thread);
+       }

       threads_debug_printf ("resuming stopped-resumed LWP %s at %s: step=%d",
                            target_pid_to_str (ptid_of (thread)).c_str (),
--
2.41.0


^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [PATCH 1/1] gdbserver: Reinstall software single-step breakpoints in resume_stopped_resumed_lwps
  2023-08-07 10:54   ` Yan, Zhiyong
@ 2023-08-08  2:31     ` Kevin Buettner
  2023-08-08  2:39       ` Yan, Zhiyong
  0 siblings, 1 reply; 7+ messages in thread
From: Kevin Buettner @ 2023-08-08  2:31 UTC (permalink / raw)
  To: Yan, Zhiyong; +Cc: gdb-patches, simon.marchi

Hi Zhiyong,

Thanks for testing.  In addition to the Co-Authored-By tag, I've added
a Tested-By tag for you.

I'll wait a few more days for comments, after which, assuming there
are no objections, I'll push it.

Thank you for all of your contributions (bug report, test case,
initial patches, and testing) in resolving this problem!

Kevin

On Mon, 7 Aug 2023 10:54:16 +0000
"Yan, Zhiyong" <Zhiyong.Yan@windriver.com> wrote:

> Hi Kevin,
>      Today I backport below patch to gdb 13.0.5 which is used by our customer board.
>      I verify this patch by customer's app where the assert error was firstly reported,  and verify it by test app "osm".
>      This patch can fix the assert error in my tests.
> 
> Best Regards.
> Zhiyong
> 
> 
> -----Original Message-----
> From: Kevin Buettner <kevinb@redhat.com> 
> Sent: Thursday, August 3, 2023 8:56 AM
> To: gdb-patches@sourceware.org
> Cc: Yan, Zhiyong <Zhiyong.Yan@windriver.com>; simon.marchi@polymtl.ca; Kevin Buettner <kevinb@redhat.com>
> Subject: [PATCH 1/1] gdbserver: Reinstall software single-step breakpoints in resume_stopped_resumed_lwps
> 
> CAUTION: This email comes from a non Wind River email account!
> Do not click links or open attachments unless you recognize the sender and know the content is safe.
> 
> At the moment, while performing a software single-step, gdbserver fails to reinsert software single-step breakpoints for a LWP when interrupted by a signal in another thread.  This commit fixes this problem by reinstalling software single-step breakpoints in linux_process_target::resume_stopped_resumed_lwps in gdbserver/linux-low.cc.
> 
> This bug was discovered due to a failing assert in maybe_hw_step() in gdbserver/linux-low.cc.  Looking at the backtrace revealed that the caller was linux_process_target::resume_stopped_resumed_lwps.
> I was uncertain whether the assert should still be valid when called from that method, so I tried hoisting the assert from maybe_hw_step to all callers except resume_stopped_resumed_lwps.  But running the new test case, described below, showed that merely eliminating the assert for this case was NOT a good fix - a study of the log file for the test showed that the single-step operation failed to occur.
> Instead GDB (via gdbserver) stopped at the next breakpoint that was hit.
> 
> Zhiyong Yan had proposed a fix which resinserted software single-step breakpoints, albeit at a different location in linux-low.cc.  Testing revealed that, while running gdb.threads/pending-fork-event-detach,
> the executable associated with that test would die due to a SIGTRAP after the test program was detached.  Examination of the core file(s) showed that a breakpoint instruction had been left in program memory.
> Test results were otherwise very good, so Zhiyong was definitely on the right track!
> 
> This commit causes software single-step breakpoint(s) to be inserted before the call to maybe_hw_step in resume_stopped_resumed_lwps.  This will cause 'has_single_step_breakpoints (thread)' to be true, so that the assert in maybe_hw_step...
> 
>       /* GDBserver must insert single-step breakpoint for software
>          single step.  */
>       gdb_assert (has_single_step_breakpoints (thread));
> 
> ...will no longer fail.  And better still, the single-step breakpoints are reinstalled, so that stepping will actually work, even when interrupted.
> 
> The C code for the test case was loosely adapted from the reproducer provided in Zhiyong's bug report for this problem.  The .exp file was copied from next-fork-other-thread.exp and then tweaked slightly.  As noted in a comment in next-fork-exec-other-thread.exp, I had to remove "on" from the loop for non-stop as it was failing on all architectures (including x86-64) that I tested.  I have a feeling that it ought to work, but this can be investigated separately and (re)enabled once it works.  I also increased the number of iterations for the loop running the "next" commands.  I've had some test runs which don't show the bug until the loop counter exceeded 100 iterations.  The C file for the new test uses shorter delays than next-fork-other-thread.c though, so it doesn't take overly long (IMO) to run this new test.
> 
> Running the new test on a Raspberry Pi w/ a 32-bit (Arm) kernel and userland using a gdbserver build without the fix in this commit shows the following results:
> 
> FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=fork: target-non-stop=auto: non-stop=off: displaced-stepping=auto: i=12: next to other line
> FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=fork: target-non-stop=auto: non-stop=off: displaced-stepping=on: i=9: next to other line
> FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=fork: target-non-stop=auto: non-stop=off: displaced-stepping=off: i=18: next to other line
> FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=fork: target-non-stop=off: non-stop=off: displaced-stepping=auto: i=3: next to other line
> FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=fork: target-non-stop=off: non-stop=off: displaced-stepping=on: i=11: next to other line
> FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=fork: target-non-stop=off: non-stop=off: displaced-stepping=off: i=1: next to other line
> FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: target-non-stop=auto: non-stop=off: displaced-stepping=auto: i=1: next to break here
> FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: target-non-stop=auto: non-stop=off: displaced-stepping=on: i=3: next to break here
> FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: target-non-stop=auto: non-stop=off: displaced-stepping=off: i=1: next to break here
> FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: target-non-stop=on: non-stop=off: displaced-stepping=auto: i=47: next to other line
> FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: target-non-stop=on: non-stop=off: displaced-stepping=on: i=57: next to other line
> FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: target-non-stop=off: non-stop=off: displaced-stepping=auto: i=1: next to break here
> FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: target-non-stop=off: non-stop=off: displaced-stepping=on: i=10: next to break here
> FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: target-non-stop=off: non-stop=off: displaced-stepping=off: i=1: next to break here
> 
>                 === gdb Summary ===
> 
>  # of unexpected core files     12
>  # of expected passes           3011
>  # of unexpected failures       14
> 
> Each of the 12 core files were caused by the failed assertion in maybe_hw_step in linux-low.c.  These correspond to 12 of the unexpected failures.
> 
> When the tests are run using a gdbserver build which includes the fix in this commit, the results are significantly better, but not perfect:
> 
> FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: target-non-stop=on: non-stop=off: displaced-stepping=auto: i=143: next to other line
> FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: target-non-stop=on: non-stop=off: displaced-stepping=on: i=25: next to other line
> 
>                 === gdb Summary ===
> 
>  # of expected passes           10178
>  # of unexpected failures       2
> 
> (I think that the two remaining failures are due to some different problem.  And, FWIW, I don't see these failures on the Pi when using
> --target_board=native-extended-gdbserver.)
> 
> Running the new test on x86-64 and aarch64, both native and native-gdbserver shows no failures.
> 
> Also, I see no regressions when running the entire test suite for armv7l-unknown-linux-gnueabihf (i.e.  the Raspberry Pi w/ 32-bit
> kernel+userland) with --target_board=native-gdbserver.  Additionally,
> using --target_board=native-gdbserver, I also see no regressions for the entire test suite for x86-64 and aarch64 running Fedora 38.
> 
> Bug: https://sourceware.org/bugzilla/show_bug.cgi?id=30387
> Co-Authored-By: Zhiyong Yan <zhiyong.yan@windriver.com>
> ---
>  .../gdb.threads/next-fork-exec-other-thread.c |  82 +++++++++++
>  .../next-fork-exec-other-thread.exp           | 131 ++++++++++++++++++
>  gdbserver/linux-low.cc                        |   7 +-
>  3 files changed, 219 insertions(+), 1 deletion(-)  create mode 100644 gdb/testsuite/gdb.threads/next-fork-exec-other-thread.c
>  create mode 100644 gdb/testsuite/gdb.threads/next-fork-exec-other-thread.exp
> 
> diff --git a/gdb/testsuite/gdb.threads/next-fork-exec-other-thread.c b/gdb/testsuite/gdb.threads/next-fork-exec-other-thread.c
> new file mode 100644
> index 00000000000..884706c6c3c
> --- /dev/null
> +++ b/gdb/testsuite/gdb.threads/next-fork-exec-other-thread.c
> @@ -0,0 +1,82 @@
> +/* This testcase is part of GDB, the GNU debugger.
> +
> +   Copyright 2023 Free Software Foundation, Inc.
> +
> +   This program is free software; you can redistribute it and/or modify
> +   it under the terms of the GNU General Public License as published by
> +   the Free Software Foundation; either version 3 of the License, or
> +   (at your option) any later version.
> +
> +   This program is distributed in the hope that it will be useful,
> +   but WITHOUT ANY WARRANTY; without even the implied warranty of
> +   MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.         See the
> +   GNU General Public License for more details.
> +
> +   You should have received a copy of the GNU General Public License
> +   along with this program.  If not, see 
> + <http://www.gnu.org/licenses/>.  */
> +
> +#include <stdio.h>
> +#include <stdlib.h>
> +#include <string.h>
> +#include <fcntl.h>
> +#include <errno.h>
> +#include <unistd.h>
> +#include <pthread.h>
> +#include <sys/wait.h>
> +
> +#define MAX_LOOP_ITER 10000
> +
> +static char *argv0;
> +
> +static void*
> +worker_a (void *pArg)
> +{
> +  int iter = 0;
> +  char *args[] = {argv0, "self-call", NULL };
> +
> +  while (iter++ < MAX_LOOP_ITER)
> +    {
> +      pid_t pid = FORK_FUNC ();
> +      if (pid == 0)
> +       {
> +         /* child */
> +         if (execvp (args[0], args) == -1)
> +           {
> +             fprintf (stderr, "execvp error: %d\n", errno);
> +             exit (1);
> +           }
> +       }
> +
> +      waitpid (pid, NULL, 0);
> +      usleep (5);
> +    }
> +}
> +
> +static void*
> +worker_b (void *pArg)
> +{
> +  int iter = 0;
> +  while (iter++ < MAX_LOOP_ITER)  /* for loop */
> +    {
> +      usleep (5);  /* break here */
> +      usleep (5);  /* other line */
> +    }
> +}
> +
> +int
> +main (int argc, char **argv)
> +{
> +  pthread_t wa_pid;
> +  pthread_t wb_pid;
> +
> +  argv0 = argv[0];
> +
> +  if (argc > 1 && strcmp (argv[1], "self-call") == 0)
> +    exit (0);
> +
> +  pthread_create (&wa_pid, NULL, worker_a, NULL);  pthread_create 
> + (&wb_pid, NULL, worker_b, NULL);  pthread_join (wa_pid, NULL);
> +
> +  exit (0);
> +}
> diff --git a/gdb/testsuite/gdb.threads/next-fork-exec-other-thread.exp b/gdb/testsuite/gdb.threads/next-fork-exec-other-thread.exp
> new file mode 100644
> index 00000000000..e4d6ff53ded
> --- /dev/null
> +++ b/gdb/testsuite/gdb.threads/next-fork-exec-other-thread.exp
> @@ -0,0 +1,131 @@
> +# Copyright 2022-2023 Free Software Foundation, Inc.
> +
> +# This program is free software; you can redistribute it and/or modify 
> +# it under the terms of the GNU General Public License as published by 
> +# the Free Software Foundation; either version 3 of the License, or # 
> +(at your option) any later version.
> +#
> +# This program is distributed in the hope that it will be useful, # but 
> +WITHOUT ANY WARRANTY; without even the implied warranty of # 
> +MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the # GNU 
> +General Public License for more details.
> +#
> +# You should have received a copy of the GNU General Public License # 
> +along with this program.  If not, see <http://www.gnu.org/licenses/>.
> +
> +# This test was adapted from next-fork-other-thread.exp. The .c file # 
> +was adapted from the reproducer for this bug:
> +#
> +# https://sourceware.org/bugzilla/show_bug.cgi?id=30387#
> +#
> +# That bug demonstrates a problem with software-singlestep in gdbserver.
> +# Prior to being fixed, this test also demonstrated that bug for a # 
> +32-bit ARM target.  (Use 
> +RUNTESTFLAGS="--target_board=native-gdbserver".)
> +# It has been reproduced on a Raspberry Pi running Ubunutu server # 
> +20.04.5 LTS with 32-bit kernel + 32-bit userland.  It was NOT 
> +reproducible # using a circa 2023 Raspberry Pi OS w/ 64-bit kernel and 32-bit userland.
> +
> +standard_testfile
> +
> +# Line where to stop the main thread.
> +set break_here_line [gdb_get_line_number "break here"]
> +
> +# Build executables, one for each fork flavor.
> +foreach_with_prefix fork_func {fork vfork} {
> +    set opts [list debug pthreads additional_flags=-DFORK_FUNC=${fork_func}]
> +    if { [build_executable "failed to prepare" \
> +           ${testfile}-${fork_func} ${srcfile} $opts] } {
> +       return
> +    }
> +}
> +
> +# If testing against GDBserver, consume all it its output.
> +
> +proc drain_gdbserver_output { } {
> +    if { [info exists ::server_spawn_id] } {
> +       gdb_test_multiple "" "" {
> +           -i "$::server_spawn_id"
> +           -timeout 0
> +           -re ".+" {
> +             exp_continue
> +           }
> +       }
> +    }
> +}
> +
> +# Run the test with the given parameters:
> +#
> +#   - FORK_FUNC: fork flavor, "fork" or "vfork".
> +#   - TARGET-NON-STOP: "maintenance set target-non-stop" value, "auto", "on" or
> +#     "off".
> +#   - NON-STOP: "set non-stop" value, "on" or "off".
> +#   - DISPLACED-STEPPING: "set displaced-stepping" value, "auto", "on" or "off".
> +
> +proc do_test { fork_func target-non-stop non-stop displaced-stepping } {
> +    save_vars { ::GDBFLAGS } {
> +       append ::GDBFLAGS " -ex \"maintenance set target-non-stop ${target-non-stop}\""
> +       append ::GDBFLAGS " -ex \"set non-stop ${non-stop}\""
> +       clean_restart ${::binfile}-${fork_func}
> +    }
> +
> +    gdb_test_no_output "set displaced-stepping ${displaced-stepping}"
> +
> +    if { ![runto_main] } {
> +       return
> +    }
> +
> +    # The "Detached after (v)fork" messages get in the way in non-stop, disable
> +    # them.
> +    gdb_test_no_output "set print inferior-events off"
> +
> +    # Advance the next-ing thread to the point where we'll execute the nexts.
> +    # Leave the breakpoint in: it will force GDB to step over it while next-ing,
> +    # which exercises some additional code paths.
> +    gdb_test "break $::break_here_line" "Breakpoint $::decimal at $::hex.*"
> +    gdb_test "continue" "hit Breakpoint $::decimal, worker_b.*"
> +
> +    # Next an arbitrary number of times over the lines of the loop.
> +    for { set i 0 } { $i < 200 } { incr i } {
> +       # If testing against GDBserver, the forking threads cause a lot of
> +       # "Detaching from process XYZ" messages to appear.  If we don't consume
> +       # that output, GDBserver eventually blocks on a full stderr.  Drain it
> +       # once every loop.  It may not be needed for 20 iterations, but it's
> +       # needed if you increase to 200 iterations.
> +       drain_gdbserver_output
> +
> +       with_test_prefix "i=$i" {
> +           if { [gdb_test "next" "other line.*" "next to other line"] != 0 } {
> +               return
> +           }
> +
> +           if { [gdb_test "next" "for loop.*" "next to for loop"] != 0 } {
> +               return
> +           }
> +
> +           if { [gdb_test "next" "break here.*" "next to break here"] != 0} {
> +               return
> +           }
> +       }
> +    }
> +}
> +
> +foreach_with_prefix fork_func {fork vfork} {
> +    foreach_with_prefix target-non-stop {auto on off} {
> +       # This file was copied from next-fork-other-thread.exp and
> +       # then adapted for the a case which also involves an exec in
> +       # addition to the fork.  Ideally, we should test non-stop "on"
> +       # in addition to "off", but, for this test, that results in a
> +       # number of failures occur preceded by the message:
> +       #
> +       # Cannot execute this command while the selected thread is running.
> +       #
> +       # That seems like correct behavior to me, but perhaps the
> +       # non-stop case can be made to work; if so, simply add "on"
> +       # after "off" on the line below...
> +       foreach_with_prefix non-stop {off} {
> +           foreach_with_prefix displaced-stepping {auto on off} {
> +               do_test ${fork_func} ${target-non-stop} ${non-stop} ${displaced-stepping}
> +           }
> +       }
> +    }
> +}
> diff --git a/gdbserver/linux-low.cc b/gdbserver/linux-low.cc index 651f219b738..e1806ade82f 100644
> --- a/gdbserver/linux-low.cc
> +++ b/gdbserver/linux-low.cc
> @@ -2463,7 +2463,12 @@ linux_process_target::resume_stopped_resumed_lwps (thread_info *thread)
>        int step = 0;
> 
>        if (thread->last_resume_kind == resume_step)
> -       step = maybe_hw_step (thread);
> +       {
> +         if (supports_software_single_step ())
> +           install_software_single_step_breakpoints (lp);
> +
> +         step = maybe_hw_step (thread);
> +       }
> 
>        threads_debug_printf ("resuming stopped-resumed LWP %s at %s: step=%d",
>                             target_pid_to_str (ptid_of (thread)).c_str (),
> --
> 2.41.0
> 


^ permalink raw reply	[flat|nested] 7+ messages in thread

* RE: [PATCH 1/1] gdbserver: Reinstall software single-step breakpoints in resume_stopped_resumed_lwps
  2023-08-08  2:31     ` Kevin Buettner
@ 2023-08-08  2:39       ` Yan, Zhiyong
  2023-08-08  2:52         ` Kevin Buettner
  0 siblings, 1 reply; 7+ messages in thread
From: Yan, Zhiyong @ 2023-08-08  2:39 UTC (permalink / raw)
  To: Kevin Buettner; +Cc: gdb-patches, simon.marchi

Hi Kevin,
     It is pleasure for me to cooperate with you.
     When the release version for this issue is determined, please let me know. I will forward this information to my customer.
     Thanks!

Best Regards.
Zhiyong


-----Original Message-----
From: Kevin Buettner <kevinb@redhat.com> 
Sent: Tuesday, August 8, 2023 10:31 AM
To: Yan, Zhiyong <Zhiyong.Yan@windriver.com>
Cc: gdb-patches@sourceware.org; simon.marchi@polymtl.ca
Subject: Re: [PATCH 1/1] gdbserver: Reinstall software single-step breakpoints in resume_stopped_resumed_lwps

CAUTION: This email comes from a non Wind River email account!
Do not click links or open attachments unless you recognize the sender and know the content is safe.

Hi Zhiyong,

Thanks for testing.  In addition to the Co-Authored-By tag, I've added a Tested-By tag for you.

I'll wait a few more days for comments, after which, assuming there are no objections, I'll push it.

Thank you for all of your contributions (bug report, test case, initial patches, and testing) in resolving this problem!

Kevin

On Mon, 7 Aug 2023 10:54:16 +0000
"Yan, Zhiyong" <Zhiyong.Yan@windriver.com> wrote:

> Hi Kevin,
>      Today I backport below patch to gdb 13.0.5 which is used by our customer board.
>      I verify this patch by customer's app where the assert error was firstly reported,  and verify it by test app "osm".
>      This patch can fix the assert error in my tests.
>
> Best Regards.
> Zhiyong
>
>
> -----Original Message-----
> From: Kevin Buettner <kevinb@redhat.com>
> Sent: Thursday, August 3, 2023 8:56 AM
> To: gdb-patches@sourceware.org
> Cc: Yan, Zhiyong <Zhiyong.Yan@windriver.com>; simon.marchi@polymtl.ca; 
> Kevin Buettner <kevinb@redhat.com>
> Subject: [PATCH 1/1] gdbserver: Reinstall software single-step 
> breakpoints in resume_stopped_resumed_lwps
>
> CAUTION: This email comes from a non Wind River email account!
> Do not click links or open attachments unless you recognize the sender and know the content is safe.
>
> At the moment, while performing a software single-step, gdbserver fails to reinsert software single-step breakpoints for a LWP when interrupted by a signal in another thread.  This commit fixes this problem by reinstalling software single-step breakpoints in linux_process_target::resume_stopped_resumed_lwps in gdbserver/linux-low.cc.
>
> This bug was discovered due to a failing assert in maybe_hw_step() in gdbserver/linux-low.cc.  Looking at the backtrace revealed that the caller was linux_process_target::resume_stopped_resumed_lwps.
> I was uncertain whether the assert should still be valid when called from that method, so I tried hoisting the assert from maybe_hw_step to all callers except resume_stopped_resumed_lwps.  But running the new test case, described below, showed that merely eliminating the assert for this case was NOT a good fix - a study of the log file for the test showed that the single-step operation failed to occur.
> Instead GDB (via gdbserver) stopped at the next breakpoint that was hit.
>
> Zhiyong Yan had proposed a fix which resinserted software single-step 
> breakpoints, albeit at a different location in linux-low.cc.  Testing 
> revealed that, while running gdb.threads/pending-fork-event-detach,
> the executable associated with that test would die due to a SIGTRAP after the test program was detached.  Examination of the core file(s) showed that a breakpoint instruction had been left in program memory.
> Test results were otherwise very good, so Zhiyong was definitely on the right track!
>
> This commit causes software single-step breakpoint(s) to be inserted before the call to maybe_hw_step in resume_stopped_resumed_lwps.  This will cause 'has_single_step_breakpoints (thread)' to be true, so that the assert in maybe_hw_step...
>
>       /* GDBserver must insert single-step breakpoint for software
>          single step.  */
>       gdb_assert (has_single_step_breakpoints (thread));
>
> ...will no longer fail.  And better still, the single-step breakpoints are reinstalled, so that stepping will actually work, even when interrupted.
>
> The C code for the test case was loosely adapted from the reproducer provided in Zhiyong's bug report for this problem.  The .exp file was copied from next-fork-other-thread.exp and then tweaked slightly.  As noted in a comment in next-fork-exec-other-thread.exp, I had to remove "on" from the loop for non-stop as it was failing on all architectures (including x86-64) that I tested.  I have a feeling that it ought to work, but this can be investigated separately and (re)enabled once it works.  I also increased the number of iterations for the loop running the "next" commands.  I've had some test runs which don't show the bug until the loop counter exceeded 100 iterations.  The C file for the new test uses shorter delays than next-fork-other-thread.c though, so it doesn't take overly long (IMO) to run this new test.
>
> Running the new test on a Raspberry Pi w/ a 32-bit (Arm) kernel and userland using a gdbserver build without the fix in this commit shows the following results:
>
> FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=fork: 
> target-non-stop=auto: non-stop=off: displaced-stepping=auto: i=12: 
> next to other line
> FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=fork: 
> target-non-stop=auto: non-stop=off: displaced-stepping=on: i=9: next 
> to other line
> FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=fork: 
> target-non-stop=auto: non-stop=off: displaced-stepping=off: i=18: next 
> to other line
> FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=fork: 
> target-non-stop=off: non-stop=off: displaced-stepping=auto: i=3: next 
> to other line
> FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=fork: 
> target-non-stop=off: non-stop=off: displaced-stepping=on: i=11: next 
> to other line
> FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=fork: 
> target-non-stop=off: non-stop=off: displaced-stepping=off: i=1: next 
> to other line
> FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: 
> target-non-stop=auto: non-stop=off: displaced-stepping=auto: i=1: next 
> to break here
> FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: 
> target-non-stop=auto: non-stop=off: displaced-stepping=on: i=3: next 
> to break here
> FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: 
> target-non-stop=auto: non-stop=off: displaced-stepping=off: i=1: next 
> to break here
> FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: 
> target-non-stop=on: non-stop=off: displaced-stepping=auto: i=47: next 
> to other line
> FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: 
> target-non-stop=on: non-stop=off: displaced-stepping=on: i=57: next to 
> other line
> FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: 
> target-non-stop=off: non-stop=off: displaced-stepping=auto: i=1: next 
> to break here
> FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: 
> target-non-stop=off: non-stop=off: displaced-stepping=on: i=10: next 
> to break here
> FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: 
> target-non-stop=off: non-stop=off: displaced-stepping=off: i=1: next 
> to break here
>
>                 === gdb Summary ===
>
>  # of unexpected core files     12
>  # of expected passes           3011
>  # of unexpected failures       14
>
> Each of the 12 core files were caused by the failed assertion in maybe_hw_step in linux-low.c.  These correspond to 12 of the unexpected failures.
>
> When the tests are run using a gdbserver build which includes the fix in this commit, the results are significantly better, but not perfect:
>
> FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: 
> target-non-stop=on: non-stop=off: displaced-stepping=auto: i=143: next 
> to other line
> FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: 
> target-non-stop=on: non-stop=off: displaced-stepping=on: i=25: next to 
> other line
>
>                 === gdb Summary ===
>
>  # of expected passes           10178
>  # of unexpected failures       2
>
> (I think that the two remaining failures are due to some different 
> problem.  And, FWIW, I don't see these failures on the Pi when using
> --target_board=native-extended-gdbserver.)
>
> Running the new test on x86-64 and aarch64, both native and native-gdbserver shows no failures.
>
> Also, I see no regressions when running the entire test suite for 
> armv7l-unknown-linux-gnueabihf (i.e.  the Raspberry Pi w/ 32-bit
> kernel+userland) with --target_board=native-gdbserver.  Additionally,
> using --target_board=native-gdbserver, I also see no regressions for the entire test suite for x86-64 and aarch64 running Fedora 38.
>
> Bug: https://sourceware.org/bugzilla/show_bug.cgi?id=30387
> Co-Authored-By: Zhiyong Yan <zhiyong.yan@windriver.com>
> ---
>  .../gdb.threads/next-fork-exec-other-thread.c |  82 +++++++++++
>  .../next-fork-exec-other-thread.exp           | 131 ++++++++++++++++++
>  gdbserver/linux-low.cc                        |   7 +-
>  3 files changed, 219 insertions(+), 1 deletion(-)  create mode 100644 
> gdb/testsuite/gdb.threads/next-fork-exec-other-thread.c
>  create mode 100644 
> gdb/testsuite/gdb.threads/next-fork-exec-other-thread.exp
>
> diff --git a/gdb/testsuite/gdb.threads/next-fork-exec-other-thread.c 
> b/gdb/testsuite/gdb.threads/next-fork-exec-other-thread.c
> new file mode 100644
> index 00000000000..884706c6c3c
> --- /dev/null
> +++ b/gdb/testsuite/gdb.threads/next-fork-exec-other-thread.c
> @@ -0,0 +1,82 @@
> +/* This testcase is part of GDB, the GNU debugger.
> +
> +   Copyright 2023 Free Software Foundation, Inc.
> +
> +   This program is free software; you can redistribute it and/or modify
> +   it under the terms of the GNU General Public License as published by
> +   the Free Software Foundation; either version 3 of the License, or
> +   (at your option) any later version.
> +
> +   This program is distributed in the hope that it will be useful,
> +   but WITHOUT ANY WARRANTY; without even the implied warranty of
> +   MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.         See the
> +   GNU General Public License for more details.
> +
> +   You should have received a copy of the GNU General Public License
> +   along with this program.  If not, see 
> + <http://www.gnu.org/licenses/>.  */
> +
> +#include <stdio.h>
> +#include <stdlib.h>
> +#include <string.h>
> +#include <fcntl.h>
> +#include <errno.h>
> +#include <unistd.h>
> +#include <pthread.h>
> +#include <sys/wait.h>
> +
> +#define MAX_LOOP_ITER 10000
> +
> +static char *argv0;
> +
> +static void*
> +worker_a (void *pArg)
> +{
> +  int iter = 0;
> +  char *args[] = {argv0, "self-call", NULL };
> +
> +  while (iter++ < MAX_LOOP_ITER)
> +    {
> +      pid_t pid = FORK_FUNC ();
> +      if (pid == 0)
> +       {
> +         /* child */
> +         if (execvp (args[0], args) == -1)
> +           {
> +             fprintf (stderr, "execvp error: %d\n", errno);
> +             exit (1);
> +           }
> +       }
> +
> +      waitpid (pid, NULL, 0);
> +      usleep (5);
> +    }
> +}
> +
> +static void*
> +worker_b (void *pArg)
> +{
> +  int iter = 0;
> +  while (iter++ < MAX_LOOP_ITER)  /* for loop */
> +    {
> +      usleep (5);  /* break here */
> +      usleep (5);  /* other line */
> +    }
> +}
> +
> +int
> +main (int argc, char **argv)
> +{
> +  pthread_t wa_pid;
> +  pthread_t wb_pid;
> +
> +  argv0 = argv[0];
> +
> +  if (argc > 1 && strcmp (argv[1], "self-call") == 0)
> +    exit (0);
> +
> +  pthread_create (&wa_pid, NULL, worker_a, NULL);  pthread_create 
> + (&wb_pid, NULL, worker_b, NULL);  pthread_join (wa_pid, NULL);
> +
> +  exit (0);
> +}
> diff --git a/gdb/testsuite/gdb.threads/next-fork-exec-other-thread.exp 
> b/gdb/testsuite/gdb.threads/next-fork-exec-other-thread.exp
> new file mode 100644
> index 00000000000..e4d6ff53ded
> --- /dev/null
> +++ b/gdb/testsuite/gdb.threads/next-fork-exec-other-thread.exp
> @@ -0,0 +1,131 @@
> +# Copyright 2022-2023 Free Software Foundation, Inc.
> +
> +# This program is free software; you can redistribute it and/or 
> +modify # it under the terms of the GNU General Public License as 
> +published by # the Free Software Foundation; either version 3 of the 
> +License, or # (at your option) any later version.
> +#
> +# This program is distributed in the hope that it will be useful, # 
> +but WITHOUT ANY WARRANTY; without even the implied warranty of # 
> +MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the # GNU 
> +General Public License for more details.
> +#
> +# You should have received a copy of the GNU General Public License # 
> +along with this program.  If not, see <http://www.gnu.org/licenses/>.
> +
> +# This test was adapted from next-fork-other-thread.exp. The .c file 
> +# was adapted from the reproducer for this bug:
> +#
> +# https://sourceware.org/bugzilla/show_bug.cgi?id=30387#
> +#
> +# That bug demonstrates a problem with software-singlestep in gdbserver.
> +# Prior to being fixed, this test also demonstrated that bug for a # 
> +32-bit ARM target.  (Use
> +RUNTESTFLAGS="--target_board=native-gdbserver".)
> +# It has been reproduced on a Raspberry Pi running Ubunutu server #
> +20.04.5 LTS with 32-bit kernel + 32-bit userland.  It was NOT 
> +reproducible # using a circa 2023 Raspberry Pi OS w/ 64-bit kernel and 32-bit userland.
> +
> +standard_testfile
> +
> +# Line where to stop the main thread.
> +set break_here_line [gdb_get_line_number "break here"]
> +
> +# Build executables, one for each fork flavor.
> +foreach_with_prefix fork_func {fork vfork} {
> +    set opts [list debug pthreads additional_flags=-DFORK_FUNC=${fork_func}]
> +    if { [build_executable "failed to prepare" \
> +           ${testfile}-${fork_func} ${srcfile} $opts] } {
> +       return
> +    }
> +}
> +
> +# If testing against GDBserver, consume all it its output.
> +
> +proc drain_gdbserver_output { } {
> +    if { [info exists ::server_spawn_id] } {
> +       gdb_test_multiple "" "" {
> +           -i "$::server_spawn_id"
> +           -timeout 0
> +           -re ".+" {
> +             exp_continue
> +           }
> +       }
> +    }
> +}
> +
> +# Run the test with the given parameters:
> +#
> +#   - FORK_FUNC: fork flavor, "fork" or "vfork".
> +#   - TARGET-NON-STOP: "maintenance set target-non-stop" value, "auto", "on" or
> +#     "off".
> +#   - NON-STOP: "set non-stop" value, "on" or "off".
> +#   - DISPLACED-STEPPING: "set displaced-stepping" value, "auto", "on" or "off".
> +
> +proc do_test { fork_func target-non-stop non-stop displaced-stepping } {
> +    save_vars { ::GDBFLAGS } {
> +       append ::GDBFLAGS " -ex \"maintenance set target-non-stop ${target-non-stop}\""
> +       append ::GDBFLAGS " -ex \"set non-stop ${non-stop}\""
> +       clean_restart ${::binfile}-${fork_func}
> +    }
> +
> +    gdb_test_no_output "set displaced-stepping ${displaced-stepping}"
> +
> +    if { ![runto_main] } {
> +       return
> +    }
> +
> +    # The "Detached after (v)fork" messages get in the way in non-stop, disable
> +    # them.
> +    gdb_test_no_output "set print inferior-events off"
> +
> +    # Advance the next-ing thread to the point where we'll execute the nexts.
> +    # Leave the breakpoint in: it will force GDB to step over it while next-ing,
> +    # which exercises some additional code paths.
> +    gdb_test "break $::break_here_line" "Breakpoint $::decimal at $::hex.*"
> +    gdb_test "continue" "hit Breakpoint $::decimal, worker_b.*"
> +
> +    # Next an arbitrary number of times over the lines of the loop.
> +    for { set i 0 } { $i < 200 } { incr i } {
> +       # If testing against GDBserver, the forking threads cause a lot of
> +       # "Detaching from process XYZ" messages to appear.  If we don't consume
> +       # that output, GDBserver eventually blocks on a full stderr.  Drain it
> +       # once every loop.  It may not be needed for 20 iterations, but it's
> +       # needed if you increase to 200 iterations.
> +       drain_gdbserver_output
> +
> +       with_test_prefix "i=$i" {
> +           if { [gdb_test "next" "other line.*" "next to other line"] != 0 } {
> +               return
> +           }
> +
> +           if { [gdb_test "next" "for loop.*" "next to for loop"] != 0 } {
> +               return
> +           }
> +
> +           if { [gdb_test "next" "break here.*" "next to break here"] != 0} {
> +               return
> +           }
> +       }
> +    }
> +}
> +
> +foreach_with_prefix fork_func {fork vfork} {
> +    foreach_with_prefix target-non-stop {auto on off} {
> +       # This file was copied from next-fork-other-thread.exp and
> +       # then adapted for the a case which also involves an exec in
> +       # addition to the fork.  Ideally, we should test non-stop "on"
> +       # in addition to "off", but, for this test, that results in a
> +       # number of failures occur preceded by the message:
> +       #
> +       # Cannot execute this command while the selected thread is running.
> +       #
> +       # That seems like correct behavior to me, but perhaps the
> +       # non-stop case can be made to work; if so, simply add "on"
> +       # after "off" on the line below...
> +       foreach_with_prefix non-stop {off} {
> +           foreach_with_prefix displaced-stepping {auto on off} {
> +               do_test ${fork_func} ${target-non-stop} ${non-stop} ${displaced-stepping}
> +           }
> +       }
> +    }
> +}
> diff --git a/gdbserver/linux-low.cc b/gdbserver/linux-low.cc index 
> 651f219b738..e1806ade82f 100644
> --- a/gdbserver/linux-low.cc
> +++ b/gdbserver/linux-low.cc
> @@ -2463,7 +2463,12 @@ linux_process_target::resume_stopped_resumed_lwps (thread_info *thread)
>        int step = 0;
>
>        if (thread->last_resume_kind == resume_step)
> -       step = maybe_hw_step (thread);
> +       {
> +         if (supports_software_single_step ())
> +           install_software_single_step_breakpoints (lp);
> +
> +         step = maybe_hw_step (thread);
> +       }
>
>        threads_debug_printf ("resuming stopped-resumed LWP %s at %s: step=%d",
>                             target_pid_to_str (ptid_of (thread)).c_str 
> (),
> --
> 2.41.0
>


^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [PATCH 1/1] gdbserver: Reinstall software single-step breakpoints in resume_stopped_resumed_lwps
  2023-08-08  2:39       ` Yan, Zhiyong
@ 2023-08-08  2:52         ` Kevin Buettner
  0 siblings, 0 replies; 7+ messages in thread
From: Kevin Buettner @ 2023-08-08  2:52 UTC (permalink / raw)
  To: Yan, Zhiyong; +Cc: gdb-patches, simon.marchi

Hi Zhiyong,

Assuming the commit gets pushed in a timely fashion, this fix should
be in GDB 14.1.

See this post from Joel:

https://sourceware.org/pipermail/gdb-patches/2023-July/201226.html

Joel has a link for the GDB PRs marked for GDB 14.1 - bug 30387
(gdbserver assert error on arm platform) is one of them.

Kevin

On Tue, 8 Aug 2023 02:39:06 +0000
"Yan, Zhiyong" <Zhiyong.Yan@windriver.com> wrote:

> Hi Kevin,
>      It is pleasure for me to cooperate with you.
>      When the release version for this issue is determined, please let me know. I will forward this information to my customer.
>      Thanks!
> 
> Best Regards.
> Zhiyong
> 
> 
> -----Original Message-----
> From: Kevin Buettner <kevinb@redhat.com> 
> Sent: Tuesday, August 8, 2023 10:31 AM
> To: Yan, Zhiyong <Zhiyong.Yan@windriver.com>
> Cc: gdb-patches@sourceware.org; simon.marchi@polymtl.ca
> Subject: Re: [PATCH 1/1] gdbserver: Reinstall software single-step breakpoints in resume_stopped_resumed_lwps
> 
> CAUTION: This email comes from a non Wind River email account!
> Do not click links or open attachments unless you recognize the sender and know the content is safe.
> 
> Hi Zhiyong,
> 
> Thanks for testing.  In addition to the Co-Authored-By tag, I've added a Tested-By tag for you.
> 
> I'll wait a few more days for comments, after which, assuming there are no objections, I'll push it.
> 
> Thank you for all of your contributions (bug report, test case, initial patches, and testing) in resolving this problem!
> 
> Kevin
> 
> On Mon, 7 Aug 2023 10:54:16 +0000
> "Yan, Zhiyong" <Zhiyong.Yan@windriver.com> wrote:
> 
> > Hi Kevin,
> >      Today I backport below patch to gdb 13.0.5 which is used by our customer board.
> >      I verify this patch by customer's app where the assert error was firstly reported,  and verify it by test app "osm".
> >      This patch can fix the assert error in my tests.
> >
> > Best Regards.
> > Zhiyong
> >
> >
> > -----Original Message-----
> > From: Kevin Buettner <kevinb@redhat.com>
> > Sent: Thursday, August 3, 2023 8:56 AM
> > To: gdb-patches@sourceware.org
> > Cc: Yan, Zhiyong <Zhiyong.Yan@windriver.com>; simon.marchi@polymtl.ca; 
> > Kevin Buettner <kevinb@redhat.com>
> > Subject: [PATCH 1/1] gdbserver: Reinstall software single-step 
> > breakpoints in resume_stopped_resumed_lwps
> >
> > CAUTION: This email comes from a non Wind River email account!
> > Do not click links or open attachments unless you recognize the sender and know the content is safe.
> >
> > At the moment, while performing a software single-step, gdbserver fails to reinsert software single-step breakpoints for a LWP when interrupted by a signal in another thread.  This commit fixes this problem by reinstalling software single-step breakpoints in linux_process_target::resume_stopped_resumed_lwps in gdbserver/linux-low.cc.
> >
> > This bug was discovered due to a failing assert in maybe_hw_step() in gdbserver/linux-low.cc.  Looking at the backtrace revealed that the caller was linux_process_target::resume_stopped_resumed_lwps.
> > I was uncertain whether the assert should still be valid when called from that method, so I tried hoisting the assert from maybe_hw_step to all callers except resume_stopped_resumed_lwps.  But running the new test case, described below, showed that merely eliminating the assert for this case was NOT a good fix - a study of the log file for the test showed that the single-step operation failed to occur.
> > Instead GDB (via gdbserver) stopped at the next breakpoint that was hit.
> >
> > Zhiyong Yan had proposed a fix which resinserted software single-step 
> > breakpoints, albeit at a different location in linux-low.cc.  Testing 
> > revealed that, while running gdb.threads/pending-fork-event-detach,
> > the executable associated with that test would die due to a SIGTRAP after the test program was detached.  Examination of the core file(s) showed that a breakpoint instruction had been left in program memory.
> > Test results were otherwise very good, so Zhiyong was definitely on the right track!
> >
> > This commit causes software single-step breakpoint(s) to be inserted before the call to maybe_hw_step in resume_stopped_resumed_lwps.  This will cause 'has_single_step_breakpoints (thread)' to be true, so that the assert in maybe_hw_step...
> >
> >       /* GDBserver must insert single-step breakpoint for software
> >          single step.  */
> >       gdb_assert (has_single_step_breakpoints (thread));
> >
> > ...will no longer fail.  And better still, the single-step breakpoints are reinstalled, so that stepping will actually work, even when interrupted.
> >
> > The C code for the test case was loosely adapted from the reproducer provided in Zhiyong's bug report for this problem.  The .exp file was copied from next-fork-other-thread.exp and then tweaked slightly.  As noted in a comment in next-fork-exec-other-thread.exp, I had to remove "on" from the loop for non-stop as it was failing on all architectures (including x86-64) that I tested.  I have a feeling that it ought to work, but this can be investigated separately and (re)enabled once it works.  I also increased the number of iterations for the loop running the "next" commands.  I've had some test runs which don't show the bug until the loop counter exceeded 100 iterations.  The C file for the new test uses shorter delays than next-fork-other-thread.c though, so it doesn't take overly long (IMO) to run this new test.
> >
> > Running the new test on a Raspberry Pi w/ a 32-bit (Arm) kernel and userland using a gdbserver build without the fix in this commit shows the following results:
> >
> > FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=fork: 
> > target-non-stop=auto: non-stop=off: displaced-stepping=auto: i=12: 
> > next to other line
> > FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=fork: 
> > target-non-stop=auto: non-stop=off: displaced-stepping=on: i=9: next 
> > to other line
> > FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=fork: 
> > target-non-stop=auto: non-stop=off: displaced-stepping=off: i=18: next 
> > to other line
> > FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=fork: 
> > target-non-stop=off: non-stop=off: displaced-stepping=auto: i=3: next 
> > to other line
> > FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=fork: 
> > target-non-stop=off: non-stop=off: displaced-stepping=on: i=11: next 
> > to other line
> > FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=fork: 
> > target-non-stop=off: non-stop=off: displaced-stepping=off: i=1: next 
> > to other line
> > FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: 
> > target-non-stop=auto: non-stop=off: displaced-stepping=auto: i=1: next 
> > to break here
> > FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: 
> > target-non-stop=auto: non-stop=off: displaced-stepping=on: i=3: next 
> > to break here
> > FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: 
> > target-non-stop=auto: non-stop=off: displaced-stepping=off: i=1: next 
> > to break here
> > FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: 
> > target-non-stop=on: non-stop=off: displaced-stepping=auto: i=47: next 
> > to other line
> > FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: 
> > target-non-stop=on: non-stop=off: displaced-stepping=on: i=57: next to 
> > other line
> > FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: 
> > target-non-stop=off: non-stop=off: displaced-stepping=auto: i=1: next 
> > to break here
> > FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: 
> > target-non-stop=off: non-stop=off: displaced-stepping=on: i=10: next 
> > to break here
> > FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: 
> > target-non-stop=off: non-stop=off: displaced-stepping=off: i=1: next 
> > to break here
> >
> >                 === gdb Summary ===
> >
> >  # of unexpected core files     12
> >  # of expected passes           3011
> >  # of unexpected failures       14
> >
> > Each of the 12 core files were caused by the failed assertion in maybe_hw_step in linux-low.c.  These correspond to 12 of the unexpected failures.
> >
> > When the tests are run using a gdbserver build which includes the fix in this commit, the results are significantly better, but not perfect:
> >
> > FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: 
> > target-non-stop=on: non-stop=off: displaced-stepping=auto: i=143: next 
> > to other line
> > FAIL: gdb.threads/next-fork-exec-other-thread.exp: fork_func=vfork: 
> > target-non-stop=on: non-stop=off: displaced-stepping=on: i=25: next to 
> > other line
> >
> >                 === gdb Summary ===
> >
> >  # of expected passes           10178
> >  # of unexpected failures       2
> >
> > (I think that the two remaining failures are due to some different 
> > problem.  And, FWIW, I don't see these failures on the Pi when using
> > --target_board=native-extended-gdbserver.)
> >
> > Running the new test on x86-64 and aarch64, both native and native-gdbserver shows no failures.
> >
> > Also, I see no regressions when running the entire test suite for 
> > armv7l-unknown-linux-gnueabihf (i.e.  the Raspberry Pi w/ 32-bit
> > kernel+userland) with --target_board=native-gdbserver.  Additionally,
> > using --target_board=native-gdbserver, I also see no regressions for the entire test suite for x86-64 and aarch64 running Fedora 38.
> >
> > Bug: https://sourceware.org/bugzilla/show_bug.cgi?id=30387
> > Co-Authored-By: Zhiyong Yan <zhiyong.yan@windriver.com>
> > ---
> >  .../gdb.threads/next-fork-exec-other-thread.c |  82 +++++++++++
> >  .../next-fork-exec-other-thread.exp           | 131 ++++++++++++++++++
> >  gdbserver/linux-low.cc                        |   7 +-
> >  3 files changed, 219 insertions(+), 1 deletion(-)  create mode 100644 
> > gdb/testsuite/gdb.threads/next-fork-exec-other-thread.c
> >  create mode 100644 
> > gdb/testsuite/gdb.threads/next-fork-exec-other-thread.exp
> >
> > diff --git a/gdb/testsuite/gdb.threads/next-fork-exec-other-thread.c 
> > b/gdb/testsuite/gdb.threads/next-fork-exec-other-thread.c
> > new file mode 100644
> > index 00000000000..884706c6c3c
> > --- /dev/null
> > +++ b/gdb/testsuite/gdb.threads/next-fork-exec-other-thread.c
> > @@ -0,0 +1,82 @@
> > +/* This testcase is part of GDB, the GNU debugger.
> > +
> > +   Copyright 2023 Free Software Foundation, Inc.
> > +
> > +   This program is free software; you can redistribute it and/or modify
> > +   it under the terms of the GNU General Public License as published by
> > +   the Free Software Foundation; either version 3 of the License, or
> > +   (at your option) any later version.
> > +
> > +   This program is distributed in the hope that it will be useful,
> > +   but WITHOUT ANY WARRANTY; without even the implied warranty of
> > +   MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.         See the
> > +   GNU General Public License for more details.
> > +
> > +   You should have received a copy of the GNU General Public License
> > +   along with this program.  If not, see 
> > + <http://www.gnu.org/licenses/>.  */
> > +
> > +#include <stdio.h>
> > +#include <stdlib.h>
> > +#include <string.h>
> > +#include <fcntl.h>
> > +#include <errno.h>
> > +#include <unistd.h>
> > +#include <pthread.h>
> > +#include <sys/wait.h>
> > +
> > +#define MAX_LOOP_ITER 10000
> > +
> > +static char *argv0;
> > +
> > +static void*
> > +worker_a (void *pArg)
> > +{
> > +  int iter = 0;
> > +  char *args[] = {argv0, "self-call", NULL };
> > +
> > +  while (iter++ < MAX_LOOP_ITER)
> > +    {
> > +      pid_t pid = FORK_FUNC ();
> > +      if (pid == 0)
> > +       {
> > +         /* child */
> > +         if (execvp (args[0], args) == -1)
> > +           {
> > +             fprintf (stderr, "execvp error: %d\n", errno);
> > +             exit (1);
> > +           }
> > +       }
> > +
> > +      waitpid (pid, NULL, 0);
> > +      usleep (5);
> > +    }
> > +}
> > +
> > +static void*
> > +worker_b (void *pArg)
> > +{
> > +  int iter = 0;
> > +  while (iter++ < MAX_LOOP_ITER)  /* for loop */
> > +    {
> > +      usleep (5);  /* break here */
> > +      usleep (5);  /* other line */
> > +    }
> > +}
> > +
> > +int
> > +main (int argc, char **argv)
> > +{
> > +  pthread_t wa_pid;
> > +  pthread_t wb_pid;
> > +
> > +  argv0 = argv[0];
> > +
> > +  if (argc > 1 && strcmp (argv[1], "self-call") == 0)
> > +    exit (0);
> > +
> > +  pthread_create (&wa_pid, NULL, worker_a, NULL);  pthread_create 
> > + (&wb_pid, NULL, worker_b, NULL);  pthread_join (wa_pid, NULL);
> > +
> > +  exit (0);
> > +}
> > diff --git a/gdb/testsuite/gdb.threads/next-fork-exec-other-thread.exp 
> > b/gdb/testsuite/gdb.threads/next-fork-exec-other-thread.exp
> > new file mode 100644
> > index 00000000000..e4d6ff53ded
> > --- /dev/null
> > +++ b/gdb/testsuite/gdb.threads/next-fork-exec-other-thread.exp
> > @@ -0,0 +1,131 @@
> > +# Copyright 2022-2023 Free Software Foundation, Inc.
> > +
> > +# This program is free software; you can redistribute it and/or 
> > +modify # it under the terms of the GNU General Public License as 
> > +published by # the Free Software Foundation; either version 3 of the 
> > +License, or # (at your option) any later version.
> > +#
> > +# This program is distributed in the hope that it will be useful, # 
> > +but WITHOUT ANY WARRANTY; without even the implied warranty of # 
> > +MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the # GNU 
> > +General Public License for more details.
> > +#
> > +# You should have received a copy of the GNU General Public License # 
> > +along with this program.  If not, see <http://www.gnu.org/licenses/>.
> > +
> > +# This test was adapted from next-fork-other-thread.exp. The .c file 
> > +# was adapted from the reproducer for this bug:
> > +#
> > +# https://sourceware.org/bugzilla/show_bug.cgi?id=30387#
> > +#
> > +# That bug demonstrates a problem with software-singlestep in gdbserver.
> > +# Prior to being fixed, this test also demonstrated that bug for a # 
> > +32-bit ARM target.  (Use
> > +RUNTESTFLAGS="--target_board=native-gdbserver".)
> > +# It has been reproduced on a Raspberry Pi running Ubunutu server #
> > +20.04.5 LTS with 32-bit kernel + 32-bit userland.  It was NOT 
> > +reproducible # using a circa 2023 Raspberry Pi OS w/ 64-bit kernel and 32-bit userland.
> > +
> > +standard_testfile
> > +
> > +# Line where to stop the main thread.
> > +set break_here_line [gdb_get_line_number "break here"]
> > +
> > +# Build executables, one for each fork flavor.
> > +foreach_with_prefix fork_func {fork vfork} {
> > +    set opts [list debug pthreads additional_flags=-DFORK_FUNC=${fork_func}]
> > +    if { [build_executable "failed to prepare" \
> > +           ${testfile}-${fork_func} ${srcfile} $opts] } {
> > +       return
> > +    }
> > +}
> > +
> > +# If testing against GDBserver, consume all it its output.
> > +
> > +proc drain_gdbserver_output { } {
> > +    if { [info exists ::server_spawn_id] } {
> > +       gdb_test_multiple "" "" {
> > +           -i "$::server_spawn_id"
> > +           -timeout 0
> > +           -re ".+" {
> > +             exp_continue
> > +           }
> > +       }
> > +    }
> > +}
> > +
> > +# Run the test with the given parameters:
> > +#
> > +#   - FORK_FUNC: fork flavor, "fork" or "vfork".
> > +#   - TARGET-NON-STOP: "maintenance set target-non-stop" value, "auto", "on" or
> > +#     "off".
> > +#   - NON-STOP: "set non-stop" value, "on" or "off".
> > +#   - DISPLACED-STEPPING: "set displaced-stepping" value, "auto", "on" or "off".
> > +
> > +proc do_test { fork_func target-non-stop non-stop displaced-stepping } {
> > +    save_vars { ::GDBFLAGS } {
> > +       append ::GDBFLAGS " -ex \"maintenance set target-non-stop ${target-non-stop}\""
> > +       append ::GDBFLAGS " -ex \"set non-stop ${non-stop}\""
> > +       clean_restart ${::binfile}-${fork_func}
> > +    }
> > +
> > +    gdb_test_no_output "set displaced-stepping ${displaced-stepping}"
> > +
> > +    if { ![runto_main] } {
> > +       return
> > +    }
> > +
> > +    # The "Detached after (v)fork" messages get in the way in non-stop, disable
> > +    # them.
> > +    gdb_test_no_output "set print inferior-events off"
> > +
> > +    # Advance the next-ing thread to the point where we'll execute the nexts.
> > +    # Leave the breakpoint in: it will force GDB to step over it while next-ing,
> > +    # which exercises some additional code paths.
> > +    gdb_test "break $::break_here_line" "Breakpoint $::decimal at $::hex.*"
> > +    gdb_test "continue" "hit Breakpoint $::decimal, worker_b.*"
> > +
> > +    # Next an arbitrary number of times over the lines of the loop.
> > +    for { set i 0 } { $i < 200 } { incr i } {
> > +       # If testing against GDBserver, the forking threads cause a lot of
> > +       # "Detaching from process XYZ" messages to appear.  If we don't consume
> > +       # that output, GDBserver eventually blocks on a full stderr.  Drain it
> > +       # once every loop.  It may not be needed for 20 iterations, but it's
> > +       # needed if you increase to 200 iterations.
> > +       drain_gdbserver_output
> > +
> > +       with_test_prefix "i=$i" {
> > +           if { [gdb_test "next" "other line.*" "next to other line"] != 0 } {
> > +               return
> > +           }
> > +
> > +           if { [gdb_test "next" "for loop.*" "next to for loop"] != 0 } {
> > +               return
> > +           }
> > +
> > +           if { [gdb_test "next" "break here.*" "next to break here"] != 0} {
> > +               return
> > +           }
> > +       }
> > +    }
> > +}
> > +
> > +foreach_with_prefix fork_func {fork vfork} {
> > +    foreach_with_prefix target-non-stop {auto on off} {
> > +       # This file was copied from next-fork-other-thread.exp and
> > +       # then adapted for the a case which also involves an exec in
> > +       # addition to the fork.  Ideally, we should test non-stop "on"
> > +       # in addition to "off", but, for this test, that results in a
> > +       # number of failures occur preceded by the message:
> > +       #
> > +       # Cannot execute this command while the selected thread is running.
> > +       #
> > +       # That seems like correct behavior to me, but perhaps the
> > +       # non-stop case can be made to work; if so, simply add "on"
> > +       # after "off" on the line below...
> > +       foreach_with_prefix non-stop {off} {
> > +           foreach_with_prefix displaced-stepping {auto on off} {
> > +               do_test ${fork_func} ${target-non-stop} ${non-stop} ${displaced-stepping}
> > +           }
> > +       }
> > +    }
> > +}
> > diff --git a/gdbserver/linux-low.cc b/gdbserver/linux-low.cc index 
> > 651f219b738..e1806ade82f 100644
> > --- a/gdbserver/linux-low.cc
> > +++ b/gdbserver/linux-low.cc
> > @@ -2463,7 +2463,12 @@ linux_process_target::resume_stopped_resumed_lwps (thread_info *thread)
> >        int step = 0;
> >
> >        if (thread->last_resume_kind == resume_step)
> > -       step = maybe_hw_step (thread);
> > +       {
> > +         if (supports_software_single_step ())
> > +           install_software_single_step_breakpoints (lp);
> > +
> > +         step = maybe_hw_step (thread);
> > +       }
> >
> >        threads_debug_printf ("resuming stopped-resumed LWP %s at %s: step=%d",
> >                             target_pid_to_str (ptid_of (thread)).c_str 
> > (),
> > --
> > 2.41.0
> >  
> 


^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [PATCH 1/1] gdbserver: Reinstall software single-step breakpoints in resume_stopped_resumed_lwps
  2023-08-03  0:56 ` [PATCH 1/1] " Kevin Buettner
  2023-08-07 10:54   ` Yan, Zhiyong
@ 2023-08-12  4:00   ` Kevin Buettner
  1 sibling, 0 replies; 7+ messages in thread
From: Kevin Buettner @ 2023-08-12  4:00 UTC (permalink / raw)
  To: gdb-patches; +Cc: zhiyong.yan, Luis Machado

I've pushed this commit, after having made a slight change to the new
test case.  Luis found that it ran slowly on his hardware, so, after
some testing, I made some adjustments so that it'll run faster.

I also added a Tested-By tag for Luis.

Kevin


^ permalink raw reply	[flat|nested] 7+ messages in thread

end of thread, other threads:[~2023-08-12  4:00 UTC | newest]

Thread overview: 7+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2023-08-03  0:56 [PATCH 0/1] gdbserver: Reinstall software single-step breakpoints in resume_stopped_resumed_lwps Kevin Buettner
2023-08-03  0:56 ` [PATCH 1/1] " Kevin Buettner
2023-08-07 10:54   ` Yan, Zhiyong
2023-08-08  2:31     ` Kevin Buettner
2023-08-08  2:39       ` Yan, Zhiyong
2023-08-08  2:52         ` Kevin Buettner
2023-08-12  4:00   ` Kevin Buettner

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).