From: "Aktemur, Tankut Baris" <tankut.baris.aktemur@intel.com>
To: Simon Marchi <simark@simark.ca>,
"gdb-patches@sourceware.org" <gdb-patches@sourceware.org>
Subject: RE: [PATCH] gdb/infrun: reset thread control's step info in end_stepping_range
Date: Tue, 16 May 2023 12:33:31 +0000 [thread overview]
Message-ID: <DM4PR11MB7303067B5714CC821C1996EAC4799@DM4PR11MB7303.namprd11.prod.outlook.com> (raw)
In-Reply-To: <9747469a-677c-3e31-7cd4-9e9ecc7b0680@simark.ca>
On Thursday, April 27, 2023 7:05 PM, Simon Marchi wrote:
> On 11/16/22 13:56, Tankut Baris Aktemur via Gdb-patches wrote:
> > Suppose we have two inferiors on an all-stop target with schedule-multi
> > set on:
> >
> > $ gdb -q
> > (gdb) target extended-remote | gdbserver --multi -
> > Remote debugging using | gdbserver --multi -
> > Remote debugging using stdio
> > (gdb) file /temp/test
> > Reading symbols from /temp/test...
> > (gdb) set remote exec-file /temp/test
> > (gdb) start
> > Temporary breakpoint 1 at 0x115c: file test.c, line 8.
> > Starting program: /temp/test
> > stdin/stdout redirected
> > Process /temp/test created; pid = 864027
> > ...
> >
> > Temporary breakpoint 1, main (argc=1, argv=0x7fffffffd218) at test.c:8
> > 8 foo();
> > (gdb) add-inferior
> > [New inferior 2]
> > Added inferior 2 on connection 1 (extended-remote | gdbserver --multi -)
> > (gdb) inferior 2
> > [Switching to inferior 2 [<null>] (<noexec>)]
> > (gdb) file /temp/test
> > Reading symbols from /temp/test...
> > (gdb) set remote exec-file /temp/test
> > (gdb) tbreak 2
> > Temporary breakpoint 2 at 0x555555555131: /temp/test.c:2. (2 locations)
> > (gdb) run
> > Starting program: /temp/test
> > stdin/stdout redirected
> > Process /temp/test created; pid = 864430
> > ...
> >
> > Thread 2.1 "test" hit Temporary breakpoint 2, foo () at test.c:2
> > 2 int a = 42;
> > (gdb) set schedule-multi on
> > (gdb)
> >
> > At this point, detaching the first inferior works fine:
> >
> > (gdb) detach inferiors 1
> > Detaching from program: /temp/test, process 858904
> > Detaching from process 858904
> > [Inferior 1 (process 858904) detached]
> > (gdb) info inferiors
> > Num Description Connection Executable
> > 1 <null> 1 (extended-remote | gdbserver --multi -) /temp/test
> > * 2 process 858925 1 (extended-remote | gdbserver --multi -) /temp/test
> > (gdb)
> >
> > Let us now repeat exactly the same scenario, but before detaching, we
> > make the current thread single-step an instruction:
> >
> > ...
> > Thread 2.1 "test" hit Temporary breakpoint 2, foo () at test.c:2
> > 2 int a = 42;
> > (gdb) stepi
> > 3 int b = 43;
> > (gdb) detach inferiors 1
> > Detaching from program: /temp/test, process 876580
> > Detaching from process 876580
> > gdbserver: Couldn't reap LWP 876580 while detaching: No child processes
> > [Inferior 1 (process 876580) detached]
> > (gdb) 3 int b = 43;
> >
> > There is a mysterious line info output. Running the scenario with
> > infrun debug logs reveals more information.
> >
> > ...
> > Thread 2.1 "test" hit Temporary breakpoint 2, foo () at test.c:2
> > 2 int a = 42;
> > (gdb) stepi
> > 3 int b = 43;
> > (gdb) set debug infrun on
> > (gdb) detach inferiors 1
> > [infrun] scoped_disable_commit_resumed: reason=detaching
> > Detaching from program: /temp/test, process 872445
> > Detaching from process 872445
> > gdbserver: Couldn't reap LWP 872445 while detaching: No child processes
> > [Inferior 1 (process 872445) detached]
> > [infrun] start_step_over: enter
> > [infrun] start_step_over: stealing global queue of threads to step, length = 0
> > [infrun] operator(): step-over queue now empty
> > [infrun] start_step_over: exit
> > [infrun] restart_stepped_thread: switching back to stepped thread (stepping)
> > [infrun] keep_going_stepped_thread: resuming previously stepped thread
> > [infrun] keep_going_stepped_thread: expected thread advanced also (0x555555555131 ->
> 0x555555555138)
> > [infrun] clear_step_over_info: clearing step over info
> > [infrun] do_target_resume: resume_ptid=-1.0.0, step=0, sig=GDB_SIGNAL_0
> > [infrun] infrun_async: enable=1
> > [infrun] reset: reason=detaching
> > [infrun] maybe_set_commit_resumed_all_targets: enabling commit-resumed for target
> extended-remote
> > [infrun] maybe_call_commit_resumed_all_targets: calling commit_resumed for target
> extended-remote
> > (gdb) [infrun] fetch_inferior_event: enter
> > [infrun] scoped_disable_commit_resumed: reason=handling event
> > [infrun] do_target_wait: Found 2 inferiors, starting at #0
> > [infrun] random_pending_event_thread: None found.
> > [infrun] print_target_wait_results: target_wait (-1.0.0 [process -1], status) =
> > [infrun] print_target_wait_results: 872464.872464.0 [Thread 872464.872464],
> > [infrun] print_target_wait_results: status->kind = STOPPED, sig = GDB_SIGNAL_TRAP
> > [infrun] handle_inferior_event: status->kind = STOPPED, sig = GDB_SIGNAL_TRAP
> > [infrun] context_switch: Switching context from 0.0.0 to 872464.872464.0
> > [infrun] handle_signal_stop: stop_pc=0x555555555138
> > [infrun] handle_signal_stop: [872464.872464.0] hit its single-step breakpoint
> > [infrun] handle_signal_stop: delayed software breakpoint trap, ignoring
> > [infrun] process_event_stop_test: stepi/nexti
> > [infrun] stop_waiting: stop_waiting
> > 3 int b = 43;
> > [infrun] infrun_async: enable=0
> > [infrun] reset: reason=handling event
> > [infrun] maybe_set_commit_resumed_all_targets: not requesting commit-resumed for
> target extended-remote, no resumed threads
> > [infrun] fetch_inferior_event: exit
> >
> > GDB attempted to do a step-over for the current thread. This takes us
> > to the commit that introduced restarting step-overs:
> >
> > commit 408f66864a1a823591b26420410c982174c239a2
> > Author: Pedro Alves <pedro@palves.net>
> > Date: Mon Jan 11 20:01:58 2021 +0000
> >
> > detach in all-stop with threads running
> >
> > A following patch will add a testcase that has a number of threads
> > constantly stepping over a breakpoint, and then has GDB detach the
> > process, while threads are running. If we have more than one inferior
> > running, and we detach from just one of the inferiors, we expect that
> > the remaining inferior continues running. However, in all-stop, if
> > GDB needs to pause the target for the detach, nothing is re-resuming
> > the other inferiors after the detach. "info threads" shows the
> > threads as running, but they really aren't. This fixes it.
> >
> > However, the thread that was resumed for step-over in our scenario did
> > not have an interrupted step-over; it had completed its stepi already.
> > More debugging reveals that the thread is resumed because of the
> > following two conditions in `restart_stepped_thread`:
> >
> > if (tp->control.trap_expected)
> > {
> > infrun_debug_printf ("switching back to stepped thread (step-over)");
> >
> > if (keep_going_stepped_thread (tp))
> > return true;
> > }
> >
> > and
> >
> > if (tp->control.step_range_end)
> > {
> > infrun_debug_printf ("switching back to stepped thread (stepping)");
> >
> > if (keep_going_stepped_thread (tp))
> > return true;
> > }
> >
> > The root cause of the problem is, the 'trap_expected' and the
> > 'step_range_end' fields of the thread's control remain set even after
> > the "stepi" command completes. We fix the problem by clearing the
> > control fields when stepping completes. We also add a regression test.
>
> Isn't the bug mainly that restart_stepped_thread tries to resume a
> thread that isn't meant to be resumed? It checks for:
>
> if (tp->state == THREAD_EXITED)
> continue;
>
> but the thread's state is THREAD_STOPPED. It seems to me like this
> condition should be changed to:
>
> if (tp->state != THREAD_RUNNING)
> continue;
>
> such that restart_stepped_thread only considers threads that are meant
> to be resumed.
This makes sense and would fix the bug. The code would also be aligned
with the condition in `restart_after_all_stop_detach`. I add this change
to v2.
> But I think I am missing some information. Let's say you have two
> inferiors running on an all-stop target, and you detach one of them (as
> what is described by Pedro's commit message, if I understand correctly)
> I'm not sure what stops the threads of the non-detached inferior. I
> don't see a call to target_stop or similar on the code path. Is it the
> detach target call that implicitly leaves them stopped? If so, it
> probably means that the m_resumed and m_executing become stale. Perhaps
> not relevant for this problem, but just something that crossed my mind.
The all-stop target (e.g. a gdbserver in default case) does not respond
to a detach command while something is running.
(gdb) continue &
Continuing.
(gdb) Reading /lib/x86_64-linux-gnu/libstdc++.so.6 from remote target...
...
detach
Cannot execute this command while the target is running.
Use the "interrupt" command to stop the target
and then try again.
(gdb)
So, the target must be explicitly stopped by the user. Pedro also did
so in the gdb.threads/detach-step-over.exp test:
-re "Cannot execute this command while the target is running.*$::gdb_prompt $" {
# Testing against a remote server that doesn't do
# non-stop mode. Explicitly interrupt. This
# doesn't test the same code paths in GDB, but
# it's still something.
I'm not sure, though, if this sufficiently answers your question.
> With that said, I'm not against the principle of your patch. There is
> some information in the thread_control_state structure that is valid
> from the point a step is started, until the thread stops, and becomes
> meaningless after that. By clearing this information as soon as it
> becomes irrelevant, it reduces the chances of something using it by
> mistake later.
Ack. I agree and will keep those changes in v2.
> While playing with GDB, trying to reproduce your bug, I think I found
> two more:
Thinking about this from a wider angle, does it make sense that commands
like 'start', 'run', and 'detach' are impacted by schedule-multi on?
Would it not make more sense that those commands apply to a single inferior
only, e.g. internally setting schedule-multi to off?
Thanks
-Baris
Intel Deutschland GmbH
Registered Address: Am Campeon 10, 85579 Neubiberg, Germany
Tel: +49 89 99 8853-0, www.intel.de <http://www.intel.de>
Managing Directors: Christin Eisenschmid, Sharon Heck, Tiffany Doon Silva
Chairperson of the Supervisory Board: Nicole Lau
Registered Office: Munich
Commercial Register: Amtsgericht Muenchen HRB 186928
prev parent reply other threads:[~2023-05-16 12:33 UTC|newest]
Thread overview: 11+ messages / expand[flat|nested] mbox.gz Atom feed top
2022-11-16 18:56 Tankut Baris Aktemur
2022-12-22 20:30 ` Aktemur, Tankut Baris
2023-01-10 21:00 ` Aktemur, Tankut Baris
2023-01-17 20:38 ` Aktemur, Tankut Baris
2023-01-24 10:36 ` Aktemur, Tankut Baris
2023-01-31 20:14 ` Aktemur, Tankut Baris
2023-02-20 13:08 ` Aktemur, Tankut Baris
2023-03-03 7:47 ` Aktemur, Tankut Baris
2023-03-28 13:39 ` Aktemur, Tankut Baris
2023-04-27 17:04 ` Simon Marchi
2023-05-16 12:33 ` Aktemur, Tankut Baris [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=DM4PR11MB7303067B5714CC821C1996EAC4799@DM4PR11MB7303.namprd11.prod.outlook.com \
--to=tankut.baris.aktemur@intel.com \
--cc=gdb-patches@sourceware.org \
--cc=simark@simark.ca \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).