From: Johnson Sun <j3.soon777@gmail.com>
To: SimonMarchi <simark@simark.ca>,
LancelotSIX <lsix@lancelotsix.com>,
gdb-patches@sourceware.org
Subject: [RFC] [PATCH v3] [PR python/29603] Disable out-of-scope watchpoints
Date: Sun, 26 Feb 2023 14:16:37 +0800 [thread overview]
Message-ID: <6f2d7693-7f29-c125-1b7a-44a8a4fbd76e@gmail.com> (raw)
In-Reply-To: <915dd1be-feb5-967d-2d32-af61c3fee9a7@gmail.com>
Request for comments:
<https://sourceware.org/pipermail/gdb-patches/2023-January/196068.html>.
Johnson
On 2/19/2023 12:26 AM, JohnsonSun wrote:
> Ping for:
> <https://sourceware.org/pipermail/gdb-patches/2023-January/196068.html>.
>
> I would like to ask for some feedback before submitting the v4 patch.
>
> Johnson
>
> On 1/23/2023 6:15 PM, Johnson Sun wrote:
>> Hi,
>>
>> Thanks for reviewing this!
>>
>> > the supplied test does not pass for me
>>
>> The current test doesn't seem to produce consistent results across
>> different
>> machines, I'll try to fix it in the next version.
>>
>> > Is there a reason to do it here in particular, and not, let's say as
>> > soon as we change the disposition to disp_del_at_next_stop?
>>
>> I have implemented this in the v1 patch, I called
>> `disable_breakpoint' as
>> soon as we change the disposition to `disp_del_at_next_stop'
>> (in `watchpoint_del_at_next_stop'). However,
>> LancelotSIX <lsix@lancelotsix.com> mentioned that the fix is in a
>> non-extension-related code path, and suggested disabling it in
>> `bpstat_check_breakpoint_conditions' instead (the v3 patch).
>> See:
>> https://sourceware.org/pipermail/gdb-patches/2022-September/192120.html
>>
>> Both the v1 and v3 patch fixes the issue. I personally prefer the v1
>> patch.
>> (i.e., modifying `watchpoint_del_at_next_stop')
>>
>> > shouldn't marking the watchpoint as
>> > disp_del_at_next_stop make it so it won't be inserted next time we
>> > resume? After all should_be_inserted returns false for breakpoint
>> > locations that are disp_del_at_next_stop. Perhaps it's because
>> since we
>> > don't do a "normal" stop, breakpoint locations stay inserted, and
>> > there's nothing that pulls this location out of the target? Therefore,
>> > maybe a solution, to keep things coherent, would be to pull the
>> > breakpoint locations out when marking the breakpoint as
>> > disp_del_at_next_stop? This way, we would respect the invariant
>> that a
>> > breakpoint disp_del_at_next_stop can't be inserted (I don't know if
>> this
>> > is really what is happening though, it's just speculation).
>>
>> For software watchpoints, they cannot be pulled out of the target, since
>> they may not be inserted into the target in the first place:
>>
>> /* Locations of type bp_loc_other and
>> bp_loc_software_watchpoint are only maintained at GDB side,
>> so there is no need to remove them. */
>>
>> -- gdb/breakpoint.c:3840
>>
>> Their expressions are re-evaluated by single-stepping through the
>> program:
>>
>> Software watchpoints are very slow, since GDB needs to single-step
>> the program being debugged and test the value of the watched
>> expression(s) after each instruction.
>>
>> -- https://sourceware.org/gdb/wiki/Internals%20Watchpoints
>>
>> Therefore, we must either disable or delete the software watchpoint.
>> We cannot pull it out of the target since it's only maintained on the
>> GDB side.
>>
>> > And, in a broader scope, why wait until the next normal stop to delete
>> > the watchpoint? A "normal" stop being a stop that we present to the
>> > user (the normal_stop function). We currently call
>> > breakpoint_auto_delete in normal_stop, which is why we don't reach it
>> > when your Breakpoint.stop method returns False. At the end of, let's
>> > say, handle_signal_stop, could we go over the bpstat chain and delete
>> > any breakpoint we've just hit that is marked for deletion?
>>
>> I believe this choice is due to performance issues.
>>
>> In an early attempt, I tried to call `breakpoint_auto_delete' on all
>> kinds
>> of stops. However, this implementation is very slow when we have a large
>> number of breakpoints, since we need to go over the entire bpstat
>> chain on
>> all stops. I recall that this implementation fails on certain
>> testcases with
>> a large number of breakpoints with many breakpoint stops.
>>
>> Furthermore, the reason for using the `disp_del_at_next_stop' state
>> remains
>> unclear, as mentioned in the comments:
>>
>> /* NOTE drow/2003-09-08: This state only exists for removing
>> watchpoints. It's not clear that it's necessary... */
>>
>> -- gdb/breakpoint.c:2914
>>
>> By tracing the git history, the `disp_del_at_next_stop' state is
>> introduced
>> in commit c906108c21474dfb4ed285bcc0ac6fe02cd400cc, which doesn't
>> provide
>> any proper explanation of this state.
>>
>> I think the best way to deal with this is to avoid going over the entire
>> bpstat chain when deleting breakpoints. Potentially by keeping track of
>> a chain of breakpoints that should be deleted, and changing the
>> bpstat chain
>> to a doubly linked list for the ease of deletion. Such changes deserve a
>> dedicated patch, though.
>>
>> To sum up, I prefer modifying `watchpoint_del_at_next_stop' (i.e.,
>> the v1 patch).
>> If you also think it's appropriate, I'll fix the failing test and
>> prepare the
>> v4 patch accordingly.
>>
>> Johnson
>>
>> On 1/13/2023 11:40 PM, SimonMarchi wrote:
>>>
>>> On 10/20/22 14:11, Johnson Sun via Gdb-patches wrote:
>>>> Currently, when a local software watchpoint goes out of scope, GDB
>>>> sets the
>>>> watchpoint's disposition to `delete at next stop' and then normal
>>>> stops
>>>> (i.e., stop and wait for the next GDB command). When GDB normal
>>>> stops, it
>>>> automatically deletes the breakpoints with their disposition set to
>>>> `delete at next stop'.
>>>>
>>>> Suppose a Python script decides not to normal stop when a local
>>>> software
>>>> watchpoint goes out of scope, the watchpoint will not be automatically
>>>> deleted even when its disposition is set to `delete at next stop'.
>>>>
>>>> Since GDB single-steps the program and tests the watched expression
>>>> after each
>>>> instruction, not deleting the watchpoint causes the watchpoint to
>>>> be hit many
>>>> more times than it should, as reported in PR python/29603.
>>>>
>>>> This was happening because the watchpoint is not deleted or
>>>> disabled when
>>>> going out of scope.
>>>>
>>>> This commit fixes this issue by disabling the watchpoint when going
>>>> out of
>>>> scope. It also adds a test to ensure this feature isn't regressed
>>>> in the
>>>> future.
>>>>
>>>> Two other solutions seem to solve this issue, but are in fact
>>>> inappropriate:
>>>> 1. Automatically delete breakpoints on all kinds of stops
>>>> (in `fetch_inferior_event'). This solution is very slow since
>>>> the deletion
>>>> requires O(N) time for N breakpoints.
>>>> 2. Disable the watchpoint after the watchpoint's disposition is set to
>>>> `delete at next stop' (in `watchpoint_del_at_next_stop'). This
>>>> solution
>>>> modifies a non-extension-related code path, and isn't preferred
>>>> since this
>>>> issue cannot occur without extension languages. (gdb scripts
>>>> always normal
>>>> stop before continuing execution)
>>> I have a bit of trouble reviewing this, because I'm not too familiar
>>> with the lifecycle of watchpoints (I know the principle, but not the
>>> specifically where things happen in GDB). So it's difficult to tell
>>> whether this is right or if there's a better way to handle it.
>>>
>>> However, the supplied test does not pass for me:
>>>
>>> source
>>> /home/simark/build/binutils-gdb/gdb/testsuite/outputs/gdb.python/py-watchpoint/py-watchpoint.py
>>> Watchpoint 2: i
>>> Python script imported
>>> (gdb) PASS: gdb.python/py-watchpoint.exp: import python scripts
>>> python print(len(gdb.breakpoints()))
>>> 2
>>> (gdb) PASS: gdb.python/py-watchpoint.exp: check modified BP count
>>> gdb_expect_list pattern: /Watchpoint Hit: ./
>>> continue
>>> Continuing.
>>> Watchpoint Hit: 1
>>> gdb_expect_list pattern: /[
>>> ]+Watchpoint . deleted because the program has left the block in/
>>> FAIL: gdb.python/py-watchpoint.exp: run until program stops
>>> (pattern 2) (timeout)
>>> gdb_expect_list pattern: /[
>>> ]+which its expression is valid./
>>> gdb_expect_list pattern: /Watchpoint Hit: ./
>>> gdb_expect_list pattern: /[
>>> ]+012\[Inferior 1 \(process .*\) exited normally\]/
>>> gdb_expect_list pattern: //
>>> python print(len(gdb.breakpoints()))
>>> Watchpoint Hit: 2
>>> Watchpoint Hit: 3
>>> Watchpoint Hit: 4
>>>
>>> Watchpoint 2 deleted because the program has left the block in
>>> which its expression is valid.
>>> Watchpoint Hit: 5
>>> 012[Inferior 1 (process 2381681) exited normally]
>>> (gdb) FAIL: gdb.python/py-watchpoint.exp: check BP count (the
>>> program exited)
>>>
>>>> Bug: https://sourceware.org/bugzilla/show_bug.cgi?id=29603
>>>> ---
>>>> gdb/breakpoint.c | 2 +
>>>> gdb/testsuite/gdb.python/py-watchpoint.c | 27 ++++++++++++
>>>> gdb/testsuite/gdb.python/py-watchpoint.exp | 48
>>>> ++++++++++++++++++++++
>>>> gdb/testsuite/gdb.python/py-watchpoint.py | 30 ++++++++++++++
>>>> 4 files changed, 107 insertions(+)
>>>> create mode 100644 gdb/testsuite/gdb.python/py-watchpoint.c
>>>> create mode 100644 gdb/testsuite/gdb.python/py-watchpoint.exp
>>>> create mode 100644 gdb/testsuite/gdb.python/py-watchpoint.py
>>>>
>>>> diff --git a/gdb/breakpoint.c b/gdb/breakpoint.c
>>>> index bff3bac7d1a..15f4ae2131c 100644
>>>> --- a/gdb/breakpoint.c
>>>> +++ b/gdb/breakpoint.c
>>>> @@ -5340,6 +5340,8 @@ bpstat_check_breakpoint_conditions (bpstat
>>>> *bs, thread_info *thread)
>>>> /* Evaluate extension language breakpoints that have a "stop"
>>>> method
>>>> implemented. */
>>>> bs->stop = breakpoint_ext_lang_cond_says_stop (b);
>>>> + if (b->disposition == disp_del_at_next_stop)
>>>> + disable_breakpoint(b);
>>> Is there a reason to do it here in particular, and not, let's say as
>>> soon as we change the disposition to disp_del_at_next_stop?
>>>
>>> Other question: shouldn't marking the watchpoint as
>>> disp_del_at_next_stop make it so it won't be inserted next time we
>>> resume? After all should_be_inserted returns false for breakpoint
>>> locations that are disp_del_at_next_stop. Perhaps it's because
>>> since we
>>> don't do a "normal" stop, breakpoint locations stay inserted, and
>>> there's nothing that pulls this location out of the target? Therefore,
>>> maybe a solution, to keep things coherent, would be to pull the
>>> breakpoint locations out when marking the breakpoint as
>>> disp_del_at_next_stop? This way, we would respect the invariant that a
>>> breakpoint disp_del_at_next_stop can't be inserted (I don't know if
>>> this
>>> is really what is happening though, it's just speculation).
>>>
>>> And, in a broader scope, why wait until the next normal stop to delete
>>> the watchpoint? A "normal" stop being a stop that we present to the
>>> user (the normal_stop function). We currently call
>>> breakpoint_auto_delete in normal_stop, which is why we don't reach it
>>> when your Breakpoint.stop method returns False. At the end of, let's
>>> say, handle_signal_stop, could we go over the bpstat chain and delete
>>> any breakpoint we've just hit that is marked for deletion?
>>>
>>> Simon
>>>
>
next prev parent reply other threads:[~2023-02-26 6:16 UTC|newest]
Thread overview: 34+ messages / expand[flat|nested] mbox.gz Atom feed top
2022-09-25 5:33 [PATCH] [PR python/29603] Fix deletion of Watchpoints Johnson Sun
2022-09-25 18:10 ` Lancelot SIX
2022-10-01 5:20 ` Johnson Sun
2022-10-01 5:27 ` [PATCH v2] [PR python/29603] Disable out-of-scope watchpoints Johnson Sun
2022-10-20 17:57 ` Johnson Sun
2022-10-20 18:11 ` [PATCH v3] " Johnson Sun
2022-11-18 12:17 ` [PING] " Johnson Sun
2022-11-25 15:11 ` [PING^2] " Johnson Sun
2022-12-04 16:45 ` [PING^3] " Johnson Sun
2022-12-12 21:44 ` [PING^4] " Johnson Sun
2022-12-20 22:08 ` [PING^5] " Johnson Sun
2022-12-27 16:40 ` [PING^6] " Johnson Sun
2023-01-12 18:34 ` [PING^7] " Johnson Sun
2023-01-13 15:40 ` Simon Marchi
2023-01-23 10:15 ` Johnson Sun
2023-02-18 16:26 ` [PING] " Johnson Sun
2023-02-26 6:16 ` Johnson Sun [this message]
2023-03-12 17:24 ` [PING] [RFC] " Johnson Sun
2023-03-13 16:00 ` Simon Marchi
2023-03-23 18:25 ` Johnson Sun
2023-03-23 18:31 ` [PATCH v4] " Johnson Sun
2023-04-09 20:47 ` Johnson Sun
2023-04-09 20:49 ` [PING] " Johnson Sun
2023-04-17 18:18 ` [PING^2] " Johnson Sun
2023-04-17 18:56 ` Simon Marchi
2023-04-23 9:46 ` Johnson Sun
2023-04-23 9:54 ` [PATCH v5] " Johnson Sun
2023-05-06 19:06 ` [PING] " Johnson Sun
2023-05-09 18:50 ` Simon Marchi
2023-05-10 17:22 ` Johnson Sun
2023-05-11 2:08 ` Simon Marchi
2023-05-11 15:46 ` [PATCH v6] " Johnson Sun
2023-05-11 16:09 ` Simon Marchi
2023-05-11 15:50 ` [PATCH v5] " Johnson Sun
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=6f2d7693-7f29-c125-1b7a-44a8a4fbd76e@gmail.com \
--to=j3.soon777@gmail.com \
--cc=gdb-patches@sourceware.org \
--cc=lsix@lancelotsix.com \
--cc=simark@simark.ca \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).