public inbox for gdb-prs@sourceware.org
help / color / mirror / Atom feed
From: "simark at simark dot ca" <sourceware-bugzilla@sourceware.org>
To: gdb-prs@sourceware.org
Subject: [Bug gdb/29762] New: FAIL: gdb.threads/access-mem-running-thread-exit.exp: non-stop: access mem (print global_var after writing again, inf=2, iter=1)
Date: Wed, 09 Nov 2022 02:15:53 +0000	[thread overview]
Message-ID: <bug-29762-4717@http.sourceware.org/bugzilla/> (raw)

https://sourceware.org/bugzilla/show_bug.cgi?id=29762

            Bug ID: 29762
           Summary: FAIL: gdb.threads/access-mem-running-thread-exit.exp:
                    non-stop: access mem (print global_var after writing
                    again, inf=2, iter=1)
           Product: gdb
           Version: HEAD
            Status: NEW
          Severity: normal
          Priority: P2
         Component: gdb
          Assignee: unassigned at sourceware dot org
          Reporter: simark at simark dot ca
  Target Milestone: ---

I get this failure very rarely on my CI.  I managed to reproduce it on my dev
machine by running:

$ while taskset -c 1,19 make check
TESTS="gdb.threads/access-mem-running-thread-exit.exp"
RUNTESTFLAGS="--target_board=native-extended-gdbserver";do done

It takes a few runs, maybe a few minutes, but it eventually fails.

I think running

$ stress -n $(nproc)

at the same time helped, but maybe it was just an illusion.

Here's an instance of the failure:

(gdb) print global_var = 555^M
$1 = 555^M
(gdb) print global_var^M
$2 = 555^M
(gdb) print global_var = 333^M
$3 = 333^M
(gdb) print global_var^M
$4 = 123^M
(gdb) FAIL: gdb.threads/access-mem-running-thread-exit.exp: non-stop: access
mem (print global_var after writing again, inf=2, iter=1)

In another case it looks like this:

(gdb) print global_var = 555^M
$1 = 555^M
(gdb) print global_var^M
$2 = 123^M
(gdb) FAIL: gdb.threads/access-mem-running-thread-exit.exp: non-stop: access
mem (print global_var after writing, inf=2, iter=1)

I don't know if the taskset is a red herring, but I never got a failure by
running it without the taskset, or by running with taskset on a single core.

Interestingly, all the failures I got were always on iter=1.

I don't really know what kind of racy problem it could be in GDB.  It sounds
like a "write memory on one core, get migrated to another CPU, then read the
old value on another core" kind of problem.

-- 
You are receiving this mail because:
You are on the CC list for the bug.

             reply	other threads:[~2022-11-09  2:15 UTC|newest]

Thread overview: 2+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2022-11-09  2:15 simark at simark dot ca [this message]
2022-11-11  1:16 ` [Bug gdb/29762] " simark at simark dot ca

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=bug-29762-4717@http.sourceware.org/bugzilla/ \
    --to=sourceware-bugzilla@sourceware.org \
    --cc=gdb-prs@sourceware.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).