From: Takashi Yano <takashi.yano@nifty.ne.jp>
To: cygwin-developers@cygwin.com
Subject: Re: malloc crash
Date: Tue, 26 Oct 2021 17:52:29 +0900 [thread overview]
Message-ID: <20211026175229.1eda36caab1b03314a8cf165@nifty.ne.jp> (raw)
In-Reply-To: <cdd7fd9c-7424-5cda-3441-c07039051098@maxrnd.com>
On Tue, 26 Oct 2021 01:30:13 -0700
Mark Geisert wrote:
> Replying to myself to correct something I wrote...
>
> Mark Geisert wrote:
> > Takashi Yano wrote:
> >> On Mon, 25 Oct 2021 16:36:50 -0700
> >> Mark Geisert wrote:
> >>> Ken Brown wrote:
> >>>> On 10/25/2021 5:29 PM, Mark Geisert wrote:
> >>>>> Corinna Vinschen wrote:
> >>>>>> Er... huh? So both threads are in a malloc function? This shouldn't
> >>>>>> have happened, given the clunky muto guarding malloc calls. This is
> >>>>>> really strange. Why's the muto not working here?
> >>>>>
> >>>>> Is it possible both threads have executed malloc_init()?
> >>>>> If so, the second one would reinit the muto.
> >>>>
> >>>> Or does the fifo_reader thread call a malloc function before the main thread has
> >>>> called malloc_init()? This would presumably cause __malloc_lock() to fail, but
> >>>> there's no error check.
> >>>
> >>> If there's a global constructor involved, that is known to happen. Constructors
> >>> are run from dll_crt0_0(), before malloc_init() is called from dll_crt0_1(). See
> >>> dcrt0.cc for the details.
> >>
> >> So how about moving malloc_init() call from dll_crt0_1() to dll_crl0_0()
> >> so that malloc() can be called in fixup_after_fork/exec()?
> >
> > It appears simple, but this is a touchy area of code. The _0 and _1 are two
> > separate phases of process startup. I'd want to hear Corinna's thoughts on this.
> >
> > I'd also like to verify somehow that this is the scenario Ken is hitting.
> >
> > When I was researching different mallocs for Cygwin I hit the constructor snag
> > repeatedly. I did try delaying the constructor-running until after malloc_init().
> > More problems. I did not try moving malloc_init() to before the constructor run.
>
> Apologies; this was many months ago. What I did try was moving the malloc_init()
> to before running the constructor chain, as Takashi suggested. That is what gave
> me more problems. I don't recall what they were, but I reverted that attempt.
>
> The "future malloc" build of Cygwin I'm running doesn't exhibit Ken's issue, as
> far as I can tell. It has a specific fix to avoid the scenario I've been talking
> about here, but I don't want to take us down that path unless we're sure Ken's
> hitting that same scenario.
I tried the following patch, and confirmed that the issue has
been disappeared. I do not notice any other problems so far
with this patch.
diff --git a/winsup/cygwin/dcrt0.cc b/winsup/cygwin/dcrt0.cc
index 6f4723bb0..0d541ec14 100644
--- a/winsup/cygwin/dcrt0.cc
+++ b/winsup/cygwin/dcrt0.cc
@@ -773,6 +773,10 @@ dll_crt0_0 ()
do_global_ctors (&__CTOR_LIST__, 1);
cygthread::init ();
+ /* malloc_init() has been moved from dll_crt0_1() to here so that
+ malloc() can be called in fixup_after_exec(). */
+ malloc_init ();
+
if (!child_proc_info)
{
setup_cygheap ();
@@ -857,7 +861,7 @@ dll_crt0_1 (void *)
on a functioning malloc and it's possible that the user's program may
have overridden malloc. We only know about that at this stage,
unfortunately. */
- malloc_init ();
+ /* malloc_init() has been moved to dll_crt0_0(). */
user_shared->initialize ();
#ifdef CYGHEAP_DEBUG
Where is the "constructor chain" you mentioned?
--
Takashi Yano <takashi.yano@nifty.ne.jp>
next prev parent reply other threads:[~2021-10-26 8:52 UTC|newest]
Thread overview: 27+ messages / expand[flat|nested] mbox.gz Atom feed top
2021-10-24 21:46 Ken Brown
2021-10-25 8:56 ` Takashi Yano
2021-10-25 13:37 ` Ken Brown
2021-10-25 8:59 ` Corinna Vinschen
2021-10-25 12:35 ` Ken Brown
2021-10-25 15:39 ` Corinna Vinschen
2021-10-25 21:29 ` Mark Geisert
2021-10-25 22:02 ` Ken Brown
2021-10-25 23:36 ` Mark Geisert
2021-10-26 0:18 ` Takashi Yano
2021-10-26 0:54 ` Mark Geisert
2021-10-26 8:30 ` Mark Geisert
2021-10-26 8:52 ` Takashi Yano [this message]
2021-10-26 8:59 ` Mark Geisert
2021-10-26 9:26 ` Takashi Yano
2021-10-26 9:31 ` Corinna Vinschen
2021-10-26 9:28 ` Corinna Vinschen
2021-10-26 9:27 ` Corinna Vinschen
2021-10-26 9:24 ` Corinna Vinschen
2021-10-26 14:32 ` Ken Brown
2021-10-26 16:03 ` Corinna Vinschen
2021-10-26 16:36 ` Ken Brown
2021-10-26 16:49 ` Corinna Vinschen
2021-10-26 17:10 ` Ken Brown
2021-10-27 0:44 ` Takashi Yano
2021-10-27 9:01 ` Corinna Vinschen
2021-10-26 16:44 ` Takashi Yano
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20211026175229.1eda36caab1b03314a8cf165@nifty.ne.jp \
--to=takashi.yano@nifty.ne.jp \
--cc=cygwin-developers@cygwin.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).