[Bug malloc/13939] New: Malloc can deadlock in retry paths

public inbox for glibc-bugs@sourceware.org
help / color / mirror / Atom feed

* [Bug malloc/13939] New: Malloc can deadlock in retry paths
@ 2012-04-03  3:15 law at redhat dot com
  2012-04-03  3:46 ` [Bug malloc/13939] " carlos_odonell at mentor dot com
                   ` (8 more replies)
  0 siblings, 9 replies; 10+ messages in thread
From: law at redhat dot com @ 2012-04-03  3:15 UTC (permalink / raw)
  To: glibc-bugs

http://sourceware.org/bugzilla/show_bug.cgi?id=13939

             Bug #: 13939
           Summary: Malloc can deadlock in retry paths
           Product: glibc
           Version: 2.15
            Status: NEW
          Severity: normal
          Priority: P2
         Component: malloc
        AssignedTo: unassigned@sourceware.org
        ReportedBy: law@redhat.com
    Classification: Unclassified

Created attachment 6312
  --> http://sourceware.org/bugzilla/attachment.cgi?id=6312
Potential fix

Assume we're we're in libc_malloc and the call to _int_malloc has failed
because we're unable to sbrk more memory.

We (sensibly) have code which will attempt to use mmap to allocate the memory
in the inner else clause below:

  arena_lock(ar_ptr, bytes);
  if(!ar_ptr)
    return 0;
  victim = _int_malloc(ar_ptr, bytes);
  if(!victim) {
    /* Maybe the failure is due to running out of mmapped areas. */
    if(ar_ptr != &main_arena) {
      (void)mutex_unlock(&ar_ptr->mutex);
      ar_ptr = &main_arena;
      (void)mutex_lock(&ar_ptr->mutex);
      victim = _int_malloc(ar_ptr, bytes);
      (void)mutex_unlock(&ar_ptr->mutex);
    } else {
      /* ... or sbrk() has failed and there is still a chance to mmap() */
      ar_ptr = arena_get2(ar_ptr->next ? ar_ptr : 0, bytes);
      (void)mutex_unlock(&main_arena.mutex);
      if(ar_ptr) {
        victim = _int_malloc(ar_ptr, bytes);
        (void)mutex_unlock(&ar_ptr->mutex);
      }
    }
  } else
    (void)mutex_unlock(&ar_ptr->mutex);

Make note that the arena referenced by ar_ptr will still be locked when we call
arena_get2 in that else clause.  Furthermore, we know that ar_ptr must refer to
the main arena.

Now assume that there are no arenas on the free list and that we've already hit
the limit for the number of arenas we're willing to create.  In that case
arena_get2 will call reused_arena which looks like this:

static mstate
reused_arena (void)
{
  mstate result;
  static mstate next_to_use;
  if (next_to_use == NULL)
    next_to_use = &main_arena;

  result = next_to_use;
  do
    {
      if (!mutex_trylock(&result->mutex))
        goto out;

      result = result->next;
    }
  while (result != next_to_use);

  /* No arena available.  Wait for the next in line.  */
  (void)mutex_lock(&result->mutex);

So let's make a couple more assumptions.  First, assume that next_to_use refers
to the main arena.  Second assume that all the other arenas are currently
locked by other threads.  And remember that the main arena is locked by the
current thread.

In that case the do-while loop will look at the every arena on the list and
find them all locked.  When it hits result == main_arena, then the loop will
exit and we call mutex_lock to acquire the main arena's lock.  But since the
main arena is already locked by the current thread, we deadlock.

libc_calloc seems to have the same problem.

libc_memalign, libc_calloc, libc_pvalloc all release the lock before calling
arena_get2 in the case where sbrk failed, which seems to be the right thing to
do.

Closely related since fixing the deadlock provides us with a great opportunity
to unify the 5 implementations a little.  For example we have this from
libc_memalign:

      mstate prev = ar_ptr->next ? ar_ptr : 0;
      (void)mutex_unlock(&ar_ptr->mutex);
      ar_ptr = arena_get2(prev, bytes);

Which seems like the right way to go, so I'd like to unify the 5
implementations to have a similar structure.  ie, release the lock within the
conditional just prior to calling arena_get2 and being safe WRT modification of
ar_ptr->next.

So dropping the lock prior to calling arena_get2 avoids the deadlock.  However,
in this case all that's going to happen is the retrying allocation will fail. 
The original allocation was trying to allocate from the main arena and so will
the retrying allocation if it can't acquire a lock for any other arena.

To get this 100% correct would require retrying in every arena, potentially
blocking to acquire the lock at each step.  That seems rather excessive.  So
instead we can just pass in a state variable indicating we're in a retry
situation and if so, avoid retrying in the same arena that just failed.

Attached is the fix we're currently using internally at Red Hat.  It needed
updating slightly as Uli cleaned up malloc.c & arena.c.  However, the basic
structure is the same as what Red Hat is using internally.  I've verified glibc
still builds after adjusting for Uli's cleanups.

-- 
Configure bugmail: http://sourceware.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug malloc/13939] Malloc can deadlock in retry paths
  2012-04-03  3:15 [Bug malloc/13939] New: Malloc can deadlock in retry paths law at redhat dot com
@ 2012-04-03  3:46 ` carlos_odonell at mentor dot com
  2012-04-03  7:46 ` aj at suse dot de
                   ` (7 subsequent siblings)
  8 siblings, 0 replies; 10+ messages in thread
From: carlos_odonell at mentor dot com @ 2012-04-03  3:46 UTC (permalink / raw)
  To: glibc-bugs

http://sourceware.org/bugzilla/show_bug.cgi?id=13939

Carlos O'Donell <carlos_odonell at mentor dot com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEW                         |UNCONFIRMED
                 CC|                            |carlos_odonell at mentor
                   |                            |dot com
   Target Milestone|---                         |2.16
     Ever Confirmed|1                           |0

--- Comment #1 from Carlos O'Donell <carlos_odonell at mentor dot com> 2012-04-03 03:45:38 UTC ---
Lets solve this for 2.16, setting milestone.

-- 
Configure bugmail: http://sourceware.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.


^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug malloc/13939] Malloc can deadlock in retry paths
  2012-04-03  3:15 [Bug malloc/13939] New: Malloc can deadlock in retry paths law at redhat dot com
  2012-04-03  3:46 ` [Bug malloc/13939] " carlos_odonell at mentor dot com
@ 2012-04-03  7:46 ` aj at suse dot de
  2012-04-03 19:33 ` ppluzhnikov at google dot com
                   ` (6 subsequent siblings)
  8 siblings, 0 replies; 10+ messages in thread
From: aj at suse dot de @ 2012-04-03  7:46 UTC (permalink / raw)
  To: glibc-bugs

http://sourceware.org/bugzilla/show_bug.cgi?id=13939

Andreas Jaeger <aj at suse dot de> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |wg at malloc dot de

-- 
Configure bugmail: http://sourceware.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.


^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug malloc/13939] Malloc can deadlock in retry paths
  2012-04-03  3:15 [Bug malloc/13939] New: Malloc can deadlock in retry paths law at redhat dot com
  2012-04-03  3:46 ` [Bug malloc/13939] " carlos_odonell at mentor dot com
  2012-04-03  7:46 ` aj at suse dot de
@ 2012-04-03 19:33 ` ppluzhnikov at google dot com
  2012-06-22 21:16 ` roland at gnu dot org
                   ` (5 subsequent siblings)
  8 siblings, 0 replies; 10+ messages in thread
From: ppluzhnikov at google dot com @ 2012-04-03 19:33 UTC (permalink / raw)
  To: glibc-bugs

http://sourceware.org/bugzilla/show_bug.cgi?id=13939

Paul Pluzhnikov <ppluzhnikov at google dot com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |ppluzhnikov at google dot
                   |                            |com

-- 
Configure bugmail: http://sourceware.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.


^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug malloc/13939] Malloc can deadlock in retry paths
  2012-04-03  3:15 [Bug malloc/13939] New: Malloc can deadlock in retry paths law at redhat dot com
                   ` (2 preceding siblings ...)
  2012-04-03 19:33 ` ppluzhnikov at google dot com
@ 2012-06-22 21:16 ` roland at gnu dot org
  2012-06-23 11:28 ` pluto at agmk dot net
                   ` (4 subsequent siblings)
  8 siblings, 0 replies; 10+ messages in thread
From: roland at gnu dot org @ 2012-06-22 21:16 UTC (permalink / raw)
  To: glibc-bugs

http://sourceware.org/bugzilla/show_bug.cgi?id=13939

Roland McGrath <roland at gnu dot org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
   Attachment #6312|0                           |1
           is patch|                            |
   Attachment #6312|application/octet-stream    |text/plain
          mime type|                            |

-- 
Configure bugmail: http://sourceware.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.


^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug malloc/13939] Malloc can deadlock in retry paths
  2012-04-03  3:15 [Bug malloc/13939] New: Malloc can deadlock in retry paths law at redhat dot com
                   ` (3 preceding siblings ...)
  2012-06-22 21:16 ` roland at gnu dot org
@ 2012-06-23 11:28 ` pluto at agmk dot net
  2012-07-18  2:00 ` carlos_odonell at mentor dot com
                   ` (3 subsequent siblings)
  8 siblings, 0 replies; 10+ messages in thread
From: pluto at agmk dot net @ 2012-06-23 11:28 UTC (permalink / raw)
  To: glibc-bugs

http://sourceware.org/bugzilla/show_bug.cgi?id=13939

Pawel Sikora <pluto at agmk dot net> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |pluto at agmk dot net

-- 
Configure bugmail: http://sourceware.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.


^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug malloc/13939] Malloc can deadlock in retry paths
  2012-04-03  3:15 [Bug malloc/13939] New: Malloc can deadlock in retry paths law at redhat dot com
                   ` (4 preceding siblings ...)
  2012-06-23 11:28 ` pluto at agmk dot net
@ 2012-07-18  2:00 ` carlos_odonell at mentor dot com
  2012-07-18 14:48 ` law at redhat dot com
                   ` (2 subsequent siblings)
  8 siblings, 0 replies; 10+ messages in thread
From: carlos_odonell at mentor dot com @ 2012-07-18  2:00 UTC (permalink / raw)
  To: glibc-bugs

http://sourceware.org/bugzilla/show_bug.cgi?id=13939

Carlos O'Donell <carlos_odonell at mentor dot com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
   Target Milestone|2.16                        |2.17

--- Comment #2 from Carlos O'Donell <carlos_odonell at mentor dot com> 2012-07-18 02:00:29 UTC ---
Maxim has provided review of this patch here:
http://sourceware.org/ml/libc-alpha/2012-07/msg00027.html

I think the next step is for Jeff to put out a new version of the patch with
the new changes included.

-- 
Configure bugmail: http://sourceware.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.


^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug malloc/13939] Malloc can deadlock in retry paths
  2012-04-03  3:15 [Bug malloc/13939] New: Malloc can deadlock in retry paths law at redhat dot com
                   ` (5 preceding siblings ...)
  2012-07-18  2:00 ` carlos_odonell at mentor dot com
@ 2012-07-18 14:48 ` law at redhat dot com
  2012-11-29 17:38 ` law at redhat dot com
  2014-06-25 11:23 ` fweimer at redhat dot com
  8 siblings, 0 replies; 10+ messages in thread
From: law at redhat dot com @ 2012-07-18 14:48 UTC (permalink / raw)
  To: glibc-bugs

http://sourceware.org/bugzilla/show_bug.cgi?id=13939

--- Comment #3 from law at redhat dot com 2012-07-18 14:47:42 UTC ---
Carlos,

Yes there's some minor edits I need to make based on Maxim's review.  I'm just
a bit buried at the moment.

-- 
Configure bugmail: http://sourceware.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.


^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug malloc/13939] Malloc can deadlock in retry paths
  2012-04-03  3:15 [Bug malloc/13939] New: Malloc can deadlock in retry paths law at redhat dot com
                   ` (6 preceding siblings ...)
  2012-07-18 14:48 ` law at redhat dot com
@ 2012-11-29 17:38 ` law at redhat dot com
  2014-06-25 11:23 ` fweimer at redhat dot com
  8 siblings, 0 replies; 10+ messages in thread
From: law at redhat dot com @ 2012-11-29 17:38 UTC (permalink / raw)
  To: glibc-bugs

http://sourceware.org/bugzilla/show_bug.cgi?id=13939

law at redhat dot com changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|UNCONFIRMED                 |RESOLVED
         Resolution|                            |FIXED

--- Comment #4 from law at redhat dot com 2012-11-29 17:37:52 UTC ---
commit bf51f568f19bc063e62904d18b77d7e449a6a44f
Author: Jeff Law <law@redhat.com>
Date:   Fri Aug 10 09:37:04 2012 -0600

            [BZ #13939]
            * malloc.c/arena.c (reused_arena): New parameter, avoid_arena.
            When avoid_arena is set, don't retry in the that arena.  Pick the
            next one, whatever it might be.
            (arena_get2): New parameter avoid_arena, pass through to
reused_arena.
            (arena_lock): Pass in new parameter to arena_get2.
            * malloc/malloc.c (__libc_memalign): Pass in new parameter to
            arena_get2.
            (__libc_malloc): Unify retrying after main arena failure with
            __libc_memalign version.
            (__libc_valloc, __libc_pvalloc, __libc_calloc): Likewise.

-- 
Configure bugmail: http://sourceware.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.


^ permalink raw reply	[flat|nested] 10+ messages in thread

* [Bug malloc/13939] Malloc can deadlock in retry paths
  2012-04-03  3:15 [Bug malloc/13939] New: Malloc can deadlock in retry paths law at redhat dot com
                   ` (7 preceding siblings ...)
  2012-11-29 17:38 ` law at redhat dot com
@ 2014-06-25 11:23 ` fweimer at redhat dot com
  8 siblings, 0 replies; 10+ messages in thread
From: fweimer at redhat dot com @ 2014-06-25 11:23 UTC (permalink / raw)
  To: glibc-bugs

https://sourceware.org/bugzilla/show_bug.cgi?id=13939

Florian Weimer <fweimer at redhat dot com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
              Flags|                            |security-

-- 
You are receiving this mail because:
You are on the CC list for the bug.


^ permalink raw reply	[flat|nested] 10+ messages in thread

end of thread, other threads:[~2014-06-25 11:23 UTC | newest]

Thread overview: 10+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2012-04-03  3:15 [Bug malloc/13939] New: Malloc can deadlock in retry paths law at redhat dot com
2012-04-03  3:46 ` [Bug malloc/13939] " carlos_odonell at mentor dot com
2012-04-03  7:46 ` aj at suse dot de
2012-04-03 19:33 ` ppluzhnikov at google dot com
2012-06-22 21:16 ` roland at gnu dot org
2012-06-23 11:28 ` pluto at agmk dot net
2012-07-18  2:00 ` carlos_odonell at mentor dot com
2012-07-18 14:48 ` law at redhat dot com
2012-11-29 17:38 ` law at redhat dot com
2014-06-25 11:23 ` fweimer at redhat dot com

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).