Re: [PATCH] nptl: Add pthread_thread_number_np function

public inbox for libc-alpha@sourceware.org
 help / color / mirror / Atom feed

From: Carlos O'Donell <carlos@redhat.com>
To: Florian Weimer <fweimer@redhat.com>
Cc: libc-alpha@sourceware.org
Subject: Re: [PATCH] nptl: Add pthread_thread_number_np function
Date: Thu, 21 Dec 2017 19:19:00 -0000	[thread overview]
Message-ID: <33e60e1e-2f89-8908-b1ab-2d878f90b671@redhat.com> (raw)
In-Reply-To: <609c8815-7d79-c504-0a1f-3eb6f82ead9d@redhat.com>

On 12/21/2017 03:03 AM, Florian Weimer wrote:
> On 12/21/2017 10:26 AM, Carlos O'Donell wrote:
> 
>>> +The returned number is only unique with regards to the current process.
>>> +It may be shared by subprocesses and other processes in the system.
>>> +
>>> +The initial (main) thread has number 1.Â  Thread numbers are not
>>> +necessarily assigned in a consecutive fashion.Â  They bear no
>>> +relationship to POSIX thread IDs (@code{pthread_t} values), process IDs
>>> +or thread IDs assigned by the kernel.
>>
>> I would like us to add something like this:
>> ~~~
>> While the return type of this function is only 64-bits wide, the intent
>> is not to impose an artificial limit on the number of threads that can be
>> created by the runtime. In the future this interface may be extended
>> to 128-bits to support creating as many threads as a user may need
>> for the lifetime of the process.
>> ~~~
>>
>> That way the intent of the interface and future changes are clear.
> 
> So how would a programmer use this interface in a future-proof way?
> I think such a statement would raise more questions than it answers.

I went to bed thinking much the same thing and worried that perhaps this
text was not appropriate for the manual, but could serve as a comment in
the source code for future maintainers. Since this is really a question
about GNU ethos and avoiding artificial limits.

Would you be opposed to adding the comment to the new function sources?

>>> diff --git a/nptl/allocatestack.c b/nptl/allocatestack.c
>>> index 1cc7893195..454df7740b 100644
>>> --- a/nptl/allocatestack.c
>>> +++ b/nptl/allocatestack.c
>>> @@ -413,16 +413,28 @@ allocate_stack (const struct pthread_attr *attr, struct pthread **pdp,
>>> Â Â Â  assert (powerof2 (pagesize_m1 + 1));
>>> Â Â Â  assert (TCB_ALIGNMENT >= STACK_ALIGN);
>>> Â  -Â  /* Get the stack size from the attribute if it is set.Â  Otherwise we
>>> -Â Â Â Â  use the default we determined at start time.Â  */
>>> -Â  if (attr->stacksize != 0)
>>> -Â Â Â  size = attr->stacksize;
>>> -Â  else
>>> -Â Â Â  {
>>> -Â Â Â Â Â  lll_lock (__default_pthread_attr_lock, LLL_PRIVATE);
>>> +Â  uint64_t thread_number;
>>> +Â  lll_lock (__default_pthread_attr_lock, LLL_PRIVATE);
>>> +Â  {
>>> +Â Â Â  /* Number 1 is reserved for the initial thread.Â  Reuse
>>> +Â Â Â Â Â Â  __default_pthread_attr_lock to avoid concurrent updates of this
>>> +Â Â Â Â Â Â  counter.Â  */
>>
>> OK.
>>
>>> +Â Â Â  static uint64_t global_thread_number = 1;
>>> +Â Â Â  thread_number = ++global_thread_number;
>>
>> Alright, here comes serious worry #1.
>>
>> If we say "Thread numbers are not necessarily assigned in a consecutive fashion.",
>> and we assign them in a consecutive fashion, users will ignore this statement
>> and use what empirically appears to be true.
> 
> The above does not actually assign thread numbers in a consecutive
> fashion, from an application perspective because the implementation
> can create its own threads for its own internal use.  (librt and
> libanl do this.)

That's a good point.

I know that aio has a helper thraed, so does the notification, so does some
of the related timer routines.

All of those would help break this notion, but these are also rarer features.
 
>> People start relying on this counter incrementing from 1 upwards.
>>
>> People start using this monotonic property for indexing.
>>
>> Soon we can't change it because it's implied API behaviour.
>>
>> I think we should disabuse them from doing something low cost to roll the value:
>>
>> * Do nothing for thread 1, leaving it 1.
>> * Check global_thread_number for overflow instead.
>> * Pick a random number of bits to roll between 0-63 (picked at process startup)
>> * Roll the value by some that number of bits.
>> * Use the rolled value as the thread_number
> 
> Not sure if I understand this.  Do you want us to start at a random
> value?  Or assign IDs randomly?  The latter will have a collision
> much sooner.
> 
> I can switch the thread numbers to a fixed, but random-looking
> permutation of the integers in [0, 2**64), but this looks excessive.

I want a low cost solution that avoids abuses of the interface for
indexing into arrays, or other issues that would break when we change
this in the future. We do not want users to make assumptions about the
values we hand out.

The lowest cost option I can think of is to circular shift the global
counter value by some number of bits.

I don't care for it to be random.

> In my opinion, we need to assume at one point that programmers read
> the documentation.

The implementation should be robust against abuse. That's what makes
it high quality.

-- 
Cheers,
Carlos.

next prev parent reply	other threads:[~2017-12-21 19:19 UTC|newest]

Thread overview: 28+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2017-12-14 18:56 Florian Weimer
2017-12-14 20:24 ` Florian Weimer
2017-12-14 23:34 ` Nix
2017-12-15  6:41   ` Florian Weimer
2017-12-15  0:29 ` Andrew Pinski
2017-12-15  7:47   ` Florian Weimer
2017-12-15  7:54   ` Andrew Pinski
2017-12-15  4:08 ` Carlos O'Donell
2017-12-15  7:48   ` Florian Weimer
2017-12-20  8:06     ` Carlos O'Donell
2017-12-20 14:34       ` Florian Weimer
2017-12-20 17:58         ` Carlos O'Donell
2017-12-21  9:26     ` Carlos O'Donell
2017-12-21 11:03       ` Florian Weimer
2017-12-21 19:19         ` Carlos O'Donell [this message]
2017-12-22 16:25           ` Florian Weimer
2017-12-22 17:09             ` Carlos O'Donell
2017-12-22 17:43               ` Joseph Myers
2017-12-22 19:39                 ` Florian Weimer
2017-12-22 20:02                   ` Joseph Myers
2017-12-22 22:11                     ` Florian Weimer
2018-03-02 18:04     ` Rich Felker
2018-03-02 18:08       ` Rich Felker
2018-03-09 17:23         ` Florian Weimer
2018-03-09 23:29           ` Carlos O'Donell
2018-03-02 14:42 Florian Weimer
2018-03-02 17:16 ` Joseph Myers
2018-05-15 13:42   ` Florian Weimer

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=33e60e1e-2f89-8908-b1ab-2d878f90b671@redhat.com \
    --to=carlos@redhat.com \
    --cc=fweimer@redhat.com \
    --cc=libc-alpha@sourceware.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).