From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <SRS0=xKkh=4Y=gmail.com=goldstein.w.n@sourceware.org>
Received: from mail-ed1-x52e.google.com (mail-ed1-x52e.google.com [IPv6:2a00:1450:4864:20::52e])
	by sourceware.org (Postfix) with ESMTPS id B3C7D3858D32
	for <libc-alpha@sourceware.org>; Mon, 26 Dec 2022 00:33:00 +0000 (GMT)
DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org B3C7D3858D32
Authentication-Results: sourceware.org; dmarc=pass (p=none dis=none) header.from=gmail.com
Authentication-Results: sourceware.org; spf=pass smtp.mailfrom=gmail.com
Received: by mail-ed1-x52e.google.com with SMTP id c17so13948826edj.13
        for <libc-alpha@sourceware.org>; Sun, 25 Dec 2022 16:33:00 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
        d=gmail.com; s=20210112;
        h=cc:to:subject:message-id:date:from:in-reply-to:references
         :mime-version:from:to:cc:subject:date:message-id:reply-to;
        bh=/0BIvEUJ1TjS8yltYbKn2BuZfMUuaM++EPUwbf3Ci+U=;
        b=OYFEZyx8kh85+MZZqfdSVeyRLpfzpwKskKzscIh/6HgTHvILdLxBygKqstei8dFu0c
         a1L/1xejBDSz42U6n6ZsJSma10FLWmkYcGtvIVxrALOkky2u/Y7baDsQlM0RNK4SqPox
         iYMD/g2gM5uzqhPuqYadgL6ZKcPutYC3143V1qi/AEUG/ubge37TifETWKFmORuLytMF
         fnEja3LHjmxjFFnShmeRP73aprt2rbKP44qcLksxEo9cNkLJ4Yhfw98qcimYhOADdY5N
         PsR6APOKuLGlEUtcNgey95fDOjG2luHPUBsj67/VrB/wFVqsouOHXcS8Sl3sB5DId6/q
         7bZw==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
        d=1e100.net; s=20210112;
        h=cc:to:subject:message-id:date:from:in-reply-to:references
         :mime-version:x-gm-message-state:from:to:cc:subject:date:message-id
         :reply-to;
        bh=/0BIvEUJ1TjS8yltYbKn2BuZfMUuaM++EPUwbf3Ci+U=;
        b=cz/cEilY4qTF/Ex0h+cKp6LrSoBARllFxI8vhsfMu7IgGJQNsVWQtAa64L5GAc59j0
         AeUkGU1nxjkc9MzmGhETgwcmrAjGHXrgic5f9/bdVyB1uPXUxCM26mAdUp50y8x2Q2XQ
         4To5n4VSa+IGfva38EgdDC9RV9muFcSr1PO29k2GQYi4cuSfJDFDUMxTeh3K6/MdBJmR
         UyW5FPr1aIrziFfZ4/WbPdgfmNNmU0sZ1Obw7uMjnGuu2yDr0MfaSVvdiPZcLHy/ElMo
         sBr0Ol0nh8l0YuwO96YSoTwVRuOCzowev/NnZx2SWPOeW1z2nyUqDvjINPMEbLirLXns
         POvg==
X-Gm-Message-State: AFqh2kra77bcPVxu+qwmudaI71xuFsmJRPGIVMQ//TkVPtcLHb2w09d/
	68qgKigW+KFnx2ZboTbRhooOD6PFGDQXHIRpAIKFN7FNBw0=
X-Google-Smtp-Source: AMrXdXvTUZskUoTwz8booCjvLihUtHFf9KnDDK6DWJEQKcIeomNqGqQ3hj8ukWnjiQDF2C6GGeoG2RWS70vPke45vzA=
X-Received: by 2002:a05:6402:2408:b0:47b:2514:f2e7 with SMTP id
 t8-20020a056402240800b0047b2514f2e7mr2007251eda.142.1672014779405; Sun, 25
 Dec 2022 16:32:59 -0800 (PST)
MIME-Version: 1.0
References: <PAWPR08MB8982EFB047372F6A54FCEE3783E99@PAWPR08MB8982.eurprd08.prod.outlook.com>
 <ddb94d90-3340-700c-76eb-7f06fba358bc@gmail.com> <91d159d1-d379-af67-6859-bf8e3fa14c72@gmail.com>
 <CAFUsyfJ9G_HjmZJJORv7ES02dmDF1Rc7tG=wVYwwBTbOmX5FUA@mail.gmail.com>
 <703b1fca-27ae-d1c7-863b-0573296fdb43@gmail.com> <CAFUsyfKzaFLr_xF_=8cL6P1Qtb6W1LKktkqXpvdrmDHA7pxOfw@mail.gmail.com>
 <6d7d708e-3255-ab54-0d11-e922c260189c@gmail.com>
In-Reply-To: <6d7d708e-3255-ab54-0d11-e922c260189c@gmail.com>
From: Noah Goldstein <goldstein.w.n@gmail.com>
Date: Sun, 25 Dec 2022 16:32:47 -0800
Message-ID: <CAFUsyfKujdC2_AFi+ZwCLe7+pkqJvAKZ5wqB+MBDEPuCn8EY8w@mail.gmail.com>
Subject: Re: [PATCH 1/1] string: Add stpecpy(3)
To: Alejandro Colomar <alx.manpages@gmail.com>
Cc: Wilco Dijkstra <Wilco.Dijkstra@arm.com>, GNU C Library <libc-alpha@sourceware.org>
Content-Type: text/plain; charset="UTF-8"
X-Spam-Status: No, score=-3.4 required=5.0 tests=BAYES_00,DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,FREEMAIL_FROM,RCVD_IN_DNSWL_NONE,SPF_HELO_NONE,SPF_PASS,TXREP autolearn=ham autolearn_force=no version=3.4.6
X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server2.sourceware.org
List-Id: <libc-alpha.sourceware.org>

On Sun, Dec 25, 2022 at 4:26 PM Alejandro Colomar
<alx.manpages@gmail.com> wrote:
>
>
>
> On 12/25/22 23:31, Noah Goldstein wrote:
> > On Sun, Dec 25, 2022 at 6:37 AM Alejandro Colomar
> > <alx.manpages@gmail.com> wrote:
> >>
> >> Hi Noah,
> >>
> >> On 12/25/22 02:52, Noah Goldstein wrote:
> >>
> >>>> char *
> >>>> stpecpy (char *dst, char *end, const char *restrict src)
> >>>> {
> >>>>      size_t dsize;
> >>>>      size_t dlen;
> >>>>      size_t slen = strlen (src);
> >>>
> >>> Imo move `dst == end` and `dst == NULL` check before strlen
> >>
> >>
> >> That's a valid optimization.  Having strlen(3) before has the advantage that you
> >> make sure that your strings are strings, as strlcpy(3) does.  But since we're
> >> inventing stpecpy(3), we can choose to be faster.  If anyone wants to instrument
> >> their code, they can add a temporary wrapper that does that.
> >>
> >>> and change strlen to `strnlen(src, dsize)`.
> >>
> >> About strnlen(3), I have doubts.  Isn't strlen(3) faster for the common case of
> >> no truncation or little truncation?  strnlen(3) would optimize for the case
> >> where you truncate by a large difference.
> >
> > It's faster if strlen(s) <= strnlen(s, N) (maybe up to N + 32).
> >
> > But generally I think of it like `qsort`. Most data gets n * log(n) behavior
> > but still it's worth preventing the worst case for minor constant cost.
> >
>
> I.  Maybe it's a good thing.  Since it's a truncating API, I guess optimizing
> for truncation is reasonable.  For common strings, which will be short (size <=
> 64), I guess the constant will really be negligible.
>
> >
> >>
> >>>>      bool   trunc = false;
> >>>>
> >>>>      if (dst == end)
> >>>
> >>> Out of curiosity what if `end < dst`?
> >>
> >> The behavior is undefined.  That's by design.  In the definition of stpecpy(3) I
> >> have currently in libstp, I even tell the compiler to optimize on that condition:
> >> <http://www.alejandro-colomar.es/src/alx/alx/libstp.git/tree/include/stp/stpe/stpecpy.h#n33>
> >>
> >
> > You could probably optimize out one of the branches along the line of:
> > if((dst - 1UL) >= (end - 1UL)) {
> >      // if dst == NULL, then dst - 1UL -> SIZE_MAX and must be >= any value.
>
> You would need a cast, wouldn't you?  Otherwise, you'll get pointer arithmetic.
> Pointer arithmetic with NULL is UB.
>
> >      // if dst == end, then (dst - 1UL) >= (end - 1UL) will be true.
> >      return NULL;
>
> Returning NULL on truncation would be a possibility, but then we'd need to use
> errno to tell the user if the error was truncation or an input NULL (which
> reports an error to a previous vsnprintf(3) call wrapped by [v]stpeprintf().

I'm not sure I see what you mean. Your current logic is:
```
   if (dst == end)
     return NULL;
   if (dst == NULL)
     return NULL;
```
Equivalent (since dst >= end || dst == NULL is required) is:
```
if((dst - 1UL) >= (end - 1UL)) {
    return NULL;
}
```
May need to be cast to a `uintptr` or something but don't see
what you mean about needing to check errno and such.

>
> Using errno would probably counter any optimization, since you'd still need one
> more branch for setting errno, so I guess it's simpler to just use end for
> truncation.
>
>
> Oooor, if we reimplement __vsnprintf_internal(3) to work on size_t and never
> fail, then we could add a [v]stpeprintf(3) that never fails, and then this
> function would only bail out on truncation.
>
> Would it be possible to make __vsnprintf_internal() never fail?  What are the
> current failing conditions; only a size greater than INT_MAX, or are there more
> errors?

Don't think its worth reimplementing    __vsnprintf_internal to save a single
branch here.
>
> > }
> >>
> >> alx@asus5775:~/src/alx/libstp$ grepc -tfd stpecpy
> >> ./include/stp/stpe/stpecpy.h:21:
> >> inline char *stp_nullable
> >> stpecpy(char *stp_nullable dst, char *end, const char *restrict src)
> >> {
> >>          bool    trunc;
> >>          size_t  dsize, dlen, slen;
> >>
> >>          slen = strlen(src);
> >>
> >>          if (dst == end)
> >>                  return end;
> >>          if (stp_unlikely(dst == NULL))  // Allow chaining with stpeprintf().
> >>                  return NULL;
> >>          stp_impossible(dst > end);
> >>
> >>          dsize = end - dst;
> >>          trunc = (slen >= dsize);
> >>          dlen = trunc ? dsize - 1 : slen;
> >>          dst[dlen] = '\0';
> >>
> >>          return mempcpy(dst, src, dlen) + trunc;
> >> }
> >> alx@asus5775:~/src/alx/libstp$ grepc -tm stp_impossible
> >> ./include/stp/_compiler.h:14:
> >> #define stp_impossible(e)   do                                                \
> >> {                                                                             \
> >>          if (e)                                                                \
> >>                  stp_unreachable();                                            \
> >> } while (0)
> >> alx@asus5775:~/src/alx/libstp$ grep -rnC1 define.stp_unreachable
> >> include/stp/_compiler.h-28-#if defined(unreachable)
> >> include/stp/_compiler.h:29:# define stp_unreachable()  unreachable()
> >> include/stp/_compiler.h-30-#else
> >> include/stp/_compiler.h:31:# define stp_unreachable()  __builtin_unreachable()
> >> include/stp/_compiler.h-32-#endif
> >>
> >>
> >> I'd do that for glibc, but I don't see any facility.  Maybe we should add an
> >> __impossible() macro to document UB, and help the compiler.
> >
> > Does it result in any improved codegen? If not seems like
> > making it fail more noisily is always a win.
>
> Both Clang and GCC generate the same code with or without the hint that it's
> impossible.  Anyway, I'll keep it in my source code because it also helps tell
> the programmer that dst>end was taken into consideration and explicitly outlawed.
>
> The 'end' pointer is expected to be always generated as 'buf + sizeof(buf)'.
> Doing something different is not what this API is designed for, and should be
> warned by compilers.  'end' should be a pointer to one after the last byte in an
> array.  Thus, no valid pointer can be greater than end.  If you use this API as
> expected, which is, only chain it with itself and with stpeprintf(3), then it is
> impossible to have dst>end.  But as always, GIGO.
>
> As for the expected result, it would be akin calling strlcpy(3) with a negative
> size.  It would wrap around size_t, and give something close to 2^64.  Both
> would result in a buffer overrun, so writing at random memory, and later a
> crash, but I don't expect that libc should try to detect if the input to
> strlcpy(3) (or actually, any mem*() function) is huge, and neither if input to
> stpecpy(3) is similarly broken.
>
> --
> <http://www.alejandro-colomar.es/>