public inbox for libc-help@sourceware.org
 help / color / mirror / Atom feed
From: Adhemerval Zanella <adhemerval.zanella@linaro.org>
To: Richard Purdie <richard.purdie@linuxfoundation.org>,
	Libc-help <libc-help@sourceware.org>
Subject: Re: glibc 2.34 and Yocto Project's "uninative"
Date: Thu, 26 Aug 2021 08:16:03 -0300	[thread overview]
Message-ID: <6d63802a-dc3c-e2de-8db4-a0ec92bbb9f6@linaro.org> (raw)
In-Reply-To: <55b147c8807abe56c41670b6c3e20fdbec85f291.camel@linuxfoundation.org>



On 25/08/2021 17:11, Richard Purdie wrote:
> On Wed, 2021-08-25 at 12:05 -0300, Adhemerval Zanella wrote:
>>
>> On 20/08/2021 11:37, Richard Purdie via Libc-help wrote:
>>> Hi,
>>>
>>> Yocto Project is a cross compiling build system used to build customised
>>> Linux, RTOSs, firmware and more. It has some interesting technology. One piece
>>> of it is "pseudo" (http://git.yoctoproject.org/cgit.cgi/pseudo/) which is an 
>>> LD_PRELOAD which intercepts calls to libc and fakes root privileges. One way or
>>> another we've managed to keep that working with multiple libc versions for the
>>> last decade.
>>>
>>> A second piece of technology is uninative. We build a lot binary artefacts, some
>>> for the target, some as tools running natively on the build system. We have a
>>> cache of these artefacts people can share and reuse. Uninative is our way of
>>> allowing one binary to run on any host distro. We do that by shipping a ld+libc
>>> binary shim and change the interpreter in the native binary to point at our own.
>>> As long as the shim is the same version or later than system version, it works.
>>>
>>> In glibc 2.34, the merge of libdl and libpthread into libc is causing a problem
>>> for us. Basically, I've been able to make pseudo work and I can make uninative
>>> work however I can't make them both work together with glibc 2.34.
>>>
>>> Pseudo uses minimal dl calls from libdl (dlsym, dlvsym and dlerror) and has a
>>> pthread mutex, therefore it links to -ldl and -lpthread.
>>>
>>> The issue is that pseudo being an LD_PRELOAD, if linked against glibc 2.34
>>> doesn't "see" symbols in libdl and just links to libc. I did try forcing older
>>> versions of the symbols along the lines of:
>>>
>>> __asm__(".symver dlerror,dlerror@GLIBC_" DLSYMVER);
>>> __asm__(".symver dlvsym,dlvsym@GLIBC_" DLVSYMVER);
>>> __asm__(".symver dlsym,dlsym@GLIBC_" DLSYMVER);
>>>
>>> with some arch specific version number defines however even if I do this, and
>>> put back a libdl.so symlink to the lib, the linker ignores the weak wildcard
>>> reference and links back to libc.so itself. When we then use this preloaded lib
>>> on a system with an older libc:
>>>
>>> relocation error: [...]/libpseudo.so: symbol dlerror, version GLIBC_2.2.5 not defined in file libc.so.6 with link time reference
>>
>> Unfortunately this was never officially supported, since it is basically
>> trying to run a binary built in a newer glibc on an older one.  The
>> issue is not that linker put back libc.so, but rather that with GLIBC
>> 2.34 there is a lot of more default symbols that are tied to GLIBC_2.34
>> version.
> 
> Thanks for the reply. I do appreciate this isn't officially supported. I had
> hoped that by using minimal calls we might have been able to figure something
> out but I hadn't realised there were the other symbols being added. That
> certainly complicates it and explains a few things.
> 
>> For instance:
>>
>> $ cat dl.c 
>> #include <dlfcn.h>
>> #include <assert.h>
>>
>> int main (int argc, char *argv[])
>> {
>>   void *h = dlopen (argv[1], RTLD_NOW);
>>   assert (h != 0);
>>   return 0;
>> }
>>
>> asm (".symver dlopen,dlopen@GLIBC_2.2.5");
>>
>> $ readelf -Ws dl | grep GLIBC_2.34
>>      1: 0000000000000000     0 FUNC    GLOBAL DEFAULT  UND __libc_start_main@GLIBC_2.34 (2)
>>     24: 0000000000000000     0 FUNC    GLOBAL DEFAULT  UND __libc_start_main@GLIBC_2.34
>>
>>
>> The __libc_start_main has the additional issue that it is provided by crt1.o,
>> so the symver in the main TU won't solve it.  You will need to hack it with
>> the static linker, even by adding some wrap symbol to redirect to the 
>> __libc_start_main@GLIBC_2.2.5.
> 
> Needing to hack these kinds of internals starts to make me worry a lot about
> whether there are details that will trip things up at runtime.
> 
>>> I did try ensuring libpseudo.so has a RUNPATH that includes our uninative libc
>>> however the loader ignores that for loading libc. I did then add our uninative
>>> libc path to LD_LIBRARY_PATH however that fails:
>>
>> And I don't think this would work, it would force two libc with potentially
>> two different versions.
> 
> It was very clear that the ld and libc need to be matched from the kinds of
> mismatches and I gave up on this route quickly.
> 
>>>
>>> libc.so.6: symbol _dl_exception_create, version GLIBC_PRIVATE not defined in file ld-linux-x86-64.so.2 with link time reference
>>>
>>> since the ld being used doesn't match the libc.
>>>
>>> Can anyone see a way we could make things work?
>> I don't have easy library based solution for the requisites you posed,
>> building a newer glibc and running on a older one.  I think what *might*
>> work is to provide a auditor library linked against the newer glibc and
>> it then intercepts and routes the required library calls.  You will need
>> to redistribute the libc which the auditor was linked against.
> 
> The other libc is already there so that bit is easy. I hadn't thought about
> using an auditor library and I'll have to explore that.

The auditor interface work on some examples, but unfortunately we also found
that they are fully correct on some more specific cases.  We are aiming to fix
for 2.35 [1].

> 
> I was going to go down the route of dummy libraries to link against however I
> realised it was easier for testing just to pull down 2.33 binaries and use those
> to link against. I did have to replace use of pthread_atfork with
> __register_atfork which is ugly but probably okish for our use as it has been
> there since 2.3.2.

Yes, this is what I would suggest you (use an older glibc to link this against).

> 
> That probably gets us out the immediate problem although it does highlight we're
> doing something that isn't supported and could break again in ways we may
> struggle to fix. We can probably replace the pthread linkage for the mutex with
> direct code/syscall usage. We're going to need the libdl usage regardless though
> so it may be worth us figuring out the dummy library to link against for that
> piece.

Why do you need to build it on a recent glibc and potentially use it on an older
one?  

> 
> Thanks for the info though, it is much appreciated and it helps a lot to
> understand what was breaking and some other avenues like the auditor lib to
> explore.
> 
> Cheers,
> 
> Richard
> 
> 

[1] https://patchwork.sourceware.org/project/glibc/list/?series=2577

  reply	other threads:[~2021-08-26 11:16 UTC|newest]

Thread overview: 6+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2021-08-20 14:37 Richard Purdie
2021-08-25 15:05 ` Adhemerval Zanella
2021-08-25 20:11   ` Richard Purdie
2021-08-26 11:16     ` Adhemerval Zanella [this message]
2021-08-26 11:29       ` Richard Purdie
2021-08-30  6:55 ` Mike Frysinger

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=6d63802a-dc3c-e2de-8db4-a0ec92bbb9f6@linaro.org \
    --to=adhemerval.zanella@linaro.org \
    --cc=libc-help@sourceware.org \
    --cc=richard.purdie@linuxfoundation.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).