From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail-oa1-x35.google.com (mail-oa1-x35.google.com [IPv6:2001:4860:4864:20::35]) by sourceware.org (Postfix) with ESMTPS id 26D703858D35 for ; Fri, 6 Jan 2023 23:20:47 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org 26D703858D35 Authentication-Results: sourceware.org; dmarc=pass (p=none dis=none) header.from=gmail.com Authentication-Results: sourceware.org; spf=pass smtp.mailfrom=gmail.com Received: by mail-oa1-x35.google.com with SMTP id 586e51a60fabf-1322d768ba7so3241117fac.5 for ; Fri, 06 Jan 2023 15:20:47 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=content-transfer-encoding:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:from:to:cc:subject:date :message-id:reply-to; bh=euCyrAqJDwvirCs/quc96Q4v6iP4Fhfns6bSAbqgNWk=; b=N0J3ta+sHH5qNTz2uAmz3TbOCFqRcKi/cxxHNL0Jd+Y7Fp0WG28s4hym9jPztThbWQ X5EbbbxX8pKiWGRixax9jExHQeWl1YTmRbhEOq6jaP7FjVOlTtfb5QhrrshMMKrlvFBO /grVG3PpLbdINCaG3keuF1YeH1mtqhe3ZTzBKcG+IoEBj9Aw876kU5q/o9+KIIzan/+a iSkr/AyLn3n2pgtWjKxmqiJITraQtdccvNOYJMqtshJgO5knsIbB6OY7fluyhTJCXk1a 9MvFahN+ubhVPxIDPUeMk374UyaVPP+9DKCAbgESP2+r0G4H/FckO0KkertTAy+UWcdD /uqA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=content-transfer-encoding:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=euCyrAqJDwvirCs/quc96Q4v6iP4Fhfns6bSAbqgNWk=; b=lruBc/ie2aRMOAC1pHwo38eTBBRsj87Hj9eebA15DuYulRnZVus4atbVsvtmyzBcai ixUhYiHCpK4XUAbpaW5I2/BoodSJ1A699KTqh6dE7D7zSXlefS2xSFaDbN0Xyzh6Wxbm YyzyI5cSvlp4jTy9sTl+6NGYyuZf0SuEU0YfBm5Htfb82oTSI24kLsVriZBpoHJh5N+b 3Ab740/OceOLDs5TBUXqfsS7M1FHJsDrw24TRIAJk6UuXlCY0dOp6yI+4Ht1rN3E0rD8 tNJmwlj3NqgKET00eKrnlhdKWGHK7ObPzWC+DQzQrJX+bTflPLsrUDTx96dNg3KnATOL nXCQ== X-Gm-Message-State: AFqh2kqr0geS/nHPfALmUx4/dzHqw37EvJdGFsKEaM7AfhfUutVLNpys LVynTAHPDK6/+I1Na8BIfM1yT2uHB/yBaEwlefaPrVWbP+0= X-Google-Smtp-Source: AMrXdXuYaTiHqd5qMiRHBGPWk5VYCtHPeItZzBwExW2aSDZvDoLf+cT+dD2XZeETomDjLLbQoMqjnKKwoltcpkqjFtc= X-Received: by 2002:a05:6870:d202:b0:148:2c02:5323 with SMTP id g2-20020a056870d20200b001482c025323mr5703085oac.298.1673047246275; Fri, 06 Jan 2023 15:20:46 -0800 (PST) MIME-Version: 1.0 References: <20230105210542.3573076-1-maskray@google.com> In-Reply-To: From: "H.J. Lu" Date: Fri, 6 Jan 2023 15:20:10 -0800 Message-ID: Subject: Re: [PATCH] ld: Allow R_X86_64_GOTPCREL for call *__tls_get_addr@GOTPCREL(%rip) To: Fangrui Song Cc: binutils@sourceware.org Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Spam-Status: No, score=-3021.1 required=5.0 tests=BAYES_00,BODY_8BITS,DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,FREEMAIL_FROM,GIT_PATCH_0,RCVD_IN_DNSWL_NONE,SPF_HELO_NONE,SPF_PASS,TXREP autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server2.sourceware.org List-Id: On Fri, Jan 6, 2023 at 3:02 PM Fangrui Song wrote: > > On Fri, Jan 6, 2023 at 2:42 PM H.J. Lu wrote: > > > > On Fri, Jan 6, 2023 at 1:44 PM Fangrui Song wrote: > > > > > > On Fri, Jan 6, 2023 at 1:27 PM H.J. Lu wrote: > > > > > > > > On Fri, Jan 6, 2023 at 1:25 PM Fangrui Song wr= ote: > > > > > > > > > > On Fri, Jan 6, 2023 at 1:14 PM H.J. Lu wro= te: > > > > > > > > > > > > On Fri, Jan 6, 2023 at 10:48 AM Fangrui Song wrote: > > > > > > > > > > > > > > On Fri, Jan 6, 2023 at 9:04 AM H.J. Lu = wrote: > > > > > > > > > > > > > > > > On Thu, Jan 5, 2023 at 1:06 PM Fangrui Song via Binutils > > > > > > > > wrote: > > > > > > > > > > > > > > > > > > _Thread_local int a; > > > > > > > > > int main() { return a; } > > > > > > > > > > > > > > > > > > % gcc -fno-plt -fpic a.c -fuse-ld=3Dbfd -Wa,-mrelax-reloc= ations=3Dno > > > > > > > > > /usr/bin/ld.bfd: /tmp/ccSSBgrg.o: TLS transition from R_X= 86_64_TLSGD to R_X86_64_GOTTPOFF against `a' at 0xd in section `.text' fail= ed > > > > > > > > > /usr/bin/ld.bfd: failed to set dynamic section sizes: bad= value > > > > > > > > > collect2: error: ld returned 1 exit status > > > > > > > > > > > > > > > > > > This commit fixes the issue. > > > > > > > > > > > > > > > > > > PR ld/24784 > > > > > > > > > * bfd/elf64-x86-64.c (elf_x86_64_check_tls_transition= ): Allow > > > > > > > > > R_X86_64_GOTPCREL. > > > > > > > > > --- > > > > > > > > > bfd/elf64-x86-64.c | 2 +- > > > > > > > > > 1 file changed, 1 insertion(+), 1 deletion(-) > > > > > > > > > > > > > > > > > > diff --git a/bfd/elf64-x86-64.c b/bfd/elf64-x86-64.c > > > > > > > > > index 914f82d0151..095fe2e0fe6 100644 > > > > > > > > > --- a/bfd/elf64-x86-64.c > > > > > > > > > +++ b/bfd/elf64-x86-64.c > > > > > > > > > @@ -1241,7 +1241,7 @@ elf_x86_64_check_tls_transition (bf= d *abfd, > > > > > > > > > if (largepic) > > > > > > > > > return r_type =3D=3D R_X86_64_PLTOFF64; > > > > > > > > > else if (indirect_call) > > > > > > > > > - return r_type =3D=3D R_X86_64_GOTPCRELX; > > > > > > > > > + return (r_type =3D=3D R_X86_64_GOTPCRELX || r= _type =3D=3D R_X86_64_GOTPCREL); > > > > > > > > > else > > > > > > > > > return (r_type =3D=3D R_X86_64_PC32 || r_type= =3D=3D R_X86_64_PLT32); > > > > > > > > > } > > > > > > > > > -- > > > > > > > > > 2.39.0.314.g84b9a713c41-goog > > > > > > > > > > > > > > > > > > > > > > > > > Since the new TLS sequence was added after R_X86_64_GOTPCRE= LX was > > > > > > > > required for call, R_X86_64_GOTPCREL should be invalid in t= his TLS sequence. > > > > > > > > > > > > > > > > -- > > > > > > > > H.J. > > > > > > > > > > > > > > I have multiple arguments (albeit no single one is very stron= g) that > > > > > > > this 1-deletion-1-addition change provides benefits for users= (IMHO > > > > > > > with no burden to binutils at all). > > > > > > > > > > > > > > Some projects may add -Wa,-mrelax-relocations=3Dno to work ar= ound older > > > > > > > GNU ld. Then the project's toolchain requirement may increase= and no > > > > > > > longer need to work around older GNU ld. > > > > > > > But a distribution may for some reason use a global -fno-plt = (e.g. > > > > > > > Arch Linux) and then run into this TLS GD/LD->IE/LE optimizat= ion > > > > > > > issue. > > > > > > > > > > > > > > rust src/ci/docker/host-x86_64/*musl/Dockerfile > > > > > > > openjdk/jdk19u make/autoconf/flags-cflags.m4 (this file appea= rs to be > > > > > > > copied into quite a few projects) > > > > > > > Linux kernel arch/x86/boot/compressed/Makefile (not a good ex= ample as > > > > > > > it doesn't use TLS AFAICT) > > > > > > > > > > > > > > R_X86_64_GOTPCREL isn't purely usefull. It may help linker de= sign: for > > > > > > > R_X86_64_GOTPCRELX/R_X86_64_REX_GOTPCRELX, the linker can mak= e a > > > > > > > decision upfront whether a GOT entry is needed > > > > > > > (this affects the size of .got, which may affect section layo= ut and > > > > > > > whether other relocations may overflow). > > > > > > > This may increase risk of 32-bit relocation overflow. > > > > > > > R_X86_64_GOTPCREL can mitigate the risk while being aware to = the user. > > > > > > > > > > > > > > rustc somehow disables x86 relaxed relocations and defaults t= o `-Z > > > > > > > > > > > > Why is that? > > > > > > > > > > It's assuredly a rust's problem and I am trying to fix that in > > > > > https://github.com/rust-lang/rust/pull/106511 > > > > > > > > > > The -Wa,-mrelax-relocations=3Dno problem may affect more package= s. > > > > > > > > -mrelax-relocations=3Dno should be a workaround for the older linke= r. It > > > > shouldn't be used with the current linker. > > > > > > A project may choose to work with many linker versions. > > > For simplicity, before it decides to drop compatibility with GNU > > > ld<2.26 (AIUI GOTPCRELX was supported in 2.26), > > > it may unconditionally add -Wa,-mrelax-relocations=3Dno, instead of > > > > -mrelax-relocations=3Dno is only supported with the newer binutils. > > The relocatable object file producer and the consumer may be on > different machines and use different binutils versions. > https://github.com/rust-lang/rust/commit/305aca86f9d8d132650b495f610f9abe= 5239fec6 > added -Wa,-mrelax-relocations=3Dno so that the relocatable object files > can be used on a user machine with an old ld. But -fno-plt may not work with the old linker. For this matter, -Wa,-mrelax-relocations=3Dno doesn't work with ld today. When -Wa,-mrelax-relocations=3Dno is used, no features from newer linkers can be used. > https://github.com/IHaskell/IHaskell/issues/636 and > https://github.com/dcos/dcos/commit/facda25019e07051f501b39720b4e71049bd0= 030 > likely use the same argument. > > In other cases, the project may use -Wa,-mrelax-relocations=3Dno with > Clang (where they assume a not-too-old version), but need to work with > system ld (which may be old). This shouldn't happen with as and ld from the same binutils. > > > doing configure work to check linker support. > > > > The TLS sequence from -fno-plt doesn't work for the older linker. > > The older linker support should be dropped for -fno-plt. > > > > > Now a user may use -fno-plt (Arch Linux, rustc, maybe Alpine) and run > > > into the aforementioned TLS problem. > > > > > > This 1-deletion-1-addition change can address this issue with no > > > maintenance burden on binutils side in my opinion, > > > so I made this patch. > > > > > > The linker design I described is true as well. Whether GOTPCRELX lead= s > > > to a GOT entry can be decided at relocation scanning time, before the > > > section layout is decided. > > > Users may make a conscious decision to use GOTPCREL to avoid potentia= l > > > relocation overflow risk. > > > > > > GOTPCREL isn't really dead. It can be used with Intel LAM and tagged > > > global variables (with non-zero high address bits) > > > https://reviews.llvm.org/D111343 > > > GOTPCREL instead of GOTPCRELX makes it clear an instruction > > > referencing the variable isn't supposed to be relaxed. > > > > The address of the local symbol, foo, in > > > > movq foo@GOTPCREL(%rip), %rax > > > > is assigned by the linker. I am not sure how the tag is involved here. > > Besides, it is the call instruction here. > > This is an auxiliary argument. I wanted to emphasize that GOTPCREL > isn't dead and did not intend to use it with this call instruction. > If GOTPCRELX is used and the distance between the current location and > the symbol is larger than 2**31, this will trigger a relocation > overflow. > > This happens with tagged globals with non-zero high address bits. This sounds needing some linker changes to add tags to data variables. I am not sure if GOTPCREL alone is sufficient. > A linker can fix the problem by avoiding relaxation, increasing the > size of .got . This requires that it scans relocations more than once. > If GOTPCRELX is decided upfront whether it needs relaxation or not, on > an arch which doesn't use range extension thunks like x86, technically > relocations can just be scanned once. > > > > > > > > plt=3Dno` and now relies on llvm-project to work around the G= NU ld > > > > > > > compatibility issue. > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > -- > > > > > > > =E5=AE=8B=E6=96=B9=E7=9D=BF > > > > > > > > > > > > > > > > > > > > > > > > -- > > > > > > H.J. > > > > > > > > > > > > > > > > > > > > -- > > > > > =E5=AE=8B=E6=96=B9=E7=9D=BF > > > > > > > > > > > > > > > > -- > > > > H.J. > > > > > > > > > > > > -- > > > =E5=AE=8B=E6=96=B9=E7=9D=BF > > > > > > > > -- > > H.J. > > > > -- > =E5=AE=8B=E6=96=B9=E7=9D=BF --=20 H.J.