From: Rich Felker <dalias@libc.org>
To: Michael Matz <matz@suse.de>
Cc: Richard Henderson <rth@redhat.com>,
"H.J. Lu" <hjl.tools@gmail.com>, Jan Hubicka <hubicka@ucw.cz>,
Alexander Monakov <amonakov@ispras.ru>,
GCC Patches <gcc-patches@gcc.gnu.org>,
Uros Bizjak <ubizjak@gmail.com>
Subject: Re: [PATCH i386] Allow sibcalls in no-PLT PIC
Date: Wed, 20 May 2015 14:17:00 -0000 [thread overview]
Message-ID: <20150520140946.GU17573@brightrain.aerifal.cx> (raw)
In-Reply-To: <alpine.LSU.2.20.1505201400400.27315@wotan.suse.de>
On Wed, May 20, 2015 at 02:10:41PM +0200, Michael Matz wrote:
> Hi,
>
> On Tue, 19 May 2015, Richard Henderson wrote:
>
> > It is. The relaxation that HJ is working on requires that the reads
> > from the got not be hoisted. I'm not especially convinced that what
> > he's working on is a win.
> >
> > With LTO, the compiler can do the same job that he's attempting in the
> > linker, without an extra nop. Without LTO, leaving it to the linker
> > means that you can't hoist the load and hide the memory latency.
>
> Well, hoisting always needs a register, and if hoisted out of a loop
> (which you all seem to be after) that register is live through the whole
> loop body. You need a register for each different called function in such
> loop, trading the one GOT pointer with N other registers. For
> register-starved machines this is a real problem, even x86-64 doesn't have
> that many. I.e. I'm not convinced that this hoisting will really be much
> of a win that often, outside toy examples. Sure, the compiler can hoist
> function addresses trivially, but I think it will lead to spilling more
> often than not, or alternatively the hoisting will be undone by the
> register allocators rematerialization. Of course, this would have to be
> measured for real not hand-waved, but, well, I'd be surprised if it's not
> so.
The obvious example where it's useful on x86_64 is a major class:
anything where the majority of the callee's data is floating point and
thus kept in xmm registers. In that case register pressure is a lot
lower, and there's also an obvious class of cross-DSO functions calls
you'd be making over and over again: anything from libm.
Rich
next prev parent reply other threads:[~2015-05-20 14:10 UTC|newest]
Thread overview: 106+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-05-04 16:38 PIC calls without PLT, generic implementation Alexander Monakov
2015-05-04 16:38 ` [PATCH i386] Allow sibcalls in no-PLT PIC Alexander Monakov
2015-05-15 16:37 ` Alexander Monakov
2015-05-15 16:48 ` H.J. Lu
2015-05-15 20:08 ` Jan Hubicka
2015-05-15 20:23 ` H.J. Lu
2015-05-15 20:35 ` Rich Felker
2015-05-15 20:37 ` H.J. Lu
2015-05-15 20:45 ` Rich Felker
2015-05-15 22:16 ` H.J. Lu
2015-05-15 23:14 ` Jan Hubicka
2015-05-15 23:30 ` H.J. Lu
2015-05-15 23:35 ` H.J. Lu
2015-05-15 23:44 ` H.J. Lu
2015-05-16 0:18 ` Rich Felker
2015-05-16 14:33 ` H.J. Lu
2015-05-16 19:03 ` H.J. Lu
2015-05-16 19:32 ` Rich Felker
2015-05-16 23:23 ` H.J. Lu
2015-05-15 23:49 ` Rich Felker
2015-05-19 14:48 ` Michael Matz
2015-05-19 15:11 ` Jeff Law
2015-05-19 16:03 ` Michael Matz
2015-05-19 19:11 ` Rich Felker
2015-05-19 18:08 ` Rich Felker
2015-05-19 19:03 ` Richard Henderson
2015-05-19 19:10 ` H.J. Lu
2015-05-19 19:17 ` Richard Henderson
2015-05-19 19:20 ` H.J. Lu
2015-05-19 19:54 ` Richard Henderson
2015-05-19 20:27 ` Rich Felker
2015-05-19 20:44 ` H.J. Lu
2015-05-19 21:28 ` Rich Felker
2015-05-20 0:52 ` H.J. Lu
2015-05-20 1:09 ` Rich Felker
2015-05-22 19:32 ` Richard Henderson
2015-05-19 19:48 ` Rich Felker
2015-05-19 20:16 ` Richard Henderson
2015-05-20 12:13 ` Michael Matz
2015-05-20 12:40 ` H.J. Lu
2015-05-20 14:17 ` Rich Felker [this message]
2015-05-20 14:33 ` Michael Matz
2015-05-18 18:25 ` Alexander Monakov
2015-05-18 19:03 ` Jan Hubicka
2015-05-04 16:38 ` [PATCH] Expand PIC calls without PLT with -fno-plt Alexander Monakov
2015-05-04 17:34 ` Jeff Law
2015-05-04 17:40 ` Jakub Jelinek
2015-05-04 17:42 ` Jeff Law
2015-05-06 3:08 ` Rich Felker
2015-05-10 17:07 ` Jan Hubicka
2015-05-06 15:25 ` Alexander Monakov
2015-05-06 15:46 ` Jakub Jelinek
2015-05-06 15:55 ` Jeff Law
2015-05-06 16:44 ` Alexander Monakov
2015-05-06 17:35 ` Rich Felker
2015-05-06 18:26 ` H.J. Lu
2015-05-06 18:37 ` Rich Felker
2015-05-06 18:45 ` H.J. Lu
2015-05-06 19:01 ` Rich Felker
2015-05-06 19:05 ` H.J. Lu
2015-05-06 19:18 ` Rich Felker
2015-05-06 19:24 ` H.J. Lu
2015-05-11 11:48 ` Michael Matz
2015-05-11 14:20 ` Rich Felker
2015-05-07 18:22 ` Jeff Law
2015-05-07 19:13 ` H.J. Lu
2015-05-10 16:59 ` Jan Hubicka
2015-05-11 20:36 ` Jeff Law
2015-05-11 20:55 ` H.J. Lu
2015-05-11 22:13 ` Jan Hubicka
2015-06-22 15:52 ` Jiong Wang
2015-06-22 18:18 ` Alexander Monakov
2015-06-23 8:41 ` Ramana Radhakrishnan
2015-06-23 10:43 ` Alexander Monakov
2015-06-23 13:28 ` Jeff Law
2015-07-16 10:37 ` [AArch64] Tighten direct call pattern to repair -fno-plt Jiong Wang
2015-07-16 10:47 ` Alexander Monakov
2015-07-16 10:48 ` Jiong Wang
2015-07-21 12:52 ` [AArch64][sibcall]Tighten " Jiong Wang
2015-08-04 9:50 ` James Greenhalgh
2015-08-06 16:18 ` [COMMITTED][AArch64][sibcall]Tighten " Jiong Wang
2015-08-07 8:22 ` James Greenhalgh
2015-08-07 13:28 ` Jiong Wang
2015-08-04 9:50 ` [AArch64] Tighten " James Greenhalgh
2015-08-06 16:16 ` [COMMITTED][AArch64] " Jiong Wang
2015-05-04 16:38 ` [PATCH i386] Extend sibcall peepholes to allow source in %eax Alexander Monakov
2015-05-10 16:54 ` Jan Hubicka
2015-05-11 17:50 ` Alexander Monakov
2015-05-11 18:00 ` Jan Hubicka
2015-05-11 19:46 ` Uros Bizjak
2015-05-11 19:48 ` Jeff Law
2015-05-11 20:16 ` Jan Hubicka
2015-05-13 19:05 ` Alexander Monakov
2015-05-13 20:04 ` Jan Hubicka
2015-05-14 17:36 ` Alexander Monakov
2015-05-04 16:38 ` [PATCH i386] Move CLOBBERED_REGS earlier in register class list Alexander Monakov
2015-05-10 16:44 ` Jan Hubicka
2015-05-10 17:51 ` Uros Bizjak
2015-05-10 18:09 ` Uros Bizjak
2015-05-11 16:26 ` Alexander Monakov
2015-05-11 16:30 ` Uros Bizjak
2015-05-04 16:38 ` [PATCH i386] PR65753: allow PIC tail calls via function pointers Alexander Monakov
2015-05-10 16:37 ` Jan Hubicka
2015-05-11 16:11 ` Alexander Monakov
2015-05-04 16:38 ` [RFC PATCH] ira: accept loads via argp rtx in validate_equiv_mem Alexander Monakov
2015-05-04 17:37 ` Jeff Law
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20150520140946.GU17573@brightrain.aerifal.cx \
--to=dalias@libc.org \
--cc=amonakov@ispras.ru \
--cc=gcc-patches@gcc.gnu.org \
--cc=hjl.tools@gmail.com \
--cc=hubicka@ucw.cz \
--cc=matz@suse.de \
--cc=rth@redhat.com \
--cc=ubizjak@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).