syscall tracing overheads: utrace vs. kprobes

public inbox for systemtap@sourceware.org
 help / color / mirror / Atom feed

* syscall tracing overheads: utrace vs. kprobes
@ 2009-04-28 17:01 Frank Ch. Eigler
  2009-04-28 18:10 ` Roland McGrath
  2009-04-28 18:19 ` David Smith
  0 siblings, 2 replies; 5+ messages in thread
From: Frank Ch. Eigler @ 2009-04-28 17:01 UTC (permalink / raw)
  To: systemtap, utrace-devel

Hi -

In a few contexts, it comes up as to whether it is faster to probe
process syscalls with kprobes or with something higher level such as
utrace.  (There are other hypothetical options too (per-syscall
tracepoints) that could be measured this way in the future.)  

It was time to check the intuitions about the overheads.  So, choosing
a syscall that won't get short-circuited via vdso:

% cat foo.c
#include <unistd.h>

int main ()
{
  unsigned c;
  for (c=0; c<10000000; c++)
     (void) close (1000);
}
% gcc foo.c

Now we compare these scenarios:

# stap -e 'probe never {}' -t --vp 00001 -c a.out

Here, no actual probing occurs so we get a measurement of the plain
uninstrumented run time of ten million close(2)s.

# stap -e 'probe process.syscall {}' -t --vp 00001 -c a.out

Here, we intercept sys_close with a kprobe.  If the system is not too
busy, we should pick up only the close(2)s coming from a.out, though a
few close(2)'s executed by other processes may show up.

# stap -e 'probe syscall.close {}' -t --vp 00001 -c a.out

Here, we intercept all a.out's syscalls with utrace.  Other processes
are not affected at all, but other syscalls by a.out would be --
though in our test, there are hardly any of those.

Some typical results on my 2.66GHz 2*Xeon5150 machine runnin Fedora 9 -
2.6.27.12:

never:  
Pass 5: run completed in 740usr/3310sys/4155real ms.

kprobe: 
probe syscall.close (<input>:1:1), hits: 10000028, cycles: 176min/202avg/3632max
Pass 5: run completed in 750usr/9320sys/10193real ms.

utrace: 
probe process.syscall (<input>:1:1), hits: 10000025, cycles: 176min/209avg/184392max
Pass 5: run completed in 1670usr/6860sys/8645real ms.

So utrace added 4.5 seconds, and kprobes added 6.0 seconds to the
uninstrumented 4.1 second run time.  But wait: we should subtract the
time taken by the probe handler itself: 200ish cycles at 2.66 GHz,
which is about 0.75 seconds.  So the overheads are approximately:

never: n/a
kprobe: 5.2 seconds => 0.52 us per hit
utrace: 3.6 seconds => 0.36 us per hit

Note that these are microbenchmarks that represent an ideal case
compared to a larger run, since they probably fit comfily inside
caches.  They probably also undercount the probe handler's run time.

- FChE

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: syscall tracing overheads: utrace vs. kprobes
  2009-04-28 17:01 syscall tracing overheads: utrace vs. kprobes Frank Ch. Eigler
@ 2009-04-28 18:10 ` Roland McGrath
  2009-04-28 18:16   ` Frank Ch. Eigler
  2009-04-28 18:19 ` David Smith
  1 sibling, 1 reply; 5+ messages in thread
From: Roland McGrath @ 2009-04-28 18:10 UTC (permalink / raw)
  To: Frank Ch. Eigler; +Cc: systemtap, utrace-devel

Btw, it is probably wise to use the syscall() function just so you are
always sure you are testing system call details rather than libc details.

The standard microbenchmark is syscall(__NR_getpid).  That is the minimal
system call, vs. close that takes locks and so forth (so it's getting more
issues into the test than the one you are looking at).

The microbenchmark makes that seem like more of a sensical comparison than
it really is.  They are really apples and oranges.  The TIF_SYSCALL_TRACE
types (process.syscall) add some overhead to every system call.  The probe
types (kprobe/tracepoint/marker) add overhead only to the probed call.

In real situations, there will be many different syscalls made.  In tracing
scenarios where you are only probing a few individual ones (especially if
they are not the cheapest or most frequent), the distribution of overheads
will be quite different.

Thanks,
Roland

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: syscall tracing overheads: utrace vs. kprobes
  2009-04-28 18:10 ` Roland McGrath
@ 2009-04-28 18:16   ` Frank Ch. Eigler
  2009-04-28 18:45     ` Roland McGrath
  0 siblings, 1 reply; 5+ messages in thread
From: Frank Ch. Eigler @ 2009-04-28 18:16 UTC (permalink / raw)
  To: Roland McGrath; +Cc: systemtap, utrace-devel

Hi -

On Tue, Apr 28, 2009 at 11:10:10AM -0700, Roland McGrath wrote:
> [...]
> The microbenchmark makes that seem like more of a sensical comparison than
> it really is.  They are really apples and oranges.  The TIF_SYSCALL_TRACE
> types (process.syscall) add some overhead to every system call.  The probe
> types (kprobe/tracepoint/marker) add overhead only to the probed call.

Certainly, in general.  But in this specific test, only the under-test
system calls occurred in essnetially the whole system, so the overhead
measurements were in a way the bare minimums imposed by the kprobes
vs. utrace callback infrastructure itself.

> In real situations [...]  the distribution of overheads will be
> quite different.

Or rather, the basic overhead quanta measured above may be multiplied
along several different axes.


- FChE

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: syscall tracing overheads: utrace vs. kprobes
  2009-04-28 17:01 syscall tracing overheads: utrace vs. kprobes Frank Ch. Eigler
  2009-04-28 18:10 ` Roland McGrath
@ 2009-04-28 18:19 ` David Smith
  1 sibling, 0 replies; 5+ messages in thread
From: David Smith @ 2009-04-28 18:19 UTC (permalink / raw)
  To: Frank Ch. Eigler; +Cc: systemtap, utrace-devel

Frank Ch. Eigler wrote:
> Hi -
> 
> In a few contexts, it comes up as to whether it is faster to probe
> process syscalls with kprobes or with something higher level such as
> utrace.  (There are other hypothetical options too (per-syscall
> tracepoints) that could be measured this way in the future.)  

These scenarios are a bit wrong:

> Now we compare these scenarios:
> 
> # stap -e 'probe never {}' -t --vp 00001 -c a.out
> 
> Here, no actual probing occurs so we get a measurement of the plain
> uninstrumented run time of ten million close(2)s.

The above one is fine.

> # stap -e 'probe process.syscall {}' -t --vp 00001 -c a.out
> 
> Here, we intercept sys_close with a kprobe.  If the system is not too
> busy, we should pick up only the close(2)s coming from a.out, though a
> few close(2)'s executed by other processes may show up.
> 
> # stap -e 'probe syscall.close {}' -t --vp 00001 -c a.out
> 
> Here, we intercept all a.out's syscalls with utrace.  Other processes
> are not affected at all, but other syscalls by a.out would be --
> though in our test, there are hardly any of those.

These 2 are swapped:  the 'process.syscall' probe is a utrace-based
probe and the 'syscall.close' probe is a kprobe-based probe.

Note that in the results, the description and probe types matched correctly.

> Some typical results on my 2.66GHz 2*Xeon5150 machine runnin Fedora 9 -
> 2.6.27.12:
> 
> never:  
> Pass 5: run completed in 740usr/3310sys/4155real ms.
> 
> kprobe: 
> probe syscall.close (<input>:1:1), hits: 10000028, cycles: 176min/202avg/3632max
> Pass 5: run completed in 750usr/9320sys/10193real ms.
> 
> utrace: 
> probe process.syscall (<input>:1:1), hits: 10000025, cycles: 176min/209avg/184392max
> Pass 5: run completed in 1670usr/6860sys/8645real ms.
> 
> So utrace added 4.5 seconds, and kprobes added 6.0 seconds to the
> uninstrumented 4.1 second run time.  But wait: we should subtract the
> time taken by the probe handler itself: 200ish cycles at 2.66 GHz,
> which is about 0.75 seconds.  So the overheads are approximately:
> 
> never: n/a
> kprobe: 5.2 seconds => 0.52 us per hit
> utrace: 3.6 seconds => 0.36 us per hit
> 
> 
> Note that these are microbenchmarks that represent an ideal case
> compared to a larger run, since they probably fit comfily inside
> caches.  They probably also undercount the probe handler's run time.

-- 
David Smith
dsmith@redhat.com
Red Hat
http://www.redhat.com
256.217.0141 (direct)
256.837.0057 (fax)

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: syscall tracing overheads: utrace vs. kprobes
  2009-04-28 18:16   ` Frank Ch. Eigler
@ 2009-04-28 18:45     ` Roland McGrath
  0 siblings, 0 replies; 5+ messages in thread
From: Roland McGrath @ 2009-04-28 18:45 UTC (permalink / raw)
  To: Frank Ch. Eigler; +Cc: systemtap, utrace-devel

> Certainly, in general.  But in this specific test, only the under-test
> system calls occurred in essnetially the whole system, so the overhead
> measurements were in a way the bare minimums imposed by the kprobes
> vs. utrace callback infrastructure itself.

Yes.  That's why I meant to explain how these numbers are true but not
necessarily the numbers that matter.

> > In real situations [...]  the distribution of overheads will be
> > quite different.
> 
> Or rather, the basic overhead quanta measured above may be multiplied
> along several different axes.

Yes.

^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2009-04-28 18:45 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2009-04-28 17:01 syscall tracing overheads: utrace vs. kprobes Frank Ch. Eigler
2009-04-28 18:10 ` Roland McGrath
2009-04-28 18:16   ` Frank Ch. Eigler
2009-04-28 18:45     ` Roland McGrath
2009-04-28 18:19 ` David Smith

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).