I'm looking for some real information about the effect of systemtap on performance. I know that static kernel probes are documented somewhere on their website (with stats), but what about userspace probes?
I saw this post on their mailing list (http://sourceware.org/ml/systemtap/2009-q1/msg00518.html) which points at a possibility of single-steping in some cases. Can anyone explain / point at some resources that list which techniques are used for which probes?
What overhead will process().function()
introduce? What about static marks?