ansaurus

Question

What is your favorite low-level profiling tool?

Answer 1

+1 A:

I don't do low-level programming now, but if I did, I would definitely look into dtrace; from what I've read it looks extremely interesting. For OS X users there's also shark.

Aeon 2008-09-16 18:46:25

Answer 2

+1 A:

valgrind is my tool of choice on Unix based systems.

JeffFoster 2008-09-16 19:05:11

Answer 3

A:

The main problem is that when you 'compile in' your benchmarking you potentially modify your results (depending on the how and when of the implementation). And with those low level things you're probably very influenced by your compiler's optimiizations.

But personally on Linux I have a soft spot for oprofile ( oprofile ) this is a system wide profiler which is embedded as a kernel module and periodically traces where you application is spending time. So this profiles your entire system, not only one application. But it could be that it isn't giving you enough granularity.

amo-ej1 2008-09-17 14:26:00

Answer 4

A:

I would advise not instrumenting your code to profile it. The best answer i can give would be to use PTU (Performance Tuning Utility) from Intel, it can be found here:

This utility is the direct descendant of VTune and provide the best available sampling profiler available. You'll be able to track where the CPU is spending or wasting time (with the help of the available hardware events), and this with no slowdown of your application or perturbation of the profile.

Fabien Hure 2008-09-21 12:13:19

Answer 5

+2 A:

valgrind has already been mentioned, but it's especially useful with the callgrind tool:

$ valgrind --tool=callgrind your_program

Then you can use KCacheGrind to visualize the data.

Torsten Marek 2008-09-21 12:29:01

Answer 6

A:

For Linux: Google Perftools

Faster than valgrind (yet, not so fine grained)
Does not need code instrumentation
Nice graphical output (--> kcachegrind)
Does memory-profiling, cpu-profiling, leak-checking

Weidenrinde 2008-09-23 07:55:54

Answer 7

A:

OK, you're describing a hot-spot situation - a tight loop that occupies a large fraction of time and does not contain function calls.

You want to know if changes you make are having any effect.

Here's what I would do:

To see what to change to make it faster, two methods, bone-simple:

1) Single-step through the inner loop, to see exactly what it's doing and why. Chances are pretty good I will see some things that might be done better.

and / or

2) Get it running in a big outer loop, and then manually interrupt it. Do this several times. The instructions / statements that account for the most time will appear in those samples roughly in proportion to their cost.

To tell if I've made any difference, another bone-simple technique:

Run it a billion times in an outer loop and count the seconds. That tells how many nanoseconds the inner loop takes.

Mike Dunlavey 2009-03-19 11:33:19

ansaurus

tags:

views:

answers:

What is your favorite low-level profiling tool?

related questions