ansaurus

Question

What exactly does C++ profiling (google cpu perf tools) measure?

Answer 1

+1 A:

_write$UNIX2003 is probably referring to the write POSIX system call, which outputs to the terminal. I/O is very slow compared to almost anything else, so it makes sense that your program is spending a lot of time there if you are writing a fair bit of output.

I'm not sure why your program wouldn't speed up when you remove the output, but I can't really make a guess on only the information you've given. It would be nice to see some of the code, or even the perftools output when the cout statement is removed.

Tyler McHenry 2009-06-16 19:04:00

Answer 2

+2 A:

From my comments:

The numbers you get from your profiler say, that the program should be around 40% faster without the print statements.

The runtime, however, stays nearly the same.

Obviously one of the measurements must be wrong. That means you have to do more and better measurements.

First I suggest starting with another easy tool: the time command. This should get you a rough idea where your time is spend.

If the results are still not conclusive you need a better testcase:

Use a larger problem
Do a warmup before measuring. Do some loops and start any measurement afterwards (in the same process).

Tiristan: It's all in user. What I'm doing is pretty simple, I think... Does the fact that the file is open the whole time mean anything?

That means the profiler is wrong.

Printing 100000 lines to the console using python results in something like:

for i in xrange(100000):
    print i

To console:

time python print.py
[...]
real    0m2.370s
user    0m0.156s
sys     0m0.232s

Versus:

time python test.py > /dev/null

real    0m0.133s
user    0m0.116s
sys     0m0.008s

My point is: Your internal measurements and time show you do not gain anything from disabling output. Google Perf Tools says you should. Who's wrong?

ebo 2009-06-16 20:06:13

The operation definitely takes 10 seconds, all basically near the Slice::DoStep function. I don't understand how the _write can go away, after being 40%, and not have any impact...

Tristan 2009-06-16 20:14:24

That's the point. One number must be wrong. Find out which one. The numbers the profiler generates may not be statistically significant or plain wrong.

ebo 2009-06-16 20:20:00

Answer 3

A:

Google perftools collects samples of the call stack, so what you need is to get some visibility into those.

According to the doc, you can display the call graph at statement or address granularity. That should tell you what you need to know.

Mike Dunlavey 2009-06-18 18:14:54

ansaurus

tags:

views:

answers:

What exactly does C++ profiling (google cpu perf tools) measure?

related questions