ansaurus

Question

How to calculate a operation's time in micro second precision

Answer 1

+19 A:

Run the operation in a loop a million times or so and divide the result by that number. That way you'll get the average execution time over that many executions. Timing one (or even a hundred) executions of a very fast operation is very unreliable, due to multitasking and whatnot.

Matti Virkkunen 2010-05-13 13:09:43

Note: If the operation is fast enough, the loop itself might be taking up a portion of the time you are concerned about. Keep this in mind as a concern.

Brian 2010-05-13 17:25:32

caching will take effect and speed up the operation if you loop over it

PiNoYBoY82 2010-05-13 20:02:45

Looping 1M times is nothing compared to the seconds it's going to run. Different story if you want to measure nanoseconds. Oh and just use a stopwatch ;-)

phkahler 2010-05-14 01:33:29

Answer 2

A:

If you're doing this for offline profiling, a very simple way is to run the function 1000 times, measure to the closest millisecond and divide by 1000.

Wim 2010-05-13 13:10:51

Answer 3

+7 A:

compile it
look at the assembler output
count the number of each instruction in your function
apply the cycles per instruction on your target processor
end up with a cycle count
multiply by the clock speed you are running at
apply arbitrary scaling factors to account for cache misses and branch mis-predictions lol

(man I am so going to get down-voted for this answer)

vicatcu 2010-05-13 13:19:35

Not downvoting, I'll just note that the last line (cache misses and branch mis-predictions) pretty much screws the very careful count of cpu cycles you had obtained so far :p

Matthieu M. 2010-05-13 14:00:33

+1 funny. However, if you're serious, this is terrible advice. As sarcasm it's a very good example of why to chose Matti answer.

caspin 2010-05-13 15:36:16

Downvoting? Not by me. I used to do that, actually. However, with cacheing now it doesn't really work. So I recommend the run-it-a-million-times method.

Mike Dunlavey 2010-05-13 20:20:08

Depending on your architecture, this is a perfectly valid and accurate method. Not all processors have caches, branch prediction or multitasking. Although, I'd note that the number of cycles per instruction may be variable, even depending on the the arguments...

Nathan Ernst 2010-05-13 21:02:46

Answer 4

+3 A:

No, you are probably getting an accurate result, QueryPerformanceCounter() works well for timing short intervals. What's wrong is the your expectation of the accuracy of Sleep(). It has a resolution of 1 millisecond, its accuracy is far worse. No better than about 15.625 milliseconds on most Windows machine.

To get it anywhere close to 1 millisecond, you'll have to call timeBeginPeriod(1) first. That probably will improve the match, ignoring the jitter you'll get from Windows being a multi-tasking operating system.

Hans Passant 2010-05-13 15:17:35

or use select with a bogus fd to get more a more accurate "sleep"

PiNoYBoY82 2010-05-13 20:04:16

Answer 5

A:

To get finer resolution than 1 ms, you will have to consult your OS documentation. There may be APIs to get timer resolution in microsecond resolution. If so, run your application many times and take the averages.

Thomas Matthews 2010-05-13 17:20:37

There is. It's called `QueryPerformanceCounter`. Exactly as mentioned by the OP.

Alan 2010-05-13 17:42:07

Answer 6

A:

Get a proper profiler.

DeadMG 2010-05-13 17:40:58

Answer 7

A:

Or you can use gettimeofday() which gives you a timeval struct that is a timestamp (down to µs)

Moons 2010-05-13 20:46:46

http://blogs.msdn.com/b/oldnewthing/archive/2005/09/02/459952.aspx

dan04 2010-07-18 03:50:01

Answer 8

A:

I like Matti Virkkunen's answer. Check the time, call the function a large number of times, check the time when you finish, and divide by the number of times you called the function. He did mention you might be off due to OS interrupts. You might vary the number of times you make the call and see a difference. Can you raise the priority of the process? Can you get it so all the calls within a single OS time slice?

Since you don't know when the OS might swap you out, you can put this all inside a larger loop to do the whole measurement a large number of times, and save the smallest number as that is the one that had the fewest OS interrupts. This still may be greater than the actual time for the function to execute because it may still contain some OS interrupts.

Jim Tshr 2010-05-13 22:03:50

Answer 9

A:

Sanjeet,

It looks (to me) like you're doing this exactly right. QueryPerformanceCounter is a perfectly good way to measure short periods of time with a high degree of precision. If you're not seeing the result you expected, it's most likely because the sleep isn't sleeping for the amount of time you expected it to! However, it is likely being measured correctly.

I want to go back to your original question about how to measure the time on windows with microsecond precision. As you already know, the high performance counter (i.e. QueryPerformanceCounter) "ticks" at the frequency reported by QueryPerformanceFrequency. That means that you can measure time with precision equal to:

1/frequency seconds

On my machine, QueryPerformanceFrequency reports 2337910 (counts/sec). That means that my computer's QPC can measure with precision 4.277e-7 seconds, or 0.427732 microseconds. That means that the smallest bit of time I can measure is 0.427732 microseconds. This, of course, gives you the precision that you originally asked for :) Your machine's frequency should be similar, but you can always do the math and check it.

Matt 2010-07-18 03:28:09

ansaurus

tags:

views:

answers:

How to calculate a operation's time in micro second precision

related questions