ansaurus

Question

How to measure x86 and x86-64 assembly commands execution time in processor cycles?

Answer 1

+1 A:

http://stackoverflow.com/questions/3253387/can-i-read-the-virtual-clocksystem-clock-value

Quonux 2010-07-15 12:04:10

Answer 2

+1 A:

With modern CPU's, there are no simple tables to look up how long an instruction will take to complete (although such tables exist for some old processors, e.g. 486). Your best information on what each instruction does and how long it might take comes from the chip manufacturer. E.g. Intel's documentation manuals are quite good (there's also an optimisation manual on that page).

On pretty much all modern CPU's there's also the RDTSC instruction that reads the time stamp counter for the processor on which the code is running into EDX:EAX. There are pitfalls with this also, but essentially if the code you are profiling is representative of a real use situation, its execution doesn't get interrupted or shifted to another CPU core, then you can use this instruction to get the timings you want. I.e. surround the code you are optimising with two RDTSC instructions and take the difference in TSC as the timing. (Variances on timings in different tests/situations can be great; statistics is your friend.)

PhiS 2010-07-21 10:41:10

Answer 3

+1 A:

You can instrument your code using assembly (rdtsc and friends) or using a instrumentation API like PAPI. Accurately measuring clock cycles that were spent during the execution of one instruction is not possible, however - you can refer to your architecture developer manuals for the best estimates.

In both cases, you should be careful when taking into account effects from running on a SMP environment.

Michael Foukarakis 2010-07-21 10:47:06

ansaurus

tags:

views:

answers:

How to measure x86 and x86-64 assembly commands execution time in processor cycles?

related questions