Performance Characteristics of Fundamental Operations for Computational Estimation of Algorithmic Complexity

+2 A:

Intel has some information about their assembly implementation in their articles database. The good ones are pretty dense (like this 600-page PDF file), but they've got a lot of interesting information, including some tables with approximate latency times. There's also a table with some latency times for their 64-bit architecture, so you might be able to search for a similar 32-bit one if you want it.

I personally have no idea about any information for AMD's processors. Google might turn up some results, but I haven't used an AMD machine since the Athlon 3000 days, so I haven't had the need to look for this kind of information.

eldarerathis 2010-08-08 23:29:23

Ugh. That first one looks very close to what I was looking for. Now if I can just extract the information that I need from the forest of what I don't...

Jon Purdy 2010-08-08 23:36:15

The tables are *fairly* noticeable if you skim through...but, like I said, definitely dense...

eldarerathis 2010-08-08 23:37:19

A:

From what I know:
inc: min O(1) max O(log n)
add, sub: O(log n)
mul, div: O(n)

malloc: O(n*m) n is size allocated, m is number of previous allocations.
free: O(1) (sometimes O(log m)).

Dani 2010-08-08 23:39:59

As I said, algorithmic complexity is comparatively simple. I'm looking for specific details on which operations of a given algorithmic complexity are faster than which others, in terms of processor cycles.

Jon Purdy 2010-08-08 23:44:39

+2 A:

On most modern CPUs, the concept of "cycle time for a particular instruction" is not especially helpful. The pipeline will be handling multiple instructions at once, and they will be competing for various resources inside the CPU - so the performance of a given instruction can only be understood in the context of the surrounding instructions. And the details will vary significantly, within even the different models in a processor family.

Furthermore, if you're doing anything that is touching data, then cache behaviour is likely to be just as important as instruction execution times.

For x86: have a look at Agner Fog's "Software optimization resources".

Matthew Slattery 2010-08-09 00:06:27

+1 for the useful link.

Jon Purdy 2010-08-09 00:34:54

A:

The group of Reinhard Wilhelm in Saarbrücken does research on timing analysis, including cache behavior.

starblue 2010-08-09 09:30:32

ansaurus

tags:

views:

answers:

Performance Characteristics of Fundamental Operations for Computational Estimation of Algorithmic Complexity

related questions