ansaurus

Question

Should I store C array values in local variables if I'm going to reference them repeatedly?

Answer 1

+13 A:

The obvious answer is of course to first write it in the most readable/plain/understandable way, then compile it with as much optimization as you can get, and then benchmark and profile that.

There is no point in optimizing things before you even know if they are bottlenecks. If the compiler does that transform automatically, you're just making the code worse, spending time, and getting absolutely nothing in return. Except perhaps the feeling of being cool, but that fades with time. :)

unwind 2009-03-19 15:52:25

Answer 2

A:

Try and time it.

I guess that the compiler is smart enough to optimize it.

Georg 2009-03-19 15:52:47

a normal profiler would struggle with this - we're talking individual instruction levels of profiling, not function level

Alnitak 2009-03-19 16:20:22

Answer 3

A:

An array in C is essentially a pointer.

local variables are cheap.

I find the first example a tad more simple to read because im not questioning what "val" is for. if "val" and "a" had better names I would venture to say the second example would improve readability.

2009-03-19 15:54:56

An array is not 'essentially a pointer' it IS one.

Brock Woolf 2009-03-20 18:21:56

brock, thanks for the pointer on pointers!

2009-04-07 14:11:47

Answer 4

A:

Don't optimise unless you know you have to
Your compiler will probably do the right thing
I find the latter version easier to read
(marginal) it's easier to isolate the latter version from side effects in a multi-threaded program

Alnitak 2009-03-19 16:00:02

why the downvotes?

Alnitak 2009-03-19 20:41:50

I guess someone doesn't agree with you.

Georg 2009-03-19 21:24:14

I guess, but I'd like to understand why...

Alnitak 2009-03-19 21:34:49

Answer 5

+7 A:

Write it for readability first. Personally, I find that all the subscripting hurts my eyes, so I'd probably write it more like:

for (i=0; i < a_len; i++) {

    int val = a[i];  /* or whatever type */
    int result = 0;  /* default result */

    if (val == 0) {
        result = f1(val);
    } else if (val % 2 == 0) {
        result = f2(val);
    } 

    a[i] = result;
}

I'm guessing the compiler will generate similar code with optimizations cranked up. But I wouldn't be shocked if one or the other was slightly (only very slightly) better. And I'd bet that if one were, it would be the one using the locals.

Also, you might get a very slight improvement by changing walking through the array using an index to walking through it using a pointer. Again, that's very compiler and situation dependent.

for (p=&a[0]; p < &a[a_len]; ++p) {

    int val = *p;    /* or whatever type */
    int result = 0;  /* default result */

    if (val == 0) {
        result = f1(val);
    } else if (val % 2 == 0) {
        result = f2(val);
    } 

    *p = result;
}

And, yes, I'm aware that these are micro-optimizations and generally should not even be worried about (please code for readability and correctness first) - I'm just pointing out some options for when the micro-optimization might be warranted (these suggestions have to backed up with analysis of the particular situation).

As far as whether the compiler will repeatedly reload from something like a[i] or not, that depends on the flow of control and whether the object being accessed is a global or has had its address taken and passed to something else.

If the object is global or has had its address taken and you call a function, generally the compiler has to assume that the object could have been modified by the function and will have to reload it. Similar issues happen when pointers are used to pass information to functions. Using locals can help mitigate this issue, since a compiler can very easily determine that a local is not modified by a called function unless the address of the local is taken. Compilers can also try solve this problem by using some sort of global optimization (such as what MSVC does at link time).

You example code probably isn't really hitting this problem even if array a is global because you don't re-read the value from the array after you've called the either of those functions (you only write to it).

I wonder why markdown is removing blank lines from the code-formatted blocks?

Michael Burr 2009-03-19 16:06:58

Great answer. It's a shame you have to pander to the "no premature optimization" crowd instead of just answering the guy's question and assuming he knows what he's talking about, but I've seen good answers get downvoted into oblivion for not doing so.

Matt J 2009-03-19 16:48:46

@Matt J - thanks, but be aware that I'm often in the "no premature optimization" crowd. Guess it depends on my mood. Here, I actually think that the more readable (to me) code is probably easier for the compiler to optimize too. But I doubt that this example code would actually see a difference.

Michael Burr 2009-03-19 17:07:37

I certainly try not to optimize my own code prematurely, but when answering someone else's question, sometimes it is more appropriate to just answer the question, if there is no indication that the asker is incompetent.Often, the "do not optimize" answer is voted way above the helpful answer is all.

Matt J 2009-03-19 19:25:30

Answer 6

+3 A:

The functions f1 and f2 seems to share the same signature. How differently do they behave? Do you really need the check outside? Or, can you embed the logic in one function?

If you have a if-else ladder instead of only two such functions, try to use an array of function pointers instead. Use the value of a[ i ] to index in to that array and call the correct function.

Hand-optimization often turns out to be error prone micro-optimization. It's best to leave this task to the compiler. If you really need to optimize, look at the big picture, think of algorithms, the design, layers etc.

As for your question: Yes, most compilers are likely to optimize out the memory read should a[ i ] be not declared volatile.

dirkgently 2009-03-19 16:16:31

That was just an example. The question was really whether the memory read and offset arithmetic would be optimized out.

Nick 2009-03-19 16:20:00

accepted for being the only answer so far to actually answer the question.

Nick 2009-03-19 19:21:51

Sometimes the compiler will not optimize the code when it is dealing with a pointer that "may be aliased". In your case, if the compiler is getting function(int * a) then the compiler might assume the ptr to a is aliased and therefore won't optimize.

Trevor Boyd Smith 2009-03-19 21:27:47

If you quality the pointer as "int * restrict a" then the compiler will know that "a" is not being aliased and it will optimize.

Trevor Boyd Smith 2009-03-19 21:28:39

restrict is a C99 addition. There are only so few C99 compatible implementations.

dirkgently 2009-03-19 21:33:57

Answer 7

+4 A:

Both versions generate exactly the same code in GCC, as long as -O or higher is turned on. So my suggestion is to do whichever way you like better (I prefer without the local variable).

Greg Rogers 2009-03-19 16:38:55

Answer 8

A:

Optimization hint

I would probable first look at having var as a pointer instead i local vaiable to probable be better. then you dont use double storing of the variable as well

int* var;//Int or whatever type a[] is
for (i=0; i < a_len; i++) {
    val = &a[i];
    if (*val == 0) {
        f1(val);//// Set the valur inside f1
    } else if (*val % 2 == 0) {
        f2(val);// Set the valur inside f2
    } else {
        *val = 0;
}

A hint for optimizing your code could be to avoid using '%' operator when you are not interested in the result. It depends on your compiler but this has proven way faster for me (using a macro for readability):

#define is_divisible(dividend, divisor) ((((dividend)/(divisor)) * (divisor))==(dividend))

use:

else if (is_divisible(val,2)) {

This is faster, at least on most cases I have tested.

Edit: true that the profit is not so large when using modula calculations using only '% 2' But if you ever are looning at a larger calue than 2 to do modula operation and are just interested in of the modula returns zero then my macro is faster in all the compilers i have used

eaanon01 2009-03-19 16:46:48

Matt J 2009-03-19 16:57:35

Well as I state the ones I have word on thay also calculate/rerunes the rest from the calculation. The macro does not look at anyting else than zero, but I see you point when just using '% 2'

eaanon01 2009-03-19 17:51:44

Answer 9

A:

You say "If I were writing the code above in assembler..." so I assume you know assembly language.

My advice: Have a look at the compiler output on critical sections of code, see what's really going on.

Jimmy J 2009-03-19 17:33:11

Answer 10

+1 A:

dirkGentley's answer:

Yes, most compilers are likely to optimize out the memory read should a[ i ]

Sometimes the compiler will not optimize the code when it is dealing with a pointer that "may be aliased". In your case Nick, if you are giving "a" as a function parameter, function(int * a), then the compiler might assume the pointer to "a" is aliased and therefore won't optimize.

If you quality the pointer as "int * restrict a" then the compiler will know that "a" is not being aliased and it will optimize.

The only way to know 100% whether the compiler is optimizing is to check the assembly!

Trevor Boyd Smith 2009-03-19 21:33:09

ansaurus

tags:

views:

answers:

Should I store C array values in local variables if I'm going to reference them repeatedly?

Optimization hint

related questions