ansaurus

Question

Is there any performance difference between for() and while()?

Answer 1

+13 A:

If your for and while loops do the same things, the machine code generated by the compiler should be (nearly) the same.

For instance in some testing I did a few years ago,

for (int i = 0; i < 10; i++)
{
...
}

and

int i = 0;
do
{
  ...
  i++;
}
while (i < 10);

would generate exactly the same code, or (and Neil pointed out in the comments) with one extra jmp, which won't make a big enough difference in performance to worry about.

Paul Tomblin 2009-05-11 12:31:34

"should be": Have you any hard evidence to support your claim?

Binary Worrier 2009-05-11 12:50:04

Actually, on my VC++ 6.0 compiler, they do not generate exactly the same code. The for() verrsion generates an extra jmp instruction.

anon 2009-05-11 12:50:08

@Neil: How about posting your experiment as an answer which can be reproduced.

sharkin 2009-05-11 13:16:10

Before people start getting excited, I should point out the jmp is only taken once, no matter how many iteration of the loop. My point was the code is not identical and that one should check these things empirically.

anon 2009-05-11 13:16:32

anon 2009-05-11 13:17:31

Why did you change the while to a do-while? It might be infinitestimally faster in some cases, but it will burn you in others.

aib 2009-05-11 15:08:44

I changed the while to a do-while because remembering back to the tests I made a few years ago, the for loop does a jmp to the test at the end. I wish I was at home so I could generate the code and compare it directly rather than trying to remember it.

Paul Tomblin 2009-05-11 15:28:42

Answer 2

A:

most likely: not at all

ammoQ 2009-05-11 12:32:05

Answer 3

+3 A:

No. If they're doing equivalent things, they'll compile to the same code - as you say, it's about semantics. Choose the one that best represents what you're trying to express.

RichieHindle 2009-05-11 12:32:16

Answer 4

+5 A:

Both are equivalent. It's a matter of semantics.

The only difference may lie in the do... while construct, where you postpone the evaluation of the condition until after the body, and thus may save 1 evaluation.

i = 1; do { ... i--; } while( i > 0 );

as opposed to

for(  i = 1; i > 0; --i )
{ ....
}

xtofl 2009-05-11 12:32:26

Answer 5

A:

Probably only coding style.

for if you know the number of iterations.
while if you do not know the number of iterations.

Max 2009-05-11 12:32:37

Answer 6

+62 A:

Short answer: no, they are exactly the same.

Guess it could in theory depend on the compiler; a really broken one might do something slightly different but I'd be surprised.

Just for fun here are two variants that compile down to exactly the same assembly code for me using x86 gcc version 4.3.3 as shipped with Ubuntu. You can check the assembly produced on the final binary with objdump on linux.

int main()
{
#if 1
    int i = 10;
    do { printf("%d\n", i); } while(--i);
#else
    int i = 10;
    for (; i; --i) printf("%d\n", i);
#endif
}

EDIT: Here is an "oranges with oranges" while loop example that also compiles down to the same thing:

    while(i) { printf("%d\n", i); --i; }

akent 2009-05-11 12:38:39

+1 for performing an experiment, and not just "expressing an opinion"

Binary Worrier 2009-05-11 12:50:57

Another +1 for due diligence by performing an experiment. Great answer!

Adam McKee 2009-05-11 13:27:37

-1 for comparing apples (for) to oranges (do-while) which in general are not equivalent. You should use the functionally equivalent while loop instead.

starblue 2009-05-11 14:43:35

+1 for saying it really depends on the compiler. it compiles to exactly the same code with my eco32 gcc backend too (risc, very simple architecture) (gcc 4.4)

Johannes Schaub - litb 2009-05-11 15:24:09

It should be "while", not "do-while". Only the #else case tests "i" before the first printf.

Shmoopty 2009-05-11 15:55:54

@akent: This would be a great answer to front this question if you could update with the assembly output of both the gcc and vc6 compilers.

sharkin 2009-05-11 19:49:15

Even clang compiles down to the same IR code after optimization: http://codepad.org/4raok1fN , using this command: clang-cc -emit-llvm main.c -o - | llvm-as | opt -std-compile-opts | llvm-dis

Johannes Schaub - litb 2009-05-11 20:00:39

@Shmoopty Hmm, possibly, but seems like GCC optimises that compare away. Another option is writing the while loop as: while(i) { printf("%d\n", i); --i; } --- this too is identical to the other two cases.

akent 2009-05-11 21:58:02

@Binary Worrier: Many times, opinions expressed are results of past experiments and not just blind guesswork.

Mehrdad Afshari 2009-05-12 13:32:17

@Mehrdad: while that is true, it is not helpful on a site like this. Providing an answer without evidence (or at least a description of how the asker can gather such evidence themselves) is of minimal value, and not much better than an unfounded opinion.

Dolphin 2009-06-19 15:55:23

Answer 7

A:

They are the same as far as performance goes. I tend to use while when waiting for a state change (such as waiting for a buffer to be filled) and for when processing a number of discrete objects (such as going through each item in a collection).

Ferruccio 2009-05-11 12:46:02

Answer 8

+3 A:

Ideally it should be the same, but eventually it depends on your compiler/interpreter. To be sure, you must measure or examine the generated assembly code.

Proof that there may be a difference: These lines produce different assembly code using cc65.

for (; i < 1000; ++i);   
while (i < 1000) ++i;

kotlinski 2009-05-11 13:02:49

Answer 9

+6 A:

The scope of the variable in the test of the while loop is wider than the scope of variables declared in the header of the for loop.

Therefore, if there are performance implications as a side-effect of keeping a variable alive longer, then there will be performance implications in choosing between a while and a for loop ( and not wrapping the while up in {} to reduce the scope of its variables ).

An example might be a concurrent collection which counts the number of iterators referring to it, and if more than one iterator exists, it applies locking to prevent concurrent modification, but as an optimisation elides the locking if only one iterator refers to it. If you then had two for loops in a function using differently named iterators on the same container, the fast path would be taken, but with two while loops the slow path would be taken. Similarly there may be performance implications if the objects are large (more cache traffic), or use system resources. But I can't think of a real example that I've ever seen where it would make a difference.

Pete Kirkham 2009-05-11 13:11:07

I like "wrapping the while up in {} to reduce the scope of its variables."

Thomas L Holaday 2009-05-11 14:06:20

Answer 10

+5 A:

Compilers that optimize using loop unrolling will probably only do so in the for-loop case.

codebolt 2009-05-11 13:42:54

Great point! Good insight that I think a lot of people forget about. Optimization is an aspect that shouldn't be ignored.

Kekoa 2009-05-11 16:35:32

Answer 11

+1 A:

On Atmel ATMega while() is faster than for(). Why is this is explained in AVR035: Efficient C Coding for AVR.

P.S. Original platform was not mentioned in question.

jpou 2009-05-11 13:48:07

Answer 12

+8 A:

There is no semantic difference, there need not be any compiled difference. But it depends on the compiler. So I tried with with g++ 4.3.2, CC 5.5, and xlc6.

g++, CC were identical, xlc WAS NOT

The difference in xlc was in the initial loop entry.

extern int doit( int );
void loop1( ) {
  for ( int ii = 0; ii < 10; ii++ ) {
    doit( ii );
  }
}

void loop2() {
  int ii = 0; 
  while ( ii < 10 ) {
    doit( ii );
    ii++;
  }
}

XLC OUTPUT

.loop2:                                 # 0x00000000 (H.10.NO_SYMBOL)
    mfspr   r0,LR
    stu     SP,-80(SP)
    st      r0,88(SP)
    cal     r3,0(r0)
    st      r3,64(SP)
    l       r3,64(SP)  ### DIFFERENCE ###
    cmpi    0,r3,10
    bc      BO_IF_NOT,CR0_LT,__L40
...
enter code here
.loop1:                                 # 0x0000006c (H.10.NO_SYMBOL+0x6c)
    mfspr   r0,LR
    stu     SP,-80(SP)
    st      r0,88(SP)
    cal     r3,0(r0)
    cmpi    0,r3,10    ### DIFFERENCE ###
    st      r3,64(SP)
    bc      BO_IF_NOT,CR0_LT,__La8
...

Sanjaya R 2009-05-11 13:49:30

Answer 13

+1 A:

continue behaves differently in for and while: in for, it alters the counter, in while, it usually doesn't

dmityugov 2009-05-12 13:33:19

which counter are you talking about?

sharkin 2009-05-13 07:50:59

in the code below, i is a counter:for ( i = 1; i < 10; ++i) { ... }i = 1; while ( i < 10) { ... ; ++i }

dmityugov 2009-05-13 09:58:31

Answer 14

A:

To add another answer: In my experience, optimizing software is like a big, bushy beard being shaved off a man.

First you lop it off in big chunks with scissors (prune whole limbs off the call tree).
Then you make it short with an electric clipper (tweak algorithms).
Finally you shave it with a razor to get rid of the last little bit (low-level optimization).

The last is where the difference between for() and while() might, but probably won't, make a difference.

P.S. The programmers I know (who are all very good, and I suspect are a representative sample) basically go at it from the other direction.

Mike Dunlavey 2009-05-12 16:49:03

Answer 15

A:

There is a difference in some cases.

If you are at the point where that difference matters, you either need to pick a better algorithm or begin coding in assembly language. Trust me, coding in assembly is preferable to fixing your compiler version.

Joshua 2009-05-12 18:08:29

Answer 16

+3 A:

I write compilers. We compile all "structured" control flow (if, while, for, switch, do...while) into conditional and unconditional branches. Then we analyze the control-flow graph. Since a C compiler has to deal with general goto anyway, it is easiest to reduce everything to branch and conditional-branch instructions, then be sure to handle that case well. (A C compiler has to do a good job not just on handwritten code but also on automatically generated code, which may have many, many goto statements.)

Norman Ramsey 2009-05-13 03:37:07

+ I also have written compilers and other code generators. Low-level optimization is a great subject for people who write compilers. For people who USE compilers, there are far bigger ways to influence the speed of code, mainly by avoiding over-design.

Mike Dunlavey 2009-05-13 17:57:37

@Mike: hear, hear. And profiling!

Norman Ramsey 2009-05-14 00:18:29

Yeah, but I do have a bit of a gripe with profiling. Only some of them actually sample the whole call stack, and I don't think any of them tell what percent of samples contain any given call site, which is exactly what the call site costs. They are more interested in functions than function calls, so you gotta rummage around in the function to find what needs fixing.

Mike Dunlavey 2009-05-14 14:25:22

... Some newer ones give stats at the statement level, but they're still hung up on the notions of counting execution and/or timing duration. You gotta multiply those and then divide by some total time, within an interval of interest, to start getting close to what matters.

Mike Dunlavey 2009-05-14 14:53:55

... They are also hung up on the ideas that 1) the stats have to have high precision, and 2) heaven forbid we should slow down the program.

Mike Dunlavey 2009-05-14 14:57:11

@Mike: I am still waiting for a really good profiler. The PC-sampling ones, like gprof, are OK, but what's fielded deals badly with recursion and virtual method calls. There are things to like about cost-center profiling and about valgrind, but overall this is an area where I would love to see better tools and better minimum standards.

Norman Ramsey 2009-05-14 23:56:44

Right. This was my attempt: http://stackoverflow.com/questions/368831/how-to-write-a-profiler/738647#738647

Mike Dunlavey 2009-05-15 00:10:09

... and another link, inspired by looking at LukeStackwalker: http://stackoverflow.com/questions/368831/how-to-write-a-profiler/874632#874632

Mike Dunlavey 2009-05-17 16:49:50

Answer 17

A:

Is while() faster/slower than for()? Let's review a few things about optimization:

Compiler-writers work very hard to shave cycles by having fewer calls to jump, compare, increment, and the other kinds of instructions that they generate.
Call instructions, on the other hand, consume many magnitudes more cycles, but the compiler is nearly powerless to do anything to remove those.
As programmers, we write lots of function calls, some because we mean to, some because we're lazy, and some because the compiler slips them in without being obvious.
Most of the time, it doesn't matter, because the hardware is so fast, and our jobs are so small, that the computer is like a beagle dog who wolfes her food and begs for more.
Sometimes, however, the job is big enough that performance is an issue.
What do we do then? Where's the bigger payoff?
- Getting the compiler to shave a few cycles off loops & such?
- Finding function calls that don't -really- need to be done so much?
The compiler can't do the latter. Only we the programmers can.
We need to learn or be taught how to do this. It doesn't come naturally. We are congenitally inclined to make wrong guesses and then bet on them. Getting better algorithms is a start, but only a start. Our teachers need to teach this, if indeed they know how.
Profilers are a start. I do this.

The apocryphal quote of Willie Sutton when asked Why do you rob banks?:
Because that's where the money is.

If you want to save cycles, find out where they are.

Mike Dunlavey 2009-05-13 18:44:13

ansaurus

tags:

views:

answers:

Is there any performance difference between for() and while()?

related questions