ansaurus

Question

Overhead of calling tiny functions from a tight inner loop? [C++]

Answer 1

+1 A:

If the loop is that critical, I can only suggest that you look at the code generated. If the compiler is allowed to aggressively optimise the calls away then perhaps it will not be an issue. Sorry to say this but modern compilers can optimise incredibly well and the I really would suggest profiling to find the best solution in your particular case.

Preet Sangha 2010-03-31 13:04:31

Answer 2

+4 A:

The general way to answer such questions is to looked at the produced assembly. With gcc, this involve replacing the -c flag with -S.

My own rule is not to fight the compiler. If something is to be inlined, then I make sure that the compiler has all the information needed to perform such an inline, and (possibly) I try to urge him to do so with an explicit inline keyword.

Also, inlining saves a few opcodes but makes the code grow, which, as far as L1 cache is concerned, can be very bad for performance.

Thomas Pornin 2010-03-31 13:04:50

Answer 3

+2 A:

All the questions you are asking are compiler-specific, so the only sensible answer is "it depends". If it is important to you, you should (as always) look at the code the compiler is emitting and do some timing experiments. Make sure your code is compiled with all optimisations turned on - this can make a big difference for things like operator[](), which is often implemented as an inline function, but which won't be inlined (in GCC at least) unless you turn on optimisation.

anon 2010-03-31 13:05:01

Answer 4

+1 A:

If the methods are small and can and will be inlined, then the compiler may do the same optimizations that you have done. So, look at the generated code and compare.

Edit: It is also important to mark const methods as const, e.g. in your example count() and getName() should be const to let the compiler know that these methods do not alter the contents of the given object.

frunsi 2010-03-31 13:05:56

The note about const is not correct. const makes no guarantee about this. const might contain mutables, or it might be "cast away".

Suma 2010-03-31 13:33:17

@Suma: yes true, but at least it is a hint for the compiler. Maybe in practice the compiler ignores const for optimizations..

frunsi 2010-03-31 18:08:02

Answer 5

A:

I think in this case you are asking the compiler to do more than it legitimately can given the scope of compile-time information it has access to. So, in particular cases the messy condition may be optimized away, but really, the compiler has no particularly good way to know what kind of side effects you might have from that long chain of function calls. I would assume that breaking out the test would be faster unless I have benchmarking (or disassembly) that shows otherwise.

This is one of the cases where the JIT compiler has a big advantage over a C++ compiler. It can in principle optimize for the most common case seen at runtime and provide optimized bytecode for that (plus checks to make sure that one falls into that case). This sort of thing is used all the time in polymorphic method calls that turn out not to actually be used polymorphically; whether it could catch something as complex as your example, though, I'm not certain.

For what it's worth, if speed really mattered, I'd split it up in Java too.

Rex Kerr 2010-03-31 13:47:10

Polymorphic inline caching of any self-respecting JIT will optimize this example easily. While in principle static C++ compilers could do PICs with whole program optimization and profile feedback, I'm not aware of any production compilers that would do it.

Ants Aasma 2010-04-01 15:39:22

Answer 6

+1 A:

As a rule, you should not have all that garbage in your "for condition" unless the result is going to be changing during your loop execution.

Use another variable set outside the loop. This will eliminate the WTF when reading the code, it will not negatively impact performance, and it will sidestep the question of how well the functions get optimized. If those calls are not optimized this will also result in performance increase.

phkahler 2010-03-31 14:12:50

++ My sentiments exactly. I would tend to do that even before I know it's actually a performance problem. BTW your self-description sounds really interesting.

Mike Dunlavey 2010-03-31 19:00:46

ansaurus

tags:

views:

answers:

Overhead of calling tiny functions from a tight inner loop? [C++]

related questions