ansaurus

Question

How to synchronize C & C++ libraries with minimal performance penalty?

Answer 1

+1 A:

I don't think you'll notice much perf difference. Assuming your target platform support all your data types,

I'm coding for the DS and a few other ARM devices and floating points are evil...I had to typedef float to FixedPoint<16,8>

CVertex 2008-11-13 02:47:54

I actually use a typedef instead of float, but left it out here for simplicity.

Judge Maygarden 2008-11-13 02:51:58

Answer 2

+4 A:

If you're just wrapping the C library calls in C++ class functions (in other words, the C++ functions do nothing but call C functions), then the compiler will optimize these calls so that it's not a performance penalty.

Bill the Lizard 2008-11-13 02:48:54

Answer 3

A:

There's not going to be any performance penalty wrapping the C functions the way you describe, which is effectively inlining the wrapper functions anyway.

You might also want to take a look at the Boost.uBLAS and Boost.Math libraries.

Ferruccio 2008-11-13 02:51:59

I'm referring to the extra indirection of a function call when inlining the code would save the time of storing the function context (pushing/popping registers to the stack, etc). Also, boost is not the most friendly library to typically bad embedded target compilers!

Judge Maygarden 2008-11-13 17:22:16

Answer 4

+2 A:

As usual with everything related to optimization, the answer is that you have to measure the performance itself before you know if the optimization is worthwhile.

Benchmark two different functions, one calling the C-style functions directly and another calling through the wrapper. See which one runs faster, or if the difference is within the margin of error of your measurement (which would mean there is no difference you can measure).
Look at the assembly code generated by the two functions in the previous step (on gcc, use -S or -save-temps). See if the compiler did something stupid, or if your wrappers have any performance bug.

Unless the performance difference is too big in favor of not using the wrapper, reimplementing is not a good idea, since you risk introducing bugs (which could even cause results which look sane but are wrong). Even if the difference is big, it would be simpler and less risky to just remember C++ is very compatible with C and use your library in the C style even within C++ code.

CesarB 2008-11-13 03:02:47

Answer 5

+1 A:

If you are worried that the overhead of calling functions is slowing you down, why not test inlining the C code or turning it into macros?

Also, why not improve the const correctness of the C code while you are at it - const_cast should really be used sparingly, especially on interfaces you control.

Greg Rogers 2008-11-13 05:43:08

I was avoiding const poisoning with an embedded system library that did not make proper use of const. Adding const would have caused a lot of inconvenience, but the point is well taken. I have been considering correcting this since I don't currently have such issues.

Judge Maygarden 2008-11-13 15:01:11

Maybe you could use MYCONST instead of const in the header, then #define it to nothing when used from embedded code, and const when used from the const-correct C++ app. This assumes you compile vector.c twice, of course, once in each project, otherwise it won't link.

Steve Jessop 2008-11-13 15:17:02

Actually, maybe it will link, since it'll presumably be extern "C" in the C++ project anyway. But I still recommend compiling vector.c with const switched on in the C++ project, if only so you know it works...

Steve Jessop 2008-11-13 15:19:22

Again, using a macro is a pretty good solution.

Judge Maygarden 2008-11-13 17:20:15

Adding const correctness to library functions doesn't affect the users of the library except for forcing recompilation. However, not using const correctly does force the users to also not use const correctly. And that is bad.

Greg Rogers 2008-11-13 23:03:37

I agree and will be correcting the library in question.

Judge Maygarden 2008-11-14 16:03:45

Answer 6

+2 A:

As with any question about performance, you'll be told to measure to get your answer (and that's the strictly correct answer).

But as a rule of thumb, for simple inline methods that can actually be inlined, you'll see no performance penalty. In general, an inline method that does nothing but pass the call onto another function is a great candidate for inlining.

However, even if your wrapper methods were not inlined, I suspect you'd notice no performance penalty - not even a measurable one - unless the wrapper method was being called in some critical loop. Even then it would likely only be measurable if the wrapped function itself didn't do much work.

This type of thing is about the last thing to be concerned about. First worry about making your code correct, maintainable, and that you're using appropriate algorithms.

Michael Burr 2008-11-13 06:41:28

Answer 7

+2 A:

Your wrapper itself will be inlined, however, your method calls to the C library typically will not. (This would require link-time-optimizations which are technically possible, but to AFAIK rudimentary at best in todays tools)

Generally, a function call as such is not very expensive. The cycle cost has decreased considerably over the last years, and it can be predicted easily, so the the call penalty as such is negligible.

However, inlining opens the door to more optimizations: if you have v = a + b + c, your wrapper class forces the generation of stack variables, whereas for inlined calls, the majority of the data can be kept in the FPU stack. Also, inlined code allows simplifying instructions, considering constant values, and more.

So while the measure before you invest rule holds true, I would expect some room for improvements here.

A typical solution is to bring the C implementaiton into a format that it can be used either as inline functions or as "C" body:

// V3impl.inl
void V3DECL v3_add(VECTOR3 *out, VECTOR3 lhs, VECTOR3 rhs)
{
    // here you maintain the actual implementations
    // ...
}

// C header
#define V3DECL 
void V3DECL v3_add(VECTOR3 *out, VECTOR3 lhs, VECTOR3 rhs);

// C body
#include "V3impl.inl"


// CPP Header
#define V3DECL inline
namespace v3core {
  #include "V3impl.inl"
} // namespace

class Vector3D { ... }

This likely makes sense only for selected methods with comparedly simple bodies. I'd move the methods to a separate namespace for the C++ implementation, as you will usually not need them directly.

(Note that the inline is just a compiler hint, it doesn't force the method to be inlined. But that's good: if the code size of an inner loop exceeds the instruction cache, inlining easily hurts performance)

Whether the pass/return-by-reference can be resolved depends on the strength of your compiler, I've seen many where foo(X * out) forces stack variables, whereas X foo() does keep values in registers.

peterchen 2008-11-13 09:26:02

The usage of inline C++ or C function bodies with macros is good idea that I hadn't considered.

Judge Maygarden 2008-11-13 14:56:54

ansaurus

tags:

views:

answers:

How to synchronize C & C++ libraries with minimal performance penalty?

related questions