ansaurus

Question

Answer 1

A:

Careless use of templates CAN cause bloat. But you're totally missing the point here.

Templates cause bloat when used carelessly, not carefully.
The quantity of runtime errors avoided by templates is massive.
The speed of templated code is far greater than non-templated code.
The size of the executable is absolutely trivial unless you run on an embedded system.
The STL provides a map container (which is a binary search tree) for your use.

You just haven't thought this through properly at all. The advantages templates offer far outweigh a few kb in executable size.

It's also worth noting that the code works as expected on Visual Studio 2010.

DeadMG 2010-09-09 08:35:08

As I said in the question, my reaction on spotting the problem was to chuck the binary tree code in a junk folder - in the rare cases when I need this kind of thing I should probably be using boost intrusive lists (but not std::map which cannot support it). Note - the point of the library isn't just to have unbalanced trees with a balance operation, but rather (e.g.) to convert a tree to a list, perform some operation that works better on lists, and then convert back to a tree when finished.

Steve314 2010-09-09 08:47:03

The much more important code I want to fix is an in-memory multiway (B+tree variant) library - more efficient (in many but not all cases) that std::map etc, plus handles some things that the standard RB tree containers cannot. Note - I wasn't criticising the use of templates, only looking for advice on keeping my templates as portable as possible. And the technique I describe for managing bloat is a well known technique, heavily used in the past if less so these days, which I've seen make three orders of magnitude difference to executable code size.

Steve314 2010-09-09 08:54:25

Answer 2

+2 A:

Did you switch off warnings? You should have got some "dereferencing type punned pointers violates strict aliasing", because thats exactly what you do at (void**) Ptr_Add(...

The compiler is free to assume that pointers to different types do not alias (with a few execpitions), and will produce optimized code which caches the targets of pointers in registers. To avoid that, you have to use unions to convert between different pointers types. Quoting from http://gcc.gnu.org/onlinedocs/gcc/Optimize-Options.html#Optimize-Options:

In particular, an object of one type is assumed never to reside at the same address as an object of a different type, unless the types are almost the same. For example, an unsigned int can alias an int, but not a void* or a double. A character type may alias any other type.

Pay special attention to code like this:
     union a_union {
        int i;
        double d;
      };

     int f() {
        union a_union t;
        t.d = 3.0;
        return t.i;
      }
The practice of reading from a different union member than the one most recently written to (called “type-punning”) is common. Even with -fstrict-aliasing, type-punning is allowed, provided the memory is accessed through the union type. So, the code above will work as expected. See Structures unions enumerations and bit-fields implementation. However, this code might not:
     int f() {
        union a_union t;
        int* ip;
        t.d = 3.0;
        ip = &t.i;
        return *ip;
      }
Similarly, access by taking the address, casting the resulting pointer and dereferencing the result has undefined behavior, even if the cast uses a union type, e.g.:
     int f() {
        double d = 3.0;
        return ((union a_union *) &d)->i;
      }

The -fstrict-aliasing option is enabled at levels -O2, -O3, -Os.

In your case you could use something like

union {
    void** ret_ptr;
    ptrdiff_t in_ptr;
}

but the code in ptr_add just looks horrible ;-)

Or just disable this specific optimization with "fno-strict-aliasing". Better fix your code though ;-)

drhirsch 2010-09-09 08:35:37

I specified -Wno-invalid-offsetof for obvious reasons. Didn't seen the punned pointers warning - maybe it gets turned off along with the offsetof warning?

Steve314 2010-09-09 08:42:51

Neither I nor the compiler may always be able to detect cases of pointer abuse :-) Try -Wstrict-aliasing. There is an __attribute__((__may_alias__)) which can be attached to variable definitions for a quick fix (hack). Maybe you should risk some nloat and just use a templated, nice version of Ptr_Add().

drhirsch 2010-09-09 08:56:12

That is a very very interesting section of the MinGW manual. Thanks. Will probably be accepting, but I'll see what other answers I get first.

Steve314 2010-09-09 09:02:35

@drhirsch - I don't think a template Ptr_Add will cause bloat - it'll be a big surprise if it doesn't inline as something smaller than a call anyway - but I think there are more problems (of the same nature) than just the Ptr_Add. The attribute is interesting, but I assume GCC-specific, so feels like saving up problems for later. I think you're right about unions - finding the right places to apply that should be a relatively quick fix.

Steve314 2010-09-09 09:07:20

@drhisrsch - Applying the union-based punning in Ptr_Add, c_Tool::Insert and c_Tool::Balance seems to be enough to fix that code - I'm pretty sure it'll work as an easy fix. Of course I may still want to get rid of all the offsetof and pointer arithmetic anyway, but - well, there's the force of laziness to consider.

Steve314 2010-09-09 09:24:19

You are right, a templated version will most likely cause no bloat, I should have put that in quotes, because it was meant to be ironic. Which of course is hard to see in written text ;-)

drhirsch 2010-09-09 10:43:56

Instead to compiler will probably inline such a simple function and completly optimize it away. Maybe some non inlined template versions of the functions are stored somewhere in the executable, but never actually executed, which may make the executable somewhat bigger, but this is proably not harmful ;-)

drhirsch 2010-09-09 10:47:44

@drhirsch - That's what function level linking is for - though for a template function defined without a prior prototype and (belt and braces) with an explicit "inline" keyword, I don't think the function will be included in object files. There's no need for a just-in-case thing because the compiler should know that everywhere it's referenced it'll be able to see the full code. Of course that's a guess, but I think a reasonable one - and you'd need a lot of unused Ptr_Add instantiations to cause noticable bloat anyway. And if all else fails, I could always rewrite as a macro.

Steve314 2010-09-10 02:42:59

ansaurus

tags:

views:

answers:

How to resolve pointer alias issues?

related questions