ansaurus

Question

When should linkers generate multiply defined X warnings?

Answer 1

+3 A:

A simple way to restrict each class to the current translation unit is to enclose it in an anonymous namespace:

// a.cpp
namespace {
  class A {
    // ...
  };
}

// b.cpp
namespace {
  class A {
    // ...
  };
}

is perfecetly legal. Because the two classes are in separate translation units, and are inside anonymous namespaces, they won't conflict.

jalf 2009-09-16 23:14:18

That's a better pattern for my regression tests. Thanks. But what about the question -- why doesn't the linker generate an error?

2009-09-16 23:24:39

Answer 2

+5 A:

The compiler and linker relies on both classes to be exactly the same. In your case, they are different and so strange things happen. The one definition rule says that the result is undefined behavior - so behavior is not at all required to be consistent among compilers. . I suspect that in runFirstUnit, in the delete line, it puts a call to the first virtual table entry (because in its translation unit, the destructor may occupy the first entry).

In the second translation unit, this entry happens to point to A::getName, but in the first translation unit (where you execute the delete), the entry points to A::~A. Since these two are differently named (A::~A vs A::getName) you don't get a name clash (you will have code emitted for both the destructor and getName). But since their class name is the same, their v-tables will clash on purpose, because since both classes have the same name, the linker will think they are the same class and assume same contents.

Notice that all member functions were defined in-class, which means they are all inline functions. These functions can be defined multiple times in a program. In the case of in-class definitions, the rationale is that you may include the same class definition into different translation units from their header files. Your test function, however, isn't an inline function and thus including it into different translation units will triggers a linker error.

If you enable namespaces, there will be no clash what-so ever, because ::A and ::mySpace::A are different classes, and of course will get different v-tables.

Johannes Schaub - litb 2009-09-16 23:21:52

But the constructors have the same name. Why no name clash for them? Seems perfectly possible and the bugs avoidable.Also, adding namespaces (except for the anonymous one) is no guarantee of avoiding name clashes if both files use the same namespace.

2009-09-16 23:29:17

There will be a name clash too. But since the constructors are in-line, the linker will assume they are all defined the same way and will emit only one definition of them into the binary (the address of an inline function is required to be the same across translation units), discarding the other definitions. Try to define them outside the class definition and you will get linker errors too, likely

Johannes Schaub - litb 2009-09-16 23:34:54

It's somewhat like template instantiations. If you do swap(a, b); with both a and b being integers from two translation units, you wouldn't expect a linker error either, although the compiler may have generated a function both times for doing the swap. The magic is, the linker will merge all instantiations so they end up as only one function instance in the end. In practice, it will usually throw away all but one of them, as far as i know.

Johannes Schaub - litb 2009-09-16 23:39:22

In your test, the first translation unit didn't use the same namespace. So using `mySpace::A` in the other one, it would effectively solve the issue, of course.

Johannes Schaub - litb 2009-09-16 23:41:22

Are the constructor's really inlined? I don't think so. Why? Because object receives the vtable pointer to the other. Doesn't the constructor set the vtable pointer? How is that possible without code from the other translation unit? Hmmm... let me try...

2009-09-17 00:05:21

So... that's interesting. Moving the definition outside of the scope of the class *does* generate a linker error -- suggesting inlining. However, when left as is, the second unit calls the first unit's constructor -- this can only happen because of linking. Therefore both sets of constructor names must be available to linker. Which again leaves us with, why is it happy with that?

2009-09-17 00:13:06

Hmm... the template analogy is plausible for an explanation. However, for non template code, it seems clear to me that multiple definitions of symbols should be disallowed. For template generated code, it is also suspect of different units contain different implementations (although linkers don't mind that either).

2009-09-17 00:20:47

The constructors and all functions that are defined in-class are inline functions. But that doesn't mean that calls to them are actually inlined. It merely means that you can have multiple definitions of them in the program, provided all of them have the same content.

Johannes Schaub - litb 2009-09-17 00:33:59

In GCC, inline functions are achieved by weak linking: If multiple symbols have weak linkage, the linker will pick one symbol of them. If there are multiple weak symbols, but *one* strong symbol, then that strong symbol is used. That's why the second translation unit will use the non-inline constructor definition provided in the first translation unit. But this behavior is not guaranteed by the standard: It's a result of undefined behavior. If a function is inline in one translation unit, it must be inline in every other translation unit too.

Johannes Schaub - litb 2009-09-17 00:35:43

Ah wait - the second-calls-first happens with the inline constructors? If so, then this is not because the first TU contains strong constructor symbols, but because the linker just decided to drop the second'd TUs constructor definition instead of the first. There are many things the linker won't check, which includes many things that concern cross-TU definitions.

Johannes Schaub - litb 2009-09-17 00:46:56

I gave a more theoretical description on it here: http://stackoverflow.com/questions/908830/isnt-cs-inline-totally-optional/910686#910686 . It doesn't concern linkers, but rather the language itself. An experiment on `weak` can be found here: http://stackoverflow.com/questions/617554/override-a-function-call-in-c/617588#617588

Johannes Schaub - litb 2009-09-17 00:51:34

So I guess the answer is that linkers can only reasonably generate warnings for strong references. I'll add a summary of what I understood at the bottom of the question. Thanks litb.

2009-09-17 17:42:13

Answer 3

A:

The functions are defined as inline. inline functions can be defined multiple times in the program. See point 3 in the summary here:

http://en.wikipedia.org/wiki/One_Definition_Rule

The important point is:

For a given entity, each definition must be the same.

Try not defining the functions as inline. The linker should start to give duplicate symbol errors then.

brone 2009-09-16 23:46:48

The functions are provably not inlined. Why? Because there are interactions between the code in each compile unit. The first unit may call the second destructor. The second unit calls the first's constructor. Only the linker can cause this behavior.

2009-09-16 23:58:12

Also. The problem is easily solvable in many ways. This question is why it is a problem at all?

2009-09-17 00:00:01

It doesn't matter if the functions are actually inlined; provided they are defined (or declared) inline, they count as inline, and multiple definitions are permitted. This is why there is no error message generated: it's not an error to have multiple definitions in this case.

brone 2009-09-17 20:14:24

ansaurus

tags:

views:

answers:

When should linkers generate multiply defined X warnings?

related questions