views:

936

answers:

4

I am writing a fairly large C++ shared-object library, and have run into a small issue that makes debugging a pain:

If I define a function/method in a header file, and forget to create a stub for it (during development), since I am building as a shared object library rather than an executable, no errors appear at compile-time telling me I have forgotten to implement that function. The only way I find out something is wrong is at runtime, when eventually an application linking against this library falls over with an 'undefined symbol' error.

I am looking for an easy way to check if I have all the symbols I need at compile time, perhaps something I can add to my Makefile.

One solution I did come up with is to run the compiled library through 'nm -C -U' to get a demangled list of all undefined references. The problem is this also comes up with the list of all references that are in other libraries, such as GLIBC, which of course will be linked against along with this library when the final application is put together. It would be possible to use the output of nm to grep through all my header files and see if any of the names corresponding.. but this seems insane. Surely this is not an uncommon issue and there is a better way of solving it?

+4  A: 

What about a testsuite ? You create mock executables that link to the symbols you need. If the linking fails, it means that your library interface is incomplete.

Stefano Borini
+2  A: 

I had the same problem once. I was developing a component model in C++, and, of course, components should load at runtime dynamically. Three solutions come to mind, that were the ones I applied:

  1. Take some time to define a build system that is able to compile statically. You'll lose some time engineering it, but it will save you much time catching these annoying runtime errors.
  2. Group your functions in well-known and well-understood sections, so that you can group of functions/stubs to be sure that each corresponding function has its stub. If you take the time on documenting it well, you can write perhaps a script that checks the definitions (via, for example, its doxygen comments) and check the corresponding .cpp file for it.
  3. Do several test executables that load the same set of libraries and specify the RTLD_NOW flag to dlopen (if you're under *NIX). They will signal the missing symbols.

Hope that helps.

Diego Sevilla
on the lines of RTLD_NOW, does an env var exist to force immediate (non lazy) binding ?
Stefano Borini
yup. here it is LD_BIND_NOW (libc5; glibc since 2.1.1) If set to non-empty string, causes the dynamic linker to resolve all symbols at program startup instead of deferring function call resolval to the point when they are first referenced. This is useful when using a debugger.
Stefano Borini
This can also be useful for the purpose of the OP: LD_WARN (ELF only)(glibc since 2.1.3) If set to non-empty string, warn about unresolved symbols.
Stefano Borini
Stefano: Yes, that's right. Those variables exist. However, if you are deferring the dynamic loading till later (say when the user loads a module) these variables are not useful unless you actually write the testing program that loads the intended (future) libraries.
Diego Sevilla
+1  A: 

On Linux (which you appear to be using) ldd -r a.out should give you exactly the answer you are looking for.

UPDATE: a trivial way to create a.out against which to check:

 echo "int main() { return 0; }" | g++ -xc++ - ./libMySharedLib.so
 ldd -r ./a.out
Employed Russian
That does almost exactly the same thing as nm -C -U, which I suggested in the original post. The problem is there's no 'a.out' application to speak of, it's a shared library, so ldd -r mylibrary.so gives a whole heap of output, because it uses symbols from various other dynamic libraries.. I'm particularly interested in symbols missing that are defined in *my* header files, rather than external libraries.
David Claridge
+4  A: 

Check out the linker option "-z defs" / "--no-undefined". When creating a shared object, it will cause the link to fail if there are unresolved symbols.

If you are using gcc to invoke the linker, you'll use the compiler -Wl option to pass the option to the linker:

gcc -shared ... -Wl,-z,defs

As an example, consider the following file:

#include <stdio.h>

void forgot_to_define(FILE *fp);

void doit(const char *filename)
{
    FILE *fp = fopen(filename, "r");
    if (fp != NULL)
    {
        forgot_to_define(fp);
        fclose(fp);
    }
}

Now, if you build that into a shared object, it will succeed:

> gcc -shared -fPIC -o libsilly.so silly.c && echo succeeded || echo failed
succeeded

But if you add -z defs, the link will fail and tell you about your missing symbol:

> gcc -shared -fPIC -o libsilly.so silly.c -Wl,-z,defs && echo succeeded || echo failed
/tmp/cccIwwbn.o: In function `doit':
silly.c:(.text+0x2c): undefined reference to `forgot_to_define'
collect2: ld returned 1 exit status
failed
R Samuel Klatchko