ansaurus

Question

C++ internal code reuse: compile everything or share the library / dynamic library?

Answer 1

+6 A:

People seem to think that C specifies an ABI. It doesn't, and I'm not aware of any standardised compiled language that does. To answer your main question, use of libraries is of course the way to go - I can't imagine doing anything else.

anon 2009-12-11 22:21:03

C doesn't specify an ABI, but in practice C binaries from different compiler versions tend to be more inter-compatible than C++ binaries, probably because C is a much simpler language.

Charles Salvia 2009-12-11 22:41:55

C *implementations* specify ABIs. In general, ABIs have more to do with the computer's architecture than the language; to do otherwise would have the tail wagging the dog.

Mike DeSimone 2009-12-11 22:55:19

"use of libraries is of course the way to go - I can't imagine doing anything else." --> http://en.wikipedia.org/wiki/Standard_Template_Library

Jason S 2009-12-12 00:52:14

Answer 2

+1 A:

We do the same. Trying to use binaries can be a real problem if you need to use shared code on different platforms, build environments, or even if you need different build options such as static vs. dynamic linking to the C runtime, different structure packing settings, etc..

I typically set projects up to build as much from source on-demand as possible, even with third-party code such as zlib and libpng. For those things that must be built separately, e.g. Boost, I typically have to build 4 or 8 different sets of binaries for the various combinations of settings needed (debug/release, VS7.1/VS9, static/dynamic), and manage the binaries along with the debugging information files in source control.

Of course, if everyone sharing your code is using the same tools on the same platform with the same options, then it's a different story.

Tim Sylvester 2009-12-11 22:52:44

Answer 3

+1 A:

I never saw shared libraries as a way to reuse code from an old project into a new one. I always thought it was more about sharing a library between different applications that you're developing at about the same time, to minimize bloat.

As far as copy-paste syndrome goes, if I copy and paste it in more than a couple places, it needs to be its own function. That's independent of whether the library is shared or not.

When we reuse code from an old project, we always bring it in as source. There's always something that needs tweaking, and its usually safer to tweak a project-specific version than to tweak a shared version that can wind up breaking the previous project. Going back and fixing the previous project is out of the question because 1) it worked (and shipped) already, 2) it's no longer funded, and 3) the test hardware needed may no longer be available.

For example, we had a communication library that had an API for sending a "message", a block of data with a message ID, over a socket, pipe, whatever:

void Foo:Send(unsigned messageID, const void* buffer, size_t bufSize);

But in a later project, we needed an optimization: the message needed to consist of several blocks of data in different parts of memory concatenated together, and we couldn't (and didn't want to, anyway) do the pointer math to create the data in its "assembled" form in the first place, and the process of copying the parts together into a unified buffer was taking too long. So we added a new API:

void Foo:SendMultiple(unsigned messageID, const void** buffer, size_t* bufSize);

Which would assemble the buffers into a message and send it. (The base class's method allocated a temporary buffer, copied the parts together, and called Foo::Send(); subclasses could use this as a default or override it with their own, e.g. the class that sent the message on a socket would just call send() for each buffer, eliminating a lot of copies.)

Now, by doing this, we have the option of backporting (copying, really) the changes to the older version, but we're not required to backport. This gives the managers flexibility, based on the time and funding constraints they have.

EDIT: After reading Neil's comment, I thought of something that we do that I need to clarify.

In our code, we do lots of "libraries". LOTS of them. One big program I wrote had something like 50 of them. Because, for us and with our build setup, they're easy.

We use a tool that auto-generates makefiles on the fly, taking care of dependencies and almost everything. If there's anything strange that needs to be done, we write a file with the exceptions, usually just a few lines.

It works like this: The tool finds everything in the directory that looks like a source file, generates dependencies if the file changed, and spits out the needed rules. Then it makes a rule to take eveything and ar/ranlib it into a libxxx.a file, named after the directory. All the objects and library are put in a subdirectory that is named after the target platform (this makes cross-compilation easy to support). This process is then repeated for every subdirectory (except the object file subdirs). Then the top-level directory gets linked with all the subdirs' libraries into the executable, and a symlink is created, again, naked after the top-level directory.

So directories are libraries. To use a library in a program, make a symbolic link to it. Painless. Ergo, everything's partitioned into libraries from the outset. If you want a shared lib, you put a ".so" suffix on the directory name.

To pull in a library from another project, I just use a Subversion external to fetch the needed directories. The symlinks are relative, so as long as I don't leave something behind it still works. When we ship, we lock the external reference to a specific revision of the parent.

If we need to add functionality to a library, we can do one of several things. We can revise the parent (if it's still an active project and thus testable), tell Subversion to use the newer revision and fix any bugs that pop up. Or we can just clone the code, replacing the external link, if messing with the parent is too risky. Either way, it still looks like a "library" to us, but I'm not sure that it matches the spirit of a library.

We're in the process of moving to Mercurial, which has no "externals" mechanism so we have to either clone the libraries in the first place, use rsync to keep the code synced between the different repositories, or force a common directory structure so you can have hg pull from multiple parents. The last option seems to be working pretty well.

Mike DeSimone 2009-12-11 22:53:44

To quote from your answer: "1) it worked" - surely this is the primary reason for re-using code from previous projects. And the safest and simplest way of doing such re-use is to implement the code in a library.

anon 2009-12-11 23:01:22

@Mike: When you pull a library from another project, how do you handle the case when that library depends on another project?

ShaChris23 2009-12-12 01:06:35

You have to pull its dependencies as well. Say I have something in `repo/foo` that needs the library at `https://bar.com/baz/bob` which itself needs `https://bar.com/baz/bats`. (Convoluted enough?) Then I'd set `svn:externals` on `repo` to `baz -r123 https://bar.com/baz` and update, then create a symlink from `repo/foo/bob` to `../baz/bob`. `bob` still works because it has a symlink from `bob/bats` to `../bats`. Hope that made sense.

Mike DeSimone 2009-12-14 16:34:29

Answer 4

+3 A:

One good reason to share the source code: Templates are one of C++'s best features because they are an elegant way around the rigidity of static typing, but by their nature are a source-level construct. If you focus on binary-level interfaces instead of source-level interfaces, your use of templates will be limited.

dsimcha 2009-12-11 22:59:37

I think there is some-cross-purpose talk going on here. In my book, a C++ library is a collection of header files and binaries - the standard library that ships with your compiler is a good example.

anon 2009-12-11 23:05:12

@dsimcha, oh wow, never thought of it that way. thanks for sharing your thoughts.

ShaChris23 2009-12-12 00:58:45

ansaurus

tags:

views:

answers:

C++ internal code reuse: compile everything or share the library / dynamic library?

General question:

Specific example:

related questions