views:

1637

answers:

8

I was wondering what would make a programmer to choose either Pimpl idiom or pure virtual class and inheritance.

I understand that pimpl idiom comes with one explicit extra indirection for each public method and the object creation overhead.

The Pure virtual class in the other hand comes with implicit indirection(vtable) for the inheriting implementation and I understand that no object creation overhead.
EDIT: But you'd need a factory if you create the object from the outside

What makes the pure virtual class less desirable than the pimpl idiom?

+12  A: 

Pointer to implementation is usually about hiding structural implementation details. Interfaces are about instancing different implementations. They really serve two different purposes.

D.Shawley
not necessarily, I've seen classes that store multiple pimpls depending on the implementation desired. Often this is say a win32 impl vs a linux impl of something that needs to be implemented differently per platform.
Doug T.
But you can use an Interface to decouple the implementation details and hide them
Arkaitz Jimenez
While you can implement pimpl using an interface, there's often no reason to decouple the implementation details. So there's no reason to go polymorphic. The *reason* for pimpl is to keep implementation details away from the client (in C++ to keep them out of the header). You might do this using an abstract base/interface, but generally that's unnecessary overkill.
Michael Burr
Why is it overkill? I mean, is it slower the interface method than the pimpl one? There might be logical reasons, but from the practical point of view I'd say its easier to do it with an abstract interface
Arkaitz Jimenez
+9  A: 

The pimpl idiom helps you reduce build dependencies and times especially in large applications, and minimizes header exposure of the implementation details of your class to one compilation unit. The users of your class should not even need to be aware of the existence of a pimple (except as a cryptic pointer to which they are not privy!).

Abstract classes (pure virtuals) is something of which your clients must be aware: if you try to use them to reduce coupling and circular references, you need to add some way of allowing them to create your objects (e.g. through factory methods or classes, dependency injection or other mechanisms).

Pontus Gagge
+1  A: 

In my understanding these two things serve completely different purposes. The purpose of the pimple idiom is basically give you a handle to your implementation so you can do things like fast swaps for a sort.

The purpose of virtual classes is more along the line of allowing polymorphism, i.e. you have a unknown pointer to an object of a derived type and when you call function x you always get the right function for whatever class the base pointer actually points to.

Apples and oranges really.

Robert S. Barnes
I agree with the apples/oranges. But it appears that you use pImpl for a functional. My goal is mostly build-technical and information hiding.
xtofl
+5  A: 

There's a very real problem with shared libraries that the pimpl idiom circumvents neatly that pure virtuals can't: you cannot safely modify/remove data members of a class without forcing users of the class to recompile their code. That may be acceptable under some circumstances, but not e.g. for system libraries.

To explain the problem in detail, consider the following code in your shared library/header:

// header
struct A
{
public:
  A();
  // more public interface, some of which uses the int below
private:
  int a;
};

// library 
A::A()
  : a(0)
{}

The compiler emits code in the shared library that calculates the address of the integer to be initialized to be a certain offset (probably zero in this case, because it's the only member) from the pointer to the A object it knows to be this.

On the user side of the code, a new A will first allocate sizeof(A) bytes of memory, then hand a pointer to that memory to the A::A() constructor as this.

If in a later revision of your library you decide to drop the integer, make it larger, smaller, or add members, there'll be a mismatch between the amount of memory user's code allocates, and the offsets the constructor code expects. The likely result is a crash, if you're lucky - if you're less lucky, your software behaves oddly.

By pimpl'ing, you can safely add and remove data members to the inner class, as the memory allocation and constructor call happen in the shared library:

// header
struct A
{
public:
  A();
  // more public interface, all of which delegates to the impl
private:
  void * impl;
};

// library 
A::A()
  : impl(new A_impl())
{}

All you need to do now is keep your public interface free of data members other than the pointer to the implementation object, and you're safe from this class of errors.

Edit: I should maybe add that the only reason I'm talking about the constructor here is that I didn't want to provide more code - the same argumentation applies to all functions that access data members.

Instead of void *, I think it's more traditional to forward declare the implementing class: `class A_impl *impl_;`
Frank Krueger
I don't understand, you shouldn't declare private members in a virtual pure class you intend to use as an interface, the idea is to keep the class pruely abstract, no size, only pure virtual methods, I don't see anything you can't do through shared libraries
Arkaitz Jimenez
@Frank Krueger: You're right, I was being lazy.@Arkaitz Jimenez: Slight misunderstanding; if you've got a class that only contains pure virtual functions, then there's not much point in talking about shared libraries. On the other hand, if you are dealing with shared libraries, pimpl'ing your public classes can be prudent for the reason outlined above.
This is just incorrect. Both methods let you hide the implementation state of your classes, if you make your other class a "pure abstract base" class.
Paul Hollingsworth
Paul, I fail to see what you think is incorrect about this - care to explain?
The first sentence in your anser implies that pure virtuals with an associated factory method somehow don't let you hide the internal state of the class. That's not true. Both techniques let you hide the internal state of the class. The difference is how it looks to the user. pImpl allows you to still represent a class with value semantics yet also hide the internal state. Pure Abstract Base Class + factory method allows you to represent entity types, and also lets you hide the internal state. The latter is exactly how COM works. Chapter 1 of "Essential COM" has a great disucssion on this.
Paul Hollingsworth
+5  A: 

When writing a C++ class, it's appropriate to think about whether it's going to be

  1. A Value Type

    Copy by value, identity is never important. It's appropriate for it to be a key in a std::map. Example, a "string" class, or a "date" class, or a "complex number" class. To "copy" instances of such a class makes sense.

  2. An Entity type

    Identity is important. Always passed by reference, never by "value". Often, doesn't make sense to "copy" instances of the class at all. When it does make sense, a polymorphic "Clone" method is usually more appropriate. Examples: A Socket class, a Database class, a "policy" class, anything that would be a "closure" in a functional language.

Both pImpl and pure abstract base class are techniques to reduce compile time dependencies.

However, I only ever use pImpl to implement Value types (type 1), and only sometimes when I really want to minimize coupling and compile-time dependencies. Often, it's not worth the bother. As you rightly point out, there's more syntactic overhead because you have to write forwarding methods for all of the public methods. For type 2 classes, I always use a pure abstract base class with associated factory method(s).

Paul Hollingsworth
+3  A: 

I was searching an answer for the same question. After reading some articles and some practice I prefer using "Pure virtual class interfaces".

  1. They are more straight forward (this is a subjective opinion). Pimpl idiom makes me feel I'm writing code "for the compiler", not for the "next developer" that will read my code.
  2. Some testing frameworks have direct support for Mocking pure virtual classes
  3. It's true that you need a factory to be accessible from the outside. But if you want to leverage polymorphism: that's also "pro", not a "con". ...and a simple factory method does not really hurts so much

The only drawback (I'm trying to investigate on this) is that pimpl idiom could be faster

  1. when the proxy-calls are inlined, while inheriting necessarily need an extra access to the object VTABLE at runtime
  2. the memory footprint the pimpl public-proxy-class is smaller (you can do easily optimizations for faster swaps and other similar optimizations)
Ilias Bartolini
+1  A: 

We must not forget that inheritance is a stronger, closer coupling than delegation. I would also take into account all the issues raised in the answers given when deciding what design idioms to employ in solving a particular problem.

Sam
+2  A: 

I hate pimples! They do the class ugly and not readable. All methods are redirected to pimple. You never see in headers, what functionalities has the class, so you can not refactor it (e. g. simply change the visibility of a method). The class feels like "pregnant". I think using iterfaces is better and really enough to hide the implementation from the client. You can event let one class implement several interfaces to hold them thin. One should prefer interfaces! Note: You do not necessary need the factory class. Relevant is that the class clients communicate with it's instances via the appropriate interface. The hiding of private methods I find as a strange paranoia and do not see reason for this since we hav interfaces.

Masha Ananieva