ansaurus

Question

Derived classes in C - What is your favorite method?

Answer 1

A:

The code I worked with used the first method.

The only two reasons I can think of for using the first method is:

Saves you some cycles because you will be doing one less de-referencing
When you cast the derived class pointer to the parent class, i.e. (struct parent_class *)ptr_my_derived_class , you know 100% what the expected result will be.

I prefer the first method because you can cast the derived class pointer to the parent class without any worries.

Can you do the same thing with the second method? ( It seems like you would have to jump through a hoop or two to get the same end result )

If you can do the same with method 2, then I think both methods would be equal.

Trevor Boyd Smith 2009-03-18 21:04:41

yes, cast any struct to the type of its first member, and you get a pointer to its first member. that's guaranteed behavior.

Johannes Schaub - litb 2009-03-18 21:10:10

I suspect that "Saves you come cycles" is not really a win because almost every compiler will take the extra arithmetic out during constant folding.

dmckee 2009-03-18 21:12:10

There's no extra dereferencing with method 2. The base type's fields are stored inline, not by reference.

P Daddy 2009-03-18 21:53:44

Answer 2

+1 A:

I know that GNOME uses the 2nd method, and casting pointers was a known thing as well. I don't remember there being any real hoops to jump through to do so. In fact, From a C memory model standpoint, there can't be any semantic difference between the two, since AFAIK the only possible difference would be compiler differences in structure padding, but since the code all runs through the same compiler, that would be a moot point.

Harper Shelby 2009-03-18 21:18:51

Answer 3

A:

I have used method #2 before and found it works quite ok:

you can upcast to the base type anytime if it is the first member in derived type
instead of dereferencing all the time to get at base members, just keep two pointers: one for the base interface, one for the derived interface

free() on the pointer to the base structure will of course also free up the derived fields, so that isn't an issue...

Also, I find accessing base fields something I tend to do in a polymorphic situation: I only care about those fields in methods that care about the base type. Fields in the derived type are used in methods only interested in the derived type.

Daren Thomas 2009-03-18 21:20:01

Answer 4

+2 A:

I'm one of the maintainers of a library that uses method 2. Works just as well as method 1, but without any preprocessor trickery. Or it actually works better, since you can have functions that take the base class as argument and you can just cast to the base struct, C guarantees that this works for the first member.

The more interesting question is, how do you do virtual functions? In our case, the struct has pointers to all the functions, and the initialization set them up. It's slightly simpler, but has more space overhead than the "proper way" with a pointer to a shared vtable.

Anyway, I'd prefer to use C++ rather than kludge it with plain C, but politics..

janneb 2009-03-18 21:20:56

"I'd prefer to use C++ rather than kludge it with plain C, but politics" and that is why it is tagged as c only. not with c and c++ tags.

Trevor Boyd Smith 2009-03-19 04:13:23

Answer 5

A:

The second way has the advantage of typesafety with inherited methods. If you want to have a method foo(struct parent_class bar) and call it with foo((struct parentclass) derived_class), this will work correctly. The C-Standard defines this. Thus, I'd generally prefer method #2. In general, it is guaranteed that casting a structure to its first member will result in a struct containing the data of the first member of the struct, no matter how memory is laid out.

Tetha 2009-03-18 21:22:10

Answer 6

A:

At a former job, we used a preprocessor to handle this. We declared classes using a simple C++-style syntax, and the preprocessor generated C headers that were basically equivalent to the First Method, but without the #includes. It also did cool things like generating vtables and macros for upcasting and downcasting.

Note that this was in the days before good C++ compilers existed for all the platforms we targeted. Now, doing this would be stupid.

Kristopher Johnson 2009-03-18 21:25:13

Answer 7

+2 A:

The first method is hideous and it hides important information. I'd never use it or allow it being used. Even using a macro would be better:

#define BODY int member1; \
             int member2; 

struct base_class
{
   BODY
};

But method 2 is much better, for reasons others have pointed out.

TrayMan 2009-03-18 21:26:21

Answer 8

A:

I prefer the first method because you can cast the derived class pointer to the parent class without any worries.

It's the other way round.

The C standard guarantees that the address of a struct is the address of the first member, so in the second case it is safe to cast a pointer to derived to parent, as the first member of derived is the parent struct, and a the struct as a member as the same layout as the same struct when not a member, so casting a pointer to a derived to parent will always work.

The same is not true for the second case. Two structs with some members defined as the same type may have different padding between those members.

It would be reasonable for a 64 bit bigendian compiler to compile

struct A { a uint64_t; b uint32_t; };

such that sizeof(A) is a whole multiple of 8 and b is 64 bit aligned, but compile

struct B { a uint64_t; b uint32_t; c uint32_t; };

so that sizeof(B) is a whole multiple of 8, but b is only 32 bit aligned so that it doesn't waste space.

Pete Kirkham 2009-03-18 22:17:00

Answer 9

+1 A:

Second option forces you to write very long names like myobj.parent.grandparent.attribute, which is ugly. First option is better from syntax point of view, but it is a bit risky to cast child to parent - I'm not sure whether is is guaranteed by standard that different structs will have same offsets for similar members. I guess compiler may use different padding for such structs.

There is another option, if you are using GCC - anonymous struct members, which is part of MS extension, so I guess it was originated by some MS compiler and still may be supported by MS.

Declarations look like

struct shape {
    double  (*area)(struct shape *);
    const char    *name;
};

struct square {
    struct shape;           // anonymous member - MS extension
    double          side;
};

struct circle {
    struct shape;           // anonymous member - MS extension
    double          radius;
};

In your "constructor" function you need to specify correct function for calculating area and the enjoy the inheritance and polymorphism. The only problem that you always need to pass explicit this - you cannot just call shape[i]->area().

shape[0] = (struct shape *)new_square(5);
shape[1] = (struct shape *)new_circle(5);
shape[2] = (struct shape *)new_square(3);
shape[3] = (struct shape *)new_circle(3);

for (i = 0; i < 4; i++)
    printf("Shape %d (%s), area %f\n", i, shape[i]->name,
            shape[i]->area(shape[i]));    // have to pass explicit 'this'

Compile with gcc -fms-extensions. I never used it in real-life project but I tested it some time ago and it worked.

qrdl 2009-03-19 07:25:19

Accessing nested structs does not involve any additional address indirection, so the compiler can compute the address of a field in a nested struct directly, without incurring any performance penalty.

TrayMan 2009-03-19 07:47:09

@TrayMan - agree, post corrected

qrdl 2009-03-19 08:42:49

ansaurus

tags:

views:

answers:

Derived classes in C - What is your favorite method?

related questions