views:

524

answers:

9

I am currently working on a big project and maintaining all those include guards makes me crazy! Writing it by hand is frustrating waste of time. Although many editors can generate include guards this doesn't help much:

  1. Editor generates guard symbol based on a filename. The problem occurs when you have headers with the same filename in different directories. Both of them will get the same include guard. Including directory structure into the guard symbol would require some fancy approach from the editor, since slashes and backslashes in the macro are not the best thing.

  2. When I have to rename a file I should rename all the include guards as well (in the ifndef, define and ideally endif's comment). Annoying.

  3. Preprocessor is flooded with tons of symbols without a clue what they mean.

  4. Nevertheless definition is included once, compiler still opens header every time it meets header inclusion.

  5. Include guards don't fit into namespaces nor templates. In fact they are subverting namespaces!

  6. You have a chance that your guard symbol won't be unique.

Maybe they were acceptable solution in times when programs contained less than 1000 headers in single directory. But nowadays? It is ancient, it has nothing to do with modern coding habits. What bothers me the most is that this issues could be almost compeletly solved by #pragma once directive. Why is it not a standard?

+6  A: 

Include guards are definitely an annoyance, and C should have originally been designed such that headers would be included once by default - requiring some special option to include a header multiple times.

However, it wasn't ,and you're mostly stuck with having to use include guards. That said, #pragma once is pretty widely supported so you might be able to get away with using it.

Personally, I solve your 1st problem (similarly named include files) by adding a GUID to the include guard. It's ugly, and most people hate it (so I'm often forced to not use it at work), but your experience shows that the idea has some value - even if it is hideously ugly (but then again the whole include guard thing is kind of a hack - why not go whole hog?):

#ifndef C_ASSERT_H_3803b949_b422_4377_8713_ce606f29d546
#define C_ASSERT_H_3803b949_b422_4377_8713_ce606f29d546

// blah blah blah...

#endif

I've heard that compilers don't actually reopen header files that have include guards (they've learned to recognize the idiom). I'm not sure if that's true (or to what extent it's true); I've never measured it. I also don't worry about it, but my projects aren't so huge that it's a problem.

My GUID hack pretty much solves items 1, 5, and 6. I just live with items 2, 3, and 4. Actually, for item 2 you could live without renaming the include guard macro when you rename the file as the GUID will ensure it remains unique. In fact, there's no reason to incorporate the filename at all with the GUID. But I do - tradition, I suppose.

Michael Burr
I agree that this is the best solution, unfortunately I'm using Eclipse which is a great tool but has terrible suport for include guards :/
doc
I don't agree that this solved namespaces issue. As one may guessnamespaces were invented to avoid names collision, and set natural names free again. This is achieved by wrapping all potentialy colliding code into namespace brackets. And of course include guards are exceptional to this rule. So it ends up in some kind of missconception. While structures are going to have more human names, include guards are going into exactly opposite direction. Resolving their name conflicts is left to IDE/scripts/whatever. This isn't pretty philosophy in my opinion.
doc
I agree that it's not pretty, but it it wasn't meant to be an ideal solution, but a pragmatic one. It does pollute the global namespace, but does so with names that are designed to be universally unique, and the names are used in exactly one place (on 2 lines, but right next to each other). Actually, it's overkill for most people (most projects get by with much simpler guard names). Personally, I see little downside to using them, other than the ugliness - but I don't have to interact with them much, so it's not a bad tradeoff. It would have been better if C didn't need guards, but it does.
Michael Burr
+1  A: 

You can probably avoid name collisions without resorting to random strings by setting the include guard to contain the name of the class and namespace that's in the file.

Besides, #pragma once is supported by both MS compilers and GCC for quite some time now so why does it bother you that it's not on the ISO standard?

shoosh
I for one am actively using a compiler that doesn't support `#pragma once`. The OP may need to support such things.
Michael Burr
Because it is not an ISO standard and so far it is not portable. Moreover GCC had been marking it as obsolete feature for quite a long time. They still don't recommend to use it...
doc
@Michael Burr, which?
shoosh
+10  A: 
  1. How often do you have to add an include file to this project? Is it really so hard to add DIRNAME_FILENAME to the guard? And there is always GUIDs.
  2. Do you really rename files that often? Ever? Also, putting the GUARD in the #endif is just as annoying as any other useless comment.
  3. I doubt your 1000 header files guard defines are even a small percentage of the number of defines generated by your system libraries (especially on Windows).
  4. I think MSC 10 for DOS (20+ years ago) kept track of what headers were included and if they contained guards would skip them if included again. This is old tech.
  5. namespaces and templates should not span headers. Dear me, don't tell me you do this:

    template <typename foo>
    class bar {
    #include "bar_impl.h"
    };
    
  6. You said that already.

jmucchiello
OMG! MY EYES! .
shoosh
If the headers are as badly laced with conditionals as the ones I work with, having a comment after #endif can be valuable for seeing more easily which condition you are seeing the end of (especially when the start of the conditional is several hundred lines up the file).
Jonathan Leffler
I can think of use cases like those above (look at boost source code to see namespaced includes in action) but then there won't be any include guards at play, probably.
MP24
If it's that bad a problem, move the #define to just before the #endif. Then you don't need the comment.
jmucchiello
Another way to avoid having to edit the comment at the guard's `#endif` when you change the macro name is to use a generic comment: `#endif /* close include guard */` It's not going to be very confusing, since there's only ever one include guard section and the `endif` for an include guard is pretty much always the last line in the header anyway (so a comment is largely redundant anyway, but I can understand wanting one).
Michael Burr
+1  A: 

It's not in C (not C++) because updates since 1989 are all by committees. The members have different goals, but nobody is going to campaign for standardizing a feature that only saves two lines per file. If you want elegance, choose a modern language which was designed for it from the start; C was designed to minimize compiler complexity.

Devin Bayer
C++ is modern. Syntactically it is most complex and powerful imperative language.
doc
Modern it may be... elegant, not so much.
Jeremy Friesner
+2  A: 

The problem occurs when you have headers with the same filename in different directories.

So you have two headers which are both called ice_cream_maker.h in your project, both of which have a class called ice_cream_maker defined in them which performs the same function? Or are you calling every class in your system foo?

Nevertheless definition is included once, compiler still opens header every time it meets header inclusion.

Edit the code so you don't include headers multiple times.

For dependent headers (rather than the main header for a library), I often use header guards which are like this:

#ifdef FOO_BAR_BAZ_H
#error foo_bar_baz.h multiply included
#else
#define FOO_BAR_BAZ_H

// header body

#endif
Pete Kirkham
Yeah I know, preprocessor is powerful. However this method brings even more passive code. Why not just #pragma once and you have all this in a single line, which may be blindly pasted even by very primitive editor. In modularity aspects C++ is in early middle ages. Nor Java, nor PHP, nor Perl, nor Python, nor Smalltalk, nor Rebol, nor probably any modern language do inclusions in the way C/C++ does. Even Turbo Pascal got nicer system of units.
doc
I don't like my engineers blindly pasting code.
Pete Kirkham
+1  A: 

IIRC, #pragma anything is not part of the language. And it shows up in practice, a lot.

(edit:completely agree that the inclusion and linking system should have been more of a focus for the new standard as it is one of the most obvious weaknesses, in 'le this day and age' )

rama-jka toti
`#pragma STDC FP_CONTRACT ON` ??
Charles Bailey
Don't know Charles, as I can't look it up easily right now.. I'd say it is a compiler directive, rather than language thing but could be way wrong here..
rama-jka toti
I was giving an example of a `#pragma` _something_ that is defined in the language. ISO/IEC 9899:1999 6.10.6/2 is where the standard C `#pragma`s are listed (if you wanted to look them up).
Charles Bailey
of course you could be right.. perhaps others can chime in as well. anyway, it seems pretty chaotic with:"1996 Any such pragma that is not recognized by the implementation is ignored ""1995 The behavior might cause translation to fail or cause the translator or the resulting program to behave in a non-conforming manner. ""2001 an implementation is permitted to behave as if it were the standard pragma, but is not required to."
rama-jka toti
perhaps those are special, perhaps not. But reading the above just doesn't give me confidence to say it is a not compiler/implementation-dependent interpretation, thus possibly disqualifying it from being defined in the language. all of theoretical value only of course, as the practice has obviously taken on the opposite direction with each vendor shooting ever-increasing pragma-related warnings at each other.
rama-jka toti
+1  A: 

A pragmatic solution:

1) choose some consistent guard-naming policy (e.g. path relative to project root + file name, or whatever else you choose). Include possible exceptions for third-party code.

2) write a program (a simple python script would do) to walk recursively the source tree and verify that the guards all conformal to the policy. And whenever the guards are wrong, output a diff (or sed script, or whatever else) that the user can easily apply to fix. Or just ask for a confirmation and make changes from the same program.

3) make everyone on the project use it (say, before submitting to source control).

anonymous
A: 

Why don't you ask the C standards people and the C++ standards people? And when you have found out please let us all know!

Sam
+6  A: 

A directive like #pragma once is not trivial to define in a fully portable way that has clear an unambiguous benefits. Some of the concepts for which it raises questions are not well defined on all systems that support C, and defining it in a simple way might provide no benefit over conventional include guards.

When the compile encounters #pragma once, how should it identify this file so that it doesn't include its contents again?

The obvious answer is the unique location of the file on the system. This is fine if the system has unique locations for all files but many systems provide links (symlinks and hardlinks) that mean that a 'file' doesn't have a unique location. Should the file be re-included just because it was found via a different name? Probably not.

But now there is a problem, how is it possible to define the behaviour of #pragma once in a way that has an exact meaning on all platforms - even those that don't even have directories, let alone symlinks - and still get the desirable behaviour on systems that do have them?

You could say that a files identity is determined by its contents, so if an included file has a #pragma once and a file is included that has exactly the same contents, then the second and subsequent #includes shall have no effect.

This is easy to define and has well defined semantics. It also has good properties such that if a project is moved from a system that supports and uses filesystem links to one that doesn't, it still behaves the same.

On the downside, every time an include file is encountered containing a #pragma once its contents must be checked against every other file using #pragma once that has already been included so far. This implies a performance hit similar to using #include guards in any case and adds a not insignificant burden to compiler writers. Obviously, the results of this could be cached, but the same is true for conventional include guards.

Conventional include guards force the programmer to choose a macro that is the unique identifier for an include file, but at least the behaviour is well-defined and simple to implement.

Given the potential pitfalls and costs, and the fact the conventional include guards do work, it is not surprising to me that the standards committee didn't feel the need to standardize #pragma once.

Charles Bailey
I'm upvoting this for the effort but want to present the other side too. if they are touting portability and finally including constructs that are widely used for over 20 years.. and most importantly, if they are on a payroll, and if standards are supposed to be obeyed, it naturally would piss me off because it is a cost to everyone else but vendors to support non-standard pragmas and by implication, even allow them to exist. that would be a job bad done in my opinion.
rama-jka toti
and sure, while policing is hard to enforce TM Ltd, someone at the committee wonderland should have picked up the idea that inclusion and linking mechanism is really outdaded, error-prone and requires urgent and backward-compatible modernisation to migrate to. they had roughly 15 years of good examples out there and nothing comes out on it yet again..
rama-jka toti
Furthermore, if I remember correctly, #pragma is a preprocessor directive to allow for vendor-specific preprocessor commands, so it's kind of an oxymoron to standardize a pragma.
blwy10
"The obvious answer is the unique location of the file on the system. This is fine if the system has unique locations for all files but many systems provide links (symlinks and hardlinks) that mean that a 'file' doesn't have a unique location. Should the file be re-included just because it was found via a different name? Probably not."..................................................................Just follow links, they point to explicit file name! Really, this shouldn't be a tremendous task for the compiler building team.
doc
And for hardlinks?
Charles Bailey
Compiler interacts with the file system on quite a low level. It must consider different directory separators in different OSes, usually it calls sys functions to determine modification date of source files, and when you include file this also isn't operation with well defined semantics. One one OS file name may have maximaly 8 characters, on other "B" letter may be unallowed character etc. Paradoxically Unix born C can't handle hard/sym -links, which are a POSIX standard.
doc
Hard link also should point to explicit file name. If for some reason it doesn't then this is user/OS problem and not the compiler problem.
doc
Going back to the original question, it should be obvious that we are getting system specific so getting a wording that is acceptable for *all* systems where C is implemented is going to be hard. Compilers don't interact with the filesystem at a fundamentally low level, they are just clients of the service like any other application.
Charles Bailey
On a typical Unix filesystem, you can tell from a filename whether there are other hardlinks that link to a given file, there is usually no reverse mapping to tell you what other filenames also link to this file, indeed a link might be just an open file handle with no name. Even if there is another name that links to a file there is no 'primary' name, so either name is just as good... unless someone unlinks one of the names because it was only a temporary filename.
Charles Bailey
It seems to me a bit nit-picking. I doubt anyone have ever need hard links functionality during development. But fine, what about defining that pragma once just can not be used within hard links? In this case compiler should trigger an error (but it is of course not obliged).
doc
And BTW what if programmer includes files from /proc, /dev or other pseudofile system? Or if he includes a pipe or a socket? Of course results of such operations are compeletely undefined, yet you are allowed to use #include.
doc