ansaurus

Question

Can a conforming C implementation #define NULL to be something wacky

Answer 1

+1 A:

Well, I've found a way to prove that

#define NULL ((void*)-1)

is not a legal definition of NULL.

int main(void) 
{ 
   void (*fp)() = NULL;   
}

Initializing a function pointer with NULL is legal and correct, whereas...

int main(void) 
{ 
   void (*fp)() = (void*)-1;   
}

...is a constraint violation that requires a diagnostic. So that's out.

But the __builtin_magic_null_pointer definition of NULL wouldn't suffer that problem. I'd still like to know if anybody can come up with a reason why it can't be.

janks 2010-04-08 11:11:13

Why is your second initialization a constraint violation that requires a diagnostic? *If* a conforming compiler is allowed to announce that `(void*)-1` is a null pointer constant (which I doubt), then my suspicion (without know what text you're looking at), is that it would be a legal initialization, because null pointer constants by definition convert to any pointer type, including pointer-to-function.

Steve Jessop 2010-04-08 11:20:51

You could very well be right about that, heh. I had in mind the first clause of `3.2.2.3 Pointers`, which says `A pointer to void may be converted to or from a pointer to any incomplete or object type...`. A function is neither an incomplete nor an object type. But now I see the final clause of `3.3.16.1 Simple assignment ... Contraints ... One of the following shall hold ... the left operand is a pointer and the right is a null pointer constant`. If the implementation defined ((void*)-1) as a valid null pointer constant, then that would seem to permit it. Reading the standard isn't easy

janks 2010-04-08 11:52:53

Answer 2

A:

An integral constant expression with the value 0, or such an expression cast to type void * , is called a null pointer constant.

NULL which expands to an implementation-defined null pointer constant;

therefore either

NULL == 0

or

NULL == (void *)0

ammoQ 2010-04-08 11:27:06

But does the first sentence preclude other forms of the null pointer constant? Or is it the minimum set of null pointer constants that must be recognized by the implementation?

janks 2010-04-08 11:55:38

@ammoQ: you haven't listed all possibilities. The following are also integral constant expressions with the value 0: `0x0`, `0L`, `(1-1)`, `(12^12)`, and depending on implementation possibly `(2*INT_MIN)`.

Steve Jessop 2010-04-08 12:11:57

@janks: You are confusing two different things. In the *source code* the only valid value of `NULL` is zero (or zero cast to `void *`). However, this code may *compile* to a different representation, without any need for the machine to actually support zero as an equivalent.

Arkku 2010-04-08 12:14:18

@Arkku: I'm well aware of the distinction between value and representation. Can you quote any standardese to support the claim that 0 or 0 cast to void* are the ONLY legal forms for the null pointer constant to take in the source code? `3.2.2.3` quoted above says nothing about "only", and doesn't otherwise imply to me that it is an exhaustive list. Is there another part of the standard that you know of that clarifies this?

janks 2010-04-08 12:20:46

@janks: See my answer for references to the standard.

Arkku 2010-04-08 12:39:22

Steve: that's why I used the == operator, instead of assuming a #define

ammoQ 2010-04-08 16:14:34

janks: IMO the first sentence precludes other forms of null pointer constants.

ammoQ 2010-04-08 16:19:01

Answer 3

+5 A:

In the C99 standard, 7.17.3 states that NULL “expands to an implementation defined null pointer constant”. Meanwhile 6.3.2.3.3 defines null pointer constant as “an integer constant expression with the value 0, or such an expression cast to type void *”. As there is no other definition for a null pointer constant, a conforming definition of NULL must expand to an integer constant expression with the value zero (or this cast to void *).

Further quoting from the C FAQ question 5.5 (emphasis added):

Section 4.1.5 of the C Standard states that NULL “expands to an implementation-defined null pointer constant,” which means that the implementation gets to choose which form of 0 to use and whether to use a `void *` cast; see questions 5.6 and 5.7. “Implementation-defined” here does not mean that NULL might be #defined to match some implementation-specific nonzero internal null pointer value.

It makes perfect sense; since the standard requires a zero integer constant in pointer contexts to compile into a null pointer (regardless of whether or not the machine's internal representation of that has a value of zero), the case where NULL is defined as zero must be handled anyhow. The programmer is not required to type NULL to obtain null pointers; it's just a stylistic convention (and may help catch errors e.g. when a NULL defined as (void *)0 is used in a non-pointer context).

Edit: One source of confusion here seems to be the concise language used by the standard, i.e. it does not explicitly say that there is no other value that might be considered a null pointer constant. However, when the standard says “…is called a null pointer constant”, it means that exactly the given definitions are called null pointer constants. It does not need to explicitly follow every definition by stating what is non-conforming when (by definition) the standard defines what is conforming.

Arkku 2010-04-08 12:29:30

The C99 text you've quoted is the same as the C89 text, and the FAQ isn't normative. You might be onto something with the argumentation regarding the absence of other definitions. I'll have to look further into that.

janks 2010-04-08 13:09:51

Edited the answer to address the absence of other definitions. One way to think about it would be to look at other parts of the standard; when there are implementation-defined possibilities involved, it's always explicitly stated. The language in the standard aims to be exact, there's no room for speculating about things left unsaid.

Arkku 2010-04-08 13:30:37

One may also consider how the definition of *null pointer constant* would look if other possibilities were allowed. It would not say “X is called…” and then give no mention of other possibilities if there were any, because that would allow arbitrary things (like your neighbour's cat) to be called null pointer constants. If there were other options, it would define what exactly *can* be a null pointer constant (e.g. "any implementation-defined integer constant expression or such an expression cast to void”).

Arkku 2010-04-08 13:39:16

But the standard has plenty of examples of completely restricted implemented-defined behaviour (whether char is signed or unsigned, two choices), as well as completely unrestricted implementation-defined behaviour (maximum number of case statements in a select, additional forms of `main()` and `main(int, char**)`, representation of floats, etc). Why is there a problem with unbounded lists of arbitrary things? Implementation-defined means the implementation must define them somewhere, so they'll be exhaustively documented by the implementation at the end of the day, no matter what it chooses

janks 2010-04-08 14:37:32

That's my point; in each of these implementation-defined cases the standard specifies that they are up to the implementation. With null pointer constants, only the two possibilities are given. The definition of NULL says that it *is* a null pointer constant, but this time explicitly states that the implementor can decide which null pointer constant to use. The definition of null pointer constant does not leave any room to assume that there might be other possibilities.

Arkku 2010-04-08 14:55:27

To borrow terms from OOP; NULL is defined as having to conform to a NullPointerConstant interface, but the definition of NullPointerConstant interface only allows these specific types of implementations.

Arkku 2010-04-08 14:56:38

Answer 4

A:

The null pointer constant must evaluate 0, otherwise expressions like !ptr would not work as expected.

The NULL macro expands to a 0-valued expression; AFAIK, it always has.

John Bode 2010-04-08 14:10:51

What would prevent the compiler from re-writing `!ptr` as `ptr == __magic_null` if that is what it used? All that is required of the compiler is that it make !ptr work somehow. It doesn't have to work as-if by treating ptr as an integer.

janks 2010-04-08 14:31:26

ansaurus

tags:

views:

answers:

Can a conforming C implementation #define NULL to be something wacky

3.2.2.3 Pointers

4.1.5 Common definitions

related questions