ansaurus

Question

how do i explain that if (xyz == null) checks are not "protective".

Answer 1

+6 A:

It really depends on the precise situation. It's rarely advisable to give general suggestions like "don't put null checks in your code", as you seem to be indicating. The contract of the class should define what's legit and what isn't. But if the contract makes it clear that passing in null is not acceptable, then an exception is indeed an appropriate response.

John Feminella 2009-10-30 02:26:49

Answer 2

+2 A:

I haven't watched through it, but eiffel.com has 2 presentations (slides+audio) on the topic of design by contract. These guys have invented the concept, so if anyone can explain it it's them :-)

Wim Coenen 2009-10-30 02:28:53

Answer 3

+11 A:

If your class cannot accept null arguments, then the best thing to do is this:

if (arg == null)
    throw new ArgumentNullException();

This is vastly preferable to getting a NullPointerException deeper down the stack. In the worst case scenario, you'll cache that null somewhere and won't actually trigger the exception until much later, and see how much fun you'll have debugging the problem then.

And as others have stated, sometimes the contract says that null is okay. In that case, having a guard clause around some parts of the code is correct--although even then I'd say that the best design would be to add an overload without the optionally-null arguments.

JSBangs 2009-10-30 02:30:57

if the contract says null is ok, then you should probably be using a nullable to more explicitly indicate this.

RCIX 2009-10-30 08:36:59

Answer 4

+2 A:

Code contracts in .net 4.0 will hopefully make this behavior much more consistent. Any articles which talk about code contracts will help get the idea across, and in the future, this kind of syntax will provide the method.

http://blogs.msdn.com/bclteam/archive/2008/11/11/introduction-to-code-contracts-melitta-andersen.aspx

jsimmons 2009-10-30 02:36:27

Unfortunately it seems that Code Contracts will not be available in the Professional edition of visual studio 2010. This will prevent if from really taking off.

Wim Coenen 2009-10-30 02:41:53

The actual contracts are a CLR feature, which will be common across all versions ( Mono included, eventually ). What is limited to team studio seems to be an application which examines code and checks for contract compliance. Runtime checking will be available everywhere.

jsimmons 2009-10-30 02:45:40

The Contract classes are indeed in the BCL (System.Diagnostics.Contracts) but they don't actually do anything without the tools; I've tried it in VS2010 Premium beta2: http://stackoverflow.com/questions/1608406/does-visual-studio-2010-premium-include-the-contract-tools

Wim Coenen 2009-10-30 23:46:29

Answer 5

+3 A:

As everyone else has said, it is vastly preferable to fail early than to get mysterious problems in production because the function didn't do anything when it was expected to. if the function returns for null arguments, as in your example).

Even if the function doesn't return and just throws a NullReferenceException, it's eaiser to solve a bug when you know that an argument was null. If a function throws a NullReferenceException, you have no idea what was null or whose fault it was.

I'd like to add that ArgumentNullException takes a parameter for a reason.

It is better to write

if(myArg == null) throw new ArgumentNullException("myArg");

than to throw an ArgumentNullException without a paramName.

This way, if you have an exception from a function that takes five parameters, you'll know which of the parameters caused the problem. This is especially important if you cannot attach a debugger. (For example, on a production web server or an end-user machine)

If you're writing many functions, this can be a lot of overhead, especially since there's no IntelliSense for the strings. I wrote a code snippet to generate these checks:

<?xml version="1.0" encoding="utf-8" ?>
<CodeSnippets  xmlns="http://schemas.microsoft.com/VisualStudio/2005/CodeSnippet"&gt;
    <CodeSnippet Format="1.0.0">
     <Header>
      <Title>Check for null arguments</Title>
      <Shortcut>tna</Shortcut>
      <Description>Code snippet for throw new ArgumentNullException</Description>
      <Author>SLaks</Author>
      <SnippetTypes>
       <SnippetType>Expansion</SnippetType>
       <SnippetType>SurroundsWith</SnippetType>
      </SnippetTypes>
     </Header>
     <Snippet>
      <Declarations>
       <Literal>
        <ID>Parameter</ID>
        <ToolTip>Paremeter to check for null</ToolTip>
        <Default>value</Default>
       </Literal>
      </Declarations>
      <Code Language="csharp"><![CDATA[if ($Parameter$ == null) throw new ArgumentNullException("$Parameter$");
     $end$]]>
      </Code>
     </Snippet>
    </CodeSnippet>
</CodeSnippets>

SLaks 2009-10-30 02:50:17

Answer 6

+1 A:

Sometimes you can't tell people why a practice like this is wrong - they have to figure it out for themselves. But you could help them get there by coming up with some unit test that causes some nasty failure due to this problem, and make them debug the error.

Spike Williams 2009-10-30 02:53:22

You are assuming that they write unit tests. The kind of developers who write unit tests will usually throw exceptions already. (I hope)

SLaks 2009-10-30 03:51:35

Answer 7

+1 A:

If the method's contract specifies that its arguments should not be null, then the right thing to do is to make it explicit, by using an Assert, like this:

Debug.Assert( item != null, "Null items are not supported );

This will fail fast when the executable is built using a debug configuration, but will present zero performance degradation when built using a release configuration.

Phillip Ngan 2009-10-30 02:58:40

If you're so worried about performance that you're unwilling to compare a reference to `null`, you should be writing assembler. Besides, are you saying that bugs do not happen in production?

SLaks 2009-10-30 03:14:11

Not to mention the implicit null checks on every member access that are there either way...

Pavel Minaev 2009-10-30 04:17:39

Answer 8

A:

Well first of all, you are unequivocally incorrect. You are embracing a very serious logical fallacy. You want your code to be correct by virtue of the code assuming that everything happening around it is correct. As if correctness was some kind of magical pixie dust that you just need to spray everywhere.

All bugs are or seem stupid once they are exposed. But its checks like this that tease them out to expose themselves. Until then, bugs are invisible. And for large and complex enough projects, you don't know who will find the bug or under what conditions they will be found. Code which is architected for resilience typically has checks like this all over the place and also check the return values for every function which have to include error values. So you end up encoding a "I can't do that because the sub-functions I rely on are not working" semantic that is actually handled properly. The great value of this is that you can usually implement work arounds or self-aware debug instrumentation quite easily. Why you want to do things like this is because the most difficult bugs usually rely on both of those properties to debug properly.

Learn some lessons from your developers. They put checks like that in there, because they don't know why sometimes they get strange results from functions. You call them naive or overly cautious because of one narrow piece of knowledge that you have that they don't. But when you are debugging something nasty you are going to wonder why you don't have such checks in your code, and you are going to end up looking just as naive for not being able to spot the bug in first place.

In short: No code is made robust by assuming robustness about the environment around them.

Paul Hsieh 2009-10-30 03:01:27

You are wrong. His point is that fail-fast code makes it much easier to find bugs in the code that calls it, and he is very very right.

SLaks 2009-10-30 03:10:42

Wait; which side are you defending?

SLaks 2009-10-30 03:12:43

Failing fast is only useful for a developer with the source code and debugger right in front of him/her, and if they are making a really really simple kind of error. Testers, beta-program people, and developers who are working in other parts of the code will also run into this. And if they fail fast, what are they going to do about it?If you are getting bad parameters, then what you want to know is *WHY* you are getting bad parameters. The *CALL SITE* will know more about that then the function which tries to consume the bad parameter.

Paul Hsieh 2009-10-30 03:32:16

Unfortunately, if you look at the example the poster provided, that almost certainly just swallows up the error into a black hole from which no information will escape. That allows the error to last much longer. On the other hand, if the protection code at least logs a message saying what happened ("null argument in XXX, all bets are off! investigate!!!") in a log that someone will actually check, then I might agree with you.

khedron 2009-10-30 03:46:43

Well keep in mind that the original poster has predeclared his prejudice on this matter. So the fact that he showed to useless example code should not be taken as evidence of anything.Of course you should have very descriptive failures. Returning an enum like: APP_ERR_NULL_FOO_OBJECT which the call site then notices, potentially cleans up and logs as errLogf("Error: %s foo(%d)", errString (APP_ERR_NULL_FOO_OBJECT), parameterToInitFoo); or something like that, then that is ideal.You can't learn or appreciate the value of this with the questioner's attitude/point of view.

Paul Hsieh 2009-10-30 04:02:56

The point he's trying to make is tha `Run` should never have been called in the first place. The `if (... == null)` check, returning `APP_ERR_NULL_FOO_OBJECT`, etc. is all code that simply shouldn't exist in `Run` to begin with.

Dan 2009-10-30 04:16:01

@Dan: That's extremely poor design. You don't want to replicate each parameter check in the call sites. That would be relying on all call sites. Being NULL is not the only way such a parameter or the function itself could go wrong. I didn't literally mean you should call errString(APP_ERR...). Obviously it would be something like errString(ret) which would correspond to whichever error was returned by the called function.

Paul Hsieh 2009-10-30 04:38:29

Answer 9

A:

My technique for avoiding if (pOrder == NULL) in C++ was to make heavy use of references

void Run(Order&);
void Run(const Order&);

no pointers, no temptation to check for NULL. But C++-style references don't exist for references in C#.

Perhaps one way of (somewhat) mimicing the C++ technique in C# is to use struct Order instead of class Order (see use MORE structs?). You could then have

void Run(Order order) { ... work on a COPY of 'order' ... }
void Run(ref Order order) { ... }

if you really wanted to pass null to Run(), you could do that too:

void Run(Order? order) { ... }

Dan 2009-10-30 03:22:30

There are many reasons not to do this. Inexperienced programmers will not realize that structs are passed by value and may try to mutate the struct in the method. Also, all f the copying may slow down the program. Finally, many methods take parameters of types you didn't define (eg, `Control` or `DataTable`).

SLaks 2009-10-30 03:33:39

The point is that a `struct` CAN'T be `null` so there is no more `if (order == null)` thus "solving" the problem.

Dan 2009-10-30 03:43:31

But creating other problems.

SLaks 2009-10-30 03:48:14

Engineering is about tradeoffs.

Dan 2009-10-30 03:51:29

Yes, but I don't think this one's worth it.

SLaks 2009-10-30 03:57:49

Answer 10

+1 A:

This seems to be a question about how best write code that is manageable. It is my new belief that you must assume ignorance of all consumers of your code. I have gotten myself into trouble by assuming I or someone with deep knowledge would be consuming my code. The only thing I would add to throwing an exception is creating custom exceptions as well as leaving breadcrumbs in the inner exception. I believe strongly in giving your developers a chance to run down the issue especially if it is due to data. I spend most of my time looking for the data that breaks my code and if you can leave hints you will save weeks in a year.

2009-10-30 03:55:05

ansaurus

tags:

views:

answers:

how do i explain that if (xyz == null) checks are not "protective".

related questions