ansaurus

Question

Should programmers use boolean variables to "document" their code?

Answer 1

+47 A:

Splitting an expression that's too nested and complicated into simpler sub-expressions assigned to local variables, then put together again, is quite a common and popular technique -- quite independently of whether the sub-expressions and/or the overall expression are boolean or of just about any other type. With well-chosen names, a tasteful decomposition of this kind can increase readability, and a good compiler should have no trouble generating code that's equivalent to the original, complicated expression.

Some languages that don't have the concept of "assignment" per se, such as Haskell, even introduce specialized constructs to let you use the "give a name to a subexpression" technique (the where clause in Haskell) -- seems to bespeak some popularity for the technique in question!-)

Alex Martelli 2010-03-19 14:49:28

if it's simpler AND easier to read, I say that's a pretty clear case of win-win :)

Steve the Plant 2010-03-19 14:53:40

I agree. In a lot of cases you could probably get away with a short comment, but I don't see how this could actually *hurt.*

Max E. 2010-03-19 22:44:57

Answer 2

+9 A:

I try to do it wherever possible. Sure, you are using an "extra line" of code, but at the same time, you are describing why you are making a comparison of two values.

In your example, I look at the code, and ask myself "okay why is the person seeing the value is less than 0?" In the second you are clearly telling me that some processes have finished when this occurs. No guessing in the second one what your intent was.

The big one for me is when I see a method like: DoSomeMethod(true); Why is it automatically set to true? It's much more readable like

bool deleteOnCompletion = true;

DoSomeMethod(deleteOnCompletion);

Kevin 2010-03-19 14:50:17

I dislike boolean parameters for precisely this reason. You end up getting calls like "createOrder(true, false, true, true, false)" and what does that mean? I prefer to use enum's, so you say something like "createOrder(Source.MAIL_ORDER, BackOrder.NO, CustomOrder.CUSTOM, PayType.CREDIT)".

Jay 2010-03-19 17:31:37

But if you follow Kevin's example, it's equivalent to yours. What difference does it make whether the variable can take on 2 values or more than 2?

Mark Ruzon 2010-03-19 19:15:09

With Jay's you can have the advantage of definitely being clearer in certain instances. For example, in use of the PayType. If it was a boolean the parameter would probable be named, isPayTypeCredit. You don't know what the altertnative is. With an enum you can clearly see what options the PayType is: Credit, Check, Cash and select the correct one.

Kevin 2010-03-19 19:47:28

@Kevin++ also an enumeration doesn't allow null assignment so value control is quite literally complete, and the auto-documentation of the enum is gravy on top

Hardryv 2010-03-24 17:06:29

Objective-C and Smalltalk really solve this problem, in Objective-C: `[Object createOrderWithSource:YES backOrder:NO custom:YES type:kCreditCard];`

chpwn 2010-03-28 19:37:01

Answer 3

+13 A:

I have used it, though normally wrapping boolean logic into a reusable method (if called from multiple locations).

It helps readability and when the logic changes, it only needs changing in one place.

Other will understand it and will not find it strange (except for those who only ever write thousand line functions, that is).

Oded 2010-03-19 14:50:30

Thanks for the answer! Regarding the reusable methods, that has another (unrelated) valid reason to be factored out... So I suppose my question really is if you should factor out one-time boolean expressions, when there's no reason other than readability. (Which of course is a big enough reason on its own :) ) Thanks for pointing that out.

froadie 2010-03-19 15:03:19

+1 for reducing the code changes necessary when the logic changes, and making fun of thousand-line-function programmers.

Jeffrey L Whitledge 2010-03-19 17:46:41

Answer 4

A:

I think, it depends on what style you / your team prefer. "Introduce variable" refactoring could be useful, but sometimes not :)

And I should disagree with Kevin in his previous post. His example, I suppose, usable in case, when introduced variable can be changed, but introducing it only for one static boolean is useless, because we have parameter name in a method declaration, so why duplicate it in code?

for example:

void DoSomeMethod(boolean needDelete) { ... }

// useful
boolean deleteOnCompletion = true;
if ( someCondition ) {
    deleteOnCompletion = false;
}
DoSomeMethod(deleteOnCompletion);

// useless
boolean shouldNotDelete = false;
DoSomeMethod(shouldNotDelete);

dchekmarev 2010-03-19 14:54:02

Answer 5

+3 A:

The only way I could see this going wrong is if the boolean fragment doesn't have a name that makes sense and a name is picked anyhow.

//No clue what the parts might mean.
if(price>0 && (customer.IsAlive || IsDay(Thursday)))

=>

first_condition = price>0
second_condition =customer.IsAlive || IsDay(Thursday)

//I'm still not enlightened.
if(first_condition && second_condition)

I point this out because it is commonplace to make rules like "comment all your code", "use named booleans for all if-criteria with more than 3 parts" only to get comments that are semantically empty of the following sort

i++; //increment i by adding 1 to i's previous value

MatthewMartin 2010-03-19 14:59:31

Jonathan Leffler 2010-03-20 06:11:54

Parens added. So this would be another strike against splitting to named variables, it introduces an opportunity to change the grouping in an non-obvious fashion.

MatthewMartin 2010-03-23 13:46:06

Answer 6

+2 A:

By doing this

finished = ((elementIndex < 0) || (MAX_ELEMENTS < elementIndex));
repeatedEntry = (elementIndex == lastElementIndex);
if(finished || repeatedEntry){
   ...
}

you remove the logic from your brain and put it in the code. Now the program knows what you meant.
Whenever you name something you give it physical representation. It exists.
You can manipulate and reuse it.

You can even define the whole block as a predicate:

bool ElementBlahBlah? (elementIndex, lastElementIndex);

and do more stuff (later) in that function.

Nick D 2010-03-19 15:09:03

And more importantly, the next dev to look at the code will know what you meant too! This is great practice, and I do it all the time.

Chris Thornton 2010-03-19 18:54:57

Answer 7

+2 A:

If the expression is complex then I either move it to another function which returns a bool eg, isAnEveningInThePubAGoodIdea(dayOfWeek, sizeOfWorkLoad, amountOfSpareCash) or reconsider the code so that such a complex expression isn't required.

Benedict Cohen 2010-03-19 15:11:10

Answer 8

+1 A:

Remember that this way you compute more than necessary. Due to taking conditions out from the code, you always compute both of them (no short circuit).

So that:

if((elementIndex < 0) || (MAX_ELEMENTS < elementIndex) || 
   (elementIndex == lastElementIndex)){
   ...
}

After the transformation becomes:

if((elementIndex < 0) || (MAX_ELEMENTS < elementIndex) |
   (elementIndex == lastElementIndex)){
   ...
}

Not an issue in most cases, but still in some it may mean worse performance or other issues, e.g. when in the 2nd expression you assume the 1st one has failed.

Konrad Garus 2010-03-19 15:15:57

A compiler will resolve that during optimization.

Jennifer Zouak 2010-03-19 15:22:46

froadie 2010-03-19 15:24:41

I actually doubt that the compiler will resolve that. If the second call is not effective, it's usually due to call of some function, and AFAIK no compiler is trying to determine if a called function is without side-effects.

J S 2010-03-19 15:58:49

You can nest the IFs, and don't do the later calculations unless the first test is insufficient to decide whether to proceed.

Jay 2010-03-19 17:34:03

Answer 9

+5 A:

The provided sample:

finished = ((elementIndex < 0) || (MAX_ELEMENTS < elementIndex));
repeatedEntry = (elementIndex == lastElementIndex);
if(finished || repeatedEntry){
   ...
}

Can also be rewritten to use methods, which improve readability and preserve the boolean logic (as Konrad pointed out):

if (IsFinished(elementIndex) || IsRepeatedEntry(elementIndex, lastElementIndex)){
   ...
}

...

private bool IsFinished(int elementIndex) {
    return ((elementIndex < 0) || (MAX_ELEMENTS < elementIndex));
}

private bool IsRepeatedEntry(int elementIndex, int lastElementIndex) {
    return (elementIndex == lastElementIndex);
}

It comes at a price, of course, which is two extra methods. If you do this a lot, it may make your code more readable, but your classes less transparent. But then again, you could also move the extra methods into helper classes.

Prutswonder 2010-03-19 15:37:33

If you got into alot of code noise with C# you can also take advantage of partial classes and move the noise to the partial and if people are interested in what IsFinished is checking it's easy to jump to.

Chris Marisic 2010-03-19 17:45:49

Answer 10

+2 A:

I think it's better to create functions/methods instead of temporary variables. This way readability is increased also because methods get shorter. Martin Fowler's book Refactoring has good advice for improving code quality. Refactorings related to your particular example are called "Replace Temp with Query" and "Extract Method".

mkj 2010-03-19 16:00:01

Are you saying that by cluttering the class-space with a lot of one-off functions, you're increasing readability? Please explain.

Zano 2010-03-19 16:50:57

It's always a trade off. The readability of the original function will improve. If the original function is short it might not be worth it.

mkj 2010-03-19 16:55:31

Also "cluttering the class-space" is something I think depends on the language used and how you partition your code.

mkj 2010-03-19 17:01:27

Answer 11

+2 A:

Personally, I think that this is a great practice. It's impact on the execution of the code is minimal, but the clarity that it can provide, if used properly, is invaluable when it comes to maintaining the code later.

GrowlingDog 2010-03-19 16:04:12

Answer 12

A:

In my experience, I've often returned to some old scripts and wondered 'what the hell was I thinking back then?'. For example:

Math.p = function Math_p(a) {
    var r = 1, b = [], m = Math;
    a = m.js.copy(arguments);
    while (a.length) {
        b = b.concat(a.shift());
    }
    while (b.length) {
        r *= b.shift();
    }
    return r;
};

which isn't as intuitive as:

/**
 * An extension to the Math object that accepts Arrays or Numbers
 * as an argument and returns the product of all numbers.
 * @param(Array) a A Number or an Array of numbers.
 * @return(Number) Returns the product of all numbers.
 */
Math.product = function Math_product(a) {
    var product = 1, numbers = [];
    a = argumentsToArray(arguments);
    while (a.length) {
        numbers = numbers.concat(a.shift());
    }
    while (numbers.length) {
        product *= numbers.shift();
    }
    return product;
};

rolandog 2010-03-19 17:30:22

-1. This has a little to do with the original question, to be honest. The question is about a different, very specific thing, not about how to write good code in general.

Pavel Shved 2010-03-19 18:45:27

And yes, welcome to StackOverflow.com!

Pavel Shved 2010-03-19 18:45:55

Answer 13

A:

I rarely create separate variables. What I do when the tests get complicated is nest the IFs and add comments. Like

boolean processElement=false;
if (elementIndex < 0) // Do we have a valid element?
{
  processElement=true;
}
else if (elementIndex==lastElementIndex) // Is it the one we want?
{
  processElement=true;
}
if (processElement)
...

The admitted flaw to this technique is that the next programmer who comes along may change the logic but not bother to update the comments. I guess that's a general problem, but I've had plenty of times I've seen a comment that says "validate customer id" and the next line is examining the part number or some such and I'm left to wonder where the customer id comes in.

Jay 2010-03-19 17:38:50

Answer 14

+1 A:

if the method requires a notification of success: (examples in c#) I like to use the

bool success = false;

to start out. the code's a falure until I change it to:

success = true;

then at the end:

return success;

Chris 2010-03-23 00:18:09

ansaurus

tags:

views:

answers:

Should programmers use boolean variables to "document" their code?

related questions