When is it good (if ever) to scrap production code and start over?

views:

1907

answers:

+40 Q:

When is it good (if ever) to scrap production code and start over?

I was asked to do a code review and report on the feasibility of adding a new feature to one of our new products, one that I haven't personally worked on until now. I know it's easy to nitpick someone else's code, but I'd say it's in bad shape (while trying to be as objective as possible). Some highlights from my code review:

Abuse of threads: QueueUserWorkItem and threads in general are used a lot, and Thread-pool delegates have uninformative names such as PoolStart and PoolStart2. There is also a lack of proper synchronization between threads, in particular accessing UI objects on threads other than the UI thread.
Magic numbers and magic strings: Some Const's and Enum's are defined in the code, but much of the code relies on literal values.
Global variables: Many variables are declared global and may or may not be initialized depending on what code paths get followed and what order things occur in. This gets very confusing when the code is also jumping around between threads.
Compiler warnings: The main solution file contains 500+ warnings, and the total number is unknown to me. I got a warning from Visual Studio that it couldn't display any more warnings.
Half-finished classes: The code was worked on and added to here and there, and I think this led to people forgetting what they had done before, so there are a few seemingly half-finished classes and empty stubs.
Not Invented Here: The product duplicates functionality that already exists in common libraries used by other products, such as data access helpers, error logging helpers, and user interface helpers.
Separation of concerns: I think someone was holding the book upside down when they read about the typical "UI -> business layer -> data access layer" 3-tier architecture. In this codebase, the UI layer directly accesses the database, because the business layer is partially implemented but mostly ignored due to not being fleshed out fully enough, and the data access layer controls the UI layer. Most of the low-level database and network methods operate on a global reference to the main form, and directly show, hide, and modify the form. Where the rather thin business layer is actually used, it also tends to control the UI directly. Most of this lower-level code also uses MessageBox.Show to display error messages when an exception occurs, and most swallow the original exception. This of course makes it a bit more complicated to start writing units tests to verify the functionality of the program before attempting to refactor it.

I'm just scratching the surface here, but my question is simple enough: Would it make more sense to take the time to refactor the existing codebase, focusing on one issue at a time, or would you consider rewriting the entire thing from scratch?

EDIT: To clarify a bit, we do have the original requirements for the project, which is why starting over could be an option. Another way to phrase my question is: Can code ever reach a point where the cost of maintaining it would become greater than the cost of dumping it and starting over?

+6 A:

Two threads of thought on this one: Do you have the original requirements? Do you have confidence that the original requirements are accurate? What about test plans or unit tests? If you have those things in place it might be easier.

Putting on my customer hat, does the system work or is it unstable? If you've got something that's unstable you've got an argument to change; otherwise you're best of refactoring it bit by bit.

Martin Clarke 2008-09-27 23:37:19

+25 A:

To actually scrap and start over?

When the current code doesn't do what you would like it to do, and would be cost prohibitive to change.

I'm sure someone will now link Joel's article about Netscape throwing their code away and how it's oh-so-terrible and a huge mistake. I don't want to talk about it in detail, but if you do link that article, before you do so, consider this: the IE engine, the engine that allowed MS to release IE 4, 5, 5.5, and 6 in quick succession, the IE engine that totally destroyed Netscape... it was new. Trident was a new engine after they threw away the IE 3 engine because it didn't provide a suitable basis for their future development work. MS did that which Joel says you must never do, and it is because MS did so that they had a browser that allowed them to completely eclipse Netscape. So please... just meditate on that thought for a moment before you link Joel and say "oh you should never do it, it's a terrible idea".

DrPizza 2008-09-27 23:39:12

Of course, it could also be said that companies like Microsoft generate their own gravity. If you have the resources of Microsoft, then by all means, rewrite. Otherwise, be very cautious.

Kyralessa 2008-09-28 22:02:28

Kyralessa, remember that IE 3 was not a juggernaut; Netscape was at the time.

Bernard 2008-09-29 03:40:24

The problem with this answer is that it ignores the difficulty of estimating the effort involved in re-writing the code. If you knew that it would be possible to re-write the code to achieve your objectives faster than changing the existing code, it's a no-brainier. But estimating is the hard part.

PeterAllenWebb 2008-10-03 19:15:32

Perhaps. But I don't think it's necessary all that difficult, because I think that any kind of radical alteration to a program will tend to be difficult and time-consuming. Far too often I think we incorrectly veer towards "modify", when actually, software isn't as malleable as is assumed.

DrPizza 2008-10-04 10:12:10

Kyralessa's point is that Microsoft had a few *billion* in the bank from their other products, so they could pour money into a rewrite because they considered the browser very strategic. Sorry, DrPizza, but I've experienced some failed overoptimistic rewrites. It's a dangerous road to go down.

MarkJ 2009-02-15 21:10:08

Martin Fowler disagrees with you too. http://martinfowler.com/bliki/StranglerApplication.html

MarkJ 2009-03-16 11:32:40

+6 A:

I think the line in the sand is when basic maintenance is taking 25% - 50% longer than it should. There comes a time when maintaining legacy code becomes too costly. A number of factors contribute to the final decision. Time and cost being the most important factors I think.

Shaun 2008-09-27 23:40:31

+3 A:

I agree with Martin. You really need to weigh the effort that will be involved in writing the app from scratch against the current state of the app and how many people use it, do they like it, etc. Often we may want to completely start from scratch, but the cost far outweighs the benefit. I come across bits of ugly looking code all the time, but I soon realize that some of these 'ugly' areas are really bug fixes and make the program work correctly.

Ed Swangren 2008-09-27 23:42:24

+7 A:

A rule of thumb I've found useful is that if given a code base, if I have to re-write more than 25% of the code to make it work or modify it based upon new requirements, you may as well re-write it from scratch.

The reasoning is that you can only patch a body of code so far; beyond a certain point, it's quicker to do over.

There's an underlying assumption that you have a mechanism (such as thorough unit and/or system tests) that will tell you whether your re-written version is functionally equivalent (where it needs to be) as the original.

Jason Etheridge 2008-09-27 23:43:15

+3 A:

You can only give a definite yes to rewriting in case if you know completely how your application works (and by completely I mean it, not just having a general idea of how it should work) and you know more or less exactly how to make it better. Any other cases and it's a shot in the dark, it depends on too much things. Perhaps gradual refactoring would be safer if it is possible.

Nouveau 2008-09-27 23:44:44

If it requires more time to read and understand the code (if that is even possible) than it would to rewrite the entire application, I say scrap it and start over.

Micah 2008-09-28 00:12:41

+3 A:

I would try to consider the architecture of the system and see whether it is possible to scrap and rewrite specific well defined components without starting everything from scratch.

What would usually happen is that you can either do that (and then sell that to the customer/management), or that you find out that the code is such a horrible and tangled mess that you become even more convinced that you need a rewrite and have more convincing arguments for it (including: "if we engineer it right, we would never need to scrap the whole thing and do a third rewrite).

Slow maintenance would eventually cause that architectural drift that would make a rewrite more expensive later.

Uri 2008-09-28 00:25:24

+8 A:

If it requires more time to read and understand the code (if that is even possible) than it would to rewrite the entire application, I say scrap it and start over.

Be very carefull with this:

Are you sure you aren't just being lazy and not bothering to read the code
Are you being arrogant about the great code you will write compared to the rubbish anyone else produced.
Remember tested-working code is worth a lot more than imaginary yet-to-be-written code

In the words of our estemed host and overlord, Joel - things you should never do,
it's not always wrong to abandon working code - but you have to be sure about the reason.

Martin Beckett 2008-09-28 00:29:01

I've read the code, that's how I did the code review ;-)In this case, I think it's obvious the code could be cleaned up, but is it worth breaking the code by trying to slowly refactor, since the code is so fragile, or better to rewrite, since we can start small and build it back up again.

Mike Spross 2008-09-28 00:54:26

As for ego/arrogance, I've written my share of ugly code and then inevitably was the one who had to fix it later. At some point, you have to look at the code objectively (whether its yours or not) and ask yourself "Does any of this make sense?" Tolerating bad code because it works seems dangerous.

Mike Spross 2008-09-28 00:59:23

Agreed, but there is a natural tendancy to decide to reinvent everything yourself rather than get to grips with whats already there, you have to be certain of your reasons.

Martin Beckett 2008-09-28 02:46:08

Indeed. There need to be clear reasons (and incentive) to toss everything out. Even if we do break away and start fresh, there may still be algorithms or concepts that can be borrowed from the existing codebase, so in the end, there might be no such thing as "throwing it all away."

Mike Spross 2008-09-28 04:11:52

MarkJ 2009-03-16 11:45:04

+5 A:

If there are clean interfaces and you can cleanly delineate module boundaries, then it might be worth refactoring it module by module or layer by layer in order to allow you to migrate existing customers forward into cleaner more stable codebases, and over time, after you've refactored every module, you will have rewritten everything.

But, based on the codereview, doesn't sound like there would be any clean boundaries.

Richard 2008-09-28 00:29:37

Poor code often has more cleanly delimited module boundaries than good code, because there's little reuse of low level routines. Have a look at Michael Feathers Working with legacy code.

MarkJ 2009-02-15 21:11:53

+2 A:

Scrap old code early and often. When in doubt, throw it out. The hard part is convincing non-technical folks of the cost-to-maintain.

So long as the value derived appears to be greater than the cost to operate and maintain, there's still positive value flowing from the software. The question surrounding a rewrite this: "will we get even more value from a rewrite?" Or alternatively "How much more value will we get from a rewrite?" How many person-hours of maintenance will you save?

Remember, the rewrite investment is once only. The return on the rewrite investment lasts forever. Forever.

Focus the value question down to specific issues. You listed a bunch of them above. Stick with that.

"Will we get more value by reducing cost through dropping the junk that we don't use but still have to wade through?"
"Will we get more value from dropping the junk that's unreliable and breaks?"
"Will we get more value if we understand it -- not by documenting, but by replacing with something we built as a team?"

Do you homework. You'll have to confront the following show-stoppers. These will originate somewhere in your executive foodchain from someone who'll respond as follows:

"Is it broken?" And when you say "It's not crashed as such," They'll say "It's not broke - don't fix it."
"You've done the code analysis, you understand it, you no longer need to fix it."

What's your answer to them?

That's only the first hurdle. Here's the worst possible situation. This doesn't always happen, but it does happen with alarming frequency.

Someone in your executive foodchain will have this thought:

"A rewrite doesn't create enough value. Rather than simply rewrite, let's expand it." The justification is that by creating enough value, users are more likely to buy in to the rewrite.

A project where scope is expanded -- artificially -- to add value is usually doomed.

Instead, do the smallest rewrite you can to replace the darn thing. Then expand to fit real needs and add value.

S.Lott 2008-09-28 00:33:37

-1 Martin Fowler says exactly the opposite to this. He advises gradually replacing (by expanding and improving). I know you've got a massive rep but I think you're wrong this time. http://martinfowler.com/bliki/StranglerApplication.html

MarkJ 2009-03-16 11:31:40

I'm talking about precisely the same approach Fowler is talking about -- incremental replacement. I'm suggesting that you replace pieces early and often. Eventually, the old system will be strangled out of existence. Do NOT create a huge project to replace it all at once.

S.Lott 2009-03-16 13:23:56

I have never completely thrown out code. Even when going from a foxpro system to a c# system.

If the old system worked then why just throw it out?

I have come across a few really bad system. Threads being used where not needed. Horrible inheritance and abuse of interfaces.

It is best to understand what the old code is doing and why it is doing it. Then change it so that it is not confusing.

Of course if the old code doesn't work. I mean can't even compile. Then you might be justified in just starting over. But how often does that actually happen?

ElGringoGrande 2008-09-28 00:43:44

Yes, it totally can happen. I've seen money be saved by doing it.

This is not a tech decision, it's a business decision. Code rewrites are long term gains, while "if it ain't totally broke..." is a short term gain. If you are in a first year startup that is focused on getting a product out the door, the answer is usually to just live with it. If you're in an established company, or the errors with the current systems are causing more workload, therefor more company money.. then they might go for it.

Present the problem as best as you can to your GM, use dollar values where you can. "I don't like dealing with it" means nothing. "It'll take twice the time to do everything until this is fixed" means a lot.

UltimateBrent 2008-09-28 01:15:49

I think there are a number of issues here that depend largely on where you are at.

Is the software working well from a customer perspective? (If yes be very careful about changes). I would think there would be little point re-witting unless you were expanding the feature set if the system was working. And are you planning to expand the features and customer base of the software? If so then you have much more reason to change.

As much as anything just trying to understand some else's code even if well written can be difficult, when badly written I would imagine almost impossible. What you describe sounds like something that would be very difficult to expand.

David L Morris 2008-09-28 01:21:41

+2 A:

If possible, I typically would prefer to rewrite smaller portions of the code over time when I need to refactor a baseline. There are typically many smaller issues such as magic number, poor commenting, etc. that tend to make the code look worse than it actually is. So, unless the baseline is just awful, keep the code and just make improvements at the same time you are maintaining the code.

If refactoring requires a lot of work, I recommend laying out a small re-design plan/todo list that gives you a list of things to work on in order so that you can bring the baseline to a better state. Starting from scratch is always a risky move and you are not guaranteed that the code will be better when you are finished. Using this technique, you will always have a working system that improves over time.

dr_pepper 2008-09-28 01:25:24

good advice... its nice to not have to learn this the hard way

Shawn Simon 2008-09-28 03:30:04

I wonder if a concurrent approach could work. Maintain the current code (bug fixes only), but work on a new version in a completely separate project, starting from scratch, but borrowing from the current codebase where appropriate/possible.

Mike Spross 2008-09-28 03:49:53

You might have to do that if the baseline is really bad, but I would strongly consider modifying it incrementally. One reason is because it will get less mgmt visibility, and it will force you to understand what is wrong with the current baseline.

dr_pepper 2008-09-28 04:13:06

+2 A:

Code with excessively high cyclomatic complexity (like over 100 in a large number of modules) is a good clue. Also, how many bugs does it have / KLOC? How critical are the bugs? How often are bugs introduced when bug fixes are made. If your answer is a lot (I cant remember norms right now), then a rewrite is warranted.

torial 2008-09-28 01:28:00

I would take into consideration if the application does what it is intended to do, is required for you to ever make modifications, and are you confident that the app has been thoroughly tested in all scenarios that it will be used in.

Do not invest the time if the app does not need alterations. However, if it doesn't function as you need and you need to control the hours and time invested to make corrections, scrap it and re-write to the standards that your team can support. There's nothing worse than terrible code that you have to support / decipher but still have to live with. Remember, Murphy's Law says it will 10 at night when you'll have to make things work, and that is never productive.

David Robbins 2008-09-28 01:36:03

+7 A:

I saw an application re-architected within 2 years of its introduction into production, and others rewritten in different technologies (one was C++ - now Java). Both efforts were were not, to my mind, successful.

I prefer a more evolutionary approach to bad software. If you can "componentize" your old app such that you can introduce your new requirements and interface with the old code, you can ease yourself into the new environment without having to "sell" the zero-value (from a biz perspective) investment in rewriting.

Suggested approach - write unit tests for the functionality with which you wish to interface to 1) ensure the code behaves as you expect and 2) provide a safety net for any refactoring that you may wish to do on the old base.

Bad code is the norm. I think IT gets a bad rap from business for favoring rewrites/rearchitecting/etc. They pay the money and "trust" us (as an industry) to deliver solid, extensible code. Sadly, business pressures frequently result in shortcuts that make the code unmaintainable. Sometimes it's bad programmers... sometimes bad situations.

To answer your rephrased question... can code maintenance costs ever exceed rewriting costs... the answer is clearly yes. I don't see anything in your examples, however, that lead me to believe this is your case. I think those issues can be addressed with tests and refactoring.

Adrian Wible 2008-09-28 01:50:05

+1 A:

I do not have any experience with using metrics for this myself, but the article "Software Maintainability Metrics Models in Practice" discusses more or less the same question asked here for two case studies they did. It starts with the following editor's note:

In the past, when a maintainer received new code to maintain, the rule-of-thumb was "If you have to change more than 40 percent of someone else's code, you throw it out and start over." The Maintainability Index [MI] addressed here gives a much more quantifiable method to determine when to "throw it out and start over." This work was sponsored by the U.S. Air Force Information Warfare Center and the U.S. Department of Energy [DOE], Idaho Field Office, DOE Contract No. DE-AC07-94ID13223.)

hlovdal 2008-09-28 02:06:00

Production code always has some value. The only case where I would truly throw it all out and start again is if we determine the intellectual property is irrevocably contaminated. For example if someone brought large amounts of code from a previous employer, or a large percentage of the code was ripped from a GPLd codebase.

DGentry 2008-09-28 02:21:29

+1 A:

I think the rule was...

The first version is always a throw away

So, if you learned your lesson(s), or his/her lessons, then you can go ahead and write it fresh now that you understand your problem domain better.

Not that there aren't parts that can/should be kept. Tested code is the most valuable code, so if it isn't deficient in any real way other than style, no reason to toss it all out.

Andrew Backer 2008-09-28 02:55:33

-1 That rule comes from Fred Brooks famous book "The Mythical Man Month". In the most recent edition, he says it was the biggest mistake he made in the book.

MarkJ 2009-03-16 11:28:06

+19 A:

Without any offense intended, the decision to rewrite a codebase from scratch is a common, and serious management mistake newbie software developers make.

There are many disadvantages to be wary of.

Rewrites stop new features from being developed cold for months/years. Few, if any companies can afford to stand-still for this long.
Most development schedules are difficult to nail. This rewrite will be no exception. Amplify the previous point by, now, a delay in development.
Bugs that were fixed in the existing codebase through painful experience will be re-introduced. Joel Spolsky has more examples in this article.
Danger of falling victim to the Second-system effect -- in summary, ``People who have designed something only once before try to do all the things they "didn't get to do last time", loading the project up with all the things they put off while making version one, even if most of them should be put off in version two as well.''
Once this expensive, burdensome rewrite is completed, the very next team to inherit the new codebase is likely to use the same excuses for doing another rewrite. Programmers hate learning someone else's code. No one writes perfect code because perfection is so subjective. Find me any real-world application and I can give you a damning indictment and rationale for doing a from-scratch rewrite.

Whether you ultimately rewrite from scratch or not, beginning a refactoring phase now is a good way to both really sit down and understand the problem so that the rewrite will go more smoothly if truly called for, as well as giving the existing codebase an honest look to really see if a rewrite's needed.

mbac32768 2008-09-28 03:14:19

I'd upvote this again if it would let me. I definitely agree with your points. OTOH, I wonder how many hard-earned "bug fixes" were added solely to fix a symptom of poor design -- if a cleaner design (with less code) can be created, then there is less chance of encountering the same bugs again.

Mike Spross 2008-09-28 03:47:50

Hmmm - sounds like Mike Spross is about to invent refactoring!

MarkJ 2009-02-26 11:11:26

Martin Fowler agrees with this too. http://martinfowler.com/bliki/StranglerApplication.html

MarkJ 2009-03-16 11:33:33

+2 A:

As early as possible. Whenever you get a premonition that your code is slowly turning into an ugly beast that is very likely to consume your soul and give you headaches, and you know the problem is in the underlying structure of the code (so any fix would be a hack, e.g. introduce a global variable), then it's time to start over.

For some reasons people don't like throwing away precious code, but if you feel your better off starting over, you are probably right. Trust your instinct and remember that it wasn't a waste of time, it taught you one more way of NOT approaching the problem. You could (should) always use a version control system so your baby is never really lost.

Firas Assaad 2008-09-28 08:57:52

I'm going to post this book every time I see a discussion on Refactoring. Everyone should read "Working Effectively with Legacy Code" by Michael Feathers. I found it to be an excellent book - if nothing else, it's a fun read, and motivational.

Matt Cruikshank 2008-09-30 03:07:47

+3 A:

I wonder if the people who vote for scrapping and starting over have ever successfully refactored a large project, or at least seen a large project in poor condition that they think could use a refactoring?

If anything, I err on the opposite side: I've seen 4 large projects that were a mess, that I advocated refactoring as opposed to rewriting. On a couple, there was barely a single line of original code that remained, and major interfaces changed in significant ways, but the process never involved the entire project failing to function as well as it originally did, for any more than a week. (And top-of-trunk was never broken).

Perhaps a project exists that is so severely broken that to attempt to refactor it would be doomed to failure, or perhaps one of the previous projects I refactored would have been better served by a "clean re-write", but I'm not sure I'd know how to recognize it.

KeyserSoze 2008-09-30 22:09:33

When the code has reached a point that is not maintainable or extensible anymore. Is full of short-term hacky fixes. It has lots of coupling. It has long (100+lines) methods. It has database access in the UI. It generates a lot of random, impossible to debug errors.

Bottom line: When maintaining it is more expensive (i.e. takes longer) than rewriting it.

Ricardo Villamil 2008-10-03 19:02:55

+2 A:

In terms of business value, I would think it's extremely rare that a real case can be made for a rewrite due solely to the internal state of the code. If the product's customer-facing and is currently live and bringing in money (i.e. is not a mothballed or unreleased product), then consider that:

You already have customers using it. They're familiar with it, and might have built some of their own assets around it. (Other systems that interface to it; products based on it; processes they'd have to change; staff they'd maybe have to retrain). All of this costs the customer money.
Re-writing it might cost less in the long term than making difficult changes and fixes. But you can't quantify that yet, unless your app is no more complex than Hello World. And a re-write means a re-test and a redeploy, and probably an upgrade path for your customers.
Who says the re-write will be any better? Can you honestly say your firm is writing sparkly code now? Have the practices that turned the original code to spaghetti been corrected? (Even if the main culprit was a single developer, where were his peers and management, ensuring quality through reviews, testing, etc.?)

In terms of technical reasons, I'd suggest it could be time for a major rewrite if the original has some technical dependencies that have become problematic. e.g. a third party dependency that's now out of support, etc.

In general though, I think the most sensible move is to refactor piece by piece (very small pieces if it's really that bad), and improve the internal architecture incrementally rather than in one big drop.