ansaurus

Question

What Pros & Cons would there be to saving code more granularly at the file level?

Answer 1

A:

Personally I would find that that type of separation, although possible in some languages, would be a nightmare to maintain, and would make following the code really hard. I personally find that with .NET languages regions in code are far more helpful.

I would be concerned on large projects with the sheer number of files.

Mitchel Sellers 2008-10-13 15:16:52

As stated in the question, we assume> we've got the perfect implementation that would make this idea to *work well* for us.This means that when we work in our code editor, there's strictly no difference for us. If you meant something else, please clarify.

Daniel Jomphe 2008-10-13 15:27:51

Answer 2

A:

Pro Commit changesets should carry much more information about what really happened for this commit to come to life without having to show a diff:

Many files listed for the same class suggest it's been refactored;
On the opposite, only one file change listed for a class would suggest its behavior has only been slightly modified.

Daniel Jomphe 2008-10-13 15:17:40

Answer 3

A:

Con On a performance stand-point, all these file I/Os could cost much in big working trees.

Daniel Jomphe 2008-10-13 15:19:43

Answer 4

+4 A:

Perhaps the better question to ask in response is: What is the problem you would hope to solve with this approach?

The .NET languages support some level of this idea with partial classes, although I've never seen it carried to that extreme and the editing experience is not as seamless as you describe in your "perfect implementation".

I can't really see the usefulness of such a scenario. Yes, having multiple files means less potential risk of a change affecting other parts of the code but I don't think that outweighs the performance impact of parsing the files to show it as one editable file.

Scott Dorman 2008-10-13 15:48:53

My Pro answer is what I hoped to solve with this approach.

Daniel Jomphe 2008-10-13 17:04:09

In that case, your changesets should include better checkin comments. These can be used for any number of secondary purposes, including a running list of changes for each build. The comments need to be more than just "changed xxx line of code" as a diff can tell you that much.

Scott Dorman 2008-10-13 18:50:30

Thinking of it, my Pro answer isn't a full answer to your question. The real point of my idea would bring automatic listing of changes per class+method. This could much more easily be achieved by small scripts run before commits, to update the message template. Thanks for your question, Scott!

Daniel Jomphe 2008-10-13 19:30:53

You're welcome. I think using the ability of a source control system to run scripts before/after commits to do this type of analysis is a good way to do it.

Scott Dorman 2008-10-13 19:41:14

Answer 5

+1 A:

I think like everything it has to be a balance - and it seems like Frameworks are often on either side of the spectrum.

aka Camping vs. Rails et al.

The level of granularity you mapped out above seems a little over the top to me. I foresee refactoring being a nightmare. Even in frameworks like ASP.net and Ruby on Rails, I find myself constantly cleaning up my workspace because I have too many files open and its causing productivity issues.

con: lots of open files when developing
con: refactoring would be complicated and disorienting
con: more prone to files breaking naming conventions and interference with actual syntax errors
con: for interpreted languages, test harnesses would have to include many more files - there would have to be some smart inclusion methods to get everything that was needed to interpret a class.
con: the filestystem is less representative of an object orientation - sometimes it's nice being able to easily ascertain the classes in a given folder.

Sorry - I want to offer a pro here but it's just not coming to me

danpickett 2008-10-13 16:22:54

On one hand, I'd say that my last implementation example already eliminates your cons #1, 2, 4, 5, as it makes the working copy exactly like we're used to see it (I understand you say it's important for it to remain this way). On the other hand, I understand it might be hard to impl. it perfectly.

Daniel Jomphe 2008-10-13 17:12:01

Answer 6

+1 A:

Hard disk space.

On a default windows installation every file takes up at least 4k. Other file systems can take more or less but there's almost always a minimum size. I could see a large software project all of the sudden taking up a large amount of disk space with this system. This might not be a huge problem on the developer's machines but I'd be concerned about the source control server since it seems like servers never have enough disk space.

Bryan Anderson 2008-10-13 16:39:26

Answer 7

+1 A:

I agree, this is way too granular.

However, I have considered separating members by scope:

ClassA-interface.cs      (public members)
ClassA-implementation.cs (non-public members)

Of course, you could go further, but even this bifurcation doesn't "solve a problem" that I have. While it would make me more conscious of changes likely to affect client code, I'm better off learning that through tests.

harpo 2008-10-13 19:50:13

Answer 8

+2 A:

CON

Class-per-file lets you wrap your brain around the class as a whole. If your classes are large to the point of being many multi-page monstrosities, you might need to split the methods to separate files, but you might as well refactor and save yourself some maintenance headache. This is one of the arguments against #region as well.

Jimmy 2008-10-13 19:59:05

Answer 9

+1 A:

PRO:

I've noticed that many GNU C libraries split up their source into 1 function per translation-unit files. From what I gather this is to assist when statically linking the library into a binary. Apparently, when searching for a symbol, the linker will include the entire translation unit in which the symbol resides (which makes sense, since there might be static functions and data in the unit which the symbol depends on). Therefore if you split your code into granular files, its possible that you could keep your static-linked file sizes to a minimum.

I would imagine that in C++ there are fewer benefits, especially for classes with virtual members, as the virtual table would introduce multiple symbol dependencies.

CON:

Unless you had built-in IDE support for this mapping it seems that maintainence of anything at the granularity level of methods would be problematic as people make wholesale changes to classes. In projects I work on, functions get will get occasionally renamed in order to more accurately reflect their changing behavior. Adding a file rename on top of the function rename is just one more step for people to screw up.

pk 2008-10-13 20:07:28

Answer 10

+1 A:

Some version control systems, notably Git (and probably BitKeeper), work on this level by design, instead of on a file level.

Pro: Especially when refactoring (on branches), this might come in very handy (being able to specify that a single method has moved from one class to another, or that a part of a class has been moved). A system that works on a lower level of granularity than files could resolve merge conflicts easier (because it can track where a method has moved, and can apply the changes accordingly to multiple files).

Con: I think you really need a VCS that supports it by design, instead of trying to duct-tape it on a file based VCS (Git doesn't work too well on Windows, and BitKeeper is commercial software). Another drawback is that you probably can't do it in a language-agnostic way. AFAIK, Git employs some advanced heuristics to track the individual pieces of content in a file, but it's not infallible, and there might be some languages with nonstandard (not C-like) function/method syntax where conventional heuristics might fail (Prolog predicates).

Kim Sullivan 2008-10-13 20:28:52

ansaurus

tags:

views:

answers:

What Pros & Cons would there be to saving code more granularly at the file level?

CON

related questions