views:

4307

answers:

16

I am in charge of about 100+ documents (word document, not source code) that needs revision by different people in my department. Currently all the documents are in a shared folder where they will retrieve, revise and save back into the folder.

What I am doing now is looking up the "date modified" in the shared folder, opened up recent modified documents and use the "Track Change" function in MS Word to apply the changes. I find this a bit tedious.

So will it be better and easier if I commit this in a version control database?

Basically I want to keep different version of a file.


What have I learn from answers:

  • Use Time Machine to save different version (or Shadow copy in Vista)

  • There is a difference between text and binary documents when you use version control app. (I didn't know that)

  • Diff won't work on binary files

  • A notification system (ie email) for revision is great

  • Google Docs revision feature.

Update :

I played around with Google Docs revision feature and feel that it is almost right for me. Just a bit annoyed with the too frequent versioning (autosaving).

But what feels right for me doesn't mean it feels right for my dept. Will they be okay with saving all these documents with Google?

A: 

You could do that, but if that files are binary you should always put a lock on it before editing. You won't get a conflict (which would be unresolvable).

rafek
+1  A: 

You can, but you will allways compare the document versions with Word itself.

I haven't heard a version control database which can track changes in Word documents.

However there are some tools which can compare Word documents, so if you set up your version control client to use these tools for comparison, you can have some fun.

Biri
+1  A: 

Not necessarily. It depends on how often the new files are committed to the repo. If the files are edited several times before a commit, then you're precisely where you are now. The biggest benefit is if the file becomes corrupted.

You can version any file; this is how Time Machine in Mac OS X Leopard works, for example, and there is an interesting article by someone who committed his entire computing environment into CVS and then just maintained working copies on his home and work machines.

But "better" and "easier" are specific to your situation, and I'm not sure I completely understand your problem as things stand.

Flubba
+7  A: 

G'day,

I guess one thing that nobody seems to have asked is if you have a legal requirement to store history of changes to the doc's?

Whether you do or don't is going to have an impact on what solutions you can consider.

Also a notification mechanism for out of date copies is also a bundle of fun. If engineer A has a copy of a document and engineer B then edits it and commits the changes you want engineer A to be notified that his copy is out of date.

Document control can become a real can of worms quite easily.

Maybe keep the doc's under CVS or SVN and set it up so that emails are generated to whoever has checked out a copy when updates for the same doc. are checked in to the repository?

HTH

cheers, Rob

Edit: I forgot to add don't forget to use the binary switch, e.g. -kb for CVS, when adding the new doc. Otherwise, you will get any sequences of data that happen to match the ascii for keyword strings having the relevant config management data appended thereby corrupting your doc. data.

Rob Wells
SVN doesn't perform keyword expansion by default - you need to set the property to enable it. As a result, you can safely store any document without setting anything special.
gbjbaanb
+1  A: 

Subversion, CVS and all other source control systems are not good for Word documents and other office files (such as Excel spread sheets), since the files themselves are stored in a binary format. That means that you can never go back and annotate (or blame, or whatever you want to call it), or do diffs between documents.

There are revision control systems for Word documents out there, unfortunately I do not know any good ones. We use such control systems for Excel at my work, and unfortunately they all cost money.

The good thing is that they make life a lot easier, especially if you ever have to do an audit or due diligence.

Mats Fredriksson
+6  A: 

Thinking out of the box, would migrating to a Wiki be out of the question?

Since you consider it feasible to force your users into Subversion (or something similar), a larger change seem acceptable.

Another migration target could be to use some kind of structured XML document format (DocBook comes to mind). This would enable you to indeed use diffs and source control, while getting all sorts of document formats for free.

Henrik Paul
+16  A: 

I've worked with Word documents in SVN. With TortoiseSVN, you can easily diff Word documents (between working copy and repository, or between two repository revisions). It's really slick and definitely recommended.

The other thing to do if you're using Word documents in SVN is to add the svn:needs-lock property to the Word documents. This will prevent two people from trying to edit the same document at the same time, since unfortunately there's no good way to merge Word documents.

With the above two things, handling revision controlled Word documents is at least tolerable. It certainly beats the alternative of using a shared folder and track-changes.

Greg Hewgill
Can you really diff the *content* of Word documents using TortoiseSVN? Not just "binary files differ" kind of diff. (SVN itself certainly doesn't provide more than that.)
Jonik
Yes, Tortoise has some VBScript helper scripts that load both old and new documents into Word and use Word's document diff features to show the differences. It works pretty well, actually.
Greg Hewgill
It sounds handy; thanks for clarifying. (This came up on a Super User question: http://superuser.com/questions/14894/diff-software-for-word-files)
Jonik
+1  A: 

If you use WinMerge it has added support for merging Word and Excel binary files.

Keith
+3  A: 

Sharepoint also does a good (ok decent) job of versioning MS-specific documents.

hometoast
+13  A: 

What on Earth are you all Word-is-binary-so-no-diff people talking about? TortoiseSVN, for example, integrates right out of the box with Word and enables you to use Word's built-in diff and merge functionality. It works just fine.

I have worked on projects that store documents in version control. It has worked out pretty well, although if people are unfamiliar with version control, they are probably going to have conceptual difficulties with things like "working copy" and "merge" and "conflict". Don't overestimate the users' capabilities when you plan your document management system.

I believe there exist big and powerful commercial solutions for all of this, as well. I'm sure if you have enough kilodollars, you can get something that fits your needs perfectly. Document management systems are a big business for big enterprise.

Sander
+1 I did not knew it was possible, but you're definitely right. TortoiseSVN can diff et merge word documents using words functionality.
Mathieu Pagé
A: 

Have a look at Sharepoint. If cost is an issue, Sharepoint portal sevices can also work for you. Read this for more info

Conrad
+1  A: 

For what it's worth, there is also Google Docs. I guess it's not a perfect fit, but it's versioning is very convenient.

grapefrukt
+1  A: 

G'day,

Just wanted to clarify an answer someone gave but I don't have enough points yet.

diff will work on binary files but it is only going to say something not really useful like "toto1 and toto2 binary files differ".

cheers, Rob

Rob Wells
A: 

Many of the new version control projects are better suited to entire directories, and not so much for single files.

Convincing someone that they need to get an entire project, when they only want to update an individual file can be a "fun" way to spend an afternoon.

Brad Bruce
A: 

Another option you have is a piece of software and cloud computing magic called dropbox. Or, you could ditch the word documents and make a locally shared mediawiki instead.

DropBox: getdropbox DOT com

MediaWiki: mediawiki DOT org

Nathan Lawrence
+1  A: 

Clearcase integrates with Word for revision tracking. I believe Telelogic DOORs does as well.

Paul Nathan