Git: Should I ignore the Index or is there a killer application for it?

views:

204

answers:

+8 Q:

Git: Should I ignore the Index or is there a killer application for it?

As a subversion user, git's index is the most challenging new concept I'm facing as I consider using it for new projects. I read many people's comments saying that they don't use the Index (always commit -a) but I'm thinking there might be a killer reason out there as to why I would want to make use of it. (I'm sharing code with around 5 other developers, working in a mature development environment where we merge code to test and stable branches and use branching for experimental or significant new features.)

+5 A:

The reason I appreciate Git's index is for staging of local changes. One thing you can do with the index is roughly the same as Subversion's "changelist" support, except it's more convenient. I often stage just one or two files out of several possibly modified ones, to construct a single commit containing just those files. With Subversion, I would have to think of a name for that changelist (even if it's just "work" or "temp"), and repeat typing that name several times during construction and committing of the changelist.

The index also supports the git add -p feature which I think is one of Git's killer features. See Ryan Tomayko's The Thing About Git which describes how Git solves the "tangled working copy problem". You can stage just portions of modified files without having to mess around with temporary copies or playing tricks with Undo in your editor.

The index doesn't really participate much in your interaction with other developers. However, it can have a significant impact on how you interact with Git.

Greg Hewgill 2009-12-03 00:14:18

+8 A:

You know that the index lets you only commit parts of the files that you want to add to the repository, of course. In general, I find it useful for this reason. I can make changes to files that sort of work, check in the parts that work, and then complete and check in the rest.

For a really killer demonstration; try using interactive add, or patch add (using git add -i, or git add -p). This runs through all your changes and lets you selectively add them to the index. This lets you make a whole load of changes to your files and yet split the commits. Useful for those 'aha' fixes that we all make from time to time.

Have a look at this screencast to see how it's done. Not till you try it yourself will you see how useful it is.

Abizern 2009-12-03 00:16:07

Cool screencast. I prefer `add -p` to `add -i` — the interface is much less confusing.

jleedev 2009-12-03 00:25:03

I agree that the interface with -i is confusing, but it wraps up all the add features in one place. Once you get used to that menu, it works the same as -p.

Abizern 2009-12-03 11:41:18

+1 A:

Aside from interactive staging, the other important usage of the index is during a merge conflict: Git stages the three versions of the file so it knows the file isn't ready, so there's a version on hand that isn't littered with conflict markers. Third-party tools could use the index here to provide a nice merging interface.

That's not to say this feature fundamentally requires the index — I'm sure Mercurial handles merge conflict without having an index — but the way git approaches this seems nice to me.

jleedev 2009-12-03 00:19:29

+1 A:

I prefer to ignore the index as much as possible.

Peaker 2009-12-03 00:20:29

Not downvoting, but also care to explain why?

ChristopheD 2009-12-03 00:28:19

Its not that often that I want to commit some of my changes and then I usually want to commit those changes on a different rev. The index doesn't help much there. When I do I can just pass more arguments to commit.The staging area also annoys the hell out of me when I want to automate [un]committing. For example, I want to attach a git-hash to my builds so that I can identify builds exactly. I can't do something like this in a build script:git commit --allow-empty -am"Auto commit ..."buildgit reset --soft HEAD^Because of the staging area, this has to be much more complicated.

Peaker 2009-12-03 09:56:53

This comment suggests to me that maybe you could read a little more about git; for example, i would suggest git-tag to uniquely identify builds.

Robert Massaioli 2009-12-07 03:17:59

Shhnap: You missed my point. The problem is that the working tree only has a hash *if you committed*. You cannot simply commit/uncommit in a script, because of the staging area. You can only "git-tag" if you have a hash. Following now?

Peaker 2009-12-13 10:00:43

+3 A:

I find the index really useful, and very rarely commit -a.

Since you're not always pushing to a remote repository when you commit, git users typically make smaller, more frequent commits, and push to a shared repo when a 'group' of changes are complete. This gives the flexibility of being able to revert or cherry-pick individual commits later on. Say I make 3 changes, and using subversion commit them all at once, then want to revert one of those changes.. or apply just one of those changes to another branch.. it's a very fiddly process. With git, you might add each file you've changed to the staging area then commit, separately. Obviously you need to make sure a commit is internally consistent and ensure each change set is 'atomic'.

You may also have local changes to a file under version control that you do not want to commit, such as a customised configuration file (or something). The staging area allows you to exclude that file from the set changes that are committed.

David Claridge 2009-12-03 00:20:35

+2 A:

Several people have already mentioned git add -p, but if you've never used it you may not appreciate its utility. Suppose you have the following line of source code which contains 3 errors:

  distance= rate * deltaT;  /* compute tax rate */

(The three errors are: misnamed variable deltaT, whitespace error before the '=', and an invalid comment.)

You've already edited the file, but you want to make 3 distinct commits with an appropriate log message for each. With git, it's fairly trivial, since add --patch actually allows you to drop into an editor and edit the patch directly.

William Pursell 2009-12-03 02:35:10

+3 A:

I find staging changes extremely useful for three reasons.

I don't accidentally commit changes as much, since there is that extra step to stage the file.
After making changes to a bunch of files at once via code generation or pattern substituation I like to step through the diff of each file before committing. Being able to stage files one by one is a nice way of bookmarking my progress.
I might be working through a feature and find an outdated comment or bad formatting along the way in some unrelated section. I can easily stage and commit a tiny change like that, keeping my feature commit pure and focused.

Ben Marini 2009-12-03 03:30:37

If you want to make sure every commit will build and pass your test suite(1), then ignore the index as much as possible.

When you use the index (in the non-trivial way where you're checking in some changes but not others) you're checking in a state of the code that you probably haven't built or run the test suite on.

Sure, for some things (a change to some documentation, for example) this probably doesn't matter and it's perfectly safe to use the index. But it's good to get out of the habit of doing it the error-prone way and into the habit of doing it the right way:

Use git stash to stash away everything you don't want to commit.
Build what's left.
Run the test suite on what's left.
Commit (all of) what's left.
Unstash the other changes, repeat if necessary.

(1): Not everyone cares about each commit being a buildable, working state of code.

Some people do because it means any version someone checks out will at least build and run. This is important for open source projects (where someone might clone your project at any time), and helps when bisecting to find where a bug was introduced (you don't need to waste time skipping over non-working, test-case-failing states).

If you don't care about each commit being a whole, working state of the code, then this doesn't really matter.

Steve Losh 2009-12-03 04:05:25

That's what `git stash save --keep-index` is for: to be able to test (partially) staged changes.

Jakub Narębski 2009-12-03 10:04:07

Another reason for not making each commit work perfectly is when working in a local repository you want to commit small changes, but then you can use git rebase to bundle up all the changes as one atomic commit which you publish to the public facing repositories.

Abizern 2009-12-03 11:45:50

ansaurus

tags:

views:

answers:

Git: Should I ignore the Index or is there a killer application for it?

related questions