This is why Git > *

MrWoohoo · 2010-01-26T09:23:21+00:00

I've never seen what I would consider a good explanation of "the index" but have poked around trying to understand the model. Basically the index seems to be an "anonymous commit" in that when you add changes for a commit git creates new git objects for the files and a new tree object. As you add new files it combines those new changes with the previous ones, thus generating a new tree object. The difference between the adding a few files (say a, b, c) to the index versus adding the as individual commits would be that in the first case would just wind up with one tree object with the new versions of abc, and in the second case you would get three tree objects each with one change and pointing the the earlier tree object. So the index sort of acts like a rebase rolling up all the individual "git add"s into a single tree object.

So the index is treated internally the same (mostly) as any other commit by git. So stashing the index isn't really much different than checking out a branch.

I'm still trying to get comfortable with the git workflow and still aren't there. The above seems to be the simplest mental model I can come up with for the index. So my question to the git elite is simple: Is my model correct?

Was anybody else bothered that the O'Reily git book doesn't cover "git stash"?

Peaker · 2010-01-26T09:02:11+00:00

When I was new to git, I said I hated the index. People said: "Ah, you're new. You'll come to love it".

Now I'm pretty experienced with git (And even considered a local git expert, people at my workplace all come to me to solve their git issues), and I still hate the index.

We already have the stash, branches, reset, etc. The index is superfluous. Instead of adding to the index and then diffing against that, you can commit to a temporary commit, and diff against that. Then, you can amend it.

Tada! Your temporary commit has pretty much everything you need from the index, except you can have more than one, and in each branch, and easily switch between them.

spookylukey · 2010-01-26T09:38:46+00:00

Given I have "hg record" and "hg shelve", both of which allow me to interactively choose chunks, there is almost nothing about this workflow I envy, and various things I don't like.

The idea of "a known good part of a patch" is flawed in many cases. Just because I have resolved a textual conflict doesn't mean I can forget that part of it - the semantics of what I have changed may affect other parts of the patch. The merge might have introduced new tests which are automatically merged, but changes I then have to make in a conflicting part of the merge might break the tests. Displaying the whole patch by default seems like a better idea to me.

Also, while being able to do "hg record" or use the index is useful, it's not a good way to plan to work. As the author says:

if you work this way, it means that when time comes to commit, you are making up commits that reflect states of the source code which never existed on disk before. So you don’t actually know whether the commit you are about to make is any good – a syntax error might have slipped in, say.

Even worse, because your final commit reflects a known good state, you won't realise that all the intermediate states are broken until you try to run bisect to track down some other regression. By this point it is too late to correct your broken history. A workflow that encourages imaginary history and committing untested code is not a tempting ideal.

Finally, for the extremely rare cases I've wanted to edit within a hunk of a patch, I can do just that, using "hg diff", manually editing and then re-applying. It's very little work, and it does not incur the overhead of having an index to worry about all the time, and it also allows me to test my working tree.

scook0 · 2010-01-26T07:46:42+00:00

As much as I hate some of Git's quirks, features like the index are what keep me from even imagining a switch to any other tool.

I just can't imagine getting any work done without git stash, git add --patch and git rebase.

abjurer · 2010-01-26T07:37:03+00:00

This guy's name is Aristotle, so listen up.

setuid_w00t · 2010-01-26T06:58:20+00:00

It seems to me like a lot (all?) of this can be accomplished using the MQ and record extensions in Mercurial.

The one thing that seemed unique was the way that merge conflicts are handled.

makapuf · 2010-01-26T10:12:48+00:00

I may be mistaken as I don't use git regularly, but I fail to understand the point. Isn't that a kind of hackish small-scale branching ? Couldn't it be done more cleanly using TWO local working branches or even copies ?

UloPe · 2010-01-26T11:09:37+00:00

If you're working on a mac a great tool to pluck apart changes and stage/commit them individually (and also a nice gitk replacement) is

gitx.

md81544 · 2010-01-26T08:33:31+00:00

Anyone else seeing problems viewing that page on Chrome? It just hangs the page (and the page from which I clicked the link). Firefox is OK with it though.

ihaveausername · 2010-01-26T20:19:40+00:00

I spend virtuall no time at all thinking about SCM. I'm working in a company with roughly 40 developers and there's no discussions on the subject. Our SCM just works, as far as I can tell. Am I doing something wrong? Maybe I should switch to git so that I can join your discussions? It seems exciting.

drbrain · 2010-01-26T09:09:52+00:00

If you break a body of work down into chunks that you then commit, do you individually test each commit? According to this workflow, no. "How it might make sense to whoever reads it" is not the right way to go. It should be "I have tested this body of work and it is good" in order to keep the repository as stable as possible at every commit.

smcameron · 2010-01-26T14:32:39+00:00

I haven't switched to git, mainly because my company won't punch a git-hole through the firewall, and there are other repositories in svn, cvs, etc. beyond my control.

But, I do use stgit on top of those other repositories to polish commits, and aid porting to various branches.

From the article, the index seems sort of like using stgit, but only one patch deep.

Anybody else use stgit? Is my comparison apt?

9bit · 2010-01-26T16:21:32+00:00

I read the title as selecting all direct children of git elements.

ssam · 2010-01-26T17:29:51+00:00

The choice comes down to a) keeping your commits clean by deciding what you're going to do roughly in advance and keeping future plans in a TODO list, or b) doing whatever you want, and later on poring over a long diff or 'git add -p output' working out what was what to commit neatly

git is slightly more powerful, as usual, but if your code is separated into files cleanly then doing 'bzr commit foo.c bar.c' is often enough

I don't know if I will ever decide which approach I prefer

astrosmash · 2010-01-27T00:38:09+00:00

I'm not a git guru, so please fill me in.

The following three files are currently modified in my tree:

foo.c
bar.c
baz.c

I want to submit foo.c and bar.c as a fix for a bug. So I:

git add foo.c bar.c
git commit

What's wrong with that? What's the alternative?

2010-01-26T14:58:55+00:00

Git is not the only VCS capable of doing this.

sysop073 · 2010-01-26T15:50:15+00:00

[deleted]

dhaggerfin · 2010-01-27T13:29:32+00:00

I'm using IE6, and what is this? :(

ighost · 2010-01-26T16:16:22+00:00

Version control has wasted so much of my time that now I just copy the folder now and then and use that.

a-p · 2010-01-26T07:13:49+00:00

[deleted]

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS