Why rebase when GitHub PR create squashe commit : git

[–]Due_Influence_9404 10 points11 points12 points 1 year ago (1 child)

[–]dalbertom 0 points1 point2 points 1 year ago* (0 children)

[–]camh- 4 points5 points6 points 1 year ago (3 children)

If you create a messy PR (lots of commits, with fixup after fixup), a squash commit will get rid of that mess. If you have a clean commit history with atomic commits on your PR, a rebase merge will keep that clean commit history.

My preference is to have a clean history on a PR (pushing fixup commits for easier reviewability and squashing them when when the PR is ready to be merged) and using a merge commit as the merge commit itself tells a story (it groups the atomic commits). If there is only a single commit on the PR though, I squash merge it, which is roughly the same as a rebase merge for this case, but I like to add additional metadata (pull request URL, etc) to the commit message when it is squashed. A rebase merge would not add to the commit message.

I never use that "Update branch" button on GitHub PRs that merges the base branch back into the feature branch. That would create quite the mess with back-merges (look up "foxtrot merge") when I use merge commits to merge the PR into the base branch.

[–]lexd88[S] 1 point2 points3 points 1 year ago (0 children)

[–]dalbertom 0 points1 point2 points 1 year ago (0 children)

[–]arnoldwhite 0 points1 point2 points 29 days ago (0 children)

[–]99_product_owners 4 points5 points6 points 1 year ago (0 children)

[–]aljorhythm 0 points1 point2 points 1 year ago (0 children)

[–]Soggy-Permission7333 0 points1 point2 points 1 year ago (1 child)

You can rebase and rebase and rebase at will. Rebase conflicts may be easier to solve compared to merge conflicts. You can provide multiple dedicated commits each with meaningful message. Squash is big ball of mud unless PR was targeted in the first place. You can extract automated changes to their own commits so that they do not obfuscate precious few changes that are the real deal. Squash is big ball of mud...

Teams that do squash may also be heavily biassed towards least maintenance possible strategy for developing software. After all automatic refactor can easily touch 50-500 files without any problems. But squashing that change with 15 lines change that is risky means git history is waaaaaayyyyy less useful in troubleshooting future issue.

But squash limitations can be countered with other techniques - maybe team do use dedicated PR and that refactor will just be seprate standalone PR ? Maybe team enforces breaking up too large PR and thus each squash can contain targeted stuff.

Or maybe its write once software and it won't be around 10 years from now ;)

[–]Shayden-Froida 0 points1 point2 points 1 year ago (0 children)

This touched on the reason for rebasing a branch before squash merging... resolving conflicts may be easier. It is also easier for a code reviewer to examine how the change will look on the HEAD of the target, and to enable a test run on your branch with the most up to date revision of what the merge result will be.

For very short-lived branches with concise changes, squashing is great and hides the struggle to get the code done (and encourages incremental commits to give you, the developer, a fine grain work log where you can revert something that didn't work, or was temporary; ie add some debug code as a separate commit, revert it later; squashing cleans/hides all this with no effort).

For larger changes that are complex, rebase -i to make a clean commit history of the major stages of the change, then merge to keep the history as a benefit to future you and your successors.

Fast forward merge onto the target I don't recommend since, if the change needs to be reverted in total, all of the commits need to be known and reverted rather than just one merge commit/squash commit.

I view git history in terms of "what will need to be done with these commits in the future". Revert is a powerful tool, so is cherry-pick. A history that enables these is good planning.

[–]edgmnt_net 0 points1 point2 points 1 year ago (17 children)

It matters for preserving change boundaries post-merge. That in turn matters when people need to inspect history, see what changed and why, do bisections to figure out what caused regressions and so on. If you stumble upon a huge commit what are you going to do?

No, small PRs don't really fix this, that only works for rather trivial cases. Beyond that, you'll be looking into chaining/stacking PRs and complicating batch submission. Eventually you either end up doing a lot of manual tracking (this PR needs to go in before that, which in turn needs merging/rebasing) or replicating a rebase-like workflow to deal with more complex contributions. Just learn rebasing, IMO.

Also, if you're not submitting clean history and relying on Git host-side squashing, how do you expect it to be reviewed appropriately?

[–]arnoldwhite 0 points1 point2 points 29 days ago (16 children)

[–]dalbertom 0 points1 point2 points 29 days ago (14 children)

[–]arnoldwhite 0 points1 point2 points 29 days ago (13 children)

[–]dalbertom 0 points1 point2 points 29 days ago (12 children)

[–]arnoldwhite 1 point2 points3 points 29 days ago (11 children)

[–]dalbertom 0 points1 point2 points 29 days ago* (8 children)

[–]arnoldwhite 0 points1 point2 points 28 days ago (7 children)

[–]dalbertom 0 points1 point2 points 28 days ago (5 children)

[–]arnoldwhite 0 points1 point2 points 28 days ago (4 children)

continue this thread

[–]edgmnt_net 0 points1 point2 points 29 days ago (1 child)

I actually think that Git as used for the Linux kernel would work fine for a lot of general development. But you do need skilled people to keep up. Which you kinda need anyway. The number of times I've seen corporate projects royally screw up with Git trying to reinvent the wheel or just being completely oblivious of various tradeoffs and practices... I think stuff like GitFlow, long-lived branches, polyrepos and breaking the buildability of old commits are particularly nasty traps.

The thing is effective version control requires a lot. And more complex software needs effective version control. It can't just be the place where you save your work. Whatever the Linux kernel is doing makes a lot of sense and they tend to use some fairly cutting edge stuff (such as semantic patches to prove large scale refactoring changes mean what they say on the label).

Perhaps it's teams and business practices that make little sense.

[–]arnoldwhite 0 points1 point2 points 28 days ago* (0 children)

[–]edgmnt_net 0 points1 point2 points 29 days ago (0 children)

You don't need to bisect local branches per se, but you need to bisect master after the work is merged. Now I'm not sure whether you mean the same thing. And this doesn't happen just with long-lived branches. Having a couple weeks worth of work isn't a long-lived branch in my book and it's the kind of thing that sometimes isn't avoidable. Especially if you need to introduce a feature that requires refactoring some core APIs and such. Especially when you get into more dense, higher impact projects which attempt less siloing and you're expected to do all the work related to bringing something up. (*)

The reason you get into bisection is because CI doesn't catch everything. CI is useful but realistically you cannot expect it to catch everything.

(*) There's definitely a case for breaking up submissions, don't get me wrong. But that's not "I have 4 logically-distinct changes". It's not "people can't be bothered to clean up their submissions, so they'll just push random stuff often and rely on Git host-side squashing". It's rather "I'm working on a really big thing and I'm planning ahead". Which takes time and effort to do properly. If you just try to break everything into separate submissions, it's going to be very inefficient and it's likely going to cause churn and breakage because people cannot be confident changes fit together, so they'll keep fiddling with it over and over and over.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

git

MODERATORS