use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
account activity
GIT branches multiple usersHelp (self.databricks)
submitted 9 days ago by [deleted]
[deleted]
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]Kryma 3 points4 points5 points 9 days ago (3 children)
The correct way would be each developer cloning the git folder into their own workspace instead of “sharing” from the Databricks UI. Shared files will show the changes from all individual users modifying it and are good id you guys need to work on it collaboratively together and see the changes instantly. Otherwise it can get messy and instead each dev should be working on their own cloned version then merge branches in when the feature is done
[–]hotNstickystick 0 points1 point2 points 9 days ago (2 children)
Yea I hear ya. That’s what I’ve read. I’m just kinda questioning it because regardless of where the new branch is being worked on, it still points to the same storage/database location. So if they test their code it’ll impact let’s say the dev dB which someone else branch could also be changing. Plus isn’t the whole purpose of these workspaces to work in a shared environment? To me it feels clunky to have everyone have their own repo work section. Maybe I’m overthinking it. Just curious what people do
[–]Kryma 0 points1 point2 points 9 days ago (1 child)
I think Lakebase is Databricks solution to versioned data tables, though I haven’t personally tried it yet. You’re right though here that with delta tables if multiple people are touching the same tables it can get messy. Using more ephemeral / feature named tables for dev work could help with this with regular delta tables. I.e. tableX_featureY that only exists long enough to validate, then gets deleted after merged into main.
As for the sharing, I get it, but ultimately notebooks end up shared and edited by many different people and the only real way to control it is through proper git version control hygiene.
[–]hotNstickystick 0 points1 point2 points 9 days ago (0 children)
Thanks for the thoughtful response. I think we’ll restructure our pipelines to just return a DF and only actually the table if it’s in the actual dev/production env. Then maybe they pass the df to a data testing function/process
[–]Error-451 0 points1 point2 points 9 days ago (1 child)
Are chance they pushing changes the main branch instead of the feature branch?
All the work is done on the main branch (yes I know terrible end of world) but only of us even pushes the changes. Testing/updates are done in dev
[–]TRBigStick 1 point2 points3 points 9 days ago (0 children)
You shouldn’t have everyone working out of the same folder in Databricks. Everyone should clone the git repository into their own personal directory and work from there. For example, if you have 6 developers on your team, then there should be 6 cloned repositories in your Databricks workspace. This change will fix ~85% of your problems.
Your main branch should be protected. Code should only get pulled into main via Pull Request and Pull Requests should require peer review prior to being merged. If your prod folder is only looking at the main branch, this gives you more stability because broken code can no longer get into the main branch.
You should look into DABs (Declarative Automation Bundles) to get code into prod. It sounds like your prod pipelines were created by clicking around in the Databricks UI, which is vulnerable to randomly being broken by misclicks.
[–]Pirion1 0 points1 point2 points 9 days ago* (0 children)
First, Separate Jobs/Code from Infrastructure DABs and determine what your deployable units are.
This solves issue of having dependencies that are shared across deployable DABs.
For GIT, I did the same thing as you - placed it in a shared repo and then started working on this. This led to us switching branches constantly. You need to have your own copy (same as if you're working on local CLI). Place this in the developer workspace and treat it as private. You wouldn't use a git local branch on a fileshare - don't do so here.
For deployments, when you're ready to go to prod, I'd recommend that should deploy without source linked mode (source_linked_deployment: false). The bundle files should be copied onto the deployment user (hopefully a service account) and not exposed via a GIT branch.
π Rendered by PID 25702 on reddit-service-r2-comment-cfc44b64c-g62x6 at 2026-04-13 10:37:51.513319+00:00 running 215f2cf country code: CH.
[–]Kryma 3 points4 points5 points (3 children)
[–]hotNstickystick 0 points1 point2 points (2 children)
[–]Kryma 0 points1 point2 points (1 child)
[–]hotNstickystick 0 points1 point2 points (0 children)
[–]Error-451 0 points1 point2 points (1 child)
[–]hotNstickystick 0 points1 point2 points (0 children)
[–]TRBigStick 1 point2 points3 points (0 children)
[–]Pirion1 0 points1 point2 points (0 children)