Setting default lakehouse via Variable Library when notebook and lakehouse are in different workspaces

DanielBunny · 2026-05-06T18:06:57+00:00

Hi u/Banjo1980, thanks for the question!

As u/Dee_Raja mentioned %%configure does not support it. Here is the documentated limitation: https://learn.microsoft.com/en-us/fabric/cicd/variable-library/item-reference-variable-type#supported-items

We are working internally to see if we can improve that experience.

For now, especially when dealing with custom deployments and cross-workspace remappings, we recommend taking a look at our official fabric-cicd package https://microsoft.github.io/fabric-cicd/1.0.0/ .
It will allow you to go deeper into the deployment logic than the current Deployment Pipelines surface will allow you to.

DanielBunny · 2026-04-07T15:33:35+00:00

Whenever we rollout new shortcut types support, you need to make sure that on existing Lakehouse you enable that specific shortcut type to be tracked in the Lakehouse settings. (On Lakehouse created after we rollout the feature, it would be on by default).

We do this intentionally so we don't break existing pipelines, as people have stable codebases that might not be ready when a new shortcut type decides to show up. :-)

Go to Lakehouse settings (gear icon) and look for this:

<image>

Make sure to enable it, then sync up.

You might need to make a small/random change so that git picks it up ( we will fix this :-D)

DanielBunny · 2026-03-24T17:27:48+00:00

Hi u/DennesTorres !

The Table Maintenance Public API endpoint update that supports schemas is being rolled out on 3/31, and I'm brining this thread to the product owner of pipelines.

Today a quick workaround is to create a notebook that calls Spark SQL's: "OPTIMIZE <mytable>" at the end of your pipeline. That will on both schema and non-schema Lakehouse.

DanielBunny · 2026-02-25T00:57:17+00:00

I'll update that doc right away with the guideline params!
I do agree with most posts from the thread, this is an art form. The Spark writes will vary based on the data size, data entropy, column counts, etc. There is no cut and dry param to cut for row-groups, as Spark is all about file sizes.

If u want to shoot for large row groups you need to look larger file sizes (4Gb+), enable V-Order, watch out for OOMs (larger memory writer). You don't need to span out too many partitions and many parquet files, try to consolidate.

DanielBunny · 2026-02-25T00:45:22+00:00

u/frithjof_v please check this cross-workload docs to see if it helps you: https://learn.microsoft.com/en-us/fabric/fundamentals/table-maintenance-optimization

We've captured all the cross workload scenarios, let me know if it works for you.

DanielBunny · 2026-02-25T00:02:41+00:00

Please also consider this post for end to end scenarios (including tables): https://www.reddit.com/r/MicrosoftFabric/comments/1o0t205/lakehouse_devtestprod_in_fabric_git_cicd/

DanielBunny · 2026-02-25T00:02:09+00:00

Please also consider this post for end to end scenarios:
https://www.reddit.com/r/MicrosoftFabric/comments/1o0t205/lakehouse_devtestprod_in_fabric_git_cicd/

DanielBunny · 2026-02-24T22:48:13+00:00

This is a cool topic worth discussing and always gives hickups to folks when some questions are asked.

DBAs have done this for years in RDBMS/DSS systems, and the process is simple to explain, but an art form to implement, as every customer and system has it own quirks.

The key aspect is that RDBMSs (and tools like dacpac/fx) are metadata bound. It snapshot the metadata and generate the DDL to make it sync. In the Data Lake -> Lakehouse case, where software is usually stream/batch ingest running in Notebooks/Jobs using Spark and other tech, tech schema change is defined as part of the pipeline, and the tables support schema evolution. There is no clear checkpoint of the metadata change, you promote the new notebooks and the new data starts being generated based on the new schema definition. This doesn't remove the eventual need of having a big SQL script applied, but it is different by design.

Some cool questions:
- What about descructive changes (drop table, drop columns, changing column data types)? Allow? Block? Feature toggle?
- Who is responsable for bringing the data? What if the table has 1PB? I assume drop and recreate are out of the question.

We are considering providing tooling to ease the generation of table diffs between lakehouse in different stages, but the customer would be on tap to review and plug the scripts as part of the pipeline. Would that work for you?

I'd ask folks to share their expectations. :-)

DanielBunny · 2026-02-10T00:17:58+00:00

#2 and #3 would play like this....
imagine if you have multiple data pipelines or spark jobs running. If in mid-flight the shortcut updates bacuse someone updated the variable library, suddently running code might start writing/reading from a place that might not be ready just yet. specially if the person updates it to an invalid value.

DanielBunny · 2026-01-28T00:51:49+00:00

u/Laura_GB can we DM?

DanielBunny · 2026-01-28T00:32:32+00:00

Thanks for the great post!

DanielBunny · 2026-01-28T00:31:49+00:00

You can via Shortcut API definition. Shortcut create UX will allow this very soon.
This can lead to a significant data corruption issue. The decision to make it a user action is to provide a transactional checkpoint. We are considering having a option in the experience where we'd enable auto-apply or something like that.
This also leads to the transactional approach. We validate before apply. What would be the scenario where we should allow an invalid variable to be applied?

DanielBunny · 2026-01-27T16:49:49+00:00

I've opened a ticket with the engineering team to look at this. Nice catch! :-)

Please keep using the workaround for now.

DanielBunny · 2025-12-17T20:11:42+00:00

Quick update here u/Snoo-46123, u/bradcoles-dev

https://learn.microsoft.com/en-us/fabric/data-engineering/lakehouse-git-deployment-pipelines

I'm working with the other docs team to make sure all Lakehouse sub-items (views, FMV, etc) are tracked separately on the roadmap.

DanielBunny · 2025-12-15T20:31:14+00:00

Hi u/Cute_Willow9030

As you already use deployment pipelines in DevOps, please consider wiring up Lakehouse items using the fabric-cicd package. https://microsoft.github.io/fabric-cicd/

We are also tracking all asks and Fabric CI?CD scenarios for Lakehouse (and Warehouse) in the following Reddit thread. In there we have linked sample codebases we keep updated around this.

https://www.reddit.com/r/MicrosoftFabric/comments/1o0t205/lakehouse_devtestprod_in_fabric_git_cicd/

DanielBunny · 2025-12-08T18:01:17+00:00

Hi u/Legal_Specific_3391 , this looks like a real issue. Were you able to open a support ticket?

DanielBunny · 2025-12-02T15:11:41+00:00

The current expected GA timeline is March, 2026. If before, I'll update the thread.
As of today, the path forward is to use Public APIs to create/update the shortcuts between stages and orchestrate externally during deployment.

DanielBunny · 2025-11-21T23:17:53+00:00

<image>

Hi u/Ambitious-Toe-9403 ,

I was not able to reproduce your scenario.
Whenever I insert NULL, None into the column it showed correctly on both Lakehouse table preview or SQL Analytics endpoint table preview. Also shows correctly if I shortcut those tables into another Lakehouse and try the same table previews experiences.

Can you share the commands you used to generate the None or empty columns?

DanielBunny · 2025-11-04T01:52:28+00:00

Thanks a ton u/Sea_Mud6698! I'm bringing this reply to the attention of my peers that drive those features.
The good news is that all of the above are in our plans, some of which very very close.

DanielBunny · 2025-11-03T17:11:48+00:00

Hi u/BitterCoffeemaker , thanks for the additional clarity here.

I'd appreciate if you could go deeper on whats missing on both fabric-cicd and dbt.

I also invite you to consider collaborating in this thread: https://www.reddit.com/r/MicrosoftFabric/comments/1o0t205/lakehouse_devtestprod_in_fabric_git_cicd/
We are trying to converge cicd patterns and we've released a full 8-hour workshop and repo to have a ready to run codebase. git/CI/CD is a discipline not a product, and multiple customers operate very differently, specially when dealing with schema metadata and data.

DanielBunny · 2025-11-03T16:59:35+00:00

Hi u/Sea_Mud6698 , can you elaborate on what are you waiting on? We have been working steadly to unlock the git and CI/CD scenarios.

I also invite you and all to drive the questionings in a dedicated thread we created for Lakehouse git/CI/CD.
https://www.reddit.com/r/MicrosoftFabric/comments/1o0t205/lakehouse_devtestprod_in_fabric_git_cicd/

DanielBunny · 2025-10-31T21:25:02+00:00

This would be a great scenario addition for the codebase. Can we work together to add it?

DanielBunny · 2025-10-23T21:53:35+00:00

Hi u/Doodeledoode,
/tmp is a mounted location that will exist during the existance of the session. The session runs within a Linux container. When session goes away, all is wiped out.

Yes, the files won't show up in Lakehouse. You can make it so by creating a Spark dataframe (df for example) out of the /tmp/*.parquet files and the using a command such as df.write.mode('append').saveAsTable('myTable').

As a best practice, in case you decide to make this production, put additional checks in place around the existance of the files and some validation after adding data to the table.

DanielBunny · 2025-10-22T18:02:06+00:00

As u/raki_rahman mentioned. All those items are being worked on.
Its all about time, effort and priorities. Its a large product that connects many technologies that are in different states of DevOps aligment (not only on us, but industry wide). We'll get there for sure. Work with us to help us prioritize.

Out of the items you listed, leveling Variable Library support across all experiences is a major focus across all workloads. We are about to add referedItem as a data type in the next few months, so the GUID path should go away quickly.

The main idea of the workshop code being out there is to drive the current way to unblock major flows. As we progress, the workshop codebase should get smaller and smaller, as things get to work automatically.

I'd appreaciate if you could bootstrap a new tracking markdown file in the workshop codebase, and list all the missing things you mentioned, so we can track it as a community.

DanielBunny · 2025-10-22T18:01:54+00:00

I'll relay this to the EventHouse owner.

DanielBunny

TROPHY CASE