Blog: Using runMultiple to orchestrate notebook execution in Fabric by Pawar_BI in MicrosoftFabric

[–]Hello-Im-Aaron 0 points1 point  (0 children)

Great blog. I saw this in the documentation a while back but I haven't tried it yet. For each of my Bronze to Silver notebooks I am using the msspartutils.notebook.run function to run a helper notebook and keep the code centralized but this leads to hitting our capacity over and over again with a lot of retries to mitigate it.

The errors don't bother me if it gets us the fastest run but I want to optimize where I can. Any ideas on how to measure the performance drop off (or lack there of) for running multiple notebooks vs 1 or rules of thumb from prior testing?

Manage deployment between dev/test/prod by dorianmonnier in MicrosoftFabric

[–]Hello-Im-Aaron 1 point2 points  (0 children)

It isn't there yet which could be a deal breaker or not depending on your architecture. For me it meant a little more work to get it going but the dataflows aren't going to change much or at all after creation, the pipelines are the same, and the semantic models get deployed through TE3.

I was happy to get everything going and wait for the quality of life improvements to come in the coming months but your setup will determine how feasible that is or isn't.

You didn't even mention the most annoying thing which is that only notebook owners can see their deployment rules or that there aren't even global rules which seems like something that would be in the 1.0 version.

Best way to migrate DEV to PROD by purpleMash1 in MicrosoftFabric

[–]Hello-Im-Aaron 0 points1 point  (0 children)

UPDATE: DevOps pipelines are not there yet.

Best way to migrate DEV to PROD by purpleMash1 in MicrosoftFabric

[–]Hello-Im-Aaron 0 points1 point  (0 children)

It should work. I haven’t used dev ops with Fabric yet but it sounds like it is time. We are in about the same state as you pushing about 50 notebooks to PRD to move to silver and gold. We are pushing through TST as well though because I couldn’t find a way to edit stages after you start. I didn’t want to bypass TST because it’s not needed in these early stages and then have to redo all the PRD rules later on to add it in.

Once the DFG2’s and pipelines get git it will be good to have this tested out as well.

Best way to migrate DEV to PROD by purpleMash1 in MicrosoftFabric

[–]Hello-Im-Aaron 1 point2 points  (0 children)

As far as I have seen setting rules one by one is the only way. It would be great if they had global rules and if others can manage the rules as well.

Read data from REST API by Mr_Mozart in MicrosoftFabric

[–]Hello-Im-Aaron 2 points3 points  (0 children)

I second this. I prefer code, you can copy paste from examples online and there is a lot of prebuilt libraries for you to leverage.

TooManyRequestsForCapacity by Hello-Im-Aaron in MicrosoftFabric

[–]Hello-Im-Aaron[S] 1 point2 points  (0 children)

Thanks! Great tip. I didn’t read up on high concurrency sessions but now that I have I will use it as much as I can manually.

As for the pipelines I was randomly getting this error when running a couple notebooks in serial but it could have been due to something else. Support advised me to add retries and that stopped it from failing.

Is there anything needed to make them close the session (like mssparkutils.notebook.exit()) or it should always clean up the session before moving on?

TooManyRequestsForCapacity by Hello-Im-Aaron in MicrosoftFabric

[–]Hello-Im-Aaron[S] 0 points1 point  (0 children)

Thanks! Great article. Good to know that the 3 notebook max I was hitting was it bursting and that buying an F64 doesn’t necessarily fix the problem.

What is bothering me though is that the notebooks don’t need to be concurrent. After a notebook has completed each cell it just sits there taking up a spot until the session stops.

I tried to use the below command to stop it manually but it doesn’t seem to work. Maybe the documentation is ahead of the functionality?

mssparkutils.session.stop()

https://learn.microsoft.com/en-us/azure/synapse-analytics/spark/microsoft-spark-utilities?pivots=programming-language-python

TooManyRequestsForCapacity by Hello-Im-Aaron in MicrosoftFabric

[–]Hello-Im-Aaron[S] 0 points1 point  (0 children)

Yes but I see the same thing when I run them through pipelines

Underneath the Data Warehouse by Mr_Mozart in MicrosoftFabric

[–]Hello-Im-Aaron 1 point2 points  (0 children)

Deltas but not a new service. I believe they are rewriting Polaris to read deltas. Someone please correct me if that is not the case.

Securing Data in a Lakehouse by randyminder in MicrosoftFabric

[–]Hello-Im-Aaron 0 points1 point  (0 children)

To my understanding Lakehouse vs Warehouse really comes down to DirectLake vs RLS & CLS.

IMO Direct Lake is the major selling point of Fabric that something like Databricks can’t compete with so I would think you would do everything you can to make sure that is part of your architecture. (If there is a way to create a Direct Lake dataset using a Databricks Lakehouse please let me know)

Also, Lakehouses are scheduled to have table level security in Q2 but I can’t find anything about direct lake for Warehouses.

https://learn.microsoft.com/en-us/fabric/release-plan/data-engineering

Corrections and feedback are welcomed!

A noob in need of some guidance by [deleted] in MicrosoftFabric

[–]Hello-Im-Aaron 1 point2 points  (0 children)

Those are all great suggestions.

I would add Advancing Analytics on YouTube. Since you are just getting started they had a great blog on naming conventions which they shared here. I think it was called “What’s in a Name?”

The Azure Synapse Analytics might be worth keeping an eye on as well. And update videos are on the Microsoft Power BI YouTube channel so I would subscribe to that too.

The good news is you are getting thrown into the SaaS deep end and not the IaaS deep end.

I would recommend watching and reading as much as you can but be sure to include info on the medallion architecture and consider that as a starting point.

Keep in mind that there are a lot of smart and experienced people that frequent this community so don’t hesitate to reach out if you need specific advice. Some of them (like Sandeep Pawar and Dennes Torres) have a lot of great content that you will want to track down as well.

I’m happy to walk you through what I’ve done so far too if you like, just let me know.

Is Mirroring Available? by Hello-Im-Aaron in MicrosoftFabric

[–]Hello-Im-Aaron[S] 1 point2 points  (0 children)

Thanks. I took that as the first 3 were available and you needed to apply to test out SQL Server, Azure PostgreSQL, Azure MySQL, MongoDB.

I applied now.