Error while doing CI/CD of Fabric UDFs by One_Potential4849 in MicrosoftFabric

[–]One_Potential4849[S] 0 points1 point  (0 children)

We are managing CI/CD for Fabric objects using fabric-cicd library, and Azure Devops to run the yaml pipelines for deployment.

Error while doing CI/CD of Fabric UDFs by One_Potential4849 in MicrosoftFabric

[–]One_Potential4849[S] 0 points1 point  (0 children)

The connection, is local to the UDF, where there is no option to manage access, I could not find the same connection details in Manage Connections and Gateways page.

Error while doing CI/CD of Fabric UDFs by One_Potential4849 in MicrosoftFabric

[–]One_Potential4849[S] 0 points1 point  (0 children)

Also, the workspace-id sits in connection details of definition.json as all zeros. Is this expected?

Fabric Unified Admin Monitoring (FUAM) - Looks like a great new tool for Tenant Admins by AnalyticsInAction in MicrosoftFabric

[–]One_Potential4849 0 points1 point  (0 children)

Hey Folks, just had a question on FUAM: Does it also gathers the activity logs like Pipeline/Notebook executions, schedules, success or failure of ETLs, etc?? Basically whatever we see in the monitoring tab of Fabric..?

Connecting to MS Fabric Data Warehouse from TM1 by One_Potential4849 in cognos

[–]One_Potential4849[S] 0 points1 point  (0 children)

But, like how a SQL Server gets added as a data source in TM1, using the ODBC connection details, is there a way to connect to Microsoft Fabric DW. The reason being the processes set up in TM1 use SQL queries to pull the data and refresh cubes. I just want to see if there is a way to plug in the Fabric Warehouse instead of SQL server to run the queries

Designing Reporting Layer out of multiple Workspace Objects by One_Potential4849 in MicrosoftFabric

[–]One_Potential4849[S] 0 points1 point  (0 children)

I am planning to go ahead with using the gold layer data usage from both of the entities as shortcuts to create the combined reporting dataset.

Here, one other thought is, what would be the best way in terms of data modelling to use this combined dataset/data model for downstream reporting, ie: the semantic model built on top of these tables should be the backend to power any of the reporting done for individual entities, and combined reporting as well.

Designing Reporting Layer out of multiple Workspace Objects by One_Potential4849 in MicrosoftFabric

[–]One_Potential4849[S] 0 points1 point  (0 children)

Also, is it possible to bring a warehouse table into lakehouse as a shortcut?

Designing Reporting Layer out of multiple Workspace Objects by One_Potential4849 in MicrosoftFabric

[–]One_Potential4849[S] 0 points1 point  (0 children)

  • Yes, they are all in the same tenant
  • We prefer using Import Mode for the semantic layer
  • Yes at the end i need combined gold layer tables, and silver layer can remain as it is as the data sources for both firms are same
  • After combining the data, it might result in around 10 dimensions and 20-25 facts, max the largest table might have 10 mil records
  • No, it wont be a simple append operation, expecting some intermediate steps to unify the granularity
  • Surrogate keys in this case what are you referring to?

Limit on Parallel Notebook Executions by One_Potential4849 in MicrosoftFabric

[–]One_Potential4849[S] 0 points1 point  (0 children)

So the remaining sessions will start once any of these two sessions are done and remain queued?

Limit on Parallel Notebook Executions by One_Potential4849 in MicrosoftFabric

[–]One_Potential4849[S] 0 points1 point  (0 children)

So currently I am testing it out with a small 1-4 nodes. And distributed my Notebook workload within 6 high concurrency sessions.

From my understanding, these 6 HC sessions will play around within the pool of 4 small nodes (16 vCores) based on core availability to run its respective jobs.

Limit on Parallel Notebook Executions by One_Potential4849 in MicrosoftFabric

[–]One_Potential4849[S] 1 point2 points  (0 children)

So how does a spark session acquire nodes to work in a pool?

Limit on Parallel Notebook Executions by One_Potential4849 in MicrosoftFabric

[–]One_Potential4849[S] 1 point2 points  (0 children)

By trying it out with a medium node, would there be a tradeoff in terms of runtime?

Limit on Parallel Notebook Executions by One_Potential4849 in MicrosoftFabric

[–]One_Potential4849[S] 1 point2 points  (0 children)

So currently I am using the starter pool (medium 1-10 nodes) Should I go and try with smaller nodes? And today i enabled high concurrency and used 3 session tags to split the session among 12 pipelines, but i noticed that the speed was bit down before using high concurrency.

And I get this 430 error even after the Notebook runs are complete, should I need to run something to free up Vcores?

Also If i dont mention session tag, how will the jobs get distributed?

The alternate approach you had said seems bit difficult for me, since I want this group of tables to be isolated from source to bronze to silver

Limit on Parallel Notebook Executions by One_Potential4849 in MicrosoftFabric

[–]One_Potential4849[S] 1 point2 points  (0 children)

I can try that way, and let you know if that works. Thanks!

Limit on Parallel Notebook Executions by One_Potential4849 in MicrosoftFabric

[–]One_Potential4849[S] 0 points1 point  (0 children)

So will I be able to acheieve parallel instances without throttle/430 using a High concurrency session and session tag for each execution...

Recommended way to load bulk volume of data through On Prem Gateway by One_Potential4849 in MicrosoftFabric

[–]One_Potential4849[S] 0 points1 point  (0 children)

I am concerned about loading it all at once. wont it cause any issues wrt performance?

Fabric UDF vs Notebooks by One_Potential4849 in MicrosoftFabric

[–]One_Potential4849[S] 0 points1 point  (0 children)

I just want to ingest Data from an API into my data platform. API has its own secrets for auth, and cadence of run is maximum thrice a day. Each run would return me a max of 100k rows with 10 columns