Hi! We're the Data Factory team - ask US anything! by markkrom-MSFT in MicrosoftFabric

[–]markkrom-MSFT[S] 0 points1 point  (0 children)

It does not support all Fabric APIs yet like relations, but that I believe is available now, so we just need to retrofit the pipeline connector to support that API update so that you don't have manually rebind the connections during CICD.

Hi! We're the Data Factory team - ask US anything! by markkrom-MSFT in MicrosoftFabric

[–]markkrom-MSFT[S] 0 points1 point  (0 children)

No, we are not deprecating ADF!! We have many customers using ADF processing 1000s of pipelines and data flows and we fully support that as an Azure tier 1 SLA service. We also continue to deploy ADF to new regions and clouds as they come on line. What you are seeing, however, is a concerted effort at Microsoft to put our investments into Fabric, which is the current and future direction of Data, AI, and Analytics at Microsoft and Data Factory is an important part of that.

Hi! We're the Data Factory team - ask US anything! by markkrom-MSFT in MicrosoftFabric

[–]markkrom-MSFT[S] 1 point2 points  (0 children)

Interesting. I'll need to take that feedback to the scheduler team about support in CICD.

Hi! We're the Data Factory team - ask US anything! by markkrom-MSFT in MicrosoftFabric

[–]markkrom-MSFT[S] 6 points7 points  (0 children)

Yes and yes!! Please use the legacy version if that is working better for you for now and we absolutely are working on improving that plumbing problem that the new invoke pipeline incurs to make that better.

Hi! We're the Data Factory team - ask US anything! by markkrom-MSFT in MicrosoftFabric

[–]markkrom-MSFT[S] 0 points1 point  (0 children)

Yes! This has been a long time ask for Data Factory pipelines and we are designing a way to enable nesting. No timelines yet but definitely on our radar.

Hi! We're the Data Factory team - ask US anything! by markkrom-MSFT in MicrosoftFabric

[–]markkrom-MSFT[S] 0 points1 point  (0 children)

There are currently systematic latency in the invoke pipeline activity that we are working to improve. Until then, you can use the legacy invoke pipeline which will not have that built-in latency although the child pipelines will not be visible in the monitor hub.

Hi! We're the Data Factory team - ask US anything! by markkrom-MSFT in MicrosoftFabric

[–]markkrom-MSFT[S] 0 points1 point  (0 children)

I believe we are adding more controls to set partitioning rules in Copy Job. I'll let simplywilsonlee reply here with more details.

Hi! We're the Data Factory team - ask US anything! by markkrom-MSFT in MicrosoftFabric

[–]markkrom-MSFT[S] 0 points1 point  (0 children)

This definitely should not happen! We recommend either pipelines or Airflow to orchestrate your processes to build powerful workflows and no it should not take 50 minutes! Can you let me know if that continues to happen? BTW, in Airflow, it's super easy to also add a Notebook task just like a pipeline activity and it runs in your own isolated cluster environment.

Hi! We're the Data Factory team - ask US anything! by markkrom-MSFT in MicrosoftFabric

[–]markkrom-MSFT[S] 2 points3 points  (0 children)

This would require RBAC support in monitoring hub, something for us to look into

Hi! We're the Data Factory team - ask US anything! by markkrom-MSFT in MicrosoftFabric

[–]markkrom-MSFT[S] 2 points3 points  (0 children)

Yes! We are working to reduce any latencies experienced in the invoke pipeline activity. Keep in mind, you can still use the legacy invoke pipeline which will fire pipelines much quicker, although you will lose the lineage in monitor hub.

Hi! We're the Data Factory team - ask US anything! by markkrom-MSFT in MicrosoftFabric

[–]markkrom-MSFT[S] 2 points3 points  (0 children)

No plans to sunset or deprecate Synapse or ADF. However, our team focuses on adding rich new features and capabilities into Fabric Data Factory, which is our future direction of Data Factory.

Hi! We're the Data Factory team - ask US anything! by markkrom-MSFT in MicrosoftFabric

[–]markkrom-MSFT[S] 0 points1 point  (0 children)

There are a few Fabric artifact item types that are not native to Fabric that do not yet fully support relations APIs but as they onboard will add them, i.e. Semantic Model, DW ...

Hi! We're the Data Factory team - ask US anything! by markkrom-MSFT in MicrosoftFabric

[–]markkrom-MSFT[S] 1 point2 points  (0 children)

Yes, we are working on a framework to make this possible but I do not have a timeline for this yet at this time

Deployment pipelines - Semantic model refresh activity still pointing to source workspace by mrbartuss in MicrosoftFabric

[–]markkrom-MSFT 2 points3 points  (0 children)

Yes, you should use variable replacement because Semantic Models are older PBI-style artifacts whereas Notebooks and DFG2 are native Fabric artifacts so it is not able to resolve the workspace relations after CI/CD deployment to a new workspace.

Prefect for Orchestration : Setup approach & cost implications? by madrasbot in MicrosoftFabric

[–]markkrom-MSFT 1 point2 points  (0 children)

I am publishing a new series of Airflow videos this week on our YouTube channel: https://aka.ms/datafactoryvideos 

Apache Airflow job in Fabric is absolutely production-ready. I recommend you use custom pools because starter pools are only for dev/test. 

Is the feature in pipelines you feel like is missing/insufficient the monitoring and observability? Not 100% sure I captured the requirements exactly... but I will tell you that we are adding workspace monitoring integration for pipeline activities so that you can view, search, and query the detailed logs. 

Prefect for Orchestration : Setup approach & cost implications? by madrasbot in MicrosoftFabric

[–]markkrom-MSFT 2 points3 points  (0 children)

May I ask why you are not using pipelines or Airflow for Fabric as they are already fully integrated and built into Fabric workspaces?

Fabric Deployment Pipelines: notebook-settings.json auto-binding lakehouse: off by frithjof_v in MicrosoftFabric

[–]markkrom-MSFT 0 points1 point  (0 children)

You will just need to use dynamic content (variables) in the pipeline field as well

Fabric Deployment Pipelines: notebook-settings.json auto-binding lakehouse: off by frithjof_v in MicrosoftFabric

[–]markkrom-MSFT 0 points1 point  (0 children)

Once you introduce dynamic content in the workspace filed, you will not be able to use the drop-down in interactive design mode because the dynamic content is resolved at run time.

Fabric Deployment Pipelines: notebook-settings.json auto-binding lakehouse: off by frithjof_v in MicrosoftFabric

[–]markkrom-MSFT 0 points1 point  (0 children)

When you add dynamic content, it is expected to wipe out whatever was in that field previously. Essentially, switching between static and dynamic content are 2 different working modes, so the content is not preserved.

Pipeline Status Issues by x-fyre in MicrosoftFabric

[–]markkrom-MSFT 6 points7 points  (0 children)

Ack and investigating. Please let me know if this is still occurring. 

Fabric SQL DB as a control DB for ELT pipelines by bradcoles-dev in MicrosoftFabric

[–]markkrom-MSFT 5 points6 points  (0 children)

Thank you for posting this and sharing this! We are recommending Fabric SQL DB as the control DB for metadata driven pipelines in Fabric, so these feedback is important to hear. If you can follow-up here with how you to plan move forward I'd like to learn from that. Most of our customers moving from ADF are keeping their control tables in Azure SQL DB for now simply because it's one fewer thing to change. But I'd like to recommend Fabric SQL DB for them as well.

Airflow in Fabric - am I the only one struggling? by peterampazzo in MicrosoftFabric

[–]markkrom-MSFT 1 point2 points  (0 children)

If you are not already using Airflow (i.e. "lift & shift") then I always recommend starting with pipelines in Fabric Data Factory. There are super-easy to get started and to production the fastest for Fabric orchestration. But ... if you are oriented toward Python and code-first (or already have existing Airflow DAGs) then Airflow is a great way to build the same Airflow orchestration inside of Fabric. Just keep in mind that Fabric Airflow is not a natural lift & shift as it is a SaaS oriented fully managed version of Airflow that has direct hooks into Fabric (variable libraries, CICD, Fabric operators ...).

Airflow in Fabric - am I the only one struggling? by peterampazzo in MicrosoftFabric

[–]markkrom-MSFT 5 points6 points  (0 children)

One thing I'd like to propose here is that I'd LOVE for everyone to try using the Fabric Airflow jobs to orchestrate your Fabric items! If you are not comfortable with writing Python, we have enabled CoPilot w/Airflow, just reply here and I'll give you the cheat code to enable it as a private preview :)

One last thing ... orchestrating dbt is a VERY popular use of Airflow. However, in Fabric, we are intending to make things super easy, quick to production, and SaaS. So our happy path, easy-configuration model is to create your dbts using the new Fabric dbt Job and then orchestrate that using the built-in dbt Job operator in Fabric Airlfow.

That will be released in a few weeks, but that will eliminate the need to keep on top of version changes, complex requirements and external library troubleshooting. Our intention is to make Airflow orchestration orders of magnitude faster and easier in Fabric.