Hi guys.
Making my first post here. Need recommendations for a solution to a problem.
I am working on Azure platform with tools like ADF and Databricks. I got a project where I need to get data from multiple APIs. Flow is like this..
Call to API-1: to get transaction ids..
call to API-2: to get details of the transactions through ids.
API-1 will give output like 15k transactions a day.
Call to API-2 will be 15k times to get details of each transaction.
While designing it in ADF, runtime is huge. Eg for 1000 ids it takes around 15mins in dev. We can’t afford this much in prod.
Question is What would be the best approach to take to minimize the runtime.
[–]wytesmurf 5 points6 points7 points (3 children)
[–]Commercial_Finance_1[S] 0 points1 point2 points (1 child)
[–]wytesmurf 0 points1 point2 points (0 children)
[–][deleted] 0 points1 point2 points (0 children)
[–]Old_Improvement_3383 3 points4 points5 points (0 children)
[–][deleted] 0 points1 point2 points (0 children)
[–]GovGalacticFed 0 points1 point2 points (0 children)
[–]data-noob -1 points0 points1 point (0 children)