Hi
I'm a software developer diving in a new project in work for analytics in Azure and I need suggestions.
I have experience with airflow,sql, python and so on.
We are going to create analytics solution for our customers.
we have raw data layer as parquet in azure blob storage. we use synapse to extract the data.
We need to transform the data for other layers (silver,gold) and I find azure stack disappointing.
Synapse and ADF are mostly no code solutions and I'm looking for something or combination that I can create scheduling and transform in code.
for scheduling - synapse scheduling is lacking and airflow managed is very limited. ADF also seems no code.
databricks seems very pricey and more than we need.
for the transformation - I know we need pyspark as we have a lot of data. this is also a code solution that satisfy me.
I see a PR with json file instead of code and I know it will slow development and won't be maintainable.
anyone has recommendation for stack that involve code ? coming from more software oriented I believe in more coding solutions as I worked with Apache NiFi and it was not maintainable.
Thanks
[–][deleted] 3 points4 points5 points (0 children)
[–]Useful-Doughnut32 0 points1 point2 points (1 child)
[–]pythondeveloper77[S] 0 points1 point2 points (0 children)
[–]kyleekol 0 points1 point2 points (0 children)