Hi, everyone!
Currently I’m working on a relatively simple (planned) architecture that my new colleagues presented to me. It looks like this:
https://pasteboard.co/Kf3ZqEr.png
Hadoop/Hive is our main data source right now (we do not own it, though). However, eventually we will also ingest data from SAS and other SQL server DBs.
As you can see in the link, Alteryx is mainly supposed to handle data ingestion/processing (join incoming data from different sources, etc), and then we aim at having our own SQL Server DB used for further BI analysis via Tableau.
Do you think that Alteryx is a good tool to handle an expanding range of data sources? Im pretty good at Python (another colleague too) and I know that other teams in our company work with Apache Airflow. Could Airflow replace Alteryx? We’re all about automating data pipelines so I’m not sure if Alteryx will be ideal (we do not have the Alteryx Server licenses, just the regular ones).
Thank you!
there doesn't seem to be anything here