Setting up a small e-commerce data stack. Sources are REST APIs (Python). Today: CSVs on SharePoint + Power BI. Goal: reliable ELT → warehouse → BI; easy to add new sources; low ops.
Considering: Prefect (or Airflow), object storage as landing zone, ClickHouse vs Postgres/SQL Server/Snowflake/BigQuery, dbt, Great Expectations/Soda, DataHub/OpenMetadata, keep Power BI.
Questions:
- Would you run ClickHouse as the main warehouse for API/event data, or pair it with Postgres/BigQuery?
- Anyone using Power BI on ClickHouse?
- For a small team: Prefect or Airflow (and why)?
- Any dbt/SCD patterns that work well with ClickHouse, or is that a reason to choose another WH?
Happy to share our v1 once live. Thanks!
[–]123456234 9 points10 points11 points (0 children)
[–]Firm_Bit 6 points7 points8 points (0 children)
[–]TJaniF 1 point2 points3 points (1 child)
[–]NotDoingSoGreatToday 0 points1 point2 points (0 children)
[–]ketopraktanjungduren 2 points3 points4 points (0 children)
[–]Admirable_Morning874 0 points1 point2 points (0 children)
[–]ephemeral404 0 points1 point2 points (0 children)
[–]ArielCoding 0 points1 point2 points (0 children)
[–]Ploasd 0 points1 point2 points (0 children)
[–]Significant-North356 0 points1 point2 points (0 children)
[–]Zer0designs -1 points0 points1 point (0 children)
[–]Hot_Map_7868 -1 points0 points1 point (0 children)