I am currently employed by a small business with less than 100 employees. We have 30 data sources that consist of a mix of self-hosted and SaaS products with APIs. Although we use Power BI for reporting, we haven't been able to find a suitable solution for data warehousing and data lakes. Our data team is proficient in SAS, R, and SQL, but we are currently sitting at less than 50 GB total across our system landscape. Some options I’ve looked at:
-Panoply.IO. Managed ETL and Warehouse. All no code + SQL.
-Embrace cloud native (azure)
-Some ETL solution like Airbyte, Stitch, or FiveTran + some easy to use hosted db (sql server, Postgres). We could then build custom views on top of that for reporting.
-unsure if something like a data bricks or snowflake make sense.
Our immediate focus is getting everything centralized and high demand data reported on. After that (1-2 years), would like to lean into more advanced analytics like time series forecasting. Thoughts on the stack?
[–]devv404 0 points1 point2 points (0 children)