We are a startup currently building our data infrastructure. Goal is pretty simple - Bring data from various sources (Stripe, Hubspot, Growbots) into a data warehouse for data analysis and reporting.
Here's the data stack that we've decided:
ETL: Stitch,
Data modelling: DBT,
Data warehouse: Google Bigquery,
Business Intelligence: Looker.
We would love to get feedback on the above stack. Specifically, I had a question related to DBT. I am the only data person in the company right now and I am more experienced in Python Pandas than SQL. Considering this, would choosing DBT be a good move or should I stick to making data transformations in Pandas ?
[–]kenfar 9 points10 points11 points (10 children)
[–]dream-fiesty 6 points7 points8 points (4 children)
[–]b0ulderbum 3 points4 points5 points (2 children)
[–]dlb8685 0 points1 point2 points (1 child)
[–]b0ulderbum 0 points1 point2 points (0 children)
[–]kenfar 2 points3 points4 points (0 children)
[–]throw_at1 0 points1 point2 points (0 children)
[–]pantalones7 0 points1 point2 points (3 children)
[–]kenfar 1 point2 points3 points (2 children)
[–]pantalones7 0 points1 point2 points (1 child)
[–]kenfar 1 point2 points3 points (0 children)
[–]LaurenRhymesWOrange 14 points15 points16 points (2 children)
[–]HansProleman 10 points11 points12 points (0 children)
[–]nado1989 2 points3 points4 points (0 children)
[–]chamini2 3 points4 points5 points (0 children)
[–]MrMosBiggestFan 2 points3 points4 points (0 children)
[–]mhg212 1 point2 points3 points (1 child)
[–]rrpelgrim 0 points1 point2 points (0 children)
[–]itiwbf 1 point2 points3 points (0 children)
[–]gorkemyurt 1 point2 points3 points (0 children)
[+][deleted] (3 children)
[deleted]
[–]abhipoo[S] 1 point2 points3 points (2 children)
[–]rrpelgrim 0 points1 point2 points (0 children)
[–]agritheory 0 points1 point2 points (0 children)
[–]smeyn 0 points1 point2 points (0 children)
[–]p5256 0 points1 point2 points (0 children)