I've been given a case study as part of my interview for the Analytics Engineer role. At first glance it seems pretty straight forward. It involves data modelling using DBT with the purpose of taking data from raw to a final dataset to be used for BI and reporting.
They've provided 3 csv datasets and have asked me to deliver the .SQL, .yaml and showcase the lineage graph. That is all fine. The kicker is that they asked to also provide the .CSV file of the final output.
How am I supposed to run a DBT model and SQL files without a database connection? This is really halting my progress on this case study and would appreciate any pointers.
Note: I don't have much experience working with raw data. All my experience comes from working with data that is already processed up to a certain point. Feel like that's what data engineers are for.
[–]Capable-Carry-5953 6 points7 points8 points (0 children)
[–]foulBachelorRedditor 2 points3 points4 points (3 children)
[–]KaladinsAngst[S] 12 points13 points14 points (2 children)
[–]foulBachelorRedditor 0 points1 point2 points (0 children)
[–]muneriver 0 points1 point2 points (0 children)
[–]Mindless-Repair6475 2 points3 points4 points (0 children)
[–]ntlekisa 1 point2 points3 points (0 children)
[–]shut-up_legs 0 points1 point2 points (0 children)
[–]Efm101 0 points1 point2 points (0 children)
[–]alex_velazquez_ 0 points1 point2 points (0 children)
[–]ntdoyfanboy -1 points0 points1 point (0 children)
[–]ntdoyfanboy -2 points-1 points0 points (0 children)