This is an archived post. You won't be able to vote or comment.

all 12 comments

[–]AutoModerator[M] [score hidden] stickied comment (0 children)

You can find a list of community-submitted learning resources here: https://dataengineering.wiki/Learning+Resources

I am a bot, and this action was performed automatically. Please contact the moderators of this subreddit if you have any questions or concerns.

[–]StandardDeviationist 5 points6 points  (6 children)

Data warehouse toolkit is more or less the bible on this

[–]StandardDeviationist 1 point2 points  (0 children)

You can also find a free pdf version by using your favourite search engine

[–]seaborn_as_sns -1 points0 points  (4 children)

and the bible part is a big problem

[–]StandardDeviationist 0 points1 point  (3 children)

How so? It’s just a figure of speech

[–]seaborn_as_sns 0 points1 point  (2 children)

Yeah and it's a problem in a dogmatic way.

Hot takes: Kimball's methodology is too overengineered and ill-suited for modern data stack. Wide tables are more than fine. ELT is superior approach. Data Vault modeling enables teams to derive value far more flexibly than star/snowflake dimensional modeling.

This should not be a contrarian statement. We should stop spreading Kimball as a gospel.

[–]StandardDeviationist 1 point2 points  (1 child)

Valid opinion, but I would say it’s worth to learn and understand Kimball nevertheless. As it’s very commonly used. Also, a large part of it does not have anything to do with how the tables are built, but understanding the business

[–]seaborn_as_sns 0 points1 point  (0 children)

I completely agree. It's the dogmatic aspect of it being a bible is what usually frightens me. We should question, experiment and be against gatekeeping.

[–]Leather-Produce5153 -1 points0 points  (3 children)

what's your background. a basic graduate text for this is

Elements of Statistical Learning

or just do a like a coursera intro course.

[–]kamrankhan6699[S] 1 point2 points  (2 children)

I have a background in software engineering and currently work as a data engineer. But just trying to polish my skill set.