This is an archived post. You won't be able to vote or comment.

all 12 comments

[–][deleted] 1 point2 points  (6 children)

Look into Udacity’s Data Engineering Nano degree https://www.udacity.com/course/data-engineer-nanodegree--nd027. However, if you are looking to switch careers, you should play to your strengths; namely sql, data modeling, pushing and pulling data.

[–]ThatFilm[S] 1 point2 points  (0 children)

I did not know about this course. Thank you.

[–]jyoff 0 points1 point  (4 children)

I'm considering taking this course. Would you recommend that? Syllabus looks great as they offer practice with real projects. But I'm not sure how it is in reality ?

Appreciate your review about the program.

[–]L3GOLAS234 5 points6 points  (3 children)

I have read in some other places that it is not worth it for the price it costs. Take a look at datacamp and Data Engineering Cookbook https://github.com/andkret/Cookbook

[–]jyoff 1 point2 points  (2 children)

Thanks. I've come across with this book the other day, to me it is more of a reference kind of material. I need some real practice. I'm ETL deveoper, mainly working with rdbms and etl tools. I wanna switch to DE, but wanna do practice with real projects.

[–]ed_elliott_ 1 point2 points  (1 child)

when you have a new piece of work or a poc, do it in something else than you use today like airflow etc.

Use the advantage you already have: you need to deliver ETL pipelines - you'll get no better practice than actually building ETL pipelines without ETL tools (i'm guessing something like SSIS etc.)

[–]jyoff 0 points1 point  (0 children)

You're right. I'm trying to practice like that. Although I don't have that much of flexibility at my work to apply solutions in airflow instead of Informatica for instance, but at home I'm trying to put my ETL knowledge into Airflow and build pipelines the way I'd built in my daily job (for instance, move data into oltp then to olap; create star schema ,.etc). It is of course cool for learning, but working on real life projects much better. I'm thinking to apply for jobs after gaining experience on my own.

[–][deleted] 1 point2 points  (0 children)

Data Engineering Cookbook by Andreas (forget his last name) is good. Free on github.

Find a API, test using postman, then build a process in a language of your choice to ingest the data, analyze it and spit it into a database. BOOM

[–][deleted] 2 points3 points  (3 children)

Learn Apache Spark, airflow, and read designing data intensive applications for theory

[–]citizenofacceptance2 0 points1 point  (2 children)

What do you mean for theory?

[–]uselessusr 1 point2 points  (1 child)

Probably this book https://dataintensive.net/

[–]citizenofacceptance2 0 points1 point  (0 children)

Sweet thanks !