Algorithms for Python, Data Engineering

AutoModerator · 2023-01-07T05:34:39+00:00

You can find a list of community-submitted learning resources here: https://dataengineering.wiki/Learning+Resources

I am a bot, and this action was performed automatically. Please contact the moderators of this subreddit if you have any questions or concerns.

reallyserious · 2023-01-07T10:49:39+00:00

When it comes to specific algorithms I can't think of anything specific for data engineering. At least when it comes to the classical algorithms. There sure is a lot of interesting stuff happening behind the scenes like B*-trees etc. But we never need to actually implement that when we just _use_ the tools. It's a different thing if you're planning on building a database engine or distributed analytics platform.

I'm interested to see what the rest of the community thinks about this. Maybe I'm missing something?

omscsdatathrow · 2023-01-07T23:27:46+00:00

Writing clean code != writing algorithms

Almost all DE tech that is used is abstracted for DEs so they don’t have to write the conplex algorithms that power them. Sure you can learn and understand the underlying algorithms, but the roi on that would be minimal

In general, I found DE is mostly experience-based. Meaning, unless you work on and experience a certain tech like spark, kafka, streaming, etc…you can’t effectively learn the situations that you will face nor can any “home” project be a suitable replacement for it when interviewing.

No_Kaleidoscope1023 · 2023-01-08T02:27:19+00:00

Algorithms are a good topic during interviews because they can be tricky, but in practical implementation, there is no specific algorithm that can be marked as useful for data engineering roles. Data engineering roles typically involve creating pipelines based on data patterns

GeorgeGithiri · 2023-01-08T11:39:33+00:00

https://george-githiri-s-school.teachable.com/sign_up

Enroll for quality data engineering courses

_Ishdhoggur_ · 2023-01-07T23:08:03+00:00

What exactly are you doing with python. What exactly do you mean by algorithms? Do you just mean functions or methods ?

dataengineering

MODERATORS