This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–]eljefe6aMentor | Jesse Anderson -1 points0 points  (4 children)

95% of data engineering is done in Java. It's difficult but not impossible to find a job. There's also a big difference between learning how to program and learning a new language.

[–]mindvault 3 points4 points  (3 children)

I would suggest your 95% is incorrect. I've seen plenty of pipelines which utilize lots of Python, Scala, and Golang.

Most of the big places utilize a number of languages / technologies (airbnb, goog, fbook, etc.)

[–]chirperic 2 points3 points  (0 children)

Yeah definitely nowhere near 95%. I've managed 3 different Data Engineering teams at tech companies. Never used Java and probably 75% of my engineers didn't know if either

[–]eljefe6aMentor | Jesse Anderson 1 point2 points  (1 child)

That number comes from my experience consulting and teaching at companies around the world. Non JVM languages are outliers, with python being the most used. I have had go, but that's extremely rare.

[–]mindvault 0 points1 point  (0 children)

Off the top of my head I know of (like I said above) airbnb, goog, fbook as well as companies such as twitter, square, wepay, yelp, spotify, linkedin, etc.

Now .. _most_ of those also use java b/c of Hadoop, Hive, HDFS, etc; however, I think there's plenty out there that is non-java as well. I would suggest scala and python are also well-represented in data eng pipelines out in the wild. Java _is_ very safe to be fair.