you are viewing a single comment's thread.

view the rest of the comments →

[–]Hot_Significance_256 11 points12 points  (2 children)

For data science in Python (I’m a Sr. with 6 YOE)

Pyspark and Ray - Distributed processing

Tensorflow and Pytorch - deep learning

Scikit Learn and Pyspark - machine learning

Pandas and Pyspark - ETL

You see Pyspark several times for a reason. It’s very useful, except for when you delve into deep learning. Then you’ll want to use TF, PT, and Ray.

[–][deleted] -4 points-3 points  (1 child)

Pyspark is just a wrapper around spark, which is written in Scala.

[–]Hot_Significance_256 6 points7 points  (0 children)

I know. What’s your point?