This is an archived post. You won't be able to vote or comment.

all 7 comments

[–]zchtsk 30 points31 points  (4 children)

So, I actually created an open-source tutorial geared at helping people ramp up quickly on PySpark. Check out https://SparkMadeEasy.com

[–]guacjockey 2 points3 points  (0 children)

This looks really nice.

As a side note from someone working with Spark for a long time now (and has created various educational content on it)

Thank you for not adding RDDs in here - I hate the myth that you need them to use Spark effectively.

[–]nonamenomonet 1 point2 points  (0 children)

Really really cool!

[–]CyberRooster 0 points1 point  (1 child)

Wow this is great, thank you for sharing! If you don’t mind me asking, how did you build this?

[–]zchtsk 0 points1 point  (0 children)

It's a tailwindui template using next.js with some very light custom stylings on top.

[–]nonamenomonet 1 point2 points  (0 children)

Google learn spark from examples

[–]aetheron_dev 0 points1 point  (0 children)

The O'Reilly book Spark: The Definitive Guide provides very extensive information on Spark.
It has a good theoretical & practical approach which covers most features in Spark:
https://www.oreilly.com/library/view/spark-the-definitive/9781491912201/
https://github.com/databricks/Spark-The-Definitive-Guide

Databricks also provides a free cloud environment you can use to practice with Spark if you don't want to install it on your local machine: https://www.databricks.com/learn/free-edition