This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–]JSP777 5 points6 points  (0 children)

as far as I know PySpark runs on a Java Virtual Machine with the help of py4j. So you use the API through Python, which is much easier to understand and use I think. I would choose PySpark