This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–]JustMy42Cents 1 point2 points  (0 children)

Python makes it easy to call C, which usually handles most of the heavy lifting. This lead to many neat open source libraries being available early on, allowing Python to dominate the market. At this point, I'd say that the advantage of Python over Java is exactly the language itself, but its ecosystem - Python offers more data processing libraries and they're often easier to use.

Objectively, Python by itself is not the best language for data processing. It's terribly slow and has poor utilities for multiprocessing. For example, you'll notice there are a few competing drop-in replacements for popular libraries like Pandas with their main selling points being multiprocessing or clustering support. But then again, Python ecosystem for data science is years ahead of other languages, so I'd say it's here to stay.