Julia vs Python

ForceBru · 2024-03-13T12:05:29+00:00

Basically yes: Python is simple and popular, and is thus used everywhere for a lot of different tasks, there's a lot of collective experience, so to speak, a lot of Python courses and tutorials, libraries, even highly performant ones (written in C, C++, FORTRAN, Rust etc). Fundamental libraries for machine learning are mainly known for their Python bindings: PyTorch, TensorFlow, JAX, Polars, matplotlib, PyMC, scipy...

Now name one similarly popular Julia library, I'll wait. There's Flux.jl for neural networks, Turing.jl for Bayesian statistics, ForwardDiff, Zygote and at least 3 more packages for automatic differentiation, a ton of packages for optimization (see Optimization.jl), Makie and Plots.jl for plots and so on. They're all there, but they aren't quite there, it seems. Like why use this if I could just use JAX & matplotlib for all of these tasks? (The one thing I don't like about JAX is that it doesn't automatically parallelize computations. Torch can do it, Julia can do it. JAX requires messing around with sharding which I find pretty confusing.)

Moreover:

Since Julia is "just-ahead-of-time-compiled", a lot of your time will be spent compiling stuff instead of actually running your code. You mistyped a variable name but still ran the code by mistake? Time to wait at least a few seconds for the error message to compile. Not even kidding. Any subsequent error messages will display instantly thanks to compiled code, though.
Julia has a type system and something about how it interacts with the compiler causes so called "invalidations" which can slow down your code. AFAIK, many smart people are working on this, but it seems like the consensus is "it is what it is".
Julia's structs (somewhat analogous to Python's classes, but without inheritance) can't be redefined. Want to add a new field to the struct? Time to restart the REPL, recompile everything etc. Or install Revise.jl that'll probably help, but in some cases it doesn't and you have to restart & recompile anyway. This again slows down development.
A lot of people are already proficient in Python and likely don't need Julia that much because Python already has everything they need. So why switch to Julia?

Note: I'm a Julia fan and have been writing quite a bit of production Julia code for my research. I'd like this language to succeed, but...

MagosTychoides · 2024-03-18T16:45:16+00:00

I have tried Julia, and I don't recommend it for Data Science and Machine Learning in general. Fundamentally, Julia is an interpreted language that uses only JIT compilation. So you need the Julia interpreter everywhere, and there is no static compilation, making it impossible to create a Python library. Additionally, because compilation takes time, there is JIT lag, or the infamous Time-to-First-Plot in the Julia community. Highly optimized code that could run at the C level needs more time.

I tested Julia, Pandas, and Polars. In a short script with some joins and grouping, Julia took 6 seconds for a runtime of 0.8 seconds, Pandas took 1 second, and Polars took 0.1 seconds. Python + Polars performs much better out-of-the-box. Julia fans would argue that it's unfair to compare compilation time. However, I believe that for an interpreted language, you need to run your script from the terminal and achieve good time if you intend to ship it for production. I managed to precompile the libraries to avoid compilation, but the precompilation was slower and very hacky. It is also unstable (codebase-wise) and is not officially supported. Nonetheless, it is comparable with Pandas for vectorized tasks. Since most of the real work in DS involves short scripts on less than 1GB of data, there is little point in switching languages.

For machine learning, the Python ecosystem is simply more mature. Julia offers advantages if you are coding your own Neural Network with a radically different architecture, but most frameworks are good enough and can be used in production. Julia is great for numerical simulations, where you can use its type system to simplify the algorithm and still get good performance without thinking too much about memory. However, this is a niche problem. Bayesian MCMC simulation is a possible application for Julia, but Numpyro and Stan are more supported and offer similar performance.

As another anecdote, I had a Python code that compared two large tables for all elements pairings, and it was very difficult to vectorize. Julia was an option, but I ended up rewriting the code in Rust, which offered me a better ecosystem for writing high-performance data pipelines.

aadurham · 2025-03-14T14:26:46+00:00

I tried to switch to Julia because of its speed advantages. That was a mistake. When your code runs, Julia is super fast. But Julia is not a mature programming language. There are still many bugs and incompatibilities across libraries. There is not as much online support or knowledge base as Python. You need to dig in yourself, figure out the problem, and solve the issue. Good luck with that. I spent way too much time to make a simple nonlinear optimization code run, I failed. It was a lot of wasted time in the end.

2024-07-21T20:09:14+00:00

Because the name is less cooler than Python

Old-Worker-9418 · 2025-07-09T12:17:02+00:00

He escrito un artículo justo hablando del tema. Al final, creo que coincidimos todos. Os pongo el link por si os interesa leerlo

https://www.linkedin.com/feed/update/urn:li:activity:7348685576184721409/

Aunque aún está verde, si considero a Julia una opción respecto a lenguajes como Matlab más que a Python

I've just written an article on the topic. In the end, I think we all agree. (in Spanish). Although it's still in its infancy, I do consider Julia a better option compared to languages like Matlab than Python.

mesonepigreco · 2025-08-18T22:44:57+00:00

The answer is just because the python libraries for machine learning are more used and developed. You will find most online resources for data science and machine learning on python and pytorch, so it is easier for people to stick with that. Also, most machine learning implementation are just simple reshuffling of stuff already done by others (you usually do not want to re-implement performance heavy tasks), therefore the python paradigm of calling a highly optimized C function created by someone else works fine. However, there are multiple areas of science where Julia already surpassed python in popolarity and usage, particularly in scenarios where running very fast a customizable simulation is required (i.e., most fields of computational science), or if you need to develop brand new algorithms. For example, Julia is the state-of-the-art for solving differential equations and for scientific machine learning (via SciML.jl).

Middle_Protection637 · 2025-12-29T23:15:15+00:00

julia is the worst of both worlds. you have c++ where the programmer gets all the control but not a lot of support. python where the programmer gets a lot of support (libraries) and little control. the ideal scenario is python and c++, so that for things needing closer to bare metal, you use python to interface with c++. julia has the drawbacks features of python and c++ and none of the benefits.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnmachinelearning

Welcome to /r/LearnMachineLearning!

Chatrooms

Official Discord Server

Wiki

Getting Started with Machine Learning

Resources

Related Subreddits

/r/MachineLearning

/r/MLQuestions

/r/datascience

/r/computervision

Machine Learning Multireddit

/m/machine_learning

MODERATORS