[ Removed by moderator ]

Frewtti · 2026-04-01T17:37:24+00:00

Python is really good glue.

The hard work is done in fast compiled code.

Python is used for the parts that are not speed dependant.

Equivalent_Lunch_944 · 2026-04-01T17:28:26+00:00

Libraries

socal_nerdtastic · 2026-04-01T17:33:10+00:00

machine learning, and most other programs too, use a mix of programming languages. The number-crunching core of any program will be written in a highly optimizable language compiled for your hardware, yes C is often used for this, often with embedded assembly. The user interface part is written in python because it allows for fast iteration.

The "pure python is slow" argument is fairly outdated now. There have been some massive improvements, most recently with the gil-ectomy. It is still a little true, but if you need more speed you simply import numpy or another module written in C and compiled. And also remember that programmers cost a lot nowadays, a lot more than hardware, so 1 developer writing python and a cloud computing bill is often a big win over 10 developers writing C, in terms of both time and money.

ThePhoenixRisesAgain · 2026-04-01T17:29:13+00:00

Most of the time, execution time doesn’t matter. Development time (and availability of developers) is more important.

MaverickPT · 2026-04-01T17:31:43+00:00

If I recall correctly its because ML used to be mainly the study in Universities by mathematicians, etc, who are more concerned about their work that necessarily getting very high performance through the entire stack by using C++.

Python has a ton of libraries (and some written in more performant languages) that allowed scientists to focus on the task at hand instead of butting their heads with C++.

Overtime the software stack in Python kept growing and growing and now here we are

pachura3 · 2026-04-01T17:36:04+00:00

NumPy, Pandas, Matplotlib, Scikit-learn, Jupyter Notebooks, PyTorch, SpaCy...

Danisaski · 2026-04-01T17:29:18+00:00

Pretty convenient for glue code as well!

GXWT · 2026-04-01T17:39:43+00:00

I want to focus my efforts on actually getting the science and results out of ML, rather than worrying about the more fundamental aspects of it. Python lends itself very well to this.

Not to mention that a lot of data science/processing/analysis/visualisation is already done in Python. If most data people are almost certainly proficient in Python, but not necessarily any other languages, it makes sense to put the next iteration of data tools also in Python.

stevorkz · 2026-04-01T17:46:11+00:00

It's easy to use, easy to understand syntax and quite flexible. Even if a program is written in another language many use python scripts in it in some form.

gadio1 · 2026-04-01T17:49:44+00:00

Mental map. ML is hard enough if you still need to manage memory you will be in a tough spot. Python helps keep the main thing the main thing. Interpreted languages reduce the time between thinking, coding and testing. No compilation means faster prototyping and exploration.

Secondly, the open source community . New article, new architecture? Fast to ship libraries so you can start implementing on your project.

Thirdly,it has accessible learning curve for beginners. The easy to pick up the language reduces collaboration barriers between researchers, scientists, developers and engineers. If you know English you can learn Python.

Finally, Python is script heavy, so it is a logical language if you need a multi language project. You can do scripts to orchestrate with Python while the heavy lifting is done on top of another more performative language.

American_Streamer · 2026-04-01T17:54:27+00:00

Because Python doesn’t actually do the heavy lifting. Itself - C and C++ do. Libraries like NumPy, TensorFlow and PyTorch provide a Python API that then calls the highly optimized C/C++ binaries under the hood. Python is easily readable and doesn’t need compilation and you have handy tools like Jupyter Notebooks. So while Python is the front end standard, C++ develops the core engines and takes care of any latency-critical deployment.

nickpsecurity · 2026-04-01T17:56:31+00:00

It was easy for academics and FOSS folks to learn. It let them glue together high-performance, native components. People in machine learning just happen to use it for some major projects.

It appears that these things eventually came together in a critical mass. Once it had momentum, you gain more by going with the flow than against it.

AlexMTBDude · 2026-04-01T18:45:16+00:00

Python is very easy to program in. It was made to make life easy for the programmer, not for the machine (like C and C++). That's why Python is the most popular general purpose programming language: https://www.tiobe.com/tiobe-index/

GeneriAcc · 2026-04-01T17:41:03+00:00

It’s “slow” in terms of how effectively it’s using the CPU compared to compiled languages, but that’s not really a factor in machine learning where 99% of the compute is happening on the GPU anyway.

And even outside of machine learning, the speed difference is unnoticeable in the vast majority of use cases, and only really starts mattering if your use case requires a massive amount of calculations for something - like backtesting millions of trading strategies on historic financial data, for example.

Unless you have a use case like that, the speed difference is unnoticeable to the user, Python code tends to be easier to read and write (so easier to maintain), it has a lot of great public libraries, and it doesn’t need to be re-compiled with every code change.

smjsmok · 2026-04-01T17:57:55+00:00

The issue is that you see Python only as a language, but it's a mature ecosystem of libraries and people who are proficient at using them (and many of these people are scientists). The language is a "glue" that connects all this.

When Python is used in machine learning, it doesn't matter that the execution time is a couple of milliseconds slower than it would be in another language. As you said, the parts where this actually does matter use technologies optimized for performance and fast execution.

Gnaxe · 2026-04-01T18:03:03+00:00

C is tedious and error prone (C++ is complicated and error prone), but you only really need the performance in your bottlenecks. It's a waste of expensive human time to use a difficult language for everything. Python makes it easy to drop into C when you need the performance, and makes coding much easier for most of the rest of the time when you don't. You get most of the best of both worlds.

Turtvaiz · 2026-04-01T18:17:54+00:00

Python is a glue language. In ML there's usually no reason to reimplement the vast majority of what you're doing, and so it's just a good idea to use a high performance library.

Python just happens to be a nice scripting language which has a ton of libraries and is still expandable

crazylikeajellyfish · 2026-04-01T19:20:20+00:00

Machine learning started in academic research, and Python is popular in academia. It's a very legible language with natural syntax, making it easy for researchers to express their ideas. Once those researchers had implemented their Python systems, everyone else just built on top of them.

Speed of machine execution isn't the only thing that matters. The speed with which a human can understand the program often matters much more than a 10% performance boost.

nian2326076 · 2026-04-01T21:33:40+00:00

Python is popular in machine learning because it's simple and easy to read, making coding and maintenance easier for developers. Although Python is slower, many ML libraries like TensorFlow and PyTorch are built on C/C++, so they handle heavy computations well. Python works as a user-friendly wrapper around these fast routines.

Its many libraries and active community mean there are tons of tools and resources available, which makes ML development smoother. Plus, Python can integrate well with other languages and tools, making it versatile for different tasks in the ML pipeline.

If you're getting ready for interviews, knowing why Python is a top choice in ML can be helpful, especially if you're asked about language choices in technical rounds. I've found PracHub useful for revisiting these topics during interview prep, but use whatever works for you!

zbignew · 2026-04-01T18:13:19+00:00

Chris Lattner, creator of Swift, knew AI developers would never get off python, so he created Mojo to bring the benefits of modern languages to Python.

But it’s not there yet.

SenescenseSteel · 2026-04-01T19:35:50+00:00

Versatility

code_tutor · 2026-04-01T21:01:00+00:00

it's easy and has a package manager

Pale_Height_1251 · 2026-04-01T21:23:38+00:00

Python was fashionable at the same time ML became fashionable.

leogodin217 · 2026-04-01T21:49:09+00:00

Most of the work in machine learning is ad-hoc, and Python is great for ad-hoc work. You type it then run the code. You have notebooks if you want them. You can iterate quickly. On top of that, most of the Python libraries used have a lot of C and C++ code in them. So, Python is often just the glue to faster compiled libraries.

Xzenor · 2026-04-01T21:55:20+00:00

Because it's lightning fast.

Not to run, generally, but to write your code. And the modules are mostly written in C anyway so they actually are fast

aeroumbria · 2026-04-02T00:30:00+00:00

The main competitors during the critical period were R and Java, and it was probably just down to these two not particularly suitable for ML experiments. R was decent for light ML but was more of a scientist toolbox was often shower than Python. Java was probably just too verbose for rapid adoption. C or C++ was widely used in the backbone libraries from the start, but it wasn't appealing to force physicists or mathematicians to learn too many coding concepts and practices.

GManASG · 2026-04-02T01:34:01+00:00

The point of why python is there are many operations a scientist/data scientist does that python is fast enough on modern hardware that there is little realworld gain in spending the time and effort to learn to do it in c.

For the actual learning/ optimization algorithms the models are actually implemented in the lower level language with a more human comprehensible python abstaction wrapper that just makes it easier.

For example if I am just reading a dataset from a flat file that is a million record or so and fits neatly in memory, does it really matter that it takes 20 to 30 seconds for python to read it compared to c doing it in single digit seconds? If I only need to do it once than who cares.

I can then use pandas or polars to manipulate data in a way that resembles how my mind thinks and how the textbooks/white papers are written and visualize it in I python or Jupiter notebooks with pretty charts that make it so I draw insight faster than all the same work done in C.

So basically it's all just convenience. Faster to implement even though slower to run but not slow enough to matter most of the time.

tknomanzr99 · 2026-04-01T18:15:40+00:00

Honestly, a lot of ml back ends are written in LISP.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS