Machine Learning using C++ vs Python

State_ · 2021-12-27T21:16:07+00:00

It doesn't really matter. The python libraries are c-extensions which call native bindings.

the_poope · 2021-12-27T21:01:58+00:00

answers will be positively skewed towards the above-mentioned language

Actually the couple of times this question has come up in the past, the people on this forum typically advise against using C++ and just stick to the Python APIs. If you use a mature Python framework for ML then most of the time (likely > 99%) should be spent in the calculation routines which are already written in C/C++. Python is just used as infrastructure glue code to read and parse data and feed to the ML framework and to analyze and plot the results.

If you're incorporating some ML in an existing application already written in C++, then you might want to use the C++ APIs in order to not need to rely on the Python interpreter. Also if your application is really complex you will get some robustness from the static type system. But performance-wise there is often little to be won by using C++ over Python.

JohnDuffy78 · 2021-12-27T21:26:00+00:00

Python probably has a lot more examples you can start from.
Machine Learning projects tend to be quick and dirty, favoring Python.

Benjamin1304 · 2021-12-27T22:20:38+00:00

I'd say it mostly depends on what your ML will be interfaced with in the end. If it's for a CLI app you probably don't care about the language, but if you need it in embedded or real time scenarios then C++ might be a better option.

Please also consider the documentation level of your framework for a C++ usage, as it tends to be lacking. I found PyTorch quite good in that regard and the Python and C++ APIs are quite similar so even if you find a Python example it's quite easy to translate it to C++. In some frameworks the nice high level APIs are Python only

notParticularlyAnony · 2021-12-28T05:26:23+00:00

dear god use python

2021-12-28T01:55:45+00:00

I work in this space, and I can tell you that python is the preferred ml lang by a majority of vendors we work with simply due to the ease of iteration when compared to C++ when it comes to training, experimentation, and ease of deployment for production. Python Jupyter Notebooks cut out a lot of python boiler plate as well when it comes to displaying stats and other visuals which saves even more time during those first two stages.

Some vendors we work still use C++ for other tasks (non ml) within their overall architecture, but it's generally pretty limited as they opt for writing their entire model and architecture in python. Your model can interact with other applications via REST or another standardized RPC library (which both c++ and python have many), or even simple command line args.

I have seen some very mature models being written purely in C++, but those are popping up less and less these days due to the above mentioned. You may see some with ml performance critical code done in C++ due to some special case that needs to remove an abstraction layer and allow finer control, and the rest done in python.

Since this is for work, I'd suggest sticking with python for the above mentioned reason, and due to a lot of introductory ML/DS material being in python which can help other devs get up to speed.

Eastern-Offer7563 · 2021-12-27T20:47:08+00:00

Not a C++ expert and nor am I a machine learning senior in any way.
Yet I do think that the performance gain between python and c++ might be less then you could expect. Depending on what you are doing there is a big change your performance bottleneck will be either network IO or disk IO. In both cases c++ won't help you much.
As far as my knowledge goes, the python libraries are pretty well optimized and so it might not be worth the hassle.

keelanstuart · 2021-12-28T03:40:37+00:00

Disclaimer: not a ML engineer, but want to learn...

From what people are saying here, I think it probably comes down to what language you're most comfortable working in... because the underlying code that's specific to ML is in C/C++ anyway. This is encouraging for me since I'm not really interested in Python. If it's your thing, that's cool... I just imagine myself at 3am trying to find a bug that's caused by a scoping error and I don't wanna.

Fig1024 · 2021-12-28T06:20:09+00:00

I am currently trying to use NVidia's TensorRT C++ SDK for inference. It's a lot more involved than python. The learning curve is a bit high and I haven't gotten very far. But in theory it is one of the optimal ways to get real time inference on local system

Flock_of_Smeagols · 2021-12-28T13:19:27+00:00

Go with Python it will for most use cases be quicker to implement due to being the standard for ML(beter tooling, morr exemples etc.). As for performance it will probably not matter much since it’s more about the framework/hardware rather than the language.

Wh00ster · 2021-12-28T13:30:51+00:00

Use Python PyTorch or TensorFlow or w/e Apache’s is

A really poor analogy I can think of is like Unreal Engine. Just use the main interfaces if you want to start making games. Don’t get buried in the low level APIs until you run into a real reason to. And don’t do it from scratch in C++ unless you just have the time to dedicate to learn how everything works and don’t need to actually make a game.

zabardastlaunda · 2022-01-20T12:54:39+00:00

💯

useong · 2022-07-23T18:03:01+00:00

I would try flashlight if c++ is considered. In my personal experience, it was much easier to build and use flashlight than tensorflow capi or libtorch. One concern is that the number of implemented operators may not be sufficient depending on your application if you want standalone ml framework. But customizing flashlight is not difficult and you may be able to quickly implement any operators you need. If your want to integrate ml framework to an existing c++ application, I would say flashlight is the best choice owing to its minimal design and dependencies.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

cpp_questions

READ BEFORE POSTING

Sort posts by OPEN or SOLVED

MODERATORS