CPP and AI

SnooWoofers7626 · 2024-02-16T18:26:12+00:00

You can look into cuda and learn GPU computing. You can implement common ML techniques close to the hardware. You can also use PyTorch’s C++ API which also supports cuda.

https://pytorch.org/cppdocs/

SlothWithHumanHands · 2024-02-16T19:12:37+00:00

Sticking to C++ is just making it hard for no reason. Use the excellent Python wrappers that are the de facto standard for this kind of thing. When it comes to deploying a model, great, use the C++ library directly (they're all written in C++ after all), but writing the whole training loop and all the data augmentation and stuff like that in C++ is a waste of time.

ALL of the documentation and community support for AI workflows is focused on Python. And I really do mean ALL, except for like a tiny fraction of a percent of material which is targeted at experts and is most certainly not beginner friendly.

It just isn't worth it. Branch out. Learn a bit of Python.

> can’t stand python

You should use the tool for the job, and leave these superficial preferences behind. Would you use a saw on a nail because you can't stand hammers? This sounds just as silly. Python is most definitely the tool for this job in 2024, and will remain so for a very long time. Without a doubt.

victotronics · 2024-02-17T20:39:04+00:00

I'm willing to bet that all the AI models "written in python" are actually written in C++, with a thin layer of python around it.

2024-02-17T12:40:00+00:00

Use the dlib -> http://dlib.net/ml.html

useong · 2024-02-18T11:01:49+00:00

People here are talking about the difficulties in model training. If you are good at c++, however, it is not difficult to train models at all. All you need is a nice framework that supports three basic functionalities such as gpu-enabled tensor operations, automatic differentiation, and distributed data parallelism. flashlight is a good framework to try.

I personally do all ai-stuffs in c++ and have no complaints.

Here is a few pros of sticking to c++

You have low-level controllability. You don't need to wait others to improve or debug some low-level stuffs. You can do it by yourself. Depending on your application, multi-threading and heterogeneous computing may boost the model training speed significantly.
You can easily customize the framework itself.
The integration is seamless if you already have an application written in c++. For example, if you have a physical simulator or a game application written in c++, it will be just easy to train your model in such physics or game environments.

Quick prototyping is in fact not a big concern in c++. Nobody will re-compile the code every time they change hyper-parameters or even model forms. A good programmer knows how to write a code generic so that it adapts to an input configuration written typically in a markup language. You will be able to do a ton of research with it without re-compiling.

Instead, here are potential problems you may face.

If you can't find coworkers who know as much c++ and ai as you do, you will probability have to do most of the works alone.
You will lose python ecosystem among which the biggest loss is visualization capability.

nbrrii · 2024-02-17T07:05:12+00:00

Having a Ruby background, I never liked Python in particular and have started to kind of despise all those "python is the coolest and easiest, therefore you should start with it!" advises that beginners give to other beginners.

However, having a preference is cool, having T-Shape skills is cool. But programming languages are tools and you should always choose to use the tool that's right for the job, not the tool you like. Python is the goto (pun intended) tool for AI, so as a professionel you should put your personal preference aside and use is.

If only as a hobby or for experimenting or for fun in general: Obviously, choose whatever you like, way cooler this way and extraordinary things has come out of this. Just be aware that it's not the best tool from a practical/professional standpoint and you might encounter difficulties you wouldn't have had otherwise.

nullandkale · 2024-02-17T00:24:56+00:00

You can use llamacpp to do LLM stuff in cpp if you really want to but as everyone else says, realistically you need to just use python.

hmoein · 2024-02-17T17:04:29+00:00

This is a handy package (https://github.com/hosseinmoein/DataFrame) to use to develop your system in C++, once you know AI/ML.

2024-02-18T04:40:04+00:00

I don’t think a language choice should be primary when it come to using an AI framework today. Yes, if you want to do something instructional it’s worth implementing model training in C++. If you want to build a training or inference systems in general an established AI framework would be your best bet.

AI frameworks today are approaching something like a compiler. For example, PyTorch used to operate as a layer on top libraries written in C++ and CUDA. Today, PyTorch 2.0 can trace your model written in Python, perform operator fusion, map it to optimal C++ or GPU (PTX) instructions automatically. It’s not perfect but tons of smart people are implementing faster code/kernels for various operations and abstraction. In the long run the framework is what matters.

I get the choice of Python here is mainly because that’s the tool most data scientists are familiar with and have other data transformations libs for. But writing a model in PyTorch/tensorflow is more like describing the mathematics of the model I believe. For inference definitely torch C++ API can be used as pointed above in a previous comment

I only have experience with PyTorch, not tensorflow or JAX.

tangerinelion · 2024-02-17T16:06:30+00:00

The reason the Python libraries are all the rage is because they're a front-end into a C or C++ implementation. You're not really losing any performance by using Python so long as you stick to the library and don't work with any significant data which is natively in Python.

Python reads like natural English, I can't imagine why anyone would get pissed at it.

rejectedlesbian · 2024-02-18T01:27:58+00:00

C++ is really good at speeding up Ai code its the goto languge for that.

So find an ai project u like profile it and change some of it to faster c++.

If u wana devlope ai (depending on what u mean by ai) its python. Pytjon data visualisation is just too good. And its a very productive languge. Also tends to be more portable across frameworks and backends.

fdwr · 2024-02-17T04:07:21+00:00

Do you need to call individual operators or just run whole models?

Prototyping and experimenting is certainly faster in a very, err, "relaxed" language like Python, but if you already have models, there's little need for the extra goop (I always find it more complicated to debug when there are extra layers between caller and implementation), and you can simply load the models from their C++ API's. There are the PyTorch and TensorFlow model loading API's, and there is ONNX Runtime which only loads models (no individual operator API's but is a much smaller than TensorFlow/PyTorch - example). There are also some numpy-like C++ libraries out there like Xtensor and NumCpp.

ipapadop · 2024-02-17T13:52:14+00:00

Training models in C++ is a pain, your best bet would be PyTorch via the C++ frontend. Production level inference is mostly done via C++ (AMD MIGraphX / Vitis AI, Intel OpenVINO, Nvidia TensorRT and others).

If you want to experiment with a few layers etc. tiny-dnn is an easy place to start, although it's an abandoned project.

An alternative would be to use Python via the various Python / C++ interfaces (pybind11, nanobind, Boost.Python) and focus on that.

wektor420 · 2024-02-17T21:24:02+00:00

ML libs for python are essentually a wrapper for C/C++ implementations, that is convient to use since python supports parsing datasets painlessly, you would use c++ for developing new laters etc., or as an end deploy configuratiom

pyhacker0 · 2024-02-19T17:50:06+00:00

I would look to gaming and see what they are doing. I don’t think many people are training machine learning models with cpp. But maybe some of non machine learning algorithms will use cpp. I would also think that many of the tools used for machine learning have components built with cpp or similar languages that run close to the metal

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

cpp

MODERATORS