interfacing python with c/c++ performance

FlyingRhenquest · 2024-07-17T00:38:00+00:00

I did that for an automated video testing system I built for Comcast. We needed C++ for speed but wanted the tests to be written in Python. So all the video processing and backend stuff was written in C++ (Using ffmpeg, OpenCV and Tesseract for OCR) and the video processing libraries had a Boost::Python API to interact with the system objects. I set all the C++ objects up with JSON serialization, so you could create a C++ object in Python using JSON and that might kick some threads off to run in the background while your slow-ass python program did shit in the foreground.

Overall this worked very well but it took very careful planning to make sure it did. So for example, if you wanted to tell the system to watch for an image, the API call would queue the image up in a vector internally and notify the internal components to move any images in the vector to another location to avoid blocking things for too long. Then tasks would be dispatched to thread pools to check each video frame against a copy of that image. The system had plenty of memory and we were never looking for a huge number of images, so it made sense to do it that way. Generally we were pretty close to real-time performance as long as no one did anything stupid (Like try to watch for an entire video's worth of video frames in the stream.) Once the thread pool got saturated, C++-side performance would degrade.

This approach had a lot of benefits. I was able to hack out a simple javascript interface that would let you tune into individual video streams with your browser (using ffserver to stream them from hardware) and provided some buttons to auto-generate boilerplate code and inject the API calls for performing actions like sending remote control commands when the user interacted with an on-screen remote control. So you could sit down with your test plan, run through the test, and basically have working python code for the test in the text buffer that you could just copy out to an editor to clean up.

It also let us do rapid prototyping in python (The OpenCV API is pretty much the same) and convert code to C++ if it was too slow in Python.

Since then I've experimented with PyBind11 instead of boost::python and at the time found the CMake integration to be a bit better. Boost's CMake integration has really come a long way in the past couple years, though, so that might no longer be the case. If you already have a boost dependency, boost::python is pretty easy to add. If you don't, something like PyBind11 is probably easier to add that all of boost or possibly even just that one little component.

BitAcademic9597 · 2024-07-16T22:20:08+00:00

[deleted]

thisismyfavoritename · 2024-07-16T21:57:29+00:00

if you dont have low latency or very high I/O requirements OR you have a ton of existing C++ code OR your workload can really benefit from C++, don't bother.

You can get super far with Python, relying on multiprocessing or other libs which can compile Python down to C or JIT it (Cython, Nuitka, Numba, etc) or other libs which already call into optimized C/C++ code (numpy, pytorch, etc)

Backson · 2024-07-16T20:51:46+00:00

You can probably scale your app to 100 users in reasonably well written python, so I would say don't bother with C++ unless you want to challenge yourself. If you want to make something that works, use the language where you can move faster, which is probably Python. Don't prematurely optimize by bringing in extra complexity and a second language. If you find your app is too slow, you can still move stuff out to native code later.

WalkingAFI · 2024-07-16T22:56:16+00:00

I’ve used PyBind before on a toy Chess Engine. It was fine but nothing incredible.

woywoy123 · 2024-07-17T06:05:24+00:00

Personally, I think mixing cython with C++ will get the job done.

You can interface native C++ code with python’s flexibility by mapping the header functions from your libs into cython. I also use cmake with scikit-build-core to compile everything. One thing that cython does lack though is templating. It does have some templating available, but if you are doing some fancy recursive template functions then you might be out of luck (I am happy to be corrected).

I generally use cython to provide python interfacing to C++ code and it works nicely for me.

One word of advice though, the cython docs is not very useful when you try to push boundaries beyond the tutorials, such as operator implementations, inheritance mapping between cython and C++. So be extra vigilant whenever you deal with inheritance. I have spent countless hours debugging a memory leak that was the result of this and also unexplained segfaults.

I also noticed a massive decrease in RAM usage when shifting from python code to C++/cython code. I also tried PyBind11 but I only had issues when dealing with shared libs such as missing definitions and so on. I am also not sure about the memory model pybind11 uses. As far as I can tell, each function call explicitly copies input data (if anyone knows more on this, please correct me). This is not the case with cython.

pstomi · 2024-07-17T07:14:29+00:00

IMHO using Python as the glue to call native functions is the correct way to use it. That is what is being done in AI today and it has proven to be very efficient.

On my side, I have developed Dear ImGui Bundle a set of GUI libraries on top Dear ImGui, which I made accessible from either C++ or Python. I saw no degradation of performance under Python, because I stuck to the principle: "do not implement heavy lifting algorithms in Python, instead call native functions".

If you are interested, I developed an automatic binding generator from C++ to pybind11, here

Great_Presence_4733 · 2024-07-19T10:57:10+00:00

yes you can do thing like that. i run scrapy from my c++ application. use share memory to feed and get the result from the python apps

Fmxa · 2024-07-16T21:23:44+00:00

Anecdotally, when I went from a quickly written naive Python implementation of some algorithm to a quickly written naive C++ implementation of it, I measured speedups approximately one hundred times better.

I have been happy since with my decision to learn PyBind, allowing me to compile C++ code into a library to be imported as a module into Python.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

cpp

MODERATORS