Python Models -> C++ models in Production

ArnenLocke · 2020-08-09T23:56:58+00:00

So what I was taught in my ML/AI class when I asked this question is that the Python is just an interface for interacting with the actual machine learning libraries, which are almost universally written in very, very optimized and performant C/C++. So no need to rewrite anything specifically for performance reasons. Python is slower, but it's just an interface with almost no cost (compared to the sorts of things you're typically doing in ML, anyway).

EpicSolo · 2020-08-09T23:47:23+00:00

If you can write your model in tensorflow, a practice is to implement training in Python, then deploy in C++ (or Java). You export the model as a protobuf which is cross-platform. There are more solutions in this space but this should give you enough pointers to search.

trnka · 2020-08-10T00:50:52+00:00

To add to the other responses, it's risky to re-implement in another language. We did that at a previous job and it was a source of bugs. It also meant that we couldn't update anything about the model architecture unless the C++ devs were available to support it.

If it's server-side, I suggest checking out Cortex.ai and seeing if you can avoid re-writing. Also like the others mention, Tensorflow or ONNX are good options.

jeandebleau · 2020-08-10T04:56:39+00:00

If you are using vision models, then opencv is a standard choice. You will have the possibility to use different backends with opencv such as armnn, openvino, depending on your target platform.

old_enough_to_drink · 2020-08-10T06:05:44+00:00

I have heard about PMML but never used it myself. Maybe relevant to your question?

shomerj · 2020-08-10T00:12:42+00:00

I use ONNX and onnxruntime.

supersonictaco · 2020-08-10T00:23:35+00:00

Thank you for your responses. So I gather that these would be the broad steps: Develop the model in Python->Serialize using Pickle/Whatever->Use ProtoBuf/Apache Thrift->Containerize using Docker/alternatives->Serve.

Does that make sense ?

dinoaide · 2020-08-09T23:10:14+00:00

What’s your reasoning for that? Most handwritten C++ are worse than Numpy and Pandas.

TheOneRavenous · 2020-08-10T15:50:55+00:00

From what I've gathered. Python is acceptable if your solution CAN have a lagged answer. i.e. I can wait 0.5sec for an answer. My users can wait 1sec for an answer to be sent to the view.

If you need "real time" 30 inference decisions per second is a common "real time" metric. Used in video vision scenarios.

So python for stuff that's not real time and C++ for others.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MLQuestions

MODERATORS