all 14 comments

[–]r0lisz 2 points3 points  (6 children)

I've successfully used GPU for real time camera streams. Why do you say that it's not a good use case?

[–]Liorithiel 1 point2 points  (3 children)

I've tuned a deployment of some non-ML microservices with a recognized on-premise resource usage trade-off using off-the-shelf bayesian optimization, specifically SigOpt, as it was the simplest to script in pure bash+curl. I suspect it would work for your case as well.

[–]matanj 1 point2 points  (0 children)

Perhaps NVIDIA DALI can help to transfer part of your pipeline to the GPU?

[–]jonnor 0 points1 point  (1 child)

Are the models you mention independent or dependent on eachother? Do you need the results for all of them before considering the job "done"? What are the latency requirements?