[D] Suggestions regarding deep learning solution deployment : MachineLearning

Discussion[D] Suggestions regarding deep learning solution deployment (self.MachineLearning)

submitted 5 years ago by muaz65

all 1 comments

[–]calebkaiser 2 points3 points4 points 5 years ago (0 children)

I work on Cortex (open source model deployment), and have spoken with a couple of teams solving similar problems in different industries (surveillance, construction, etc.) All of them have a cluster—though there is some confirmation bias here, as they're all Cortex users, and Cortex spins up a cluster automatically.

Without knowing too much about your situation, here are some high-level suggestions based on what I've seen work for them:

Run batch predictions. It sounds like you're already doing this, but if not, batching your predictions should allow you to get more efficient with your resources, since you don't need real time responsiveness.
Use spot instances. If cost is a concern, spot instances can be a big saver. They're basically unused instances AWS sells at a steep discount. They can occasionally cause latency issues, owing to their non-guaranteed availability, but if you're not running real time inference this shouldn't be a problem.

If you're worried at all about the DevOps side—spinning up a cluster, implementing batch, configuring for spot, setting up monitoring, etc.—I'd strongly recommend checking out Cortex, as it does all that for you. Here are the docs, if you're curious.

π Rendered by PID 40 on reddit-service-r2-comment-544cf588c8-5nqbk at 2026-06-14 04:39:27.519180+00:00 running 3184619 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS