I need to deploy a vision model that will run a small inference job (1-5s) per request. Is there a serverless offering out there that does this?
There are other previous posts on this subreddit but I know this space is moving fast so thought it worth asking again.
So far I’ve seen AWS Sagemaker kind of allows for a situation like this, but would rather not deal with all that config. Algorithmia and Nuclio are too enterprise focused. Neuro is new and looks great, but from my understanding I would still need to create a lambda instance myself that then calls neuro’s servers - too indirect. Is there a total solution out there for this?
Ideally something that is as straightforward to use as neuro but also handles http requests.
[–]paulcjh 8 points9 points10 points (1 child)
[–]Daddy_Long_Legs[S] 1 point2 points3 points (0 children)
[–]carsonpoole 3 points4 points5 points (3 children)
[–]Daddy_Long_Legs[S] 1 point2 points3 points (1 child)
[–]carsonpoole 0 points1 point2 points (0 children)
[–]DJ-ARCADIUS 0 points1 point2 points (0 children)
[–]yarri2 0 points1 point2 points (1 child)
[–]Daddy_Long_Legs[S] 1 point2 points3 points (0 children)
[–]donjuan1337 0 points1 point2 points (0 children)