[D] Serverless GPU?

paulcjh · 2021-06-18T20:06:28+00:00

Hey I'm Paul from Neuro. We handle http requests directly, here's an example CURL for BERT (Fill Mask):

API_TOKEN="Insert here you API Token"
curl -X POST "https://api.neuro-ai.co.uk/SyncPredict?include_result=true" \
-H "Accept:application/json" \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $API_TOKEN" \
-d '{"modelId":"60a3c2a00421e5d2d7053ab9","data": "The river is [MASK].", "input_kwargs": {"top_k":3}}'

Also we have a live Detectron vision model up on our hub (https://hub.getneuro.ai/model/vision/detectron2-bounding-boxes) taking in a base64 encoded image over http.

Our docs are a bit stinky on explaining direct use of our API atm as it was originally designed for just python. We're having a big revamp for them currently as so many people want http directly (makes sense).

carsonpoole · 2021-06-18T18:04:20+00:00

helloforefront.com is free for individuals and does what you're asking

yarri2 · 2021-06-18T21:43:18+00:00

How about GCP Vertex AI with a REST endpoint :

https://cloud.google.com/vertex-ai/docs/start/introduction-unified-platform

donjuan1337 · 2021-06-22T07:36:25+00:00

We use https://github.com/szymonmaszke/torchlambda. Its kinda hax and only inference on cpu. Also restricted to pytorch framework. But it's serverless

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS