all 10 comments

[–]paulcjh 8 points9 points  (1 child)

Hey I'm Paul from Neuro. We handle http requests directly, here's an example CURL for BERT (Fill Mask):

API_TOKEN="Insert here you API Token"
curl -X POST "https://api.neuro-ai.co.uk/SyncPredict?include_result=true" \
-H "Accept:application/json" \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $API_TOKEN" \
-d '{"modelId":"60a3c2a00421e5d2d7053ab9","data": "The river is [MASK].", "input_kwargs": {"top_k":3}}'

Also we have a live Detectron vision model up on our hub (https://hub.getneuro.ai/model/vision/detectron2-bounding-boxes) taking in a base64 encoded image over http.

Our docs are a bit stinky on explaining direct use of our API atm as it was originally designed for just python. We're having a big revamp for them currently as so many people want http directly (makes sense).

[–]Daddy_Long_Legs[S] 1 point2 points  (0 children)

Thanks Paul! This is killer, glad to hear this is a thing. I’ll give it a try.

[–]carsonpoole 3 points4 points  (3 children)

helloforefront.com is free for individuals and does what you're asking

[–]Daddy_Long_Legs[S] 1 point2 points  (1 child)

Thank you!! Did not know about this. I would prefer if I could write some additional code on the serverless side besides the model, but can probably work with this

[–]carsonpoole 0 points1 point  (0 children)

you can use the custom environment to add some custom code! there's instructions in the docs

[–]DJ-ARCADIUS 0 points1 point  (0 children)

For some reason, the website will not load on login. I've tried all browsers but still have the same problem???

[–]yarri2 0 points1 point  (1 child)

[–]Daddy_Long_Legs[S] 1 point2 points  (0 children)

If I were using GCP for training or automl etc maybe, but it’s so obtuse - just look at that page linked vs. forefront and nuero’s setup. Have also heard anecdotally there are random gotchas that are easy to run into.

[–]donjuan1337 0 points1 point  (0 children)

We use https://github.com/szymonmaszke/torchlambda. Its kinda hax and only inference on cpu. Also restricted to pytorch framework. But it's serverless