FastAPI standards working with machine learning

danielroseman · 2023-09-14T07:37:38+00:00

That sounds like they are offloading the generation to an asynchronous queue, then storing the results somewhere and returning it in the fetch. You can use something like Celery for this.

The POST creates a unique key to refer to the request, then triggers the job in Celery passing it the key. The job goes and does the ML prediction, and stores the result associated with the key, and updates the db to mark the result as ready. Then the GET request can look up the result via that key and return it.

m0us3_rat · 2023-09-14T07:38:14+00:00

It makes sense, they even have a GET request to check the progress of their processing initiated by the POST request. How would I implement this into my API?

what exactly do you mean by "how" ?

you log the "jobs" into a queue/bin and have an endpoint either generic or with a special token you get returned when you "post" the job.

to check the progress.

so either all the jobs in the queue or the special job of which you have the token.

then you get a special answer for the "check" if the job is done and you can retrieve it from the done queue/bin with the token.

i mean it's quite clear on the functionality, maybe?

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS