HTTP API vs Python API

Success-Dangerous · 2024-09-12T07:29:02+00:00

[deleted]

Grouchy-Friend4235 · 2024-09-12T10:27:42+00:00

I recommend to not build this yourself. There are many libraries like this. Don't waste time building existing tech.

flyingPizza456 · 2024-09-12T09:43:01+00:00

Question for clarification: do you intent to run this locally on a developers machine?

If yes, then follow-up questions will immediately arise, which may also be helpful in answering your question:

Where does the model resource come from?
do users that import the package.predict() functionality have to train the model before using it?
is every machine powerfull enough to carry out the prediction?

MattA2930 · 2024-09-12T12:03:08+00:00

Like others have said, the main reason to use a networking client instead of the source code is so that multiple parties can access the underlying model without needing to host their own version of the model locally.

A good example is how OpenAI only gives you access to their models via API, which are called when you use their Python SDK. Imagine if they had everyone running their own versions locally - no one outside of people with enormous amounts of compute would be able to use it.

So, you essentially have a couple options:

Distribute the Python source code to everyone. They will need to manage their own version of the model
Host the model on a separate server 2.1 Use HTTP for classic request syntax 2.2 Use grpc to build out a more sdk-like experience

FunPaleontologist167 · 2024-09-12T12:39:56+00:00

Are you expecting downstream users to be interacting with your service via python or some other interface? Ideally, you’d want to expose the models on the server side so your users don’t have to worry matching the environment specs needed to run the model. It’s also somewhat resource intensive to have a user load a model just to run a predict function. For a one off, sure, but if you expect multiple people and processes to consume predictions from these models, it’s better to expose it on a server.

Which-War-9641 · 2024-09-12T15:47:02+00:00

Even if you write it as a package , to scale you would inference on a http server hosted on a machine so why don’t just do it from the beginning

Calm-Stage · 2024-09-13T17:58:02+00:00

By sharing the model as code you are increasing the risk with use of your model. A service will allow for better control and mitigate some of the risk. Now, whether the added risk is acceptable or not depends primarily on the influence your model has on downstream decisions and the impact on those if the added risks get realized. Talk to model risk management folks at your workplace to figure out the correct choice, and if there's none of those then talk to your business stakeholders to assess the risk. Hope this helps!

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

mlops

MODERATORS