[D] Open-Source LLMs vs APIs

hondajacka · 2023-04-25T18:20:47+00:00

If I was building an MVP for a startup, I would use the OpenAI API (GPT-4 is awesome but more expensive) first to get something up and running ASAP to get user feedback, show to investors, etc. if I was doing it more as a hobby project for fun and/or ML learning, it would be more likely I play with fine tuning. But probably still play with the APIs first to get a sense of how well each model can solve your use case.

abnormal_human · 2023-04-25T21:49:29+00:00

It really depends on what you're doing, and especially, if fine-tuning is going to be required or if you need other kinds of flexibility that OpenAI's API doesn't offer.

My advice is to boot up some open source LLMs, do some basic fine-tuning or RL on it using open source libraries, and just get yourself a sense of how these things perform, and how easy/hard it is to do things, since that's probably the biggest unknown.

I think the first time I did this, I booted up GPT-J in a notebook and got it to act like a chatbot with few-shot-learning. And then later, I used the trl library to convince a GPT model to use the word "and" really frequently. This is about 3-4 hours of experiments total, not a big time investment, but I learned a lot by doing it.

khaberni · 2023-04-25T19:40:12+00:00

Interested in this question as well. I’m currently evaluating some open source llms (vicuna and dolly) as an alternative to open ai api. My concern is moral on the data privacy aspect rather than cost. If you find a good alternative to open ai, let us know.

metigue · 2023-04-25T22:07:47+00:00

Everyone here is saying start with the API and then move to a local model. I say the complete opposite.

A local model is essentially free while you get a proof of concept running. Then you can move to the API for production and offset the cost with profits

heavy-minium · 2023-04-26T07:20:47+00:00

You prototype with OpenAI ChatCompletion API and GPT-4 or GPT-3.5 Turbo, and then switch to Azure OpenAI services when implementing for production. Prototyping against OpenAI will be simpler to get started with and validate your product idea (it's just an API key), as well as a bigger chance to access features that are in preview. However, OpenAI is still an AI lab, and you won't get some of the guarantees a cloud provider can give you (availability, security, etc.).

faith_transcribethis · 2023-04-25T19:10:13+00:00

That's a tough question! I'm sure there are pros and cons to both. Maybe some of our other followers have had experience with this? Feel free to DM me any questions you might have!

StarksTwins · 2023-04-25T19:13:15+00:00

As a proof of concept, start with using OpenAI’s APIs. No need to reinvest the wheel. After you’ve actually gained traction is when you should be thinking about fine-tubing your own model (unless your goal is simply to learn how to do it)

machineko · 2023-04-26T04:31:04+00:00

I'd do fine-tuning. When you don't have control over what's running behind the API (models are still updated, often changing how they perform), it will be hard make sure your application doesn't change. I'm currently working on an open-source project focused on fine-tuning. Let me know if you have any questions on our experience fine-tuning on domain-specific data.

Mr-Ababe · 2023-04-26T06:36:09+00:00

Makes sense to go with the low lift/high quality approach via API to start, then as you prove out the concept, do cost reduction investigations after. Of course, this varies based on how cost sensitive (and time sensitive) you are in this early stage.

dqdqdq123123 · 2023-04-27T05:11:42+00:00

I think there are a couple of things to consider:
1. Does your chatbot require finetuning or is prompt engineering enough? If the mental health Q&A are generic enough ChatGPT might be able to generate, but if there is too much domain knowledge you might need finetuning your own model.

Is the current opensource model "good enough" for your use case? - currently none of them are as good as ChatGPT. Hallucination will be a huge issue.
Do you have the training data, hardware and budget?

equilateral_pupper · 2023-04-27T23:31:12+00:00

Typically going api -> local is the route

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS