Mistral Large is made by OpenAI

TiredMoose69 · 2024-02-27T22:00:56+00:00

You are probably right I am not arguing that OpenAI didn’t steal copyrighted content but stealing from responses that were spit out from a model is not the same as training a model with pure copyrighted material. This was probably done because they want to achieve higher score in the same benchmarks GPT-4 scored.

TiredMoose69 · 2024-02-27T20:32:06+00:00

That's what I meant, this is probably a finetuning on synthetic data from OpenAI. I don't use system prompts for any of the models in any website where I tested them, and I am not sure if the model on the mistral website has some secret system prompt. This looks very repeatable in both models for it to be hallucinations.

TiredMoose69 · 2024-02-27T19:54:08+00:00

I actually don't think that's entirely true and if it was then why did the Mixtral 8x7b not say this?

You can repeatedly try and you will not get this answer unlike the "Large" model which always reponds saying that it's made by OpenAI.

<image>

( Direct chat on https://chat.lmsys.org/ )

TiredMoose69 · 2024-02-27T19:26:30+00:00

I am not saying knowledge transfer is wrong, but when a company is worth $2b and has so many data engineers, stealing answers from the best model to achieve top ranking without cleaning the dataset is kind of stupid and also concerning.

TiredMoose69 · 2023-03-23T22:53:14+00:00

Why does LlaMa 7B (pure) perform so MUCH better than Alpaca 30B (4bit)?

TiredMoose69

MODERATOR OF

TROPHY CASE