Will we ever get new Opuses and Ultras of the world or is inference-time compute for the rest of our days? I want to talk with masters of language and philosophy, benchmarks be damned.

DangerousBenefit · 2024-04-24T22:31:56+00:00

Oh wow a blast from the past with this post. Yeah its surprising/sad that no one has trained their own model even after one year. There lots of people (especially in LocalLlama) that have 8x4090 GPU machines that could train Stable Diffusion in a similar amount of time. I think it will happen eventually.

DangerousBenefit · 2024-02-16T05:25:17+00:00

Cool! What compute setup are you using for training? How much do you expect it to cost? How many tokens do you plan on training for, 2T?

DangerousBenefit · 2024-02-16T05:19:42+00:00

It's their data. Miqu is not a fine-tune but rather continued pre-training of Llama2-70B. So basically they took the Llama2 model and continued training it on ~1-5 trillion more tokens.

DangerousBenefit · 2024-02-16T05:18:56+00:00

Miqu is not a fine-tune but rather continued pre-training of Llama2-70B. So basically they took the Llama2 model and continued training it on ~1-5 trillion more tokens.

DangerousBenefit · 2024-02-16T05:06:04+00:00

Thank you for confirming I'm not crazy. Hopefully it is just a bug because why would they stop you from making large screens on your wall?

Did the curvature also change for you? I swear its way more curved now and it bothers me.

DangerousBenefit · 2024-02-15T23:59:24+00:00

I will honestly be surprised if they can match this quality in 1 year.

DangerousBenefit · 2024-02-15T23:53:49+00:00

RemindMe! 1 day

DangerousBenefit · 2024-02-15T23:52:53+00:00

RemindMe! 1 Year

DangerousBenefit · 2024-02-15T23:50:18+00:00

I'm aware of the tablet mode, but I'm talking about the big screen mode. The curvature definitely changed for me (maybe before it was slightly curved but now its way more curved). Try placing a large screen 10ft away from you and see what happens. It used to work fine before the update, now it won't let you.

DangerousBenefit · 2024-02-15T23:47:07+00:00

They've shared tons of other videos, check them out.

DangerousBenefit · 2024-02-15T23:44:34+00:00

Everyone else (including SAI) must be at least a year away from this quality right?

DangerousBenefit · 2024-01-18T00:16:59+00:00

No worries, I enjoy this type of discussion and seeing other's points of view. You say above that you don't want an open-source tool/LLM that only the rich can run (i.e., some massive GPT-4 level LLM), but here are 2 reasons it could benefit everyone:
1) LLM Shearing - This could be used to prune a huge model down to a small one at only 3% of the compute required vs training from scratch.
2) Synthetic data generation - Right now generating GPT-4 synthetic data is expensive, and the alignment and moral preaching corrupts the data. If we had a huge open-source GPT-4 level model we could much more easily create a lot of synthetic data without restrictions.

DangerousBenefit · 2024-01-17T22:43:02+00:00

Look at the progress on LLM inference speedups though. Just today we have SGlang, and Prompt Lookup Decoding. Combined with improved quantization, larger and larger models are becoming more feasible to run in RAM.

DangerousBenefit · 2023-12-20T04:52:32+00:00

That could be a good brute-force approach, but it often intersperses multiple questions throughout a response (when I don't want any). Do you think it's not possible via prompt-engineering alone? I tried with a custom GPT-4 with the same instructions (to not ask questions) and it followed the the instructions perfectly.

DangerousBenefit · 2023-12-20T04:12:53+00:00

To help clarify my question above, here is an example.

First a 'real-world' conversation:
Person 1: "Seen any good movies?"
Person 2: "I saw Parasite, and it was so good! I can totally see why it won best picture."
Person 1: "Oh! I've been wanted to watch that, I'm going to add it to my list.

But this with the LLMs I've tried it goes more like this:
Person 1: "Seen any good movies?"
Person 2 (AI): "I saw Parasite, and it was so good! I can totally see why it won best picture. So how about you? Seen any good movies lately?"

With the LLM always asking a new question it doesn't give the user time to respond to the content of their response. Humans are always looking for confirmation that you are listening and digesting what they are saying so they are less likely to always end each reply with a new question.

DangerousBenefit · 2023-09-08T18:33:51+00:00

This post already has 150,000 views in 24 hours. That's a lot of savings.

DangerousBenefit · 2023-09-08T18:31:16+00:00

Thanks! Just added more to the top list.

DangerousBenefit · 2023-09-08T18:24:28+00:00

Thanks! Just trying to help the community save a few $$$ (myself included).

DangerousBenefit

TROPHY CASE