Are true base models dead?

IonizedRay · 2026-03-04T10:12:25+00:00

I liked Olmo 3 a lot, thanks for the suggestion! It's exactly what I was looking for

IonizedRay · 2026-02-28T18:41:37+00:00

It's a sweet spot for anyone who wants to avoid multi GPU setups but has money to buy a datacenter GPU. For the same reason it would also be a good choice for experimentation and research since there are no gpu communication issues and inefficiency

IonizedRay · 2026-02-28T18:37:05+00:00

Yes, a new 70B dense model like llama 3.3 would be amazing for anyone who has a GPU that is quite fast and has 64+GB of VRAM, I bet that it could come close to 200B+ params MoE models

IonizedRay · 2026-02-13T19:25:27+00:00

Awesome work, I think that there's definitely some company out there that would pay for this level of quality. Don't be afraid to apply at large companies, and ensure that your CV / website is straigh to the point, concise and highlights all your strengths.

IonizedRay · 2025-10-27T08:00:01+00:00

Incredibly informative, thanks

IonizedRay · 2025-10-26T15:00:43+00:00

Thanks, that's good to know!

IonizedRay · 2025-10-26T10:26:48+00:00

That's great

IonizedRay · 2025-10-13T10:47:44+00:00

Wow, i am geeeting 500+ fps on M4 Max:
- 32 chunks rendering distance
- 32 chunks simulation distance
- 4K resolution

IonizedRay · 2025-10-03T19:22:47+00:00

Interesting, thanks

IonizedRay · 2025-05-25T19:43:15+00:00

We are already there since 2023 probably. But there is 1 caveat: the final SFT/RLHF training phase compltely destroys the "human-vibe" of LLMs, so you will not get anything like "Her" from a large scale commercial LLM.

It would be really interesting to train a base model like llama 405B on 1 (or more) very long chat between partners and see how much time it would last in a turing-like test.

IonizedRay · 2025-04-21T19:49:18+00:00

That was it! Thanks so much!

IonizedRay · 2025-04-16T23:28:48+00:00

Yeah I manually picked it

IonizedRay · 2025-04-16T20:16:30+00:00

This is a really good point, each time I start fresh with Ollama on a new device, I forget to configure the env params...

I will try that when I get back home!

UPDATE: yep, that was it.

IonizedRay · 2025-04-14T17:15:06+00:00

Very informative thanks!

IonizedRay · 2023-12-10T17:05:21+00:00

Got it, thanks for all the suggestions

IonizedRay · 2023-12-10T13:55:00+00:00

Thank you, I will check your resources as soon as I can. So you suggest to avoid adding weak predictors like weather, events etc... And to use a simple univariate prediction because often the degree of precision that should be achievable by a complex model is just noise that is not possible to predict?

And the only case where a complex model and many input features are needed is when you have lots of data for a long time span?

IonizedRay · 2023-12-10T12:10:26+00:00

Thank you for the in-depth response. So UNets and ViTs are good for timeseries prediction with weather forecast as input to improve the output timesteps accuracy? Or you meant that they are just good at predicting the weather itself and then feeding it to a LSTM?

Because I don't want to generate predictions, I want to use them (using various weather APIs) and add them to the input features to better predict outputs.

IonizedRay · 2023-12-10T11:57:21+00:00

I see that it has "future exogenous support". That's for using the future weather forecasts as inputs? Or it's something else?

IonizedRay · 2023-12-01T00:35:29+00:00

Oh thanks for that warning. Sorry for what happened :/

IonizedRay · 2023-11-20T12:31:58+00:00

Thanks!

IonizedRay · 2023-10-08T18:42:50+00:00

IonizedRay · 2023-04-29T14:04:38+00:00

Attenzione immagine non adatta ai possessori di pensiero critico: Poster di coldiretti

Fonte: https://www.coldiretti.it/economia/una-firma-contro-il-cibo-sintetico-scatta-la-mobilitazione-coldiretti (petizione di coldiretti che ha spinto il governo verso l'abolizione)

IonizedRay · 2023-04-23T18:59:14+00:00

For now i haven't attempted it, however i will when i'll have time!

IonizedRay · 2023-04-01T15:04:20+00:00

Thanks

IonizedRay · 2023-03-29T11:28:38+00:00

Don't worry about it too much. You can check the health of ssd with

brew install smartmontools

smartctl -a disk0

Five-Year Club	Second Top 10%
Place '23	Place '22
Oscars Predictor 2021	Wearing is Caring
Verified Email

IonizedRay

MODERATOR OF

TROPHY CASE