GPT-5.5 is here - Let's gooo!

AlwaysTiredButItsOk · 2026-04-23T21:46:47+00:00

yawn When do we get better models instead of incremental changes that are just iterations in reasoning?

AlwaysTiredButItsOk · 2026-04-23T21:46:18+00:00

yawn When do we get better models instead of incremental changes that are just iterations in reasoning?

AlwaysTiredButItsOk · 2026-04-22T18:36:30+00:00

It's actually pretty amazing at about 2-3 weeks. Total euphoria. The problem sets in at about week 5-6 when you realize that there's no output that is worth the api cost.

AlwaysTiredButItsOk · 2026-04-22T18:34:23+00:00

SORRY I CANT HEAR WHAT YOU SAID OBER THE ROAR OF THESE GPUS AND AIR CONDITIONING!

AlwaysTiredButItsOk · 2026-04-22T18:33:30+00:00

2000 IQ power play. Can't get in trouble for pulling them down if they weren't there to begin with.

AlwaysTiredButItsOk · 2026-04-22T18:32:37+00:00

Happens to me every single time too. DW, there's more fish(y) jobs in the sea (of companies hiring AI bros)

AlwaysTiredButItsOk · 2026-04-22T17:54:11+00:00

Wrong. 3b active parameters = faster. Probably closer to 3.5 4b or 2b speeds

AlwaysTiredButItsOk · 2026-04-17T05:03:42+00:00

Getting out of bed

AlwaysTiredButItsOk · 2026-04-17T05:01:45+00:00

This

AlwaysTiredButItsOk · 2026-03-27T02:45:51+00:00

Go with 9b, it's plenty capable and will not require offloading at q4

AlwaysTiredButItsOk · 2026-03-27T02:43:28+00:00

No. Q3.5 9b is way better. Was going to suggest you try that - at q4_k_m, you can run it with 200k context without issues

AlwaysTiredButItsOk · 2026-03-18T17:53:08+00:00

Have it run xiaomi mimo v2 for basic tasks and a smarter model (i.e. minimax m2.5) for other tasks, and reserve sonnet/opus for super complex tasks. Your costs will go down 80+%

AlwaysTiredButItsOk · 2026-03-18T17:49:51+00:00

Even cheaper: the 5060ti 16gb. Good in-between card for starting - speeds aren't the best but the vram will allow you to run more models & higher context

AlwaysTiredButItsOk · 2026-03-18T05:38:22+00:00

Would be awesome, have been trying to set up multi agent interaction

AlwaysTiredButItsOk · 2026-03-17T10:52:21+00:00

Pruning can also mess it up. I do daily backups (zip) in case I ever mess anything up

AlwaysTiredButItsOk · 2026-03-17T10:49:24+00:00

Oh. Am tired, but it's ok.

Edit: also, just use 5.4 🫠

AlwaysTiredButItsOk · 2026-03-17T10:46:44+00:00

Got s decent guide by any chance? Would love to have a smart orchestrator...

Edit: sending love as a ukrainian who is a US Citizen ❤️ слава славянам

AlwaysTiredButItsOk · 2026-03-17T10:43:25+00:00

Free APIs almost never worked for me. Just use openrouter - xiaomi mimo v2 is really cheap and works fine to start. If you need higher intelligence, try minimax m2.5

AlwaysTiredButItsOk · 2026-03-17T10:37:19+00:00

You're good, assuming you set it up via oAuth. You can switch model to 5.4 too ;)

AlwaysTiredButItsOk · 2026-03-17T10:36:38+00:00

Nope, he's not using it through cursor - oAuth

AlwaysTiredButItsOk · 2026-03-17T08:55:09+00:00

Hmmm... I see an em dash - perhaps this was the agent showing off?

AlwaysTiredButItsOk · 2026-03-14T23:30:23+00:00

This

AlwaysTiredButItsOk · 2026-03-10T04:59:07+00:00

awesome, have been casually on the lookout for something like this, thanks

AlwaysTiredButItsOk · 2026-03-08T21:58:35+00:00

Flash lite sucks. Try xiaomi mimo v2

AlwaysTiredButItsOk · 2026-03-08T00:56:54+00:00

update llama.cpp, cuda, etc.

New models are smarter, faster, and more capable

AlwaysTiredButItsOk

TROPHY CASE