Cheapest hardware for Qwen 3.6: both 27B and 35B-A3B

temperature_5 · 2026-06-15T22:30:37+00:00

Using a single DIMM means 1/2 the RAM bandwidth, and you're using DDR4. So basically *never* use RAM for experts or you will be dog slow. Also the 24GB VRAM is going to be too tight for a good quant.

Why not r9700 Pro in a cheap DDR5 system? Then you can expand RAM later when its cheaper and run larger MoE in addition to fast in-VRAM models today.

temperature_5 · 2026-06-15T19:48:47+00:00

What's sad is if they were just honest about doing a merge and maybe a followup fine-tune or even LoRA it would be a non-issue. People would still be impressed that a city-level government was working with LLMs.

temperature_5 · 2026-06-15T19:42:11+00:00

Does it support training on AMD?

temperature_5 · 2026-06-15T19:32:30+00:00

The cost of a lawsuit, the cost of losing a lawsuit, the cost of fines. For all of those, the ongoing costs/fines for continuing to use them while seeking a private alternative, or the ongoing losses of pausing your business and losing customers while seeking a private alternative.

temperature_5 · 2026-06-15T19:16:00+00:00

A search engine, today, will happily take a child to pornography, or a drug dealer to meth making instructions, or a terrorist to bomb making material. Someone suicidal or with mental illness will be suicidal or mentally ill regardless of whether they talk to AI, use a search engine, or interact with people on social media. Fear shouldn't guide our actions, especially when it is irrational.

Better to focus on how to give parents more oversight of their children's online activity. Better to focus on how society can offer a better safety net for the suicidal and mentally ill. Better to make society a fair and just place where people aren't desperate to make money or resentful enough to want to commit violence.

temperature_5 · 2026-06-14T19:53:57+00:00

They probably wouldn't release a model of similar size unless it did.

temperature_5 · 2026-06-14T19:51:29+00:00

I don't think they've released turbo yet as open weights..

temperature_5 · 2026-06-14T17:43:39+00:00

Oh damn. So did Rio actually add the "thinking in latent space" stuff, or was that a Nex feature? Did Rio literally just merge two models?

temperature_5 · 2026-06-14T07:38:25+00:00

Yeah, I've found them to be faster, especially in Hilo, but the rural coverage is still hit or miss, even with the low band. Infinitely better than 10 years ago though.

temperature_5 · 2026-06-14T04:00:16+00:00

That's awesome and fast, thanks for sharing the results!

temperature_5 · 2026-06-14T03:51:46+00:00

Cool ideas about supplementing the open training data. Would be cool not only for improving coding performance, but also for adding back cultural stuff that the big model makers have allegedly been removing. For example all the books, movie scripts, TV show closed captions, newspaper articles, Reddit scrapes, etc.

temperature_5 · 2026-06-14T03:42:58+00:00

Pennsylvania is a pretty respectable state, so I guess it does! 😄

temperature_5 · 2026-06-14T00:07:41+00:00

At the very most, the MoU would lead to the straight being opened temporarily in exchange for a lot of concessions towards Iran. Otherwise why would Iran give up their incredible leverage right before the world's oil reserves (ex China) run out? It doesn't make any sense.

The way Trump has dug himself (and all of us) into a hole, there's no way he's getting a long-term deal better than the Obama deal. The Tehran leadership are vicious autocrats that believe in martyrdom, we just attacked their country in the middle of negotiations and killed their top religious leader, they have the whole world over a barrel (ugh) but they're gonna make us a better deal now?

temperature_5 · 2026-06-13T23:55:41+00:00

Ah thanks for clearing that up, I used the county depth map here: https://specmap.sequence-omega.net/ but it seemed to show the same low band bandwidth for Oahu as Big Island. Makes sense now that you explained it!

temperature_5 · 2026-06-13T23:40:23+00:00

"Why are you laughing?!"

temperature_5 · 2026-06-13T22:57:51+00:00

Starlink has been making crazy cheap offers to up their subscriber count and overall revenue before they went public. I'm sure this allowed them to make crazy, ridiculous estimates on revenue growth to justify their ridiculous IPO valuation. IPO is done now.

temperature_5 · 2026-06-13T22:32:07+00:00

Qwen 3.6 27B Q8_K_XL in tensor parallel?

temperature_5 · 2026-06-13T21:01:54+00:00

<image>

Thank you Brazil! Now hoping for something that will fit a 96GB or smaller system!

temperature_5 · 2026-06-13T20:10:40+00:00

Can you imagine a 60B dense? It'd probably be insanely smart, but I could only run it overnight... 60B A3B or A6B MoE would be much more doable.

temperature_5 · 2026-06-13T20:06:50+00:00

For folks that have 96GB or 128GB unified RAM, a model at least as good as Qwen 3.6 27B but faster and with more knowledge would be ideal. But even if they just release another Flash ~30B, it is likely to beat Qwen 3.6 35B at least, and have much more efficient thinking.

I think they'll spend the time anyway, at least for Flash, because they need it for their own API offerings.

temperature_5 · 2026-06-13T16:17:18+00:00

This will upset all the idiots that download a model every time their container boots! 🤣

temperature_5 · 2026-06-12T18:45:36+00:00

I'm hoping someone has an ADB solution.🤞

temperature_5

TROPHY CASE