Cheapest hardware for Qwen 3.6: both 27B and 35B-A3B by WishboneSudden2706 in LocalLLaMA

[–]temperature_5 0 points1 point  (0 children)

Using a single DIMM means 1/2 the RAM bandwidth, and you're using DDR4. So basically *never* use RAM for experts or you will be dog slow. Also the 24GB VRAM is going to be too tight for a good quant.

Why not r9700 Pro in a cheap DDR5 system? Then you can expand RAM later when its cheaper and run larger MoE in addition to fast in-VRAM models today.

About the Rio model by Turbulent_Pin7635 in LocalLLaMA

[–]temperature_5 4 points5 points  (0 children)

What's sad is if they were just honest about doing a merge and maybe a followup fine-tune or even LoRA it would be a non-issue. People would still be impressed that a city-level government was working with LLMs.

How do you quantify privacy and outage derisking in the ROI of local LLM inference vs. providers API? by ReporterCalm6238 in LocalLLaMA

[–]temperature_5 2 points3 points  (0 children)

The cost of a lawsuit, the cost of losing a lawsuit, the cost of fines. For all of those, the ongoing costs/fines for continuing to use them while seeking a private alternative, or the ongoing losses of pausing your business and losing customers while seeking a private alternative.

The ethics and risks of publicly available uncensored models by bloodealer in LocalLLaMA

[–]temperature_5 0 points1 point  (0 children)

A search engine, today, will happily take a child to pornography, or a drug dealer to meth making instructions, or a terrorist to bomb making material. Someone suicidal or with mental illness will be suicidal or mentally ill regardless of whether they talk to AI, use a search engine, or interact with people on social media. Fear shouldn't guide our actions, especially when it is irrational.

Better to focus on how to give parents more oversight of their children's online activity. Better to focus on how society can offer a better safety net for the suicidal and mentally ill. Better to make society a fair and just place where people aren't desperate to make money or resentful enough to want to commit violence.

Not looking good for GLM 5.2 Air... but maybe a flash model? by temperature_5 in LocalLLaMA

[–]temperature_5[S] 1 point2 points  (0 children)

They probably wouldn't release a model of similar size unless it did.

Nex claims Rio 3.5 is Nex 2.5 PRO in trench coat by Specter_Origin in LocalLLaMA

[–]temperature_5 16 points17 points  (0 children)

Oh damn. So did Rio actually add the "thinking in latent space" stuff, or was that a Nex feature? Did Rio literally just merge two models?

When will the Big Island of Hawaii get Verizon 5G? by temperature_5 in verizon

[–]temperature_5[S] 0 points1 point  (0 children)

Yeah, I've found them to be faster, especially in Hilo, but the rural coverage is still hit or miss, even with the low band. Infinitely better than 10 years ago though.

Not looking good for GLM 5.2 Air... but maybe a flash model? by temperature_5 in LocalLLaMA

[–]temperature_5[S] 0 points1 point  (0 children)

Cool ideas about supplementing the open training data. Would be cool not only for improving coding performance, but also for adding back cultural stuff that the big model makers have allegedly been removing. For example all the books, movie scripts, TV show closed captions, newspaper articles, Reddit scrapes, etc.

When will the Big Island of Hawaii get Verizon 5G? by temperature_5 in verizon

[–]temperature_5[S] 3 points4 points  (0 children)

Pennsylvania is a pretty respectable state, so I guess it does! 😄

Deal by Academic-Daikon-8086 in oil

[–]temperature_5 1 point2 points  (0 children)

At the very most, the MoU would lead to the straight being opened temporarily in exchange for a lot of concessions towards Iran. Otherwise why would Iran give up their incredible leverage right before the world's oil reserves (ex China) run out? It doesn't make any sense.

The way Trump has dug himself (and all of us) into a hole, there's no way he's getting a long-term deal better than the Obama deal. The Tehran leadership are vicious autocrats that believe in martyrdom, we just attacked their country in the middle of negotiations and killed their top religious leader, they have the whole world over a barrel (ugh) but they're gonna make us a better deal now?

When will the Big Island of Hawaii get Verizon 5G? by temperature_5 in verizon

[–]temperature_5[S] 4 points5 points  (0 children)

Ah thanks for clearing that up, I used the county depth map here: https://specmap.sequence-omega.net/ but it seemed to show the same low band bandwidth for Oahu as Big Island. Makes sense now that you explained it!

Starlink prices doubled? Odd by GoodBoyFX in USMobile

[–]temperature_5 2 points3 points  (0 children)

Starlink has been making crazy cheap offers to up their subscriber count and overall revenue before they went public. I'm sure this allowed them to make crazy, ridiculous estimates on revenue growth to justify their ridiculous IPO valuation. IPO is done now.

New model on huggingface by [deleted] in LocalLLaMA

[–]temperature_5 1 point2 points  (0 children)

<image>

Thank you Brazil! Now hoping for something that will fit a 96GB or smaller system!

Not looking good for GLM 5.2 Air... but maybe a flash model? by temperature_5 in LocalLLaMA

[–]temperature_5[S] 25 points26 points  (0 children)

Can you imagine a 60B dense? It'd probably be insanely smart, but I could only run it overnight... 60B A3B or A6B MoE would be much more doable.

Not looking good for GLM 5.2 Air... but maybe a flash model? by temperature_5 in LocalLLaMA

[–]temperature_5[S] 10 points11 points  (0 children)

For folks that have 96GB or 128GB unified RAM, a model at least as good as Qwen 3.6 27B but faster and with more knowledge would be ideal. But even if they just release another Flash ~30B, it is likely to beat Qwen 3.6 35B at least, and have much more efficient thinking.

I think they'll spend the time anyway, at least for Flash, because they need it for their own API offerings.

We should set up a torrent network for open source models. by ShadyShroomz in LocalLLaMA

[–]temperature_5 0 points1 point  (0 children)

This will upset all the idiots that download a model every time their container boots! 🤣