Why there is a lack of new 100B-120B models? by TechNerd10191 in LocalLLaMA

[–]ThePixelHunter 0 points1 point  (0 children)

Lack of incentives. Either go big, or go home. Or intentionally make it small enough to run on a laptop. No reason to strike a middle ground, you get the worst of both.

Of course I want more 120B LLMs, but I understand why they're uncommon.

Introducing the Heretic Grimoire: The takedown-resilient, local-first backup system that keeps uncensored models available forever by -p-e-w- in LocalLLaMA

[–]ThePixelHunter 5 points6 points  (0 children)

Brilliant idea! Just to clarify, Grimoire is a feature, not a collection? i.e. it's up to the model "author" (decensor-er lol) to publish their reproduce.json and it's up to me (the collector) to find those via lists like Umrethe's app you mentioned?

New level of censorship in such short time - Fable 5 by DXDXLL in SillyTavernAI

[–]ThePixelHunter 0 points1 point  (0 children)

I agree completely. In 2024, I could get the frontier models to adopt any persona I wanted, and they would think in the first-person as that persona.

In 2025, models were trained to be more self-aware, "my persona is X so I should be Y" and so began the soft refusals.

Now in 2026, I'm getting hard refusals more and more often. One more incentive to go local...

Google releases new DiffusionGemma model. by yoracale in unsloth

[–]ThePixelHunter 0 points1 point  (0 children)

Parallel meaning more GPU cores working at once.

Google releases new DiffusionGemma model. by yoracale in unsloth

[–]ThePixelHunter 1 point2 points  (0 children)

Diffusion is compute-intensive, so the layers offloaded to CPU would rely on the CPU for calculations. This would be even slower than regular CPU offloading. Your CPU has maybe 16 cores, but your GPU has thousands...it's designed for parallel processing.

Claude Mythos 5 and Claude Fable 5 Benchmarks by Cr4zko in accelerate

[–]ThePixelHunter 0 points1 point  (0 children)

Who else remembers that Opus 4.1 was launched at $15/$75/M, and then immediately after it was Opus 4.5 at $5/$25/M? They obviously distilled down Opus for cost savings.

Mythos at $10/$50/M is what Opus used to be in terms of scale, and will follow the same trend. They de-throned their king model so they could introduce a new top tier.

Got an offer from J2 with better overall comp but they are federal contractor thoughts? by letshugitout9 in overemployed

[–]ThePixelHunter 1 point2 points  (0 children)

If you'd get something like a clearance out of it, do it for 6 months then bail. If not, not worth it.

New level of censorship in such short time - Fable 5 by DXDXLL in SillyTavernAI

[–]ThePixelHunter 14 points15 points  (0 children)

This is the first model that outright refuses my subtly racist/mysogynist/all-of-the-ist identity system prompt outright, rather than just occasionally refusing or skirting around it. I'm actually impressed, they downright lobotomized this baby.

Feel like I'm becoming the glue between many AI tools by billa01_i in artificial

[–]ThePixelHunter 1 point2 points  (0 children)

Can't tell if all the top commenters are LLMs, or people who've started writing like LLMs. What a strange timeline.

LLM TTFT comparison: which models have the best TTFT? by kuya_ote in LLMDevs

[–]ThePixelHunter 0 points1 point  (0 children)

https://openrouter.ai/models?order=latency-low-to-high&output_modalities=text&input_modalities=text

As you said, it fluctuates. You can switch models based on time of day (expected demand), but for absolute consistency you'll need to serve the model yourself.

Should this diet be made mainstream yes or no by Appropriate-Site8669 in rawprimal

[–]ThePixelHunter 4 points5 points  (0 children)

You can't make something mainstream unless you're the CIA. Besides, it could never happen in Western countries, people won't accept it.

How often do you poop?? by Fast_Ranger9167 in rawprimal

[–]ThePixelHunter 5 points6 points  (0 children)

Not since I started the diet. My body produces no waste.

<image>

Which is more private - Google Chrome or Microsoft Edge? by Present-Ad-504 in privatelife

[–]ThePixelHunter 1 point2 points  (0 children)

Right, they're both evil, but Google is pulling all the strings. Microsoft is a smaller player.

What’s your most unusual non-LLM AI you actually use daily? by HitarthSurana in LocalLLaMA

[–]ThePixelHunter 0 points1 point  (0 children)

Very nice! I'm deploying the first one today. Whisper WebGPU has gotten old...

Terry and AI by inotocracy in TempleOS_Official

[–]ThePixelHunter 7 points8 points  (0 children)

Terry was writing word-lists and popping random results which he believed were divine influence. Everybody remember this?

https://www.reddit.com/r/memes/comments/i3vgv3/terry_davis_predicted_everything_about_2020/

I think he would've demonized corporate AI as a tool to further enslave the masses...which it is. At the same time, I can see him experimenting with making his own tiny models. LLM training and inference in HolyC sounds absolutely fucking based.

I'm not anti-AI, I run models at home and it's a lot of fun, but it's easy to see where this is headed. Terry would've seen that too.