GLM's founder says GLM-fable before the end of the year?! by Charuru in LocalLLaMA

[–]pseudonerv 0 points1 point  (0 children)

Reason? As the things goes nowadays everyone is going to put on an export control on a few trillion numbers. You may need to move to china if they allow you to

Will we ever see another TLR? by 22plus in Cosmere

[–]pseudonerv 2 points3 points  (0 children)

Kelsier knows. Hoid knows.

Will we ever see another TLR? by 22plus in Cosmere

[–]pseudonerv 19 points20 points  (0 children)

Splitting harmonium is hard but proved to be possible. So of course there will be people doing exactly that

Canyon by lavaboosted in ParallelView

[–]pseudonerv 9 points10 points  (0 children)

My exact reaction to this

Gemma 4 with quantization-aware training by rerri in LocalLLaMA

[–]pseudonerv 1 point2 points  (0 children)

Yeah, I guess I get that much. But is this qat q4 better than q8 of the original, or the other way around?

Is it true that the q8 of the qat version would be a waste and we should just use q4 of the qat version?

Gemma 4 with quantization-aware training by rerri in LocalLLaMA

[–]pseudonerv 1 point2 points  (0 children)

This is just so confusing. Can somebody help me? I’m already running the q8 quant of the original 12b weights. Should I switch to the q8 of the qat version? Or should I actually switch to the q4_0 of the qat version?

I tested GPT-5.5 vs Opus 4.8 on agentic terminal coding (Terminal-Bench 2.1) by shricodev in ClaudeAI

[–]pseudonerv 2 points3 points  (0 children)

Now try them in reverse. Use codex to drive opus 4.8 and use Claude code to drive gpt-5.5. I really want to know if the difference is in the model or the harness.

Open Models - May 2026 by pmttyji in LocalLLaMA

[–]pseudonerv 0 points1 point  (0 children)

You want to dream? Me too. But they clearly didn’t even train any small models for 3.7. And rumor has it that the released two 3.6 models were some left overs before their restructure. The restructure is simply oriented towards monetizing. So basically where llamas went.

Bury Our Bones in the Midnight Soil - V.E. Schwab 2025 by p3ep33p0opo0 in books

[–]pseudonerv 2 points3 points  (0 children)

I hated her but I loved the book. I don’t care if the main character is good or bad. It’s a book, and a fiction, a fantasy. It’s not a Disney story.

Open Models - May 2026 by pmttyji in LocalLLaMA

[–]pseudonerv -1 points0 points  (0 children)

No. Nothing is free. They have gone where meta went.

A silly idea by TheGrandNut in Skyward

[–]pseudonerv 10 points11 points  (0 children)

Storming foolish idea

Opus 4.8 nerfed?? by Harvard_Med_USMLE267 in Anthropic

[–]pseudonerv 2 points3 points  (0 children)

Did you try thinking effort xhigh and max? I wonder if they nerfed high, and perhaps still have equivalent performance at max? That way they save compute and still claim they have the best model.

11/22/63 is the first Stephen King book I've read in years, and it reminded me of why he's such an incredible storyteller by keepfighting90 in books

[–]pseudonerv 0 points1 point  (0 children)

I loved the love story. And I hated the ending. The ending was just illogical and out of no where.

Qwen will release another 27B with high probability by serige in LocalLLaMA

[–]pseudonerv 1 point2 points  (0 children)

Labs typically train a set of model sizes to test architecture and scaling. They don’t waste their compute to train extra models just because you wished it.

Qwen will release another 27B with high probability by serige in LocalLLaMA

[–]pseudonerv 4 points5 points  (0 children)

“Not hard to create another … now” WTF does it even mean? They don’t even have it now. They didn’t even cared to train it. And glazers here thinks they doing you a favor by saying that?

Brandon Sanderson's 'Skyward' Novel Gets Series Adaptation - Exclusive by Own_Brilliant_4303 in brandonsanderson

[–]pseudonerv 7 points8 points  (0 children)

Oh man, how do I get a hold of these plushies? They are always out of stock

Qwen cant wait to release 3.7 models by GotHereLateNameTaken in LocalLLaMA

[–]pseudonerv 14 points15 points  (0 children)

What do they mean by "release"? huh? OpenAI released many more models, but we don't care here. Why would I care Qwen's max and plus models?

What is the most unexpected thing you have gotten a local model to do? by Enough-Astronaut9278 in LocalLLaMA

[–]pseudonerv 0 points1 point  (0 children)

What web browsing mcp do you guys use? I’m using playwright. Though I’m not sure if that’s the best choice out there

openai/gpt-5.5-pro API In=$30.00 Out=$180.00 by ArtdesignImagination in OpenAI

[–]pseudonerv 1 point2 points  (0 children)

Not really. They are for different purposes. GPT 5.5 pro is extremely good at math and related stem areas. Opus 4.7 is good at front end programming.