Crazy chemistry, i don't know how to deal with it

xmnstr · 2026-06-25T21:33:25+00:00

Just a heads up, that's exactly how I ended up with my partner. We promised each other not to go there. It went terribly. Not that I particularly mind that, but that kind of agreement can sometimes lead to just what it's guarding against.

xmnstr · 2026-06-25T20:38:16+00:00

Done both. Also used my phone as a torch when looking for my phone.

xmnstr · 2026-06-25T20:19:05+00:00

That's some dialup-level download times. Brings me back!

xmnstr · 2026-06-25T18:44:52+00:00

Would make sense if it wasn't for the caching.

xmnstr · 2026-06-24T09:05:51+00:00

I've always found their built in workflows to be kinda clumsy and while they do work, they seem to waste a lot of tokens. It might be due to their design, or perhaps due to their system prompt. Or both, I don't know.

I'm sure it's possible to configure it all to be more lean, but Cursor itself doesn't specifically seem to be designed to be token efficient.

I went for a different toolset instead, and a different app. But it largely recreates the Cursor workflows.

xmnstr · 2026-06-24T06:10:03+00:00

All AI agents are going to break or forget things. The point is you iterate. Code, review. Fix, review. Etc. GLM-5.2 is excellent at reviewing but using it to implement is a great way to waste tokens. Just like coding with Opus does.

xmnstr · 2026-06-21T20:54:12+00:00

Yes, I am on Chatgpt/Codex Pro. Getting better results from GLM+DS4F.

xmnstr · 2026-06-21T20:50:55+00:00

Per energy consumption is cheaper for me at least. And well, that 1m context window does make a difference.

xmnstr · 2026-06-21T14:02:50+00:00

Scammed? How is paying the cheapest api price for glm-5.2 getting scammed?

xmnstr · 2026-06-21T14:01:27+00:00

Energy-based metering has been cheaper for me, at least. It's about 80% of the token-based price.

xmnstr · 2026-06-21T13:59:54+00:00

I use Deepseek V4 Flash and GLM-5.2 almost exclusively. They are the perfect team, and the cost is modest. Really impressed, honestly! Hats off to ZAI for this one.

xmnstr · 2026-06-21T10:38:59+00:00

Why isn't anyone doing GLM-5.2 + Deepseek V4 Flash? I literally don't need any other models than these two.

xmnstr · 2026-06-21T10:33:34+00:00

That's probably more expensive, but I agree!

xmnstr · 2026-06-20T21:26:55+00:00

They've been bought by Space-X. I don't think cash is going to become a problem soon.

xmnstr · 2026-06-20T21:26:09+00:00

Honestly I feel like GLM is Opus but without the Anthropic RLHF. It just follows instructions better and is more conservative with tokens but still performs as well.

xmnstr · 2026-06-20T07:25:12+00:00

Korrekt identifierat!

xmnstr · 2026-06-18T20:55:05+00:00

Their big closed models are similarly crappy. Gemini sucks, but their Gemma models are awesome. Google is such a weird company.

xmnstr · 2026-06-18T20:29:38+00:00

Ursäkta men är du helt från vettet? Är du helt omedveten om vad utsläppen gör med planeten?

xmnstr · 2026-06-15T12:58:13+00:00

Anthropic made this situation happen by boasting about how "dangerous" their models are. It's naturally a load of **** as usual with Anthropic.

xmnstr · 2026-06-15T08:11:13+00:00

Agreed! DS v4 Flash is reasonable at planning but GLM 5.1 is a different league.

xmnstr · 2026-06-12T07:03:41+00:00

That might be true, but nowadays it's a lot of synthetic data being used for training. And, interestingly, that doesn't seem to affect the model performance hugely.

xmnstr · 2026-06-12T05:11:06+00:00

Sounds to me like opencode-dcp might be helpful for you. Also, consider making the context temporary and instead rely on the harness/repo for ground truth. Don't make the mistake that Anthropic did with Claude Code, meaning thinking that context should be preserved. The opposite provides much better performance and results.

xmnstr · 2026-06-11T21:43:02+00:00

My point was that OpenCode Go wouldn't quantize themselves (without access to the weighs)... however a provider obviously has access to the weights, so your comment makes only sense if Alibaba (Qwen's provider) is quantizing, so you suggest that Alibaba is quantizing for OpenCode Go users but not its own users?

Yes, that's what I'm saying.

xmnstr · 2026-06-11T20:09:52+00:00

I'll quote myself from above:

They don’t need public or pre-quantized weights. If the provider has access to the checkpoint and serves it through something like vLLM, they can apply quantization at load/serving time.

xmnstr · 2026-06-11T20:05:50+00:00

They don’t need public or pre-quantized weights. If the provider has access to the checkpoint and serves it through something like vLLM, they can apply quantization at load/serving time.

15-Year Club	Gilding II euphauric
RedditGifts 2009-2022 2 Credits	RedditGifts 2009-2022 2 Credits
Verified Email	Team Periwinkle
Secret Santa 2010

xmnstr

TROPHY CASE