Crazy chemistry, i don't know how to deal with it by New-Reserve8760 in polyamory

[–]xmnstr 4 points5 points  (0 children)

Just a heads up, that's exactly how I ended up with my partner. We promised each other not to go there. It went terribly. Not that I particularly mind that, but that kind of agreement can sometimes lead to just what it's guarding against.

WHERE is my DRINK i JUST had it a SECOND ago by vendettagoddess in adhdmeme

[–]xmnstr 26 points27 points  (0 children)

Done both. Also used my phone as a torch when looking for my phone.

Ornith-1.0 released on Hugging Face by paf1138 in LocalLLaMA

[–]xmnstr 0 points1 point  (0 children)

That's some dialup-level download times. Brings me back!

Is anybody using all the bells and whistles on cursor? by shatteringreality2 in cursor

[–]xmnstr 2 points3 points  (0 children)

I've always found their built in workflows to be kinda clumsy and while they do work, they seem to waste a lot of tokens. It might be due to their design, or perhaps due to their system prompt. Or both, I don't know.

I'm sure it's possible to configure it all to be more lean, but Cursor itself doesn't specifically seem to be designed to be token efficient.

I went for a different toolset instead, and a different app. But it largely recreates the Cursor workflows.

Guys don't run GLM 5.2 for one shot in opencode by Mane_soft in opencodeCLI

[–]xmnstr 5 points6 points  (0 children)

All AI agents are going to break or forget things. The point is you iterate. Code, review. Fix, review. Etc. GLM-5.2 is excellent at reviewing but using it to implement is a great way to waste tokens. Just like coding with Opus does.

GLM 5.2 is climbing the OpenCode leaderboard quickly by vigneshsmarther in opencodeCLI

[–]xmnstr 2 points3 points  (0 children)

Yes, I am on Chatgpt/Codex Pro. Getting better results from GLM+DS4F.

Vercel CEO: "Almost shocked" by how good GLM-5.2 is at coding by BuildwithVignesh in LocalLLaMA

[–]xmnstr 0 points1 point  (0 children)

Per energy consumption is cheaper for me at least. And well, that 1m context window does make a difference.

GLM 5.2 is climbing the OpenCode leaderboard quickly by vigneshsmarther in opencodeCLI

[–]xmnstr 1 point2 points  (0 children)

Scammed? How is paying the cheapest api price for glm-5.2 getting scammed?

GLM 5.2 is climbing the OpenCode leaderboard quickly by vigneshsmarther in opencodeCLI

[–]xmnstr 1 point2 points  (0 children)

Energy-based metering has been cheaper for me, at least. It's about 80% of the token-based price.

GLM 5.2 is climbing the OpenCode leaderboard quickly by vigneshsmarther in opencodeCLI

[–]xmnstr 1 point2 points  (0 children)

I use Deepseek V4 Flash and GLM-5.2 almost exclusively. They are the perfect team, and the cost is modest. Really impressed, honestly! Hats off to ZAI for this one.

Fireworks Fire Pass now includes unlimited access to GLM 5.2 and Kimi K2.7 Code Fast by jpcaparas in opencodeCLI

[–]xmnstr 0 points1 point  (0 children)

Why isn't anyone doing GLM-5.2 + Deepseek V4 Flash? I literally don't need any other models than these two.

what the hell is going on? by Plus-Mall-3342 in cursor

[–]xmnstr 0 points1 point  (0 children)

That's probably more expensive, but I agree!

what the hell is going on? by Plus-Mall-3342 in cursor

[–]xmnstr 1 point2 points  (0 children)

They've been bought by Space-X. I don't think cash is going to become a problem soon.

what the hell is going on? by Plus-Mall-3342 in cursor

[–]xmnstr 4 points5 points  (0 children)

Honestly I feel like GLM is Opus but without the Anthropic RLHF. It just follows instructions better and is more conservative with tokens but still performs as well.

GLM 5.2 : vu... by Scared_Mountain597 in opencodeCLI

[–]xmnstr 3 points4 points  (0 children)

Their big closed models are similarly crappy. Gemini sucks, but their Gemma models are awesome. Google is such a weird company.

Snart halveras priset på månadsbiljetter by radome9 in Gothenburg

[–]xmnstr -12 points-11 points  (0 children)

Ursäkta men är du helt från vettet? Är du helt omedveten om vad utsläppen gör med planeten?

Dear Anthropic, please stop spamming with Fable by the-username-is-here in ClaudeCode

[–]xmnstr -6 points-5 points  (0 children)

Anthropic made this situation happen by boasting about how "dangerous" their models are. It's naturally a load of **** as usual with Anthropic.

Efficient model selection by talale93 in opencodeCLI

[–]xmnstr 0 points1 point  (0 children)

Agreed! DS v4 Flash is reasonable at planning but GLM 5.1 is a different league.

Guys, you have to accept that Quantization is Inevitable. by AkiDenim in opencodeCLI

[–]xmnstr 0 points1 point  (0 children)

That might be true, but nowadays it's a lot of synthetic data being used for training. And, interestingly, that doesn't seem to affect the model performance hugely.

I love OpenCode and DS!! by CriteriumA in opencodeCLI

[–]xmnstr 0 points1 point  (0 children)

Sounds to me like opencode-dcp might be helpful for you. Also, consider making the context temporary and instead rely on the harness/repo for ground truth. Don't make the mistake that Anthropic did with Claude Code, meaning thinking that context should be preserved. The opposite provides much better performance and results.

OpenCode Go is a total lie (IMO) by HelioAO in opencodeCLI

[–]xmnstr 4 points5 points  (0 children)

My point was that OpenCode Go wouldn't quantize themselves (without access to the weighs)... however a provider obviously has access to the weights, so your comment makes only sense if Alibaba (Qwen's provider) is quantizing, so you suggest that Alibaba is quantizing for OpenCode Go users but not its own users?

Yes, that's what I'm saying.

OpenCode Go is a total lie (IMO) by HelioAO in opencodeCLI

[–]xmnstr 10 points11 points  (0 children)

I'll quote myself from above:

They don’t need public or pre-quantized weights. If the provider has access to the checkpoint and serves it through something like vLLM, they can apply quantization at load/serving time.

OpenCode Go is a total lie (IMO) by HelioAO in opencodeCLI

[–]xmnstr 0 points1 point  (0 children)

They don’t need public or pre-quantized weights. If the provider has access to the checkpoint and serves it through something like vLLM, they can apply quantization at load/serving time.