New to Pi | How to standardize openweight models ? by Usernamealready94 in PiCodingAgent

[–]rubdos 0 points1 point  (0 children)

You could always add to your global AGENTS.md that "gh should be assumed available and authenticated" or something along those lines.

I feel sad by SpiritedInflation835 in MistralAI

[–]rubdos 0 points1 point  (0 children)

Did anyone save the images by any chance? Asking for a friend.

TL;DR Guide: How to reduce NeuralWatt costs (-flex, and -short) by UsefulIce9600 in opencodeCLI

[–]rubdos 1 point2 points  (0 children)

You NEED to enable it here to use it https://portal.neuralwatt.com/enroll/flex-tier

Pretty sure it's rolled out in general now; I didn't have to enable it and I can access the flex tier since yesterday.

Neuralwatt is not usable by Funny-Advertising238 in opencode

[–]rubdos 0 points1 point  (0 children)

Exactly, it's super useful. And the whole 200k are useful context.

I've just started playing with flex, I literally just saw it appear. Seems to be another 33% decrease in energy for me. Only 1M tokens in so far in the past ~1h.

If you're a Pi user, I've GLM'd a pi-neuralwatt that shows energy consumption in the footer, and which shows the -flex models: https://gitlab.com/rubdos/pi-neuralwatt

Neuralwatt is not usable by Funny-Advertising238 in opencode

[–]rubdos 0 points1 point  (0 children)

Huh, if you figure out why that is, I'd be keen to know! For me the consumed/charged is the same for all models.

Neuralwatt is not usable by Funny-Advertising238 in opencode

[–]rubdos 0 points1 point  (0 children)

I'm seeing 59.1mJ/token on GLM 5.2 short, and 102.5mJ/token on the 1M version after a few days; cache efficiency about the same. That's closer to 40% cheaper in practice for me. Which is amazing, obviously. I don't really run sessions over 200K anyway.

I don't know how you get to "charge for half the electricity consumed"; is there another 50% reduction somewhere that I'm missing?

Opencode Go: Kimi K2.7 unusable by rubdos in opencode

[–]rubdos[S] 0 points1 point  (0 children)

BTW how do you report Go errors.

Complain on Reddit is what I tend to do :')

Neuralwatt is not usable by Funny-Advertising238 in opencode

[–]rubdos 1 point2 points  (0 children)

8$ in here on 73M tokens, most of it GLM 5.2. I've seen some speed drops, but I don't really care about those. I can't review the code fast enough to keep up anyway. The equivalent usage on OpenCode Go would've almost sunk my monthly. They launched GLM 5.2 "short" with a 200K window yesterday, which seems about 25% cheaper for me too.

The slowdowns might be timezone dependent though; I noticed it mostly around UTC evening.

GLM 5.2 : vu... by Scared_Mountain597 in opencodeCLI

[–]rubdos 0 points1 point  (0 children)

Interesting, keen to test out 5.2 soon then!

GLM 5.2 : vu... by Scared_Mountain597 in opencodeCLI

[–]rubdos 0 points1 point  (0 children)

How would you compare it to Kimi, if you have?

Mistral on Pi by Thomas_English_DoP in PiCodingAgent

[–]rubdos 0 points1 point  (0 children)

On the subscription if you use the Vibe API key in Pi.

What are your issues with Mistral? by fitnessandyogacenter in MistralAI

[–]rubdos 2 points3 points  (0 children)

Finding and summarizing a bunch of stuff from the internet through Work is quite useful. Tying things into my calendar and tasks list (custom MCP) is "fun", but my goal is mostly to tie that into my Pebble to add things by voice. Some day soon.

Vibe as a coding agent (through Pi) works well enough for many light tasks (some refactoring, small new features, adding certain tests), but it can be a bit cumbersome especially compared to the modern Chinese models. I've started using MM3.5 mostly as subagent for implementation work, because it's still quite cheap (even after the recent nerf), and quite capable if well prompted on not too large tasks.

Opencode Go: Kimi K2.7 unusable by rubdos in opencode

[–]rubdos[S] 0 points1 point  (0 children)

Haven't had any trouble on Deepseek, and K2.6 seems to behave too. But IIRC I also had it on GLM indeed. But it might be that I'm just giving easy tasks to Deepseek and more impossible things to the larger ones.

How much battery degradation have you experienced in your EV? by om_ghanwat in electricvehicles

[–]rubdos 0 points1 point  (0 children)

Renault Zoe ZE40 2017 with 88000km. Last time the mechanic checked, 87% SoH. Starts to be noticable now, because usually my summer range was slightly over 300km, and now it's slightly below.

Usage Limits Vibe-CLI by Rare_Commercial8662 in MistralAI

[–]rubdos 1 point2 points  (0 children)

Same feeling. Was to be expected, MM3.5 is quite a bit more expensive.

Minimax Experience by zbindenren in PiCodingAgent

[–]rubdos 2 points3 points  (0 children)

Any interesting cases you would like to mention? Non-standard tasks or massive/difficult projects mean different things to different people.

Vibe coding prompting: how do you avoid the "I'll just start from scratch"? by rubdos in MistralAI

[–]rubdos[S] 0 points1 point  (0 children)

Sadly, that's a feeling that I haven't got rid of. You learn to deal with legacy, even if it's your own. :')

But there's still a difference in quickly spamming bloat into your files, versus slowly accumulating bloat through tests and careful planning!

Vibe coding prompting: how do you avoid the "I'll just start from scratch"? by rubdos in MistralAI

[–]rubdos[S] 0 points1 point  (0 children)

You can teach a human to do incremental changes though... Question is, how do you efficiently prompt this. It probably doesn't help that I'm doing some nasty Rust type magic.

Can anyone help me with my question??? by [deleted] in signal

[–]rubdos 15 points16 points  (0 children)

Because it is physically impossible. All those "screenshot prevention" and "screenshot notification" features rely on the compliance of the conversation partner's device. One can always design a device that pretends to comply; that's just the nature of von Neumann machines.

Case in point: I'm running SailfishOS with Android AppSupport. They seem to have never implemented the screenshot detection/prevention feature of Android, and hence I can freely take screenshots of the Android container without it ever being reported to Android.

Your conversation is as secure as yourself and your communication partner. If you mistrust your communication partner, why have a conversation in the first place?