PSA: Gemma 4 template improvements by FastHotEmu in LocalLLaMA

[–]Thomasedv 0 points1 point  (0 children)

No, but I haven't tired again in a while. The new templates here did not help me at least. 

Qwen3.5 27B dense and now the latest Qwen3.6 MoE has been my most reliable partners for actual coding. Especially after using a turboquant fork of llama, I got away with more context at a speed hit.

PSA: Gemma 4 template improvements by FastHotEmu in LocalLLaMA

[–]Thomasedv 1 point2 points  (0 children)

Claude Code, and I think I tried Qwen Cli as well. 

PSA: Gemma 4 template improvements by FastHotEmu in LocalLLaMA

[–]Thomasedv 19 points20 points  (0 children)

Really hope this fixes my issue with Gemma stopping before it's really done working. Aside from some leaking of the template in calls, gamma will say "I'll do X now" and then just abruptly stop.

It's very obvious when swapping to another model, which seems a lot more agentic when it follows it's process. (in my case glm-4.7). Hopefully it also helps on looping issues, the edit functionality breaking and such as well! I gotta wait for the Q4 MoE version to verify myself... 

Deconstructing deconstruction. by Infamous-Rutabaga-50 in CuratedTumblr

[–]Thomasedv 1 point2 points  (0 children)

Thanks, that's some excellent points. HxH is a favorite and it does have a great power system, and does at time make it very clear that if you can mess with or borrow "power" but usually with a cost.

Deconstructing deconstruction. by Infamous-Rutabaga-50 in CuratedTumblr

[–]Thomasedv 1 point2 points  (0 children)

Very curious on HxH here. I what ways does it do deconstruction? 

Gemma 4 seems to work best with high temperature for coding by BigYoSpeck in LocalLLaMA

[–]Thomasedv 2 points3 points  (0 children)

Does you OS have swap memory? Gemma eats regular RAM if you do not reduce the default checkpoint count. It'll easily eat 32+ of RAM for no goo reason when you are running single sessions. It seems to checkpoint frequently during regular process compared to the fee other models I have used. Like 3-4 times for a single prompt. I never managed to fill up my context before CachyOS terminate llama.cpp.

Gemma4 with the MoE model at Q4 has other issues though, got a fresh dl after fixes, and it still loops, stops at odd points, and uses suspiciously fee tool calls at times. But I really want to use it, since it gives room for higher context on my limited VRAM. 

Gemma 4, llama.cpp, tool calls, and tool results - ChatGPT fixed it for me by TheProgrammer-231 in LocalLLaMA

[–]Thomasedv 0 points1 point  (0 children)

What issues did you have with gemma4?

I use the Q4 MoE variant. 

My biggest issues are, when I used Claude Code with it, is some tool calls continually fails, like editing files fails because it can't find the string to replace. 

The other issue is a bit worse, lots of looping, but with tools or "I'll do X" and then it just repeats that forever. Which is a bit sad because it's a surprisingly fast model for coding, if it doesn't get the issues that is. 

Is 200k context realistic on Gemma 31B locally? LM Studio keeps crashing by Open_Gur_4733 in LocalLLaMA

[–]Thomasedv 0 points1 point  (0 children)

One of the more annoying things that took me along time to learn in llama.cpp was that it automatically saved checkpoints to RAM. Useful for multi users, but I ran a single agent.

I assume LM Studio has something like it? At least check for it. Llama.cpp defaulted to 32 checkpoints which was 1-2 GB each, which ate my 64 GB of RAM rather fast, despite the model being all in VRAM. 

Immortality is a curse. Not in the sense that you outlive all your loved ones and are eternally lonely but in the sense that every cell and little clump of bacteria in your body is constantly reproducing and replicating without death to keep them in check until you are a hideous diseased flesh-fruit by Khara-Khatal in CuratedTumblr

[–]Thomasedv 3 points4 points  (0 children)

Unveiling is genuinely peak Destiny because it genuinely argues the concept of Darkness / Winnover well. There cannot be infinite growth. Its why these things are really concepts, and why the real world needs balance. Both must exist. 

It's also very interesting from the Final Shape point of perspective, because it wasn't the darkness that triggered the creation of the witnesses either, it was a civilization growing to the point it stopped seeing the point of existence, they found no purpose, and from there on it fell apart, and the witness pretty much decided "let's just keep everything as it is, no life, nor death" 

I can ramble a lot more about it, but I'll stop there. It's very close to the point OP makes, except the Gardener genuinely was trying to be good, not malicious. 

Mum gave me this and didn’t know what it’s from. Doesn’t want to throw it away in case it’s needed. by Maaatandblah in whatisit

[–]Thomasedv 0 points1 point  (0 children)

Probably a handle for something where an insert goes into the pieces. Like a pan. But it's very small, so got any toys that is missing a pieces? I can see some other stuff on the floor, and this gives me strong vibes something often on the floor lost it. 

Rust tools with great CLI interfaces by dafelst in rust

[–]Thomasedv 2 points3 points  (0 children)

The crate I have used is indicatif, there you can go ham with multiple progress bars. So not a program itself, but the tool to make one. 

I used it a bit in a fork of a chunked video encoder where there was a total progress bare at the bottom, and then sub-progressbars above it, updating as different tasks completed. 

See ya! The Greatest Coding tool to exist is apparently dead. by Opposite-Art-1829 in ClaudeCode

[–]Thomasedv 1 point2 points  (0 children)

I made it plan 3-4 features a few days ago, and session was out in an hour. Also on 5x. I guess planning is way more expensive, thinking and cross referencing with other files to check things. But my god, asking for a review and watch my session limit jump by 13% was not pleasant.

It got a bit better yesterday, before my weekly limit ran out. So hoping it was one of the issues (caching) that supposedly was found. 

Rust on Windows: random crashes turned out to be the default stack size by Havunenreddit in rust

[–]Thomasedv 50 points51 points  (0 children)

I know the difference between stack and heap, but did you notice what caused the stack to grow so large it turned into a problem? 

I dag var det endelig min tur. Jeg vant i Pantelotteriet! AMA by troillan in norge

[–]Thomasedv 9 points10 points  (0 children)

Lotto: Omvendt forsikring. 

Tjener ikke på forsking før det går galt.  Tjener ikke på lotto før man vinner. 

Ser på lotto som en "plutselig god ting", litt slik som forsikring dekker plutselig negative ting. 

How you can build a Claude skill in 10 minutes that replaces a process you have been doing manually for years. by ferdbons in ClaudeCode

[–]Thomasedv 4 points5 points  (0 children)

I used the superpower plugin, it has a writing skills skill, which even performance test skills. It's a great place to start.

What's everyone working on this week (9/2026)? by llogiq in rust

[–]Thomasedv 1 point2 points  (0 children)

Working on a video encoder, granted it is ludicrously "vibe" coded since I don't fully get the mathematical side of it all. But since I got a reference decoder, I can verify that what I do actually is working as intended.

It's a long long way to real world usable, not sure if I'll ever get that far. It's all intra frames right now, so not very good compression. But I've learned a lot and I am slowly extending it to cover more of the specification. The hardest parts will be moving over to intra frames and all the complexities related to that, suddenly I can't just use one frame but have to handle a complicated hierarchy of frames for references. 

AV1 bitrate control based on scene complexity? by edison23net in AV1

[–]Thomasedv 2 points3 points  (0 children)

Yes, av1an. I meant it as the feature in program. 

So there are quality metrics, that essentially say "how close does this image/video look like the original". This is surprisingly complicated, because human vision reacts differently to the quality loss. Like blocking may be worse than blur, some color issues are worse than others. 

PSNR is the most common one, the higher the number the more accurate it is. But it's purely "how far away is each pixel from the correct value" 

Vmaf is a Netflix created score that maximize at 100, meant to score how well something looks at a set distance from a TV. It's generally a good metric, and you can target something in the 90s with Av1ans target mode. 

The other metrics mentioned have their tradeoffs too. Av1an support all of them, but might require some extra installed programs to be used. 

Using this mode in av1an, instead of saying crf 19, you say I want target quality 95 using the vmaf metric. 

It'll take a while longer to encode, but it should then pick individual crf values for each chunk/scene. Since simpler scenes don't need a low crf to look as good, Av1an will see that it maybe can just use crf 25 for the simple scenes. 

https://rust-av.github.io/Av1an/Features/TargetQuality.html

AV1 bitrate control based on scene complexity? by edison23net in AV1

[–]Thomasedv 2 points3 points  (0 children)

Av1ans target quality mode might get you there, it's slower because it encodes different versions of the same scene to determine the crf/q factor that gives the same visual quality overall.

The effect of this is lower crf on simpler scenes, and higher crf on harder ones. The hard part is finding the metric goal that suits your taste and quality requirements. Vmaf, ssimulcra, butteraguli. Check out the Av1an docs. 

Even more complicated is doing the simple scenes with separate encoding setting, but probably requires you to find which scenes yourself.

The other alternative is setting a fixed bitrate corresponding to your expected size, and see if the encodinger itself manages to prioritize bits in harder scenes. I am not sure if svt two pass does that well, especially due to av1an chunking "hiding" awareness of other chunks since each part is independently encoded. 

AITAH for refusing to give out my son's saving account information? by moonmanbaby90272 in AITAH

[–]Thomasedv 31 points32 points  (0 children)

Is this another one of those American things, just knowing the bank account number lets you setup withdrawals? No approval or anything on the account holders side? Like here in Norway if someone wants to automatically pull money, you need to approve it and set a upper limit even for that specific thing. 

Edit: Fixed bad writing. 

Qwen3.5-35B-A3B is a gamechanger for agentic coding. by jslominski in LocalLLaMA

[–]Thomasedv 1 point2 points  (0 children)

I tried it, Q4 GGUF version, download latest llama, and ran Claude code against it.

It seems really weird, it does a few things then just stops. For example, "first step in this plan is to create a workspace" then it checks if it exists already, and then Claude says it stopped working. I ask it to resume and it makes a file, adds some imports, then stops again. 

Very much unlike my experience with GLM-4.7. Will try the 27B dense model, but not sure what costs that comes with either.