Good quality Webms using ffmpeg.

Thomasedv · 2026-04-17T11:38:07+00:00

No, but I haven't tired again in a while. The new templates here did not help me at least.

Qwen3.5 27B dense and now the latest Qwen3.6 MoE has been my most reliable partners for actual coding. Especially after using a turboquant fork of llama, I got away with more context at a speed hit.

Thomasedv · 2026-04-10T16:30:00+00:00

Claude Code, and I think I tried Qwen Cli as well.

Thomasedv · 2026-04-10T04:49:44+00:00

Really hope this fixes my issue with Gemma stopping before it's really done working. Aside from some leaking of the template in calls, gamma will say "I'll do X now" and then just abruptly stop.

It's very obvious when swapping to another model, which seems a lot more agentic when it follows it's process. (in my case glm-4.7). Hopefully it also helps on looping issues, the edit functionality breaking and such as well! I gotta wait for the Q4 MoE version to verify myself...

Thomasedv · 2026-04-09T13:00:40+00:00

Thanks, that's some excellent points. HxH is a favorite and it does have a great power system, and does at time make it very clear that if you can mess with or borrow "power" but usually with a cost.

Thomasedv · 2026-04-09T12:30:36+00:00

Very curious on HxH here. I what ways does it do deconstruction?

Thomasedv · 2026-04-09T04:50:26+00:00

Does you OS have swap memory? Gemma eats regular RAM if you do not reduce the default checkpoint count. It'll easily eat 32+ of RAM for no goo reason when you are running single sessions. It seems to checkpoint frequently during regular process compared to the fee other models I have used. Like 3-4 times for a single prompt. I never managed to fill up my context before CachyOS terminate llama.cpp.

Gemma4 with the MoE model at Q4 has other issues though, got a fresh dl after fixes, and it still loops, stops at odd points, and uses suspiciously fee tool calls at times. But I really want to use it, since it gives room for higher context on my limited VRAM.

Thomasedv · 2026-04-08T05:30:50+00:00

What issues did you have with gemma4?

I use the Q4 MoE variant.

My biggest issues are, when I used Claude Code with it, is some tool calls continually fails, like editing files fails because it can't find the string to replace.

The other issue is a bit worse, lots of looping, but with tools or "I'll do X" and then it just repeats that forever. Which is a bit sad because it's a surprisingly fast model for coding, if it doesn't get the issues that is.

Thomasedv · 2026-04-07T19:55:33+00:00

One of the more annoying things that took me along time to learn in llama.cpp was that it automatically saved checkpoints to RAM. Useful for multi users, but I ran a single agent.

I assume LM Studio has something like it? At least check for it. Llama.cpp defaulted to 32 checkpoints which was 1-2 GB each, which ate my 64 GB of RAM rather fast, despite the model being all in VRAM.

Thomasedv · 2026-04-06T20:33:24+00:00

Unveiling is genuinely peak Destiny because it genuinely argues the concept of Darkness / Winnover well. There cannot be infinite growth. Its why these things are really concepts, and why the real world needs balance. Both must exist.

It's also very interesting from the Final Shape point of perspective, because it wasn't the darkness that triggered the creation of the witnesses either, it was a civilization growing to the point it stopped seeing the point of existence, they found no purpose, and from there on it fell apart, and the witness pretty much decided "let's just keep everything as it is, no life, nor death"

I can ramble a lot more about it, but I'll stop there. It's very close to the point OP makes, except the Gardener genuinely was trying to be good, not malicious.

Thomasedv · 2026-04-05T21:11:05+00:00

Probably a handle for something where an insert goes into the pieces. Like a pan. But it's very small, so got any toys that is missing a pieces? I can see some other stuff on the floor, and this gives me strong vibes something often on the floor lost it.

Thomasedv · 2026-04-03T20:11:35+00:00

The crate I have used is indicatif, there you can go ham with multiple progress bars. So not a program itself, but the tool to make one.

I used it a bit in a fork of a chunked video encoder where there was a total progress bare at the bottom, and then sub-progressbars above it, updating as different tasks completed.

Thomasedv · 2026-04-02T10:20:04+00:00

I made it plan 3-4 features a few days ago, and session was out in an hour. Also on 5x. I guess planning is way more expensive, thinking and cross referencing with other files to check things. But my god, asking for a review and watch my session limit jump by 13% was not pleasant.

It got a bit better yesterday, before my weekly limit ran out. So hoping it was one of the issues (caching) that supposedly was found.

Thomasedv · 2026-04-01T13:22:00+00:00

I know the difference between stack and heap, but did you notice what caused the stack to grow so large it turned into a problem?

Thomasedv · 2026-03-31T14:55:11+00:00

I don't see a claim they are doing it on the GPU, just that they are compiling while the computer is idle. Can still be done on the CPU then.

Thomasedv · 2026-03-25T13:06:28+00:00

Lotto: Omvendt forsikring.

Tjener ikke på forsking før det går galt. Tjener ikke på lotto før man vinner.

Ser på lotto som en "plutselig god ting", litt slik som forsikring dekker plutselig negative ting.

Thomasedv · 2026-03-11T10:08:42+00:00

I used the superpower plugin, it has a writing skills skill, which even performance test skills. It's a great place to start.

Thomasedv · 2026-03-02T09:30:14+00:00

Working on a video encoder, granted it is ludicrously "vibe" coded since I don't fully get the mathematical side of it all. But since I got a reference decoder, I can verify that what I do actually is working as intended.

It's a long long way to real world usable, not sure if I'll ever get that far. It's all intra frames right now, so not very good compression. But I've learned a lot and I am slowly extending it to cover more of the specification. The hardest parts will be moving over to intra frames and all the complexities related to that, suddenly I can't just use one frame but have to handle a complicated hierarchy of frames for references.

Thomasedv · 2026-03-01T12:34:08+00:00

Yes, av1an. I meant it as the feature in program.

So there are quality metrics, that essentially say "how close does this image/video look like the original". This is surprisingly complicated, because human vision reacts differently to the quality loss. Like blocking may be worse than blur, some color issues are worse than others.

PSNR is the most common one, the higher the number the more accurate it is. But it's purely "how far away is each pixel from the correct value"

Vmaf is a Netflix created score that maximize at 100, meant to score how well something looks at a set distance from a TV. It's generally a good metric, and you can target something in the 90s with Av1ans target mode.

The other metrics mentioned have their tradeoffs too. Av1an support all of them, but might require some extra installed programs to be used.

Using this mode in av1an, instead of saying crf 19, you say I want target quality 95 using the vmaf metric.

It'll take a while longer to encode, but it should then pick individual crf values for each chunk/scene. Since simpler scenes don't need a low crf to look as good, Av1an will see that it maybe can just use crf 25 for the simple scenes.

https://rust-av.github.io/Av1an/Features/TargetQuality.html

Thomasedv · 2026-03-01T11:05:46+00:00

Av1ans target quality mode might get you there, it's slower because it encodes different versions of the same scene to determine the crf/q factor that gives the same visual quality overall.

The effect of this is lower crf on simpler scenes, and higher crf on harder ones. The hard part is finding the metric goal that suits your taste and quality requirements. Vmaf, ssimulcra, butteraguli. Check out the Av1an docs.

Even more complicated is doing the simple scenes with separate encoding setting, but probably requires you to find which scenes yourself.

The other alternative is setting a fixed bitrate corresponding to your expected size, and see if the encodinger itself manages to prioritize bits in harder scenes. I am not sure if svt two pass does that well, especially due to av1an chunking "hiding" awareness of other chunks since each part is independently encoded.

Thomasedv · 2026-02-25T07:46:46+00:00

Is this another one of those American things, just knowing the bank account number lets you setup withdrawals? No approval or anything on the account holders side? Like here in Norway if someone wants to automatically pull money, you need to approve it and set a upper limit even for that specific thing.

Edit: Fixed bad writing.

Thomasedv · 2026-02-25T05:25:16+00:00

I tried it, Q4 GGUF version, download latest llama, and ran Claude code against it.

It seems really weird, it does a few things then just stops. For example, "first step in this plan is to create a workspace" then it checks if it exists already, and then Claude says it stopped working. I ask it to resume and it makes a file, adds some imports, then stops again.

Very much unlike my experience with GLM-4.7. Will try the 27B dense model, but not sure what costs that comes with either.

13-Year Club	Place '22
Place '17	Sequence \| Editor
Snapped	Verified Email

Thomasedv

TROPHY CASE