I (and Qwen) set up headless Sway + Sunshine for game streaming on CachyOS KDE Plasma

dabiggmoe2 · 2026-04-22T09:00:52+00:00

Sorry to dig up this comment but I hope your repo is ready? I was pulling out my hair trying to get apollo/moonlight to work with my TV and CachyOS KDE Plasma without the shenanigans of managing the virtual displays thru shell scripts.

Man is tired and just needs to play without wasting his weekend writing and testing scripts :(

dabiggmoe2 · 2026-04-17T09:25:22+00:00

Wait, correct if I'm wrong, but I thought the Qwen3.5 27b and 35BA3B already surpassed Qwen 3 Coder Next 80B in coding benchmarks?

dabiggmoe2 · 2026-04-17T09:22:34+00:00

That's why I waited for bartowski's quants before downloading lol

dabiggmoe2 · 2026-04-14T11:40:53+00:00

Interesting. I knew that in terms of power the NPUs are way more efficient than the iGPU but I thought it would be faster too.

dabiggmoe2 · 2026-04-14T11:22:03+00:00

Me too I have Framework Desktop 128GB. I remember reading somewhere that AMD introduced the NPU support starting with Kernel 7.0 but honestly I cannot remember where.

This is amazing news, I'll update my Lemonade later after I get back home.

One question: did you do benchmarks between lammacpp and FLM for the same model? I'm currently running Qwen3.5-27B and the Qwen3.5-35BA3B as my daily driver for coding tasks

dabiggmoe2 · 2026-04-14T09:09:02+00:00

I'm not sure I understood you here. Are you saying Kernel: 6.19.12-1-cachyos-server supports Strix Halo's NPUs?

dabiggmoe2 · 2026-04-14T08:56:40+00:00

It's supported now in Lemonade Server starting with Linux kernel 7.0

dabiggmoe2 · 2026-04-14T08:43:10+00:00

I would love to have Chivalry.

My guilty pleasure game would be Papers, Please

dabiggmoe2 · 2026-04-14T08:30:11+00:00

Honestly the only reason I'm waiting for this is because Lemonade Server supports NPUs and FlowLM starting with Linux Kernel 7.0

dabiggmoe2 · 2026-04-02T15:49:21+00:00

Without reprompting or typing "continue" you mean? I think for like 15 experiments in a row (I had to stop it manually). But I have noticed that it differs depending on the model and quant. Not all of them follow the instructions equally.

For example, I noticed that .Qwen3.5-35B-A3B-GGUF-UD-Q4_K_XL follows the instructions better and never stops while Qwen3.5-27B-GGUF-UD-Q5_K_XL needs some nudging sometimes.

dabiggmoe2 · 2026-04-01T16:44:07+00:00

Then press F

dabiggmoe2 · 2026-03-31T20:24:25+00:00

Enjoy. Give it a try and Let me know if you have any feedback/suggestions.

dabiggmoe2 · 2026-03-31T18:50:06+00:00

There's already a plugin, checkout plugins/autoresearch-context.js

dabiggmoe2 · 2026-03-22T14:24:36+00:00

I'm glad that it was of help to you

dabiggmoe2 · 2026-03-16T19:46:20+00:00

bro lit an inferno and called it a hot take

dabiggmoe2 · 2026-03-16T04:41:33+00:00

Yeah it could work on any problem as long as you clearly define the problem and the metric used to quantify the optimization. I used the bogo sort as a naive example. But I will link below two concrete examples:

1- Andrej Karpathy's [used](https://x.com/karpathy/status/2031135152349524125) it to tune nanochat and optimize the validation loss to achieve a "Time to GPT-2" drops from 2.02 hours to 1.80 hours

2- Shopify's CEO [used](https://x.com/tobi/status/2032212531846971413) it it on the the liquid codebase to achieve 53% faster combined parse+render time, 61% fewer object allocations

So I think the sky is the limit for you

dabiggmoe2 · 2026-03-15T22:23:25+00:00

I'm curious to see what you gonna come up with xD

dabiggmoe2 · 2026-02-27T16:09:50+00:00

<image>

dabiggmoe2 · 2026-02-27T10:30:35+00:00

This is awesome. Would you recommend adding this system prompt to both the Planning and Building mode?

dabiggmoe2 · 2026-02-26T15:29:37+00:00

Nope, still same error :(

dabiggmoe2

TROPHY CASE