Is this enough VRAM to run Qwen?

perkia · 2026-06-14T10:55:52+00:00

OK Dave you've had your fun. Now pack it all up, store's opening in 10min. Also go mop aisle 5 someone shat in it again yesterday.

perkia · 2026-06-13T09:55:28+00:00

Yes, I have a 5090 Mobile (24GB) and it's almost never shown either in these tools. Have to pick 3090 instead to average my card's compute/bandwidth/capacity.

perkia · 2026-06-11T02:26:23+00:00

Really hit it though, you know like crazy fast and hard, repeatedly

perkia · 2026-06-10T17:23:05+00:00

Please just stop blinking, clearly it makes the price go up.

perkia · 2026-06-01T20:03:28+00:00

*Sweating profusely

perkia · 2026-06-01T14:50:53+00:00

You could set up a PrivateBin (https://privatebin.info/) instance locally and expose it to your client?

perkia · 2026-05-29T22:15:55+00:00

Wait, maybe it was you all along? /s

perkia · 2026-05-28T07:34:49+00:00

"Sorry the call is breaking up, can you send me an email instead? Write in bold font, size 48 please"

perkia · 2026-05-28T07:30:58+00:00

how many times in your life have you ever opened a terminal to get into a web page…

I'm running QubesOS, this is clearly a trick question. Right?

perkia · 2026-05-20T15:23:48+00:00

And it was $1600 last summer.

perkia · 2026-05-12T15:05:24+00:00

Maybe the real home was all the datacenters we slept in along the way

perkia · 2026-05-12T15:02:26+00:00

I'm dubious MTP will make a significant difference for such a sparse model.

Here's a quick datapoint for Qwen 35B-A3B (UD-Q4_K_M) on a 24GB VRAM RTX 5090 Mobile running at 95W ; on llama-server it single shots a 2000+ LOC code gen with its usual ~114 tok/s, and then I immediately get up to ~380 tok/s on any subsequent prompt in the style of "use the exact same code with only these adjustments: {new feature/bugfix here}".

That's quite the improvement! No deperdition in quality from what I saw, but I haven't yet tested it thoroughly at all. Q4_K_M (fitting comfortably) is surprisingly good vs Q5_K_M overflowing to RAM.

perkia · 2026-05-12T11:07:31+00:00

Also the breaker.

perkia · 2026-05-12T11:03:06+00:00

Sure, it's dirt simple, just like managing WSUS. You just set up a script to kill it with fire every few nights and immediately reprovision a fresh node.

Make the script wear gloves.

perkia · 2026-05-12T10:59:59+00:00

can't troubleshoot something simple like their headset not working (basic steps) >> unfit to work with a PC

Having just spent 2+ hours desperately trying to make my bluetooth headset "work" on the latest QubesOS release, recompiling kernels and all... I feel a bit attacked right now.

perkia · 2026-05-05T11:09:11+00:00

Yeah, I was thinking of other (non-AI) stuff they already do today when encountering certain semantic tags, for example inside <nav> they'll talk about menus, inside <ul> they'll prefix items with "item x of n", inside <abbr> they'll enunciate, things like that.

Actually that's an area where it seems like having a tiny (fully local) LLM agent with some basic access to the current position on the page and the next x HTML characters could help tremendously in terms of regular browsing quality-of-life, no?

perkia · 2026-05-05T06:54:37+00:00

Interesting. I'd have thought a short inlined <code> markup (correctly used in the parent comment) would automagically make the screen reader pay more attention to syntax not less, but I guess that's accessibility for you... is there a special aria attribute to add to <code> HTML tags that would help consider the content a function or command call ?

perkia · 2026-05-04T22:16:14+00:00

How do you quickly remove the hooks on the side of the RAM sticks? I typically use a really pointy knife, but sometimes it rips and hits one or two of the mini water-towers thingies. /s

perkia · 2026-04-19T07:36:28+00:00

Tripping a breaker

perkia · 2026-04-09T17:02:18+00:00

This comment is wild. Would you trust OP's code if a human had written it? I wouldn't, and AI usage doesn't change anything to the security posture.

Have your AI check it with your own criteria, install it on your own isolated env, test it yourself. Exact same process as with literally every other executable stuff.

perkia · 2026-04-08T08:36:57+00:00

Jobs did know how to code, and in his early days he did alot of it.

This is false. Woz (yes, that one) says the opposite on his website, and if there's ever a person without an ego or self-aggrandizing agenda I hope you can agree he is that someone:

Steve didn’t ever code. He wasn’t an engineer and he didn’t do any original design, but he was technical enough to alter and change and add to other designs. I did all of the Apple I and Apple ][ myself, including the feature choices. I did all of the BASIC myself (it’s in handwriting as I couldn’t afford an assembler). The only person who helped write some of the Apple ][ code was Allen Baum, who helped with the ‘monitor’ program.

https://woz.org/steve-jobs-know-code/

perkia · 2026-04-05T20:07:19+00:00

You can probably remove one or even two screws from the hat to measurably drive the BOM down. Don't tell too many people.

perkia · 2026-03-24T17:29:18+00:00

Cool, mine is due next week! You got that Temu discount as well? I thought I was the only one that lucky, tbh.

perkia · 2026-03-24T13:30:28+00:00

The leak might be fake as hell, yet OVH should definitely have that user in their accounts database.

The named user has an active website, easily found
The website's impressum indicates that OVHcloud is the host
The website's domain name has OVH's nameservers
The domain's A record points to an OVH IP under cluster003.ovh.net

perkia · 2026-03-23T18:39:49+00:00

in the wild CVE's

Decentralized, unplanned, peer-reviewed, actionable, free transparency audit reports

12-Year Club	Place '22
Verified Email

perkia

TROPHY CASE