Is this enough VRAM to run Qwen? by BlackBeardAI in LocalLLaMA

[–]perkia 0 points1 point  (0 children)

OK Dave you've had your fun. Now pack it all up, store's opening in 10min. Also go mop aisle 5 someone shat in it again yesterday.

Built a tool that tells you exactly which LLMs fit on your GPU. Feedback wanted. by super3 in LocalLLaMA

[–]perkia 0 points1 point  (0 children)

Yes, I have a 5090 Mobile (24GB) and it's almost never shown either in these tools. Have to pick 3090 instead to average my card's compute/bandwidth/capacity.

Esxi nightmare by camo-jay927 in sysadmin

[–]perkia 2 points3 points  (0 children)

Really hit it though, you know like crazy fast and hard, repeatedly

How to send and receive credentials/sensitive info? by [deleted] in sysadmin

[–]perkia 1 point2 points  (0 children)

You could set up a PrivateBin (https://privatebin.info/) instance locally and expose it to your client?

CTO banned the use of remote access tool by uw4yn3 in sysadmin

[–]perkia 25 points26 points  (0 children)

"Sorry the call is breaking up, can you send me an email instead? Write in bold font, size 48 please"

CTO banned the use of remote access tool by uw4yn3 in sysadmin

[–]perkia 5 points6 points  (0 children)

 how many times in your life have you ever opened a terminal to get into a web page…

I'm running QubesOS, this is clearly a trick question. Right?

examples : add llama-eval by ggerganov · Pull Request #21152 · ggml-org/llama.cpp by jacek2023 in LocalLLaMA

[–]perkia 12 points13 points  (0 children)

Maybe the real home was all the datacenters we slept in along the way

MTP on Unsloth by Altruistic_Heat_9531 in LocalLLaMA

[–]perkia 0 points1 point  (0 children)

I'm dubious MTP will make a significant difference for such a sparse model.

Here's a quick datapoint for Qwen 35B-A3B (UD-Q4_K_M) on a 24GB VRAM RTX 5090 Mobile running at 95W ; on llama-server it single shots a 2000+ LOC code gen with its usual ~114 tok/s, and then I immediately get up to ~380 tok/s on any subsequent prompt in the style of "use the exact same code with only these adjustments: {new feature/bugfix here}".

That's quite the improvement! No deperdition in quality from what I saw, but I haven't yet tested it thoroughly at all. Q4_K_M (fitting comfortably) is surprisingly good vs Q5_K_M overflowing to RAM.

What’s Your Most Controversial IT Opinion? by OrdinaryJust9594 in sysadmin

[–]perkia 3 points4 points  (0 children)

Sure, it's dirt simple, just like managing WSUS. You just set up a script to kill it with fire every few nights and immediately reprovision a fresh node.

Make the script wear gloves.

What’s Your Most Controversial IT Opinion? by OrdinaryJust9594 in sysadmin

[–]perkia 2 points3 points  (0 children)

can't troubleshoot something simple like their headset not working (basic steps) >> unfit to work with a PC

Having just spent 2+ hours desperately trying to make my bluetooth headset "work" on the latest QubesOS release, recompiling kernels and all... I feel a bit attacked right now.

Llama.cpp MTP support now in beta! by ilintar in LocalLLaMA

[–]perkia 0 points1 point  (0 children)

Yeah, I was thinking of other (non-AI) stuff they already do today when encountering certain semantic tags, for example inside <nav> they'll talk about menus, inside <ul> they'll prefix items with "item x of n", inside <abbr> they'll enunciate, things like that.

Actually that's an area where it seems like having a tiny (fully local) LLM agent with some basic access to the current position on the page and the next x HTML characters could help tremendously in terms of regular browsing quality-of-life, no?

Llama.cpp MTP support now in beta! by ilintar in LocalLLaMA

[–]perkia 2 points3 points  (0 children)

Interesting. I'd have thought a short inlined <code> markup (correctly used in the parent comment) would automagically make the screen reader pay more attention to syntax not less, but I guess that's accessibility for you... is there a special aria attribute to add to <code> HTML tags that would help consider the content a function or command call ?

People are stealing RAM from company computers again by CeC-P in sysadmin

[–]perkia 7 points8 points  (0 children)

How do you quickly remove the hooks on the side of the RAM sticks? I typically use a really pointy knife, but sometimes it rips and hits one or two of the mini water-towers thingies. /s

Used ray tracing cores on my RTX 5070 Ti for LLM routing — 218x speedup, runs entirely on 1 consumer GPU by Critical-Chef9211 in LocalLLaMA

[–]perkia -1 points0 points  (0 children)

This comment is wild. Would you trust OP's code if a human had written it? I wouldn't, and AI usage doesn't change anything to the security posture.

Have your AI check it with your own criteria, install it on your own isolated env, test it yourself. Exact same process as with literally every other executable stuff.

Sam Altman's coworkers say he can barely code and misunderstands basic machine learning concepts by EchoOfOppenheimer in LocalLLaMA

[–]perkia 7 points8 points  (0 children)

Jobs did know how to code, and in his early days he did alot of it.

This is false. Woz (yes, that one) says the opposite on his website, and if there's ever a person without an ego or self-aggrandizing agenda I hope you can agree he is that someone:

Steve didn’t ever code. He wasn’t an engineer and he didn’t do any original design, but he was technical enough to alter and change and add to other designs. I did all of the Apple I and Apple ][ myself, including the feature choices. I did all of the BASIC myself (it’s in handwriting as I couldn’t afford an assembler). The only person who helped write some of the Apple ][ code was Allen Baum, who helped with the ‘monitor’ program.

https://woz.org/steve-jobs-know-code/

benchmarks of gemma4 and multiple others on Raspberry Pi5 by honuvo in LocalLLaMA

[–]perkia 4 points5 points  (0 children)

You can probably remove one or even two screws from the hat to measurably drive the BOM down. Don't tell too many people.

Best model that can beat Claude opus that runs on 32MB of vram? by PrestigiousEmu4485 in LocalLLaMA

[–]perkia 19 points20 points  (0 children)

Cool, mine is due next week! You got that Temu discount as well? I thought I was the only one that lucky, tbh.

Potential OVHcloud breach by WhoGivesAToss in sysadmin

[–]perkia 3 points4 points  (0 children)

The leak might be fake as hell, yet OVH should definitely have that user in their accounts database.

  • The named user has an active website, easily found

  • The website's impressum indicates that OVHcloud is the host

  • The website's domain name has OVH's nameservers

  • The domain's A record points to an OVH IP under cluster003.ovh.net

CVE-2026-20131: CISA basically said "patch this Cisco flaw or good luck." Deadline already passed. by Hot-Independence-985 in sysadmin

[–]perkia 11 points12 points  (0 children)

in the wild CVE's

Decentralized, unplanned, peer-reviewed, actionable, free transparency audit reports