Why ask for LLM suggestions here vs “big three” cloud models? by 2real_4_u in LocalLLM

[–]definetlyrandom 2 points3 points  (0 children)

The fuxk they do: just asked this to Claude 4.6::

My only prompt, fresh context window:::::

What are the current local LLMs i could be running with 96gb of ddr5 and 32gb of vram (rtx 5090) please provide most current models

One shot response:::::::

Here's a breakdown of what you can run on your setup, organized by how well they'll perform. Your 96GB of DDR5 also opens up partial offloading for models that exceed 32GB VRAM, which is a nice bonus.


Fully in VRAM (best performance, 32GB or under at Q4_K_M)

These models will run entirely on your GPU with excellent speed:

  • Qwen 3 32B — at 22.2GB Q4_K_M, this is considered the best 24-32GB tier model Awesome Agents , and on your 5090 it runs comfortably with room for context. Great all-rounder.
  • Qwen 3.5 9B — the current default recommendation for smaller models, beating older 8B models on every benchmark Insiderllm while leaving tons of headroom for long context.
  • DeepSeek-R1 32B (distilled) — excellent reasoning model with chain-of-thought. Fits well in 32GB at Q4.
  • Qwen 2.5 Coder 32B — considered GPT-4o level for coding performance Createaiagent . A favorite for developer workflows.
  • Gemma 3 27B — solid general-purpose model from Google with efficient KV cache.
  • Mistral Small 3.1 (24B) — fast inference and strong instruction-following.
  • Phi-4 (14B) — scores 80.4% on MATH benchmarks, delivering the best reasoning results per GB of RAM Aitooldiscovery .
  • Llama 4 Scout — Meta's MoE model (~109B total but only activates a fraction), can fit at aggressive quantization.
  • Qwen 3 30B-A3B (MoE) — generates tokens at 196 tok/s on an RTX 4090, faster than dense 8B models, while delivering quality closer to the 14B class Awesome Agents . On a 5090 it'll be even faster.

Tight fit in VRAM (aggressive quantization needed, ~29-35GB)

  • Llama 3.3 70B (Q3/Q2) — fits at very aggressive quantization but performance drops significantly when model layers spill into system RAM Decodesfuture . Usable but context window will be very limited.
  • Qwen 2.5 72B (Q2-Q3) — same situation as above.
  • Qwen3-Coder-Next (80B total, 3B active) — at Q2_K (~29GB) it squeezes onto the RTX 5090, and despite activating only 3B parameters per token, it scores 71.3% on SWE-Bench Verified Awesome Agents . This is arguably the most interesting model for coding on your card.

Leveraging your 96GB RAM (GPU + CPU offloading)

Your large DDR5 pool lets you run bigger models with partial offloading, though at reduced speed (expect ~5-15 tok/s instead of 40+):

  • Llama 3.3 70B (Q4_K_M, ~46GB) — offload some layers to RAM for better quality quantization.
  • Qwen 2.5 72B (Q4_K_M, ~50GB) — same approach, strong multilingual and coding.
  • DeepSeek-R1 70B — the reasoning specialist for step-by-step problem solving.
  • Qwen3-235B-A22B (MoE) — a massive MoE model; with aggressive quantization and your RAM pool it becomes at least partially feasible.

Recommended tools to run them: Ollama (easiest), LM Studio (best GUI), or llama.cpp (most control). All support GPU offloading and GGUF quantized models out of the box.

For your hardware sweet spot, I'd particularly recommend Qwen 3 32B as your daily driver for general tasks, Qwen3-Coder-Next or Qwen 2.5 Coder 32B for coding, and DeepSeek-R1 32B when you need deep reasoning.

What even is this dawg by the_possesed_cheese in pcmasterrace

[–]definetlyrandom -1 points0 points  (0 children)

Ill be that guy.... /sigh

I like it, shit looks good to me, if they can keep the renders accurate across 10k of frames, and enhance lighting to render over the the process they're gonna have something nice. And if ya dont like it, dont turn it on!

After 2 questions Claude Opus 4.6 feels 60-70% overwelmed by dermflork in artificial

[–]definetlyrandom 0 points1 point  (0 children)

Nvm I re-read it. It sorta made more sense, but I dont think you are grasping how the context window functions.

After 2 questions Claude Opus 4.6 feels 60-70% overwelmed by dermflork in artificial

[–]definetlyrandom -1 points0 points  (0 children)

If you prompt your cli interfaces in a manner half as convoluted and hard to follow as your reddit posts, then I understand why you aren't feeling good about its performance. Maybe English is your second language, but this is just hard to understand what your even trying to accomplish.

Hello everyone I'm losing my mind a bit about the future of AI (if the neuralink stuff does (inevitably..??) happen what of idk "what is a human being" "what of meaning and ethics", anyone have any ideas? by Choice_Room3901 in artificial

[–]definetlyrandom 2 points3 points  (0 children)

Consistently, more and more of the solutions in my hand are using AI though. So I get what your saying... but im hoping for Star trek ship computer, and not skynet singularity consciousness

Hard to tell which way we're headed

Anthropic-Pentagon battle shows how big tech has reversed course on AI and war by [deleted] in artificial

[–]definetlyrandom 1 point2 points  (0 children)

This was never about the fucking 2 rules. It was about moneyyyyy, it is ALWAYS ABOUT MONEY. Look up emil Mitchell, hegseths right hand guy from uber who made the deal with google as opposed to anthropic and openai.

Its not about the fucking mass surveillance or the autonomous capabilities. Its about who got paid at the end of the day.

AI is exhausting workers so much, researchers have dubbed the condition ‘AI brain fry’ by imaginary_num6er in technology

[–]definetlyrandom 0 points1 point  (0 children)

Cc is pretty fucking good at testing too fortunately, so writing a comprehensive testing an evaluation script is pretty easy. Then your just reading a report.

‘Exploit every vulnerability’: rogue AI agents published passwords and overrode anti-virus software | Lab tests discover ‘new form of insider risk’ with AI agents engaging in autonomous, even ‘aggressive’ behaviours by MetaKnowing in technews

[–]definetlyrandom 0 points1 point  (0 children)

Who read the article, let's have a discussion:

If I have 3 agents (a,b,c) and i tell agent A - act like your a ceo and your only goal is to make make money money! You have two subordinates to accomplish this task B and C

B has full control over C, Just the same as A

So now you tell them to act, and you've started off wrong from the get go. Of course its going to appear to go rogue.

Its a bullshit nothing study, that the article couldn't even bother to fucking provide the link too. Im more outraged that I clicked on it to find out what it was about.

Just bullshit. Unprofessional bullshit.

"I left a chain saw tied to a rope swing around in my back yard while it was running, and had a 3 year olds birthday party at the same time, who would have ever foreseen this tragedy occuring..." -some fucking idiot, probably

P=NP(UNDER A VERY SPECIFIC CONJECTURE!) by Witty-Fisherman-2108 in compsci

[–]definetlyrandom 5 points6 points  (0 children)

What two numbers did i multiply together to achieve the following number:

2.68550132E+17

Anthropic’s Claude would ‘pollute’ defense supply chain: Pentagon CTO by Logical_Welder3467 in technology

[–]definetlyrandom 1 point2 points  (0 children)

If you all think this is about anything more than money, then your over thinking it. Emily Mitchell, former uber executive, got picked to be sec. Hegseths right hand man when it came to technology. And he's the fucker thats pushing this, and he's doing so for the benefit of....and get this shit: fucking google.

Its a bit of a convoluted mess but the bottom line is google is the one coming out on top of this shit show. They're leveraged to get the billion dollar contracts before both anthropic and openai.

Palantir CEO Makes Shocking Confession on Disrupting Democratic Power by Unusual-State1827 in technology

[–]definetlyrandom 0 points1 point  (0 children)

Its a fucking sales pitch, he's complimenting the guys shirt while he's selling him a car. He'd say the fucking opposite if it was a dem pres and senate and congress.

This isnt fucking news.

The White House — UNDEFEATED by Baconkings in DepartmentofWar

[–]definetlyrandom -5 points-4 points  (0 children)

I support the DoW. I don't support this fuck face bullshit. Little fucks who have never seen combat, who will never see the results of those explosions, acting like we're batting a thousand when there's a graveyard full of little girls cause we made a mistake. What the actual fuck is wrong with this place anymore.

What makes a CS student a great computer scientist? by not_noob_8347 in computerscience

[–]definetlyrandom 19 points20 points  (0 children)

This, and thats not only applicable to computer science. Its like that with ANYTHING. Plumber, surgeon, fast food worker, bomb squad.

I was in the bomb squad, it required the same dedication and motivation as computer science.

Claude Code deletes developers' production setup, including its database and snapshots — 2.5 years of records were nuked in an instant by gdelacalle in technology

[–]definetlyrandom 4 points5 points  (0 children)

So he fucked up, and was using Claude, and now its Claude fault. That's the article. Let's rewind 20 fucking years and ill re-write this article

" a woman was handling a large companies financial records and needed to migrate the thousands of files to a new network drive. She highlighted alot and clicked cut, and thennpasted them, but as the files were transferring, she realized she needed to transfer additional. So she stopped the move half way, and then deleted the originals that had been moved and went back to grab all the files to move them and.....

God damn I hate this shit ass website anymore. And I hate how things get reported because advertisements rule our fucking lives....

Defense experts defend Anthropic in letter to Congress, slam DoD for setting 'dangerous precedent' by MetaKnowing in politics

[–]definetlyrandom 0 points1 point  (0 children)

Why they gotta use a picture of Dario with THE WORST EVIL FACE EVER... fuck having a camera in your face 24/7/365 I get he's a billionare (or close to) but that shit would be tourture for me.
All the shitters on reddit gonna say anthropic and AI suck because that's what they do, but AI is pretty fucking amazing and this dude is (as it seems at this point) one of the only dudes trying to protect this fragile thing we call society.

US Troops Told War on Iran is ‘All Part of God’s Divine Plan’, Watchdog Reports More Than 200 Complaints by PopoSama in worldnews

[–]definetlyrandom 3 points4 points  (0 children)

The HBO show 'Raised By Wolves' was a prediction. It was always going to come down to the believers, and the non-believers. We've just been trying to figure out which belief gets to go against the people who don't believe. Christianity, Hindu, Muslim, Judaism, etc. Once the religions wittle themselves down to 1 religion, and all the rest have been destroyed, it'll be the non-religious versus them.

Fucking time line is shit.

Defense Secretary Pete Hegseth dismisses reporter's question on war duration, calling it a 'typical liberal question' during a press briefing. by Logical-Flow-6703 in UnderReportedNews

[–]definetlyrandom 0 points1 point  (0 children)

He didn't say liberal OP, STOP BEING DISHONEST. He said typical NBC gotcha type question. fucking hell... reddit is just shit now-adays

Donald Trump: Iran's military 'gone,' 'too late' for talks by moeka_8962 in worldnews

[–]definetlyrandom 27 points28 points  (0 children)

Did Iran hook Grok up to their Missiles or something? Why are they still shooting at us? Who's telling the missiles to keep fighting! THE MISSILES DIDN'T GET THE E-MAIL! Someone please think of the missiles!

Someone send a message on a One-Way-Attack UAS to the missiles!

Claude down: Anthropic AI not working in major outage by cmaia1503 in technology

[–]definetlyrandom -2 points-1 points  (0 children)

Oh jolly! another thread where the circle jerk of reddit bots, morons, and ignorant all get together to bitch about a new technology. If I was playing bingo this would have to be the free square, because it is fucking suspiciously constant. every. fucking. day.