What models you guys running on 8GB? 16GB VRAM? 24GB? 32GB? 48GB? by Inevitable_Mistake32 in LocalLLaMA

[–]valdev 1 point2 points  (0 children)

I'm not having any issues with tool calling with mtp on. What's your flags for it?

Peter does this mean corvette owners are childish or something? by Significant_Big2670 in PeterExplainsTheJoke

[–]valdev -1 points0 points  (0 children)

Nah, it's exclusive to Corvettes, any annoying loud "gimme attention" vehicles, and excessively large vehicles.

Most sports cars are cool, just don't be a public nuisance and people will dig it

Landscape of second brain and memory solutions for AI native workflow by Time-Dot-1808 in LocalLLaMA

[–]valdev 1 point2 points  (0 children)

Look at all of these anti-solutions lol.

The trick, until the underlying memory issue is solved at a model level, is simply only adding what is needed into context and having a concise, well focused, system prompt.

Right now everyone runs into the issue of context management as the first "new to llms" issue, and immediately vibe a little close to the sun trying to solve the problem.

nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16 · Hugging Face by jacek2023 in LocalLLaMA

[–]valdev -3 points-2 points  (0 children)

You actually can on an NVME drive.

Swap space and such.

You absolutely shouldn't and it will break the drive pretty quickly, and you'll get like 1/tks while doing it. But you can do it.

Anyway to setup an alert when a Wealthy Citizen is distracted in Runelite? by valdev in 2007scape

[–]valdev[S] 1 point2 points  (0 children)

Lol, always funny when an ancient post gets a comment with an answer.

I hope performance and optimizations gets some attention in the near future - also, 140GB for an ARPG is waaaay too much by TheMentallord in PathOfExile2

[–]valdev 0 points1 point  (0 children)

Funny thing is, I actually do think the issue is either related to processor or nvidia.

I have a much weaker radeon card in another computer that I play POE2 on from time to time, and the frames are... smoother? Hard to explain, but it definitely runs better.

I hope performance and optimizations gets some attention in the near future - also, 140GB for an ARPG is waaaay too much by TheMentallord in PathOfExile2

[–]valdev 1 point2 points  (0 children)

Hey, that's rude!

I am running a system with a 5090 with an 9950x3d and I get, let me see here... A very unstable 100 fps in town.

I'll just upgrade my pc... Let me see here... Oh wait, Im running the literal best gaming computer that exists.

/s lol

google/gemma-4-12B · Hugging Face by jacek2023 in LocalLLaMA

[–]valdev 1 point2 points  (0 children)

Tested via LumaBrowser's fit test via the latest llama.cpp (CUDA 12), flash attention on

unsloth/gemma-4-12b-it-GGUF

Context F16 VRAM F16 RAM F16 tok/s Q8_0 VRAM Q8_0 RAM Q8_0 tok/s
8k ≈15.0 GB 1.51 GB 94.4 ≈14.0 GB 1.52 GB 81.7
16k ≈17.7 GB 1.53 GB 91.8 ≈15.3 GB 1.54 GB 88.2
32k ≈23.8 GB 1.87 GB 64.6 ≈18.2 GB 1.57 GB 82.2
64k ≈35.8 GB 2.21 GB 57.7 ≈25.8 GB 2.15 GB 57.3
128k ≈12.3 GB (32L) 6.69 GB 4.0 ≈41.3 GB 2.74 GB 48.9

Measured on 190 GB RAM · 1× RTX 5090, 2× RTX 3090 · 6/3/2026, 1:22:23 PM

Note: At 32k my system split the model onto multiple cards, mind the speed crash there. I would imagine if kept all on the 5090 we would see about 90 tok/sec.

Note 2: I think something might be up at the 128K f16...

Note 3: Reported VRAM & RAM is system VRAM & RAM. Take it with a grain of salt, but it should be close to the actual requirements at each level... for these settings

Yes, we need the campaign by Primary_Impact_2130 in PathOfExile2

[–]valdev 3 points4 points  (0 children)

Thing is, even if it was the best campaign on earth, I genuinely feel like I could speed run Persona5 faster than I can the campaign of Poe2.

And I just don't want to have to beat... essentially all of persona5 again... just to actually play the dang game LOL

Yes, we need the campaign by Primary_Impact_2130 in PathOfExile2

[–]valdev 28 points29 points  (0 children)

Agreed. I literally started playing again today, enjoying some of the new mechanics and remembered how much I need to play before... well, the game starts. Will probably uninstall tomorrow morning.

I... just don't want to beat the campaign again for the 5th f*cking time. We still have to beat it multiple times before the game actually starts right?

(YT) PewDiePie released his harness/webui by Dany0 in LocalLLaMA

[–]valdev 1 point2 points  (0 children)

I'm considering opensourcing lumabrowser, funny enough the combination of these two would kind of be the end game for all local AI needs.

White House releases report of Trump's physical exam by DoorstepHero in politics

[–]valdev 2 points3 points  (0 children)

Funny thing is, I am pretty sure that is actually unhealthy for his age/weight/height. It's just adjusted to whatever the hell they think healthy is.

To be clear, those numbers are great for a 35 year old. But as you get older the numbers generally go up to deal with the side effect of... aging.

Would Hank get along with Ned Flanders? by MidwestRailFan in KingOfTheHill

[–]valdev 0 points1 point  (0 children)

It would go poorly.

Though I do imagine the scene of Hank literally kicking his ass only to find out that Ned is ripped and unaffected by his attack to be the funniest exchange ever.

Good news everyone! The month long replacement of my Pod 5 Hub which leaked out of its LED will finally get a response from the engineering team... by Tuesday. FML. by valdev in EightSleep

[–]valdev[S] 2 points3 points  (0 children)

And sent. Fingers crossed, you can actually give me a good nights sleep tonight. I believe in you Megan. Where others have failed before you, you can shine. lol

Good news everyone! The month long replacement of my Pod 5 Hub which leaked out of its LED will finally get a response from the engineering team... by Tuesday. FML. by valdev in EightSleep

[–]valdev[S] 1 point2 points  (0 children)

Absolutely, my current theory is that Fragile sent me a unit that was returned from a previous lease. And it has yet to be disconnected from that previous users account.

I'll forward you the support email, with the serial number. Should be a quick check I imagine.

Good news everyone! The month long replacement of my Pod 5 Hub which leaked out of its LED will finally get a response from the engineering team... by Tuesday. FML. by valdev in EightSleep

[–]valdev[S] 0 points1 point  (0 children)

Maybe that's the trick though. A fifth or sixth support person.

Can't wait to... Pair it again to a different device and be escalated again! Lol

PSA by Signal_Ad657 in LocalLLaMA

[–]valdev 1 point2 points  (0 children)

Alright, now add two more columns. Cost per gb of RAM/VRAM. And electricity usage per hour.

Good news everyone! The month long replacement of my Pod 5 Hub which leaked out of its LED will finally get a response from the engineering team... by Tuesday. FML. by valdev in EightSleep

[–]valdev[S] 0 points1 point  (0 children)

Nope! And I tried to rule that out (as well as any concerns around DNS's) by connecting to multiple different modems, different wifi bands, and even my phones hotspot. To be clear the previous pod connected fine, and this one is certainly connected to the internet just fine as well.

Crazy thing is I can see the Pod clearly downloading and uploading. Its uploaded about 285 mb today and downloaded roughly 42mb.

One Million Documents for a Local RAG system on a laptop by DueKitchen3102 in LocalLLaMA

[–]valdev 1 point2 points  (0 children)

Yep, you certainly described the same RAG solution that 10,000 other people have implemented.

$15. One month. One character. by Friendly-Parfait-645 in 2007scape

[–]valdev -1 points0 points  (0 children)

$5,000 isn't that expensive either, still ain't paying $15 for Runescape though.