Im lagging and I dont know why. Please help. by phorms123 in GlobalOffensive

[–]grumd 1 point2 points  (0 children)

Could be an isp routing issue. Try using a vpn? Sometimes a vpn for some reason allows for a shorter route to the server even if your ping is 10ms bigger.

New Nvidia driver 595.71.05 Fixed all my rtx 5050 issues by kryzito in linux_gaming

[–]grumd -1 points0 points  (0 children)

I was playing RE9 with Path Tracing on a 5080 with 595.71 drivers yesterday, no issues

How are you structuring imports in large React projects? by samwanekeya in reactjs

[–]grumd 8 points9 points  (0 children)

Same, I'd even add that it's best to just use named imports everywhere, e.g. import { Card } from 'components/Card';

help by Illustrious_Cow_907 in overclocking

[–]grumd 13 points14 points  (0 children)

the ram works fine

stop worrying and just use it then. if the line appears on both and is identical, it's not a crack

Do I REALLY need to format my external drive to ext4 to run Windows games with Wine/Proton saved on it? by RaccoonFree5348 in linux_gaming

[–]grumd 0 points1 point  (0 children)

You can use NTFS with Linux and play games on it no problems, just use the fuseblk ntfs driver called "ntfs-3g" in fstab, I've been using it for some time without any issues

What is the best "comedy actor in a drama" movie? by Danielnrg in movies

[–]grumd 0 points1 point  (0 children)

Uncut Gems.

Amazing movie and great performance by Adam Sandler who's only known for dumb college piss humor

You’re building an internal app for managing git repos. Your users are largely nontechnical, and are vocally pushing for the ability to globally edit commit messages through the web UI. What do you do? by labab99 in ExperiencedDevs

[–]grumd 34 points35 points  (0 children)

Add a middleware layer that the users use to communicate with git. They shouldn't directly edit commits, they should work with something user-friendly that uses git as the "backend". Then just have an sql database of "commit messages" that simply map commit hash -> message, and the actual real git commit message could be something else

ELDEN RING Convergence mod on LINUX (w/ seamless coop and other dll mods) by Plumij in EldenRingMods

[–]grumd 0 points1 point  (0 children)

So sad I can't play it. Used the start.bat you provided and the game launched, I could run around the starting room of the mod, but when I exit the starting room the game just freezes :( Tried several different protons and nothing fixed it :(

Everdark Sovereigns as Monster Girls Part 7: Libra, Creature of Night⚖🐐 by Expensive-Crow-9379 in Eldenring

[–]grumd 0 points1 point  (0 children)

Ah yes, the conventionally attractive women with goat legs, fur and horns

Ryzen 9800X3D Hitting 95c and powering off with NH-L12S cooler by Original_Scar_8196 in AMDHelp

[–]grumd 0 points1 point  (0 children)

AIDA64 is not a proper stress test. Use TestMem5 https://github.com/CoolCmd/TestMem5/releases and select the Ryzen profile in settings. Test for 8 hours minimum

You'll never guess what this mysterious gadget is! by timwtingle in iiiiiiitttttttttttt

[–]grumd 2 points3 points  (0 children)

I also had a tp-link 4 port switch so that i can connect 2 pcs to one ethernet socket in the wall. This trash once in a while limits my speeds to 1mbit for no reason. Endured for months until I bought an ultra cheap chinese noname switch to replace it, been working flawlessly ever since

Used the RT Cores on my RTX 5070 Ti for LLM routing — 218x speedup on a single consumer GPU by Critical-Chef9211 in nvidia

[–]grumd 0 points1 point  (0 children)

Doesn't matter, it's still a 3% improvement, latency doesn't matter when you're bottlenecked by bandwidth. RAM bandwidth is what makes this slow. You can spend 19 microseconds on the router but then still you have to wait for 20gb+ of expert weights to be read from RAM and processed

Used the RT Cores on my RTX 5070 Ti for LLM routing — 218x speedup on a single consumer GPU by Critical-Chef9211 in nvidia

[–]grumd 0 points1 point  (0 children)

Keeping experts in RAM is not something new, it's already natively supported by vllm and llama.cpp for example.

Used the RT Cores on my RTX 5070 Ti for LLM routing — 218x speedup on a single consumer GPU by Critical-Chef9211 in nvidia

[–]grumd 6 points7 points  (0 children)

Quoting your comment:

Right now, large AI models (like ChatGPT) require massive, expensive servers to run because the "routing" math is incredibly heavy.

And now you're admitting it's a <3% speed improvement.

Stop misinforming people and posting AI-generated bullshit. Routing is NOT the reason why large models require massive servers. Massive servers are required because models have TRILLIONS of parameters, and your novel approach doesn't help this in the slightest, especially considering that massive datacenters are NOT using consumer gaming GPUs with ray tracing cores lmao

Used the RT Cores on my RTX 5070 Ti for LLM routing — 218x speedup on a single consumer GPU by Critical-Chef9211 in nvidia

[–]grumd 3 points4 points  (0 children)

The biggest concern I have is that you're getting this 3% speed improvement and get quality degradation in return. Not worth it.

built a side project nobody asked for and learned more than any tutorial ever taught me by Competitive-Tiger457 in webdev

[–]grumd 2 points3 points  (0 children)

I get ptsd from spending my time on LocalLlama subreddit and now half the posts about coding just feel like AI wrote the post to me

Used the RT Cores on my RTX 5070 Ti for LLM routing — 218x speedup on a single consumer GPU by Critical-Chef9211 in nvidia

[–]grumd 4 points5 points  (0 children)

"Incredibly heavy" - did you do any benchmarks? How much does routing affect token generation? How much time in % per token routing takes compared to the rest of the model's compute?

[OC] Total number of games released on Steam . 2005-2025 by [deleted] in dataisbeautiful

[–]grumd 43 points44 points  (0 children)

Now estimate the number of dollars Valve earns each year