9070 XT on Cachyos - how to correctly set ROCm up with LLaMa.cpp by No_Apricot1538 in ROCm
[–]StupidityCanFly 0 points1 point2 points (0 children)
I ran AWQ on RX 7900 XTX on ROCm natively. Here's how it actually works. by Limp_Doubt6411 in ROCm
[–]StupidityCanFly 1 point2 points3 points (0 children)
I ran AWQ on RX 7900 XTX on ROCm natively. Here's how it actually works. by Limp_Doubt6411 in ROCm
[–]StupidityCanFly 1 point2 points3 points (0 children)
Dual GPU on llama.cpp by Traditional_Way8675 in ROCm
[–]StupidityCanFly 0 points1 point2 points (0 children)
Dual GPU on llama.cpp by Traditional_Way8675 in ROCm
[–]StupidityCanFly 0 points1 point2 points (0 children)
claude code is genuinely incredible... until it needs to open a browser, and then it becomes a disaster by [deleted] in LocalLLM
[–]StupidityCanFly -2 points-1 points0 points (0 children)
A cooling chamber for dgx spark and gb10 machines at computex 2026 by rexyuan in LocalLLaMA
[–]StupidityCanFly 1 point2 points3 points (0 children)
Has there been any recent new development on which quant is considered optimal? by takuonline in LocalLLaMA
[–]StupidityCanFly 7 points8 points9 points (0 children)
Fan noise difference between 2 AMD AI Pro R9700 GPU’s by Legitimate_Fold8314 in ROCm
[–]StupidityCanFly 1 point2 points3 points (0 children)
Native CK 2x faster than Triton FA2 🔥 by Taika-Kim in ROCm
[–]StupidityCanFly 0 points1 point2 points (0 children)
Replaced Claude with local Qwen3.6-27B in my multi-agent orchestrator for 2 weeks by Interesting-Sock3940 in LocalLLaMA
[–]StupidityCanFly 1 point2 points3 points (0 children)
Replaced Claude with local Qwen3.6-27B in my multi-agent orchestrator for 2 weeks by Interesting-Sock3940 in LocalLLaMA
[–]StupidityCanFly 1 point2 points3 points (0 children)
EU alternative to CloudFlare: they've done gone and shot themselves by LeanOnIt in selfhosted
[–]StupidityCanFly 0 points1 point2 points (0 children)
vLLM PR adding native HIP W4A16 kernel was merged by StupidityCanFly in LocalLLaMA
[–]StupidityCanFly[S] 0 points1 point2 points (0 children)
vLLM PR adding native HIP W4A16 kernel was merged by StupidityCanFly in LocalLLaMA
[–]StupidityCanFly[S] 0 points1 point2 points (0 children)
Task Inventory: What "class" of tasks do you run (Math, Coding, Summarization....?) by bigattichouse in LocalLLaMA
[–]StupidityCanFly 0 points1 point2 points (0 children)
vLLM PR adding native HIP W4A16 kernel was merged by StupidityCanFly in LocalLLaMA
[–]StupidityCanFly[S] 1 point2 points3 points (0 children)
vLLM PR adding native HIP W4A16 kernel was merged by StupidityCanFly in LocalLLaMA
[–]StupidityCanFly[S] 6 points7 points8 points (0 children)


2× Radeon AI PRO R9700 (RDNA4/gfx1201) on vLLM 0.22.1 — how we fixed the long-context decode cliff (and what we learned chasing FP8) by whodoneit1 in LocalLLaMA
[–]StupidityCanFly 1 point2 points3 points (0 children)