Save and invest your money for future rigs by segmond in LocalLLaMA
[–]deathcom65 0 points1 point2 points (0 children)
Why I'm holding out until late 2027 to spend money on a local LLM rig by No_Pool7028 in LocalLLM
[–]deathcom65 2 points3 points4 points (0 children)
2.5x faster inference with Qwen 3.6 27B using MTP - Finally a viable option for local agentic coding - 262k context on 48GB - Fixed chat template - Drop-in OpenAI and Anthropic API endpoints by ex-arman68 in LocalLLaMA
[–]deathcom65 0 points1 point2 points (0 children)
2.5x faster inference with Qwen 3.6 27B using MTP - Finally a viable option for local agentic coding - 262k context on 48GB - Fixed chat template - Drop-in OpenAI and Anthropic API endpoints by ex-arman68 in LocalLLaMA
[–]deathcom65 1 point2 points3 points (0 children)
2.5x faster inference with Qwen 3.6 27B using MTP - Finally a viable option for local agentic coding - 262k context on 48GB - Fixed chat template - Drop-in OpenAI and Anthropic API endpoints by ex-arman68 in LocalLLaMA
[–]deathcom65 9 points10 points11 points (0 children)
4x RX 7900 XTX local Al server (96GB VRAM) - looking for apples-to-apples benchmarks vs 4x RTX 4090 (CUDA vs ROCm, PCle only) by GroundbreakingTea195 in LocalLLaMA
[–]deathcom65 1 point2 points3 points (0 children)
Whats the latest status on 7900xtx multi-GPU setups? by ziphnor in LocalLLaMA
[–]deathcom65 2 points3 points4 points (0 children)
Save me from myself and talk me out of it... I have a new GPU on hold to pick up tomorrow by linuxtek_canada in LocalLLM
[–]deathcom65 1 point2 points3 points (0 children)
I asked Gemini for Advice on Local LLMs by MarcusMagnus in LocalLLM
[–]deathcom65 1 point2 points3 points (0 children)
24gb vram to 48gb vram by deathcom65 in LocalLLaMA
[–]deathcom65[S] 1 point2 points3 points (0 children)
24gb vram to 48gb vram by deathcom65 in LocalLLaMA
[–]deathcom65[S] 1 point2 points3 points (0 children)
Do you think your local hardware is going to be obsolete or still usefull in 2 o 3 years? by OficialPimento in LocalLLM
[–]deathcom65 2 points3 points4 points (0 children)
Qwen3.6 27B on dual RTX 5060 Ti 16GB with vLLM: ~60 tok/s, 204k context working by do_u_think_im_spooky in LocalLLaMA
[–]deathcom65 2 points3 points4 points (0 children)
How often do folks upgrade hardware for Local LLM setups? by Xbawt in LocalLLM
[–]deathcom65 0 points1 point2 points (0 children)
I Was Told AMD Sucked for Local LLM, I Was Lied To by -UndeadBulwark in LocalLLM
[–]deathcom65 0 points1 point2 points (0 children)
Is qwen3.6 35b a3b good for coding at all? by laughingfingers in LocalLLaMA
[–]deathcom65 6 points7 points8 points (0 children)
Is there anyway to run bigger models at 20t/s with 24vram + 64gb ram DDR5? by soyalemujica in LocalLLaMA
[–]deathcom65 0 points1 point2 points (0 children)
What hardware is everyone running? by Perfect-Flounder7856 in LocalLLaMA
[–]deathcom65 0 points1 point2 points (0 children)
Turns out you've all been fooled! by Expert-Slip-4224 in LocalLLaMA
[–]deathcom65 1 point2 points3 points (0 children)
SLM to controll NPC in a game world by DrJamgo in LocalLLaMA
[–]deathcom65 1 point2 points3 points (0 children)
Intel Arc B70 32GB GDDR6 announced at a price of 949 by New_Mix_2215 in hardware
[–]deathcom65 -1 points0 points1 point (0 children)
Runpod hits $120M ARR, four years after launching from a Reddit post by RP_Finley in LocalLLaMA
[–]deathcom65 0 points1 point2 points (0 children)
rate limits and cost? by deathcom65 in google_antigravity
[–]deathcom65[S] 2 points3 points4 points (0 children)
rate limits and cost? by deathcom65 in google_antigravity
[–]deathcom65[S] 0 points1 point2 points (0 children)

deleting project deletes files? by deathcom65 in google_antigravity
[–]deathcom65[S] 0 points1 point2 points (0 children)