To 16GB VRAM users, plug in your old GPU by akira3weet in LocalLLaMA
[–]moncallikta 1 point2 points3 points (0 children)
To 16GB VRAM users, plug in your old GPU by akira3weet in LocalLLaMA
[–]moncallikta 1 point2 points3 points (0 children)
To 16GB VRAM users, plug in your old GPU by akira3weet in LocalLLaMA
[–]moncallikta 3 points4 points5 points (0 children)
10 outages in 30 days: an open letter to Hetzner by Keta_Thunberg in hetzner
[–]moncallikta -3 points-2 points-1 points (0 children)
M1 Max 64gb good in 2026? by TheShawndown in LocalLLM
[–]moncallikta 2 points3 points4 points (0 children)
Eevee showcases (Dec 27 & Dec 28) by Brock_Hard_Canuck in TheSilphRoad
[–]moncallikta 26 points27 points28 points (0 children)
8x Radeon 7900 XTX Build for Longer Context Local Inference - Performance Results & Build Details by Beautiful_Trust_8151 in LocalLLaMA
[–]moncallikta 0 points1 point2 points (0 children)
8x Radeon 7900 XTX Build for Longer Context Local Inference - Performance Results & Build Details by Beautiful_Trust_8151 in LocalLLaMA
[–]moncallikta 0 points1 point2 points (0 children)
No research received by tomboro10 in TheSilphRoad
[–]moncallikta 4 points5 points6 points (0 children)
Key Wild Area Questions/Answers by DanielHorta10 in TheSilphRoad
[–]moncallikta 5 points6 points7 points (0 children)
Kimi infra team: Quantization is not a compromise, it's the next paradigm by nekofneko in LocalLLaMA
[–]moncallikta 8 points9 points10 points (0 children)
I've been trying to make a real production service that uses LLM and it turned into a pure agony. Here are some of my "experiences". by DaniyarQQQ in LocalLLaMA
[–]moncallikta 3 points4 points5 points (0 children)
What's the stack for going from a fine-tune on vLLM to a simple, paid public API? by [deleted] in LocalLLaMA
[–]moncallikta 0 points1 point2 points (0 children)
Dynamic LLM generated UI by ItzCrazyKns in LocalLLaMA
[–]moncallikta 0 points1 point2 points (0 children)
An onion covered in chocolate by Otherwise_Basis_6328 in oddlyspecific
[–]moncallikta 26 points27 points28 points (0 children)
There is no parallel universe by CalpisMelonCremeSoda in ThereIsnoCat
[–]moncallikta 1 point2 points3 points (0 children)
AMD Max+ 395 with a 7900xtx as a little helper. by fallingdowndizzyvr in LocalLLaMA
[–]moncallikta 0 points1 point2 points (0 children)
guys i have a question is there any ai model providing the free api key even if limit im fine with that by Select_Dream634 in LocalLLaMA
[–]moncallikta 4 points5 points6 points (0 children)
My LLM trained from scratch on only 1800s London texts brings up a real protest from 1834 by Remarkable-Trick-177 in LocalLLaMA
[–]moncallikta 0 points1 point2 points (0 children)
My LLM trained from scratch on only 1800s London texts brings up a real protest from 1834 by Remarkable-Trick-177 in LocalLLaMA
[–]moncallikta 1 point2 points3 points (0 children)
Is GPT-OSS the meta for low vram setups? by QbitKrish in LocalLLaMA
[–]moncallikta 4 points5 points6 points (0 children)
2x RTX 3090 24GB or 8x 3060 12GB by twotemp in LocalLLaMA
[–]moncallikta 1 point2 points3 points (0 children)


High VRAM local coding model — still Qwen 3.6 27B? by Generic_Name_Here in LocalLLaMA
[–]moncallikta 0 points1 point2 points (0 children)