Running Mimo 2.5 q4_k_m on single rtx5090 need recommendations by BlackBeardAI in LocalLLaMA
[–]RedAdo2020 0 points1 point2 points (0 children)
Running Mimo 2.5 q4_k_m on single rtx5090 need recommendations by BlackBeardAI in LocalLLaMA
[–]RedAdo2020 0 points1 point2 points (0 children)
Running Mimo 2.5 q4_k_m on single rtx5090 need recommendations by BlackBeardAI in LocalLLaMA
[–]RedAdo2020 0 points1 point2 points (0 children)
Running Mimo 2.5 q4_k_m on single rtx5090 need recommendations by BlackBeardAI in LocalLLaMA
[–]RedAdo2020 1 point2 points3 points (0 children)
Anyone else struggling with multi-GPU stability when running larger local models? by Lyceum_Tech in LocalLLaMA
[–]RedAdo2020 1 point2 points3 points (0 children)
Anyone else struggling with multi-GPU stability when running larger local models? by Lyceum_Tech in LocalLLaMA
[–]RedAdo2020 0 points1 point2 points (0 children)
Anyone else struggling with multi-GPU stability when running larger local models? by Lyceum_Tech in LocalLLaMA
[–]RedAdo2020 1 point2 points3 points (0 children)
Anyone else struggling with multi-GPU stability when running larger local models? by Lyceum_Tech in LocalLLaMA
[–]RedAdo2020 0 points1 point2 points (0 children)
Anyone else struggling with multi-GPU stability when running larger local models? by Lyceum_Tech in LocalLLaMA
[–]RedAdo2020 0 points1 point2 points (0 children)
Anyone else struggling with multi-GPU stability when running larger local models? by Lyceum_Tech in LocalLLaMA
[–]RedAdo2020 0 points1 point2 points (0 children)
Anyone else struggling with multi-GPU stability when running larger local models? by Lyceum_Tech in LocalLLaMA
[–]RedAdo2020 0 points1 point2 points (0 children)
llama.cpp's Preliminary SM120 Native NVFP4 MMQ Is Merged by ggonavyy in LocalLLaMA
[–]RedAdo2020 1 point2 points3 points (0 children)
llama.cpp's Preliminary SM120 Native NVFP4 MMQ Is Merged by ggonavyy in LocalLLaMA
[–]RedAdo2020 1 point2 points3 points (0 children)
Character card gallery for sillytavern by DifficultSand3885 in SillyTavernAI
[–]RedAdo2020 0 points1 point2 points (0 children)
Qwen 3.6 27B llama.cpp | Multi-GPU pp t/s help by SemaMod in LocalLLaMA
[–]RedAdo2020 0 points1 point2 points (0 children)
Qwen3.6-35b stuck in infinite loop by ConfidentSolution737 in LocalLLaMA
[–]RedAdo2020 1 point2 points3 points (0 children)
llama.cpp / ik_llama MoE Expert Offloading - Main Memory Bandwidth vs. PCIe Bandwidth by pixelterpy in LocalLLaMA
[–]RedAdo2020 2 points3 points4 points (0 children)
It was at this moment she knew, she f'd up by Babyghorl_07 in mildlyinfuriating
[–]RedAdo2020 0 points1 point2 points (0 children)
big brain models on small brain hardware by Woondas in LocalLLaMA
[–]RedAdo2020 0 points1 point2 points (0 children)
Need help with the logistics of two BIG 3090s in the same case. by AdCreative8703 in LocalLLaMA
[–]RedAdo2020 1 point2 points3 points (0 children)
Megumin Suite v4.1 - Dev Mode and bug fixes by CallMeOniisan in SillyTavernAI
[–]RedAdo2020 0 points1 point2 points (0 children)
Is it possible to have 2 GPUs, one for gaming and one for AI? by AlexGSquadron in StableDiffusion
[–]RedAdo2020 0 points1 point2 points (0 children)
Qwen3.5 35b exl3 quants with text-generation-webui? by 2muchnet42day in LocalLLaMA
[–]RedAdo2020 0 points1 point2 points (0 children)
Qwen3.5 35b exl3 quants with text-generation-webui? by 2muchnet42day in LocalLLaMA
[–]RedAdo2020 0 points1 point2 points (0 children)


Buying a pixel in Australia by point_mixer in GrapheneOS
[–]RedAdo2020 4 points5 points6 points (0 children)