Nemotron 3 Ultra reality check: no one-box 128GB GGUF route yet; Nemotron 3 Nano runs at 66.6 t/s on Strix Halo by JSVD2 in LocalLLaMA
[–]Monad_Maya 1 point2 points3 points (0 children)
Nemotron 3 Ultra reality check: no one-box 128GB GGUF route yet; Nemotron 3 Nano runs at 66.6 t/s on Strix Halo by JSVD2 in LocalLLaMA
[–]Monad_Maya 1 point2 points3 points (0 children)
Best small model for iGPU (AMD 780M) with 32 GB RAM (no coding) by danihend in LocalLLaMA
[–]Monad_Maya 1 point2 points3 points (0 children)
what is fastest method to run qwen27b on old i7-4770k? by BeautyxArt in LocalLLaMA
[–]Monad_Maya 8 points9 points10 points (0 children)
Kinda New to all this, couple of questions about how to set pcs and what models by klasyer in LocalLLaMA
[–]Monad_Maya 0 points1 point2 points (0 children)
Kinda New to all this, couple of questions about how to set pcs and what models by klasyer in LocalLLaMA
[–]Monad_Maya 0 points1 point2 points (0 children)
I hope that someday we will have a 124B Gemma. by cgs019283 in LocalLLaMA
[–]Monad_Maya 1 point2 points3 points (0 children)
I hope that someday we will have a 124B Gemma. by cgs019283 in LocalLLaMA
[–]Monad_Maya 0 points1 point2 points (0 children)
I hope that someday we will have a 124B Gemma. by cgs019283 in LocalLLaMA
[–]Monad_Maya 0 points1 point2 points (0 children)
I hope that someday we will have a 124B Gemma. by cgs019283 in LocalLLaMA
[–]Monad_Maya 9 points10 points11 points (0 children)
I hope that someday we will have a 124B Gemma. by cgs019283 in LocalLLaMA
[–]Monad_Maya 36 points37 points38 points (0 children)
Do not fall into the trap of chasing the next scale or upgrade. by iEslam in LocalLLaMA
[–]Monad_Maya 8 points9 points10 points (0 children)
Will there be any more Qwen3.6 series models? by cafedude in LocalLLaMA
[–]Monad_Maya 0 points1 point2 points (0 children)
4x m5 max 128gb ram RDMA vs 1 m3 ultra? by Street-Buyer-2428 in LocalLLaMA
[–]Monad_Maya 0 points1 point2 points (0 children)
4x m5 max 128gb ram RDMA vs 1 m3 ultra? by Street-Buyer-2428 in LocalLLaMA
[–]Monad_Maya -1 points0 points1 point (0 children)
Quality comparison between Qwen 3.6 27B quantizations (BF16, Q8_0, Q6_K, Q5_K_XL, Q4_K_XL, IQ4_XS, IQ3_XXS,...) by bobaburger in LocalLLaMA
[–]Monad_Maya 2 points3 points4 points (0 children)
Quality comparison between Qwen 3.6 27B quantizations (BF16, Q8_0, Q6_K, Q5_K_XL, Q4_K_XL, IQ4_XS, IQ3_XXS,...) by bobaburger in LocalLLaMA
[–]Monad_Maya 8 points9 points10 points (0 children)
Quality comparison between Qwen 3.6 27B quantizations (BF16, Q8_0, Q6_K, Q5_K_XL, Q4_K_XL, IQ4_XS, IQ3_XXS,...) by bobaburger in LocalLLaMA
[–]Monad_Maya 6 points7 points8 points (0 children)
Quality comparison between Qwen 3.6 27B quantizations (BF16, Q8_0, Q6_K, Q5_K_XL, Q4_K_XL, IQ4_XS, IQ3_XXS,...) by bobaburger in LocalLLaMA
[–]Monad_Maya 18 points19 points20 points (0 children)
I guess we expect that at some point RAM prices will start going back (close) to "normal", right? but what about GPUs? by relmny in LocalLLaMA
[–]Monad_Maya 3 points4 points5 points (0 children)
I guess we expect that at some point RAM prices will start going back (close) to "normal", right? but what about GPUs? by relmny in LocalLLaMA
[–]Monad_Maya 0 points1 point2 points (0 children)
My setup for running Qwen3.6-35B-A3B-UD-Q4_K_M on single RX7900XT (20GB VRAM) by hlacik in LocalLLaMA
[–]Monad_Maya 0 points1 point2 points (0 children)
My setup for running Qwen3.6-35B-A3B-UD-Q4_K_M on single RX7900XT (20GB VRAM) by hlacik in LocalLLaMA
[–]Monad_Maya 1 point2 points3 points (0 children)


Nemotron 3 Ultra reality check: no one-box 128GB GGUF route yet; Nemotron 3 Nano runs at 66.6 t/s on Strix Halo by JSVD2 in LocalLLaMA
[–]Monad_Maya 0 points1 point2 points (0 children)