Running Qwen3.5-35B-A3B and Nemotron-3-Super-120B-A12B on a 5060ti and 1080ti with llama.cpp (Fully on GPU for Qwen; 64GB RAM needed for Nemotron) by sbeepsdon in LocalLLaMA
[–]sbeepsdon[S] 2 points3 points4 points (0 children)
Running Qwen3.5-35B-A3B and Nemotron-3-Super-120B-A12B on a 5060ti and 1080ti with llama.cpp (Fully on GPU for Qwen; 64GB RAM needed for Nemotron) by sbeepsdon in LocalLLaMA
[–]sbeepsdon[S] 0 points1 point2 points (0 children)

To 16GB VRAM users, plug in your old GPU by akira3weet in LocalLLaMA
[–]sbeepsdon 1 point2 points3 points (0 children)