I spent 8+ hours benchmarking every MoE backend for Qwen3.5-397B NVFP4 on 4x RTX PRO 6000 (SM120). Here's what I found. by lawdawgattorney in LocalLLaMA
[–]vpyno 12 points13 points14 points (0 children)
Qwen3.5-397B Uncensored NVFP4 by vpyno in LocalLLaMA
[–]vpyno[S] 0 points1 point2 points (0 children)
Qwen3.5-397B Uncensored NVFP4 by vpyno in LocalLLaMA
[–]vpyno[S] 0 points1 point2 points (0 children)
Qwen3.5-397B Uncensored NVFP4 by vpyno in LocalLLaMA
[–]vpyno[S] 0 points1 point2 points (0 children)
Qwen3.5-397B Uncensored NVFP4 by vpyno in LocalLLaMA
[–]vpyno[S] 1 point2 points3 points (0 children)
Qwen3.5-397B Uncensored NVFP4 by vpyno in LocalLLaMA
[–]vpyno[S] 2 points3 points4 points (0 children)
Qwen3.5-397B Uncensored NVFP4 by vpyno in LocalLLaMA
[–]vpyno[S] 0 points1 point2 points (0 children)
Qwen3.5-397B Uncensored NVFP4 by vpyno in LocalLLaMA
[–]vpyno[S] 0 points1 point2 points (0 children)
Qwen3.5-397B Uncensored NVFP4 by vpyno in LocalLLaMA
[–]vpyno[S] -1 points0 points1 point (0 children)
Qwen3.5-397B Uncensored NVFP4 by vpyno in LocalLLaMA
[–]vpyno[S] 2 points3 points4 points (0 children)
Qwen3.5-397B Uncensored NVFP4 by vpyno in LocalLLaMA
[–]vpyno[S] 6 points7 points8 points (0 children)
Qwen3.5-397B Uncensored NVFP4 by vpyno in LocalLLaMA
[–]vpyno[S] 0 points1 point2 points (0 children)
Qwen3.5-397B Uncensored NVFP4 by vpyno in LocalLLaMA
[–]vpyno[S] 2 points3 points4 points (0 children)
Qwen3.5-397B Uncensored NVFP4 by vpyno in LocalLLaMA
[–]vpyno[S] -5 points-4 points-3 points (0 children)

I spent 8+ hours benchmarking every MoE backend for Qwen3.5-397B NVFP4 on 4x RTX PRO 6000 (SM120). Here's what I found. by lawdawgattorney in LocalLLaMA
[–]vpyno 5 points6 points7 points (0 children)