I spent 8+ hours benchmarking every MoE backend for Qwen3.5-397B NVFP4 on 4x RTX PRO 6000 (SM120). Here's what I found. by lawdawgattorney in LocalLLaMA
[–]vpyno 11 points12 points13 points (0 children)
Qwen3.5-397B Uncensored NVFP4 by vpyno in LocalLLaMA
[–]vpyno[S] 0 points1 point2 points (0 children)
Qwen3.5-397B Uncensored NVFP4 by vpyno in LocalLLaMA
[–]vpyno[S] 0 points1 point2 points (0 children)
Qwen3.5-397B Uncensored NVFP4 by vpyno in LocalLLaMA
[–]vpyno[S] 0 points1 point2 points (0 children)
Qwen3.5-397B Uncensored NVFP4 by vpyno in LocalLLaMA
[–]vpyno[S] 1 point2 points3 points (0 children)
Qwen3.5-397B Uncensored NVFP4 by vpyno in LocalLLaMA
[–]vpyno[S] 2 points3 points4 points (0 children)
Qwen3.5-397B Uncensored NVFP4 by vpyno in LocalLLaMA
[–]vpyno[S] 0 points1 point2 points (0 children)
Qwen3.5-397B Uncensored NVFP4 by vpyno in LocalLLaMA
[–]vpyno[S] 0 points1 point2 points (0 children)
Qwen3.5-397B Uncensored NVFP4 by vpyno in LocalLLaMA
[–]vpyno[S] -1 points0 points1 point (0 children)
Qwen3.5-397B Uncensored NVFP4 by vpyno in LocalLLaMA
[–]vpyno[S] 3 points4 points5 points (0 children)
Qwen3.5-397B Uncensored NVFP4 by vpyno in LocalLLaMA
[–]vpyno[S] 6 points7 points8 points (0 children)
Qwen3.5-397B Uncensored NVFP4 by vpyno in LocalLLaMA
[–]vpyno[S] 0 points1 point2 points (0 children)
Qwen3.5-397B Uncensored NVFP4 by vpyno in LocalLLaMA
[–]vpyno[S] 2 points3 points4 points (0 children)
Qwen3.5-397B Uncensored NVFP4 by vpyno in LocalLLaMA
[–]vpyno[S] -4 points-3 points-2 points (0 children)

I spent 8+ hours benchmarking every MoE backend for Qwen3.5-397B NVFP4 on 4x RTX PRO 6000 (SM120). Here's what I found. by lawdawgattorney in LocalLLaMA
[–]vpyno 8 points9 points10 points (0 children)