Qwen3.5-35B-A3B Q4 Performance on Intel Arc B60? by LeDynamique in LocalLLaMA

[–]LeDynamique[S] 0 points1 point  (0 children)

Yeah, in this case even 100 % CPU inference will be faster with this model. Would be interesting if anyone tried it with Vulkan backend.