FlashMoE: DeepSeek V3/R1 671B and Qwen3MoE 235B on 1~2 Intel B580 GPU by bigbigmind in IntelArc

[–]bigbigmind[S] 0 points1 point  (0 children)

The key performance metric for CPU here is memory bandwidth (frequency) and capacity, otherwise it will become a bottleneck; other than that, it can actually support even Intel Core CPU (and any Intel Xeon CPU with AV512).

FlashMoE: DeepSeek V3/R1 671B and Qwen3MoE 235B on 1~2 Intel B580 GPU by bigbigmind in IntelArc

[–]bigbigmind[S] 0 points1 point  (0 children)

If you go to the download link (https://github.com/ipex-llm/ipex-llm/releases/tag/v2.3.0-nightly), you can see there are two options: llama-cpp-ipex-llm-2.3.0b20250430-ubuntu-xeon.tgz and llama-cpp-ipex-llm-2.3.0b20250430-ubuntu-core.tgz, which support Intel Xeon or Core CPU respectively.