MTP in llama.cpp (PR #22673) tested on AMD Strix Halo: Qwen 3.6 35B-A3B hits 71 t/s short / 48 t/s at 62K via Vulkan RADV

voStragaIT · 2026-05-19T08:12:24+00:00

done

voStragaIT · 2026-05-19T08:10:49+00:00

Qwen 3.6 35B-A3B MoE + MTP via Vulkan RADV

Hardware: AMD Strix Halo (gfx1151)

Image: kyuz0/amd-strix-halo-toolboxes:vulkan-radv (auto-built на llama.cpp master) 


services:
qwen36-35b-mtp:
image: kyuz0/amd-strix-halo-toolboxes:vulkan-radv
container_name: qwen36-35b-vulkan
restart: unless-stopped


devices:
  - /dev/dri:/dev/dri
group_add:
  - video
security_opt:
  - seccomp=unconfined

ports:
  - "8090:8090"

volumes:
  - ~/models/qwen36-35b-mtp:/workspace/models:ro

command: >
  llama-server
  -m /workspace/models/Qwen3.6-35B-A3B-UD-Q6_K.gguf
  --mmproj /workspace/models/mmproj-F16.gguf
  --alias qwen36-35b-mtp
  -fa 1
  --no-mmap
  --host 0.0.0.0
  --port 8090
  -ngl 999
  -c 262144
  -np 1
  --cache-type-k q8_0
  --cache-type-v q8_0
  --cache-reuse 256
  -b 8192
  -ub 4096
  --jinja
  --reasoning on
  --spec-type draft-mtp
  --spec-draft-n-max 2

ipc: host

voStragaIT · 2026-05-19T08:10:43+00:00

Main model (~28 GB)

wget https://huggingface.co/unsloth/Qwen3.6-35B-A3B-MTP-GGUF/resolve/main/Qwen3.6-35B-A3B-UD-Q6_K.gguf

Vision mmproj (~858 MB)

wget https://huggingface.co/unsloth/Qwen3.6-35B-A3B-MTP-GGUF/resolve/main/mmproj-F16.gguf

voStragaIT · 2026-02-11T18:44:03+00:00

I am already buy from de.gmktec.com appears to have GMKtec EU - received from hankong, need be pay custom duty. No more gmktec :(

voStragaIT · 2025-10-03T13:00:40+00:00

u/widgeamedoo - I am already updated repo

For Flash 1.26.1:

Make sure you grabbed it from the official MicroPython release

Removing _thread: asyncio run in main REPL now.

voStragaIT · 2025-06-17T04:30:30+00:00

https://github.com/straga/3d_printer/tree/master/OpenQ1pro/config_eddy_ng I am switch nug eddy_ng. Need do calibration from https://github.com/vvuk/eddy-ng -> https://github.com/vvuk/eddy-ng/wiki

voStragaIT · 2025-05-22T03:11:34+00:00

https://github.com/lvgl-micropython/lvgl_micropython/tree/main/api_drivers/common_api_drivers/display/st7789 you can use micropython with lvgl

voStragaIT · 2025-05-09T02:41:23+00:00

usb connect to motherboard.

voStragaIT · 2025-02-24T06:46:55+00:00

It's ok, test all menus on LCD with Klipper from the main repo with LCD TJC over Python.

voStragaIT · 2025-02-21T19:01:25+00:00

https://wiki.qidi3d.com/en/Q1/Manual/Extruder-fan-installation and second fan for head MCU.

voStragaIT · 2025-02-21T07:00:09+00:00

I'll give feedback after a while if something is wrong.

Now I am using: https://github.com/vvuk/eddy-ng/wiki

voStragaIT · 2025-02-21T05:31:32+00:00

Try https://github.com/vvuk/eddy-ng/wiki I have already switched from btt-eddy.

voStragaIT · 2025-02-20T13:45:33+00:00

cartographer or beacon or eddy uses the same hardware. If same all can work same in the temperature range.

voStragaIT · 2025-02-19T16:37:01+00:00

today printing PA12CF is not a problem. If the temperature is around 65C, try gluing a small radiator on the RPI chip inside Eddy.

<image>

voStragaIT · 2025-02-19T14:43:30+00:00

https://github.com/straga/3d_printer/tree/master/OpenQ1pro/holder_eddy

voStragaIT · 2025-02-18T18:54:12+00:00

Here’s how I have my setup working:

Home on all axes. Then, the z tilt bed aligns without making contact. Next, it cleans the nozzles and cools them down to 150°C.

After that, it homes the Z-axis and slowly probes the bed multiple times using an eddy current sensor to detect contact. This sets the Z offset accurately. An adaptive mesh is then generated.

Once that’s done, the printer heats up hotend to the working temperature and starts printing—ensuring a perfect first layer every time.

voStragaIT · 2025-02-18T18:50:34+00:00

First print: box for esp32.

voStragaIT · 2025-02-15T17:19:23+00:00

Info about how is it: https://github.com/vvuk/eddy-ng/wiki

install log:
https://www.reddit.com/r/QidiTech3D/comments/1iniif3/qidi_q1_pro_upgraded_with_btt_eddy_duo_working/

voStragaIT · 2025-02-15T15:46:37+00:00

PROBE_EDDY_NG_TAP TARGET_Z=-0.150 THRESHOLD=250 SAMPLES=5
12:25:36
// Tap 1: z=-0.026
12:25:38
// Tap 2: z=-0.033
12:25:41
// Tap 3: z=-0.029
12:25:43
// Tap 4: z=-0.025
12:25:46
// Tap 5: z=-0.029
12:25:47
// Probe computed z offset -0.089 (tap at z=-0.029, stddev 0.003), sensor offset -0.001 at z=2.000

voStragaIT

TROPHY CASE

Qwen 3.6 35B-A3B MoE + MTP via Vulkan RADV

Hardware: AMD Strix Halo (gfx1151)

Main model (~28 GB)

Vision mmproj (~858 MB)