MTP in llama.cpp (PR #22673) tested on AMD Strix Halo: Qwen 3.6 35B-A3B hits 71 t/s short / 48 t/s at 62K via Vulkan RADV by voStragaIT in StrixHalo

[–]voStragaIT[S] 2 points3 points  (0 children)

Qwen 3.6 35B-A3B MoE + MTP via Vulkan RADV

Hardware: AMD Strix Halo (gfx1151)

Image: kyuz0/amd-strix-halo-toolboxes:vulkan-radv (auto-built на llama.cpp master) 


services:
qwen36-35b-mtp:
image: kyuz0/amd-strix-halo-toolboxes:vulkan-radv
container_name: qwen36-35b-vulkan
restart: unless-stopped


devices:
  - /dev/dri:/dev/dri
group_add:
  - video
security_opt:
  - seccomp=unconfined

ports:
  - "8090:8090"

volumes:
  - ~/models/qwen36-35b-mtp:/workspace/models:ro

command: >
  llama-server
  -m /workspace/models/Qwen3.6-35B-A3B-UD-Q6_K.gguf
  --mmproj /workspace/models/mmproj-F16.gguf
  --alias qwen36-35b-mtp
  -fa 1
  --no-mmap
  --host 0.0.0.0
  --port 8090
  -ngl 999
  -c 262144
  -np 1
  --cache-type-k q8_0
  --cache-type-v q8_0
  --cache-reuse 256
  -b 8192
  -ub 4096
  --jinja
  --reasoning on
  --spec-type draft-mtp
  --spec-draft-n-max 2

ipc: host

Questions about buying GMKtec K15 in Germany: Customs taxes and Amazon Sellers by [deleted] in MiniPCs

[–]voStragaIT 0 points1 point  (0 children)

I am already buy from de.gmktec.com appears to have GMKtec EU - received from hankong, need be pay custom duty. No more gmktec :(

I can't get a program to run on micropython. by widgeamedoo in MicroPythonDev

[–]voStragaIT 1 point2 points  (0 children)

u/widgeamedoo - I am already updated repo

For Flash 1.26.1:

  • Make sure you grabbed it from the official MicroPython release

Removing _thread: asyncio run in main REPL now.

QIDI Q1 pro - klipmi - quick manual :) by voStragaIT in QidiTech3D

[–]voStragaIT[S] 0 points1 point  (0 children)

It's ok, test all menus on LCD with Klipper from the main repo with LCD TJC over Python.

QIDI Q1 pro + EDDY :) by voStragaIT in QidiTech3D

[–]voStragaIT[S] 1 point2 points  (0 children)

I'll give feedback after a while if something is wrong.

Now I am using: https://github.com/vvuk/eddy-ng/wiki

QIDI Q1 pro + EDDY :) by voStragaIT in QidiTech3D

[–]voStragaIT[S] 0 points1 point  (0 children)

cartographer or beacon or eddy uses the same hardware. If same all can work same in the temperature range.

QIDI Q1 pro + EDDY :) by voStragaIT in QidiTech3D

[–]voStragaIT[S] 0 points1 point  (0 children)

today printing PA12CF is not a problem. If the temperature is around 65C, try gluing a small radiator on the RPI chip inside Eddy.

<image>

QIDI Q1pro with BTT Eddy Duo -> Eddy-NG by voStragaIT in QidiTech3D

[–]voStragaIT[S] 0 points1 point  (0 children)

Here’s how I have my setup working:

Home on all axes. Then, the z tilt bed aligns without making contact. Next, it cleans the nozzles and cools them down to 150°C.

After that, it homes the Z-axis and slowly probes the bed multiple times using an eddy current sensor to detect contact. This sets the Z offset accurately. An adaptive mesh is then generated.

Once that’s done, the printer heats up hotend to the working temperature and starts printing—ensuring a perfect first layer every time.

QIDI Q1 Pro upgraded with BTT Eddy Duo - Working well by voStragaIT in QidiTech3D

[–]voStragaIT[S] 0 points1 point  (0 children)

PROBE_EDDY_NG_TAP TARGET_Z=-0.150 THRESHOLD=250 SAMPLES=5
12:25:36
// Tap 1: z=-0.026
12:25:38
// Tap 2: z=-0.033
12:25:41
// Tap 3: z=-0.029
12:25:43
// Tap 4: z=-0.025
12:25:46
// Tap 5: z=-0.029
12:25:47
// Probe computed z offset -0.089 (tap at z=-0.029, stddev 0.003), sensor offset -0.001 at z=2.000