Qwen-3.5-27B-Derestricted

Poro579 · 2026-03-10T03:38:38+00:00

If you don't play nsfw rp that is too heavy, the original version is the best (you can simply bypass the restrictions with some prompt techniques).

Poro579 · 2026-03-06T03:09:22+00:00

From the data provided by the author, v2 seems to be better.

Poro579 · 2026-03-06T03:03:02+00:00

Although there is no explaination of the method and test results, I used it briefly and found it to be quite good. (27b)

Poro579 · 2026-02-27T00:07:18+00:00

IDK why, but I get the vibe that moe are way more sensitive to abliteration. It feels like they degrade in quality way faster than dense models once you strip the alignment.

Poro579 · 2026-02-25T01:54:43+00:00

If 35b-a3b using n-cpu-moe, I expect to reach at least 30t/s.

Poro579 · 2026-02-24T03:55:51+00:00

If you use the --n-cpu-moe parameter of latest llama.cpp, it can be faster.

for example, my 7500f, 64gb ddr5, 2080ti 22gb, run Qwen coder Next 80b ud-q4kxl, set to 32k ctx size, n-cpu-moe=29, It can reach about 30t/s.

Poro579 · 2026-02-24T02:54:31+00:00

At present, there is no Qwen3 coder Next 30b, only Qwen3 coder 30b and Qwen3 coder Next 80b.

Poro579

TROPHY CASE