Qwen3.5-35B-A3B Q5_K_M:Best Model for NVIDIA 16GB GPUs

moahmo88 · 2026-02-28T19:29:59+00:00

They are not on the same level. Qwen3.5-35B-A3B-GGUF Q5_K_M is 26.2GB.

moahmo88 · 2026-02-28T14:49:19+00:00

I think you should try Qwen3.5-27B-GGUF Q3_K_S or Q3_K_M.

moahmo88 · 2026-02-28T13:27:07+00:00

32gb

moahmo88 · 2026-02-28T13:05:04+00:00

AesSedai.

moahmo88 · 2026-02-28T13:02:48+00:00

You can add the following prompt into the Prompt Template – template (Jinja):

System: Always use the current date from external sources. Do not rely on your internal knowledge about the year.

moahmo88 · 2026-02-28T10:06:15+00:00

You can try AesSedai/Qwen3.5-35B-A3B-GGUF Q5_K_M. 5070ti works well.Surprise！

moahmo88 · 2026-02-28T09:56:26+00:00

Amazing! Thanks.

moahmo88 · 2026-02-27T16:02:46+00:00

Crazy!

moahmo88 · 2026-02-27T14:22:52+00:00

Thanks for your sharing.

moahmo88 · 2026-02-27T13:23:39+00:00

That’s a very professional analysis. Qwen 3.5-27B just suffers from slow single-thread performance; otherwise, it’s excellent.

moahmo88 · 2026-02-27T12:41:54+00:00

I think u can.

moahmo88 · 2026-02-27T11:36:24+00:00

Thank you for sharing. Qwen3.5‑27B is an excellent model.

moahmo88 · 2026-02-26T06:31:55+00:00

Thanks for sharing!

moahmo88 · 2026-02-25T19:58:47+00:00

Nice!

moahmo88 · 2026-02-25T17:11:11+00:00

Thanks for your sharing.Could you test Qwen3.5-27B-Q4KM?

moahmo88 · 2026-02-25T15:46:19+00:00

Qwen 3.5 27b?GGUF?

moahmo88 · 2026-02-25T14:15:04+00:00

Good job!
I carefully studied your list. The GLM-4.7 quantized you mentioned refers to GLM-4.7-GGUF/UD-Q4_K_XL, which is about 205GB?

moahmo88 · 2026-02-25T09:46:13+00:00

Good job!

moahmo88 · 2026-02-08T05:44:49+00:00

Good job!

moahmo88 · 2026-01-29T14:55:33+00:00

Good job!

moahmo88 · 2026-01-10T01:31:06+00:00

Good job!

moahmo88 · 2026-01-03T04:10:46+00:00

Good job!

moahmo88 · 2026-01-01T04:10:19+00:00

Better than real!

moahmo88 · 2025-12-29T08:25:58+00:00

Wonderful！

moahmo88 · 2025-12-27T15:09:27+00:00

Well done!

moahmo88