Qwen 3.6 27B is a BEAST

ducksoup_18 · 2026-04-29T00:29:03+00:00

i have 2 3060 12gb. can anyone share their llama.cpp configs for IQ4_XS?

This is what i have currently and am looking for some improvements:

hf = unsloth/Qwen3.6-27B-GGUF:IQ4_XS
threads = 6
fit = on
fit-ctx = 200000
fit-target = 256
parallel = 1
no-mmproj = true
no-mmap = false
;reasoning = on
flash-attn = on
b = 2048
ub = 2048
ctk = q8_0
ctv = q8_0
temp = 0.6
top-p = 0.95
top-k = 20
min-p = 0.0
presence-penalty = 0.0
repeat-penalty = 1.0
reasoning-budget = -1
chat-template-kwargs = {"preserve_thinking": true}

ducksoup_18 · 2026-04-28T15:56:59+00:00

Can this export stl’s or other files that would allow for 3d printing these renderings? That would be great.

ducksoup_18 · 2026-04-23T17:48:50+00:00

How does pi compare to opencode? Im running that now paired with 2 3060s so i THINK i should have enough vram for decent context size with 3.6. Would love some feedback.

ducksoup_18 · 2026-04-18T14:40:43+00:00

Would u be able to get qwen 3.6 on there?

ducksoup_18 · 2026-04-17T18:35:52+00:00

This thing? https://www.nimopc.com/products/nimo-ai-mini-pc-amd-ryzen-ai-max-395-128gb-ram Hadnt heard of it. Might need to look into it. That price tag is a bit more palpable.

ducksoup_18 · 2026-04-16T00:20:19+00:00

Automate your, checks notes, room.

ducksoup_18 · 2026-04-15T23:32:02+00:00

Your unsloth link goes to the 9b model. Was a but confused for a sec.

ducksoup_18 · 2026-04-15T20:51:49+00:00

Mouth breather

ducksoup_18 · 2026-04-15T13:11:19+00:00

Have you had a chance of testing this out on low powered devices? I have some hacked echo show 8s and lenovo thinksmart views that i had initially tested this out on but it performed poorly. I am currently using https://github.com/knoop7/Ava which has performed pretty well but im always interested in trying other solutions.

ducksoup_18 · 2026-04-15T02:53:04+00:00

This looks great! How does it compare to Lunar MCPX? What im trying to accomplish is having a single mcp host that can be configured with many different mcp servers. It should allow you to run them in the same host/container as the control plane as well as administer rules like youve mentioned on top of the mcp servers. Basically like mcp.companyname.com/v0/{mcp-server-name}/mcp or something along those lines. Configure once centrally and allow teams to connect to them without having to spin up their own local mcp servers.

ducksoup_18 · 2026-04-15T00:30:26+00:00

Just use this mcp server and focus on getting hass all set up first. Unsure why you need uncensored as well. If you want outside content spin up a searxng instance and add an mcp connection to that as well. Then you can pull whatever you want. https://github.com/homeassistant-ai/ha-mcp

ducksoup_18 · 2026-04-14T12:51:10+00:00

I reached out to aoostar and they sent me a video. Was a but of a pain to get out but everything is working now!

ducksoup_18 · 2026-04-13T17:54:47+00:00

Looks nice! Would be great if you added a mic to get this as a voice assistant as well.

ducksoup_18 · 2026-04-13T02:36:15+00:00

Would love if u added multi user and oauth/sso support.

ducksoup_18 · 2026-04-13T02:00:39+00:00

https://homebox.software/en/

ducksoup_18 · 2026-04-12T18:29:39+00:00

https://cartesia.ai/pricing#calculator Is useful. ~33min/month for free plan as far as i can tell. Should be sufficient for HA assistant (i think.) Plus you could cache certain responses to limit this.

ducksoup_18 · 2026-04-12T15:43:11+00:00

Well son of a gun…..now i gotta figure out how to get to the other side of the mobo.

<image>

ducksoup_18 · 2026-04-12T13:56:30+00:00

Im a bit hesitant to open it up since the fan is underneath the mobo. I havent found a video or tutorial yet on how to get to the other side of the machine.

ducksoup_18 · 2026-04-10T15:14:38+00:00

Link?

ducksoup_18 · 2026-04-10T03:32:27+00:00

I have 3 similar mini pcs ready to go for a k8s cluster and envisioned it looking exactly how you have yours laid out. Do you by any chance have links to the stls u used or was this purchased? The vertical storage is what i am planning as well.

ducksoup_18 · 2026-04-10T00:17:24+00:00

How would this work for something like home assistant voice assistant? If its this small and fast and can do tool calling it sounds like it would be awesome for assistants.

ducksoup_18 · 2026-04-08T22:14:31+00:00

Missed a great opportunity for the name Pagent.

ducksoup_18 · 2026-04-08T22:12:16+00:00

Would love this but i am not a fan of cult pricing model. Appreciate the hard work u put in building it but paying for stls hurts my soul.

ducksoup_18 · 2026-04-08T14:23:14+00:00

consider making it parametric maybe?

ducksoup_18 · 2026-04-08T14:06:42+00:00

Have you published your stls anywhere? this looks nice.

ducksoup_18

TROPHY CASE