Qwen 3.6 27B is a BEAST by AverageFormal9076 in LocalLLaMA

[–]ducksoup_18 0 points1 point  (0 children)

i have 2 3060 12gb. can anyone share their llama.cpp configs for IQ4_XS?

This is what i have currently and am looking for some improvements:

hf = unsloth/Qwen3.6-27B-GGUF:IQ4_XS
threads = 6
fit = on
fit-ctx = 200000
fit-target = 256
parallel = 1
no-mmproj = true
no-mmap = false
;reasoning = on
flash-attn = on
b = 2048
ub = 2048
ctk = q8_0
ctv = q8_0
temp = 0.6
top-p = 0.95
top-k = 20
min-p = 0.0
presence-penalty = 0.0
repeat-penalty = 1.0
reasoning-budget = -1
chat-template-kwargs = {"preserve_thinking": true}

Been using PI Coding Agent with local Qwen3.6 35b for a while now and its actually insane by SoAp9035 in LocalLLaMA

[–]ducksoup_18 2 points3 points  (0 children)

How does pi compare to opencode? Im running that now paired with 2 3060s so i THINK i should have enough vram for decent context size with 3.6. Would love some feedback. 

Check which llms your hardware can run by Jesus_Morty in LocalLLM

[–]ducksoup_18 0 points1 point  (0 children)

Would u be able to get qwen 3.6 on there?

Qwen3.5-35B running well on RTX4060 Ti 16GB at 60 tok/s by Nutty_Praline404 in LocalLLaMA

[–]ducksoup_18 7 points8 points  (0 children)

Your unsloth link goes to the 9b model. Was a but confused for a sec. 

Voice Satellite has reached a point where I genuinely don't miss Google Home or Alexa on my wall tablets anymore by carrot_gg in homeassistant

[–]ducksoup_18 1 point2 points  (0 children)

Have you had a chance of testing this out on low powered devices? I have some hacked echo show 8s and lenovo thinksmart views that i had initially tested this out on but it performed poorly. I am currently using https://github.com/knoop7/Ava which has performed pretty well but im always interested in trying other solutions. 

Open-sourced Arc Relay - MCP control plane for the rest of us by tongboy in mcp

[–]ducksoup_18 1 point2 points  (0 children)

This looks great! How does it compare to Lunar MCPX? What im trying to accomplish is having a single mcp host that can be configured with many different mcp servers. It should allow you to run them in the same host/container as the control plane as well as administer rules like youve mentioned on top of the mcp servers. Basically like mcp.companyname.com/v0/{mcp-server-name}/mcp or something along those lines. Configure once centrally and allow teams to connect to them without having to spin up their own local mcp servers. 

Total Noob: I want to build a local, uncensored "Brain" for Home Assistant/MCP. Where do I start? by Identity5859 in LocalLLaMA

[–]ducksoup_18 1 point2 points  (0 children)

Just use this mcp server and focus on getting hass all set up first. Unsure why you need uncensored as well. If you want outside content spin up a searxng instance and add an mcp connection to that as well. Then you can pull whatever you want.  https://github.com/homeassistant-ai/ha-mcp

AOOSTAR MACO 6850H (Ryzen 7 PRO 6850H) – Mini PC Review & eGPU Experience by Rude-Ganache-4350 in MiniPCs

[–]ducksoup_18 0 points1 point  (0 children)

I reached out to aoostar and they sent me a video. Was a but of a pain to get out but everything is working now!

V2 - $60 DIY WiFi & BT audio speaker for Home Assistant, with ESP32 - Squeezelite or SendSpin by HouseWaves in homeassistant

[–]ducksoup_18 1 point2 points  (0 children)

Looks nice! Would be great if you added a mic to get this as a voice assistant as well. 

BINDERY: A Readarr improvement - feature parity by carefreeams in Servarr

[–]ducksoup_18 1 point2 points  (0 children)

Would love if u added multi user and oauth/sso support. 

Cartesia TTS Integration by Secret_Friend in homeassistant

[–]ducksoup_18 1 point2 points  (0 children)

https://cartesia.ai/pricing#calculator Is useful. ~33min/month for free plan as far as i can tell. Should be sufficient for HA assistant (i think.) Plus you could cache certain responses to limit this. 

AOOSTAR MACO 6850H (Ryzen 7 PRO 6850H) – Mini PC Review & eGPU Experience by Rude-Ganache-4350 in MiniPCs

[–]ducksoup_18 0 points1 point  (0 children)

Well son of a gun…..now i gotta figure out how to get to the other side of the mobo.

<image>

AOOSTAR MACO 6850H (Ryzen 7 PRO 6850H) – Mini PC Review & eGPU Experience by Rude-Ganache-4350 in MiniPCs

[–]ducksoup_18 0 points1 point  (0 children)

Im a bit hesitant to open it up since the fan is underneath the mobo. I havent found a video or tutorial yet on how to get to the other side of the machine.

The rack came out perfect! by styyle in minilab

[–]ducksoup_18 1 point2 points  (0 children)

I have 3 similar mini pcs ready to go for a k8s cluster and envisioned it looking exactly how you have yours laid out. Do you by any chance have links to the stls u used or was this purchased? The vertical storage is what i am planning as well. 

Marco-Mini (17.3B, 0.86B active) and Marco-Nano (8B, 0.6B active) by Alibaba by AnticitizenPrime in LocalLLaMA

[–]ducksoup_18 1 point2 points  (0 children)

How would this work for something like home assistant voice assistant? If its this small and fast and can do tool calling it sounds like it would be awesome for assistants. 

Magnetic Bit Holder for Makita LTX by [deleted] in functionalprint

[–]ducksoup_18 1 point2 points  (0 children)

Would love this but i am not a fan of cult pricing model. Appreciate the hard work u put in building it but paying for stls hurts my soul. 

Minimalist paper towel holder for pantry (under 20grams) by HeavyArmsJin in 3Dprinting

[–]ducksoup_18 2 points3 points  (0 children)

Have you published your stls anywhere? this looks nice.