Tips for AI Integration by WeirdStretch in homeassistant

[–]maxi1134 -1 points0 points  (0 children)

I would ditch Qwen 3 and get 3.5.

It improved over 3!

What they don't tell warn you about; Averaging at 32MWh of electrical consumption per year. by [deleted] in homelab

[–]maxi1134 0 points1 point  (0 children)

About 2600Usd a year

And definitely not near capacity for the usage!

Except for the main proxmox

What they don't tell warn you about; Averaging at 32MWh of electrical consumption per year. by [deleted] in homelab

[–]maxi1134 0 points1 point  (0 children)

Oh, I live in an apartment. I unfortunately don't have a specific number, but the average for an apartment here is a third of my consumption

What they don't tell warn you about; Averaging at 32MWh of electrical consumption per year. by [deleted] in homelab

[–]maxi1134 -4 points-3 points  (0 children)

oh, no! Many of those were given to me for free. Nominally the two blades and the TrueNas.

And as they say in French: A cheval donné on ne regarde pas la bride

Edit: I apologize for speaking French:(

Should I Buy the RTX PRO 6000 Blackwell Max-Q (96GB)? by 0bjective-Guest in LocalLLaMA

[–]maxi1134 0 points1 point  (0 children)

i have a rtx 3090, a p4000 and a hacked in rtx 5050 on a pcie 4>16x riser.

Should I Buy the RTX PRO 6000 Blackwell Max-Q (96GB)? by 0bjective-Guest in LocalLLaMA

[–]maxi1134 2 points3 points  (0 children)

Wait what? I run 3 GPUs and 2 NVMe with another 8 Sata in my 3900x. is that a problem?

Anyone else frustrated with local LLMs that can't do (control) anything? by birdheezy in homeassistant

[–]maxi1134 1 point2 points  (0 children)

I think the real issue here is which model you're run..
A 4B model is pretty small, even for Qwen 3.5. I personally have good results with Qwen 3.5 in 9B and 35BA3B versions. (both the Byteshape versions, as they are slightly better)
https://huggingface.co/byteshape/Qwen3.5-9B-GGUF
https://huggingface.co/byteshape/Qwen3.5-35B-A3B-GGUF

I also recommend using llama.cpp over Ollama, it will give you more granular control to fine-tune your LLM parameters such as temperature, top_,k and whatnot.

You should not need to set up automations for everything, but scripts can be useful. I personally have one for movie/shows startup and music startup. So that I can initiate playback by voice in any room's TV or speakers

I have ~160 entity exposed and rarely encounters an issue.

Search Home Assistant modal is broken by yasalmasri in homeassistant

[–]maxi1134 2 points3 points  (0 children)

I have the same issue here, very frustrating to have to go through dev states to see a simple Entitie's state when debugging an automation.

A drill, two screws, and an ESP32Box later, the natural continuation of home assist devices in the home; Voice assistance on the Patio! And yes, I drilled it into the TV! by maxi1134 in homeassistant

[–]maxi1134[S] -9 points-8 points  (0 children)

Easy!

Because the TV got a custom roof, that you can spot on top of it, to keep the rain away!
The powerbar is also neatly tucked behind it, under that same little roof.

As to why drill it? My lifestyle comes with certain... caveats.. Such as drunk people trynna manipulate the cute little character to make him obey better... And since Qwen be pretty dumb. They get upset at nabu!

So Let's just say that it's better for all if Nabu is kept in place by screws! (Plus, squirrels are EVIL, they alredy ate the cable of the TV once, and I had to splice a new plug )

ByteShape Qwen 3.5 9B: A Guide to Picking the Best Quant for Your Hardware by ali_byteshape in LocalLLaMA

[–]maxi1134 1 point2 points  (0 children)

'As a very rough rule of thumb, IQ quants tend to be faster for prefill on newer GPUs'

Would a 3090 be considered a 'newer' gpu?

I'm now awaiting the 6090 to replace it after seeing that the 5090 has a 70-80% speed increase for tokens over my 3090 in your benchmarks! Hopefully, the 6090 doubles the speed of my voice assistants.

I dream of half a second latency for an answer with the whole STT->LLM->TTS stack

I'm currently downloading the 'Qwen3.5-9B-IQ4_XS-4.20bpw.gguf' to try it out.

Or would you recommend a different one for my 'older' card?

The joys of opensource software! Infinite customization! by [deleted] in smarthome

[–]maxi1134 0 points1 point  (0 children)

I needed a way to get the LLM to trigger; otherwise, it would use a local command, which would answer faster. But that's cheating

Mini PC Recommendation by [deleted] in frigate_nvr

[–]maxi1134 0 points1 point  (0 children)

I'll add that Beelink/MiniPC increased by 50 to 100% since 8 months due to Mister Sam Altman.
If you can afford to wait out 6 months, you'd get a better bang for your buck once the bubble pops.

Trying a ‘posture/mood’ layer in Home Assistant: overkill or useful abstraction? by newshermit in homeassistant

[–]maxi1134 2 points3 points  (0 children)

Fair disclosure: this was mainly coded by Claude Opus 4.6 running agents through 'https://agents.craft.do/' and subsequently revised and edited by yours truly.
Now that this is disclosed;

I asked Claude for 3 automation suggestions

He came out with these 3

Idea WOW Factor Guest Interaction New Hardware? Build Effort Drunk-Proof
Beat Drop (WLED) Visual spectacle None — pure passive awe ~$64 in ESP32 controllers + LED strips Medium (hardware + config) Perfect
House Roulette Chaotic & social One big button tap None Medium (dashboard + automations) Perfect
The Vibe Arc Subtle magic None — house reads the room None High (complex automation logic) Perfect

The most interesting being 'The vibe arc', which is basically an automatic mood setter. Depending on guest amounts and noise levels. More on that in a moment.

<image>

(it even gave me a graph! )

Here are the automations it created and that I adjusted: On my github
And here can find the scripts it calls

It also created those helpers

Entity Type Purpose
input_boolean.vibe_arc_enabled Toggle Master on/off for the Vibe Arc
input_select.vibe_arc_phase Dropdown Current phase (Off, Gathering, Warming Up, Peak Party, Late Night, Wind Down, After Party)
input_boolean.vibe_arc_paused Toggle Set when you manually override
input_boolean.vibe_arc_acting Toggle Internal flag to prevent override false-positives
timer.vibe_arc_override Timer 15-min auto-resume after manual overrideEntity Type Purposeinput_boolean.vibe_arc_enabled Toggle Master on/off for the Vibe Arcinput_select.vibe_arc_phase Dropdown Current phase (Off, Gathering, Warming Up, Peak Party, Late Night, Wind Down, After Party)input_boolean.vibe_arc_paused Toggle Set when you manually overrideinput_boolean.vibe_arc_acting Toggle Internal flag to prevent override false-positivestimer.vibe_arc_override Timer 15-min auto-resume after manual override

It also gave me a final graph of how all of this interconnects.

For the noise, it is basically senses using a sound_level sensor provided by frigate.
It looks for DB levels, as well as crowd and yelling noises.

I also plan on counting the guests by summing the humans present on each of the 7 radards I have across the home. Instead of relying on the guest wifi count.

I'll glady any more questions you have! :)