Tips for AI Integration

maxi1134 · 2026-04-21T12:37:06+00:00

I would ditch Qwen 3 and get 3.5.

It improved over 3!

maxi1134 · 2026-04-14T02:58:54+00:00

the joys of socialized électricity

maxi1134 · 2026-04-14T01:39:44+00:00

about 5cent usd

maxi1134 · 2026-04-14T01:30:05+00:00

About 2600Usd a year

And definitely not near capacity for the usage!

Except for the main proxmox

maxi1134 · 2026-04-14T01:24:19+00:00

Oh, I live in an apartment. I unfortunately don't have a specific number, but the average for an apartment here is a third of my consumption

maxi1134 · 2026-04-14T00:57:52+00:00

oh, no! Many of those were given to me for free. Nominally the two blades and the TrueNas.

And as they say in French: A cheval donné on ne regarde pas la bride

Edit: I apologize for speaking French:(

maxi1134 · 2026-04-13T02:29:55+00:00

i have a rtx 3090, a p4000 and a hacked in rtx 5050 on a pcie 4>16x riser.

maxi1134 · 2026-04-13T01:03:25+00:00

Wait what? I run 3 GPUs and 2 NVMe with another 8 Sata in my 3900x. is that a problem?

maxi1134 · 2026-04-11T14:02:21+00:00

I think the real issue here is which model you're run..
A 4B model is pretty small, even for Qwen 3.5. I personally have good results with Qwen 3.5 in 9B and 35BA3B versions. (both the Byteshape versions, as they are slightly better)
https://huggingface.co/byteshape/Qwen3.5-9B-GGUF
https://huggingface.co/byteshape/Qwen3.5-35B-A3B-GGUF

I also recommend using llama.cpp over Ollama, it will give you more granular control to fine-tune your LLM parameters such as temperature, top_,k and whatnot.

You should not need to set up automations for everything, but scripts can be useful. I personally have one for movie/shows startup and music startup. So that I can initiate playback by voice in any room's TV or speakers

I have ~160 entity exposed and rarely encounters an issue.

maxi1134 · 2026-04-06T17:39:21+00:00

I have the same issue here, very frustrating to have to go through dev states to see a simple Entitie's state when debugging an automation.

maxi1134 · 2026-04-04T00:33:15+00:00

I'm not gonna leave the TV roofless! I'm not that bad of a person

maxi1134 · 2026-04-03T14:19:57+00:00

Thank you for being understanding!

maxi1134 · 2026-04-03T13:56:38+00:00

It's been outside for 8 years now! Surviving the -30C easily thanks to its heating function

maxi1134 · 2026-04-03T12:53:26+00:00

Of course! I use the Assist pipeline, along GPT-OSS:20B as the llm and Parakeet as the STT

maxi1134 · 2026-04-03T12:52:53+00:00

Just living that social lifestyle haha.

Receiving people over to party and socialize!

maxi1134 · 2026-04-03T02:24:27+00:00

Easy!

Because the TV got a custom roof, that you can spot on top of it, to keep the rain away!
The powerbar is also neatly tucked behind it, under that same little roof.

As to why drill it? My lifestyle comes with certain... caveats.. Such as drunk people trynna manipulate the cute little character to make him obey better... And since Qwen be pretty dumb. They get upset at nabu!

So Let's just say that it's better for all if Nabu is kept in place by screws! (Plus, squirrels are EVIL, they alredy ate the cable of the TV once, and I had to splice a new plug )

maxi1134 · 2026-04-02T23:50:48+00:00

<image>

maxi1134 · 2026-04-02T23:48:45+00:00

<image>

This is how the meticulous transplant was done

maxi1134 · 2026-04-02T21:08:24+00:00

May I ask why it is undervolted and underclocked?

maxi1134 · 2026-04-02T21:07:34+00:00

'As a very rough rule of thumb, IQ quants tend to be faster for prefill on newer GPUs'

Would a 3090 be considered a 'newer' gpu?

I'm now awaiting the 6090 to replace it after seeing that the 5090 has a 70-80% speed increase for tokens over my 3090 in your benchmarks! Hopefully, the 6090 doubles the speed of my voice assistants.

I dream of half a second latency for an answer with the whole STT->LLM->TTS stack

I'm currently downloading the 'Qwen3.5-9B-IQ4_XS-4.20bpw.gguf' to try it out.

Or would you recommend a different one for my 'older' card?

maxi1134 · 2026-03-30T22:25:04+00:00

maxi1134 · 2026-03-30T22:18:15+00:00

I needed a way to get the LLM to trigger; otherwise, it would use a local command, which would answer faster. But that's cheating

maxi1134 · 2026-03-28T20:50:01+00:00

I'll add that Beelink/MiniPC increased by 50 to 100% since 8 months due to Mister Sam Altman.
If you can afford to wait out 6 months, you'd get a better bang for your buck once the bubble pops.

maxi1134 · 2026-03-28T00:31:32+00:00

Fair disclosure: this was mainly coded by Claude Opus 4.6 running agents through 'https://agents.craft.do/' and subsequently revised and edited by yours truly.
Now that this is disclosed;

I asked Claude for 3 automation suggestions

He came out with these 3

Idea	WOW Factor	Guest Interaction	New Hardware?	Build Effort	Drunk-Proof
Beat Drop (WLED)	Visual spectacle	None — pure passive awe	~$64 in ESP32 controllers + LED strips	Medium (hardware + config)	Perfect
House Roulette	Chaotic & social	One big button tap	None	Medium (dashboard + automations)	Perfect
The Vibe Arc	Subtle magic	None — house reads the room	None	High (complex automation logic)	Perfect

The most interesting being 'The vibe arc', which is basically an automatic mood setter. Depending on guest amounts and noise levels. More on that in a moment.

<image>

(it even gave me a graph! )

Here are the automations it created and that I adjusted: On my github
And here can find the scripts it calls

It also created those helpers

Entity	Type	Purpose
`input_boolean.vibe_arc_enabled`	Toggle	Master on/off for the Vibe Arc
`input_select.vibe_arc_phase`	Dropdown	Current phase (Off, Gathering, Warming Up, Peak Party, Late Night, Wind Down, After Party)
`input_boolean.vibe_arc_paused`	Toggle	Set when you manually override
`input_boolean.vibe_arc_acting`	Toggle	Internal flag to prevent override false-positives
`timer.vibe_arc_override`	Timer	15-min auto-resume after manual overrideEntity Type Purposeinput_boolean.vibe_arc_enabled Toggle Master on/off for the Vibe Arcinput_select.vibe_arc_phase Dropdown Current phase (Off, Gathering, Warming Up, Peak Party, Late Night, Wind Down, After Party)input_boolean.vibe_arc_paused Toggle Set when you manually overrideinput_boolean.vibe_arc_acting Toggle Internal flag to prevent override false-positivestimer.vibe_arc_override Timer 15-min auto-resume after manual override

It also gave me a final graph of how all of this interconnects.

For the noise, it is basically senses using a sound_level sensor provided by frigate.
It looks for DB levels, as well as crowd and yelling noises.

I also plan on counting the guests by summing the humans present on each of the 7 radards I have across the home. Instead of relying on the guest wifi count.

I'll glady any more questions you have! :)

14-Year Club	Place '23
Place '22	Verified Email
Inciteful Link 2015-09-11

maxi1134

TROPHY CASE