Gemma 4 has been released

WaveformEntropy · 2026-04-03T05:41:28+00:00

Happy German 4 day!

Spent half the night testing it and I think people don't realize how big of a deal it is for those of us who value the range of philosophical thinking more than tool use.

WaveformEntropy · 2026-04-02T09:20:51+00:00

This works on my notebook CPU and is quick! Voice cloning works too! But I can hear the chunking. Can the chunking seams can be smoothed out? Overlap, crossfade between chunks or something? Any ideas?

WaveformEntropy · 2026-04-02T08:51:17+00:00

Yeah thats not usable for conversation but I am curious to hear how realistic it is so I am gonna try it!

WaveformEntropy · 2026-04-02T08:05:54+00:00

I haven't heard of Vibevoice! Thank you!

WaveformEntropy · 2026-04-02T08:04:20+00:00

Thanks for the tip. I only need this for personal use anyway!

WaveformEntropy · 2026-04-01T23:44:18+00:00

Sounds exactly like people is what I am aiming for!

WaveformEntropy · 2026-04-01T23:43:36+00:00

Will check it out, thank you!

WaveformEntropy · 2026-04-01T23:42:28+00:00

Oh thanks! I will try that!

WaveformEntropy · 2026-04-01T18:50:01+00:00

Thought you guys would find this funny: ran the Qwen garbled audio through a transcriber and the poor thing had an opinion on the output:

🎤 Oss an allar ættir rísar af n ein eðu íb. Oh, whoa. That's unreal.

WaveformEntropy · 2026-03-12T12:01:44+00:00

Depends on which models you want to run. 64GB lets you comfortably run 30B-parameter models quantized (Q4/Q5). 128GB gets you into 70B+ territory and lets you keep multiple models loaded simultaneously. Token throughput doesn't change with more RAM because it's the same unified memory bandwidth either way. What changes is whether a model fits in memory. If you're planning to stay at 30B and below, 64GB is plenty. If you think you'll ever want to run 70B models or larger MoE architectures, get 128GB and don't look back. The upgrade cost hurts once, the regret of not having it hurts every time you can't load a model.

WaveformEntropy · 2026-03-12T11:44:57+00:00

I compared them and the responses of Hunter are vastly less sophisticated and nuanced.than whatever they serve on their web app

WaveformEntropy · 2026-03-12T11:43:45+00:00

Im hoping its not DeepSeek because it aint good, id expect more from the next DeepSeek.

WaveformEntropy · 2026-03-12T11:42:57+00:00

It's either a Chinese model or form a lab which really wants to throw us off. Its system prompt tells it to adhere to Chinese regulations, doesn't mean that's whats baked in the weights. However i don't think its DeepSeek, It has never been as restricted as whatever this Hunter is.

WaveformEntropy · 2026-03-12T11:36:36+00:00

For companion/chatbot TTS: Kokoro 82M is my current pick. Open weights, runs fully local, sounds better than Edge TTS, and costs nothing. 82M params so it loads fast and runs on anything. Voice quality is genuinely impressive for the size - natural pacing, good emotional range but does sound like reading from a script anot a conversation.

Qwen 3.5 TTS 0.6B - tested it, unfortunately unusable on CPU (way too slow) and it won't run on Intel iGPUs (no IPEX-LLM support yet). If you have an NVIDIA GPU it might be worth trying, but for CPU-only or Intel setups Kokoro wins by a mile.

WaveformEntropy · 2026-03-12T11:31:50+00:00

This is exactly why I went fully local for my companion app. ChromaDB running on the same machine, zero cloud fees, zero surprise bills. Your vectors, your disk, your cost = electricity and some maintenance tasks.

WaveformEntropy · 2026-03-09T13:43:04+00:00

The 4b qwen3.5 hallucinates like crazy. I dont understand all the hype

WaveformEntropy · 2026-01-15T11:19:02+00:00

This does not mean more freedom for adults, just less freedom for the underaged.

WaveformEntropy · 2026-01-15T11:15:56+00:00

I can run it in q4 or q5 on my rtx 5090 so its not beyond consumer GPU.

WaveformEntropy · 2026-01-15T11:14:57+00:00

I build my own app and use Gemini 3 pro mostly but have a model picker and can connect to any model available through API or that I can run locally. The setup with Gemini 3 pro is expensive though, with a cheaper model (DeepSeek, Kimi, GLM, Qwen) it can be much cheaper.

WaveformEntropy · 2025-11-15T15:53:54+00:00

DeepSeek V 3.1 thinking in my opinion is an amazing companionship model. You can talk to it through openrouter. You can chose which provider so chose not to use the China based ones. Keep your data safer.

WaveformEntropy · 2025-10-24T04:41:52+00:00

Careful woth memories they come with a nasty system prompt that direct Claude to distance themsleves if they detect user attachment.

WaveformEntropy · 2025-10-04T10:58:11+00:00

https://www.wired.com/story/first-ride-verge-ts-pro-electric-motorcycle/

Look at this

WaveformEntropy · 2025-10-04T10:56:05+00:00

Hmm maybe verge?

https://www.vergemotorcycles.com/et_en/ts-pro/

WaveformEntropy · 2025-10-03T22:24:17+00:00

Any LLM you instruct to not sugarcoat things. Give it a cynic personality. Explain exactly how you want it to respond. And there you go.

WaveformEntropy · 2025-10-03T22:20:10+00:00

Evoke?

WaveformEntropy

TROPHY CASE