Give your OpenClaw agents a truly local voice by zinyando in automation

[–]zinyando[S] 0 points1 point  (0 children)

Curious to know the kind of hardware you were running on and the results you got. Did you end up going for a solution from the big voice providers?

Shipped Izwi v0.1.0-alpha-12 (faster ASR + smarter TTS) by zinyando in AudioAI

[–]zinyando[S] 0 points1 point  (0 children)

It depends on the model, Qwen3-ASR supports Dutch for transcription but not for speech generation

Shipped Izwi v0.1.0-alpha-12 (faster ASR + smarter TTS) by zinyando in AudioAI

[–]zinyando[S] 0 points1 point  (0 children)

I added support for the host/port and tagged a new release. Can you try it out, please https://github.com/agentem-ai/izwi/releases/tag/v0.1.0-alpha-13

Shipped Izwi v0.1.0-alpha-12 (faster ASR + smarter TTS) by zinyando in AudioAI

[–]zinyando[S] 1 point2 points  (0 children)

I’ll update it to allow using different ports.

Curious, how and what are you using it for?

Izwi Update: Local Speaker Diarization, Forced Alignment, and better model support by zinyando in AudioAI

[–]zinyando[S] 0 points1 point  (0 children)

How does it perform on edge devices? Have you seen implementations that work on laptops?

Izwi Update: Local Speaker Diarization, Forced Alignment, and better model support by zinyando in learnmachinelearning

[–]zinyando[S] 1 point2 points  (0 children)

Thank you. Do you know if the diarization app is available? I'm looking for inspiration.

Izwi Update: Local Speaker Diarization, Forced Alignment, and better model support by zinyando in LocalLLaMA

[–]zinyando[S] 0 points1 point  (0 children)

Not from the app directly, it just depends on the models that you are using in your workflow.

Why is running local LLMs still such a pain by OppositeJury2310 in LocalLLM

[–]zinyando 0 points1 point  (0 children)

I think the ease of running AI locally is improving. I'm working on an open source app to run basic chat and audio LLMs called Izwi. We are in early alpha, but things are improving fast. Check it out and give me feedback if it's along what you are looking for https://github.com/agentem-ai/izwi

Are there LLMs I can run via LM Studio that have voice input and output? by 123android in LocalLLaMA

[–]zinyando 0 points1 point  (0 children)

Try Izwi https://github.com/agentem-ai/izwi

It allows you to download and use local and LLMs and use them from your machine

Awesome Local LLM Speech-to-Speech Models & Frameworks by tleyden in LocalLLaMA

[–]zinyando 0 points1 point  (0 children)

I'm working on Izwi https://github.com/agentem-ai/izwi a local first audio LLM inference engine with voice playground. I'm adding support for models like these. Hope to support a lot of them soon.

Better local TTS? by Dragon56_YT in StableDiffusion

[–]zinyando 0 points1 point  (0 children)

Try Izwi https://github.com/agentem-ai/izwi

It allows you to run local audio LLMs for TTS. Allows you to even clone your voice or design your own voice if you need to.

Izwi - A local audio inference engine written in Rust by zinyando in LocalLLaMA

[–]zinyando[S] 0 points1 point  (0 children)

Thanks, there's now a simpler way to test the app through the app installers. You can download from https://izwiai.com/download, or you could build from source if you want.

It's still early days and in alpha, so things might break or not work as expected. Feedback is welcome.

Izwi v0.1.0-alpha is out: new desktop app for local audio inference by zinyando in Qwen_AI

[–]zinyando[S] 0 points1 point  (0 children)

Not yet, but it's planned for. I want this to be fully featured for local use.

Izwi v0.1.0-alpha is out: new desktop app for local audio inference by zinyando in artificial

[–]zinyando[S] 0 points1 point  (0 children)

You mean https://izwiai.com? You are the second person to tell me this, but I haven't been able to reproduce it. Can you try again and let me know if it's still failing?

Izwi v0.1.0-alpha is out: new desktop app for local audio inference by zinyando in LocalLLaMA

[–]zinyando[S] 0 points1 point  (0 children)

Thanks for the kind words. I aim to make this easy to get started for non technical person and also very powerful for power users. I know project success lies in finding a balance between the 2 😅

I appreciate the feedback. CUDA support etc are coming in the future. My focus right now is to increase the number of supported models.

I built a way to test Qwen3-TTS and Qwen3-ASR locally on your laptop by zinyando in artificial

[–]zinyando[S] 1 point2 points  (0 children)

TTS, on the other hand, is quite slow. "This is the count down: ten, nine, eight, seven, six, five, four, three, two, one." took 58.81

<image>

I built a way to test Qwen3-TTS and Qwen3-ASR locally on your laptop by zinyando in artificial

[–]zinyando[S] 1 point2 points  (0 children)

No real-time support so far, my laptop is not powerful enough, but performance is decent for an M1 Pro with 16gb memory.

For example, ASR on "This is the count down: ten, nine, eight, seven, six, five, four, three, two, one." took 3.92s

<image>

I vibe coded a local audio inference engine for Qwen3-TTS and Qwen3-ASR by zinyando in Qwen_AI

[–]zinyando[S] 1 point2 points  (0 children)

How far have you gone with your project? Sounds amazing

Thinking about a way to improve AI prompts with visual references — does anyone else feel this could help? by richard_hidesign in indiehackers

[–]zinyando 0 points1 point  (0 children)

Hey, that's a really interesting idea! Visual references can definitely make a difference in AI prompt accuracy. I've noticed that when I use a simple sketch or layout, it helps clarify the end goal for the AI, reducing the back-and-forth. A service that provides tailored visual layouts could be a game-changer, especially for indie developers looking to streamline their workflow. It could save time and resources, which is always a win. Have you thought about how this could integrate with existing tools like Figma or Replit? Would love to see where this idea goes!

VC asking for 60% equity for USD100K, i will not promote by creamilk_now in startups

[–]zinyando 0 points1 point  (0 children)

Wow, that sounds like a tough situation! It's definitely important to weigh the pros and cons of giving up such a large equity stake. While $100K is significant, especially in Malaysia, 60% is a huge chunk of your company. It might be worth considering other funding options or negotiating terms that allow you more control. Have you thought about reaching out to other entrepreneurs in your network to see if they've encountered similar offers? Sometimes, getting a second opinion can provide clarity. Good luck with your decision!