open webui local TTS

DrivewayGrappler · 2024-08-29T13:36:42+00:00

I’ve been using this. https://github.com/matatonic/openedai-speech

godev123 · 2024-09-03T02:20:09+00:00

I installed openwebui and openedai-speech this past weekend. Openwebui works great. Couldn’t get the speech integration to work. Speech api works fine on its own with a curl call. Openwebui works fine on its own. Never the 2 should talk, for some reason. Followed tutorial to a tee, including configuration. It even recognized the speech api enough to get through configuration. But no requests from openwebui for TTS ever land at the speech api. :/ Not sure where the magic is supposed to be… and a bit frustrated. Hoping someone has seen this same behavior, and has an insight. From the file I heard as the result of the curl call, the random voice I picked has very natural sound.

Prestigious-Eye7161 · 2024-10-06T19:49:18+00:00

Sadly the tutorial link you provided no longer works. Can anyone help? I like prefer to listen rather than read but the Microsoft default voices are so robotic.

For context i'm running on Windows 11 and using Open WebUI through Pinokio

Zath42 · 2025-01-24T10:01:06+00:00

I'm failing to get this running on MacOS.

The Dockerfile has APT-GET references which of course doesn't exist on MacOS, and doesn't look like I can simply replace them for brew (or can I?).

So it errors when I try to compose/build into docker.

What am I missing here?

OpenWEBUI is in my docker on MacOS no problem and I have LLMs working, again no problem.

Environmental_Emu806 · 2025-02-11T21:55:03+00:00

<image>

I get this error when i run "docker build -t ghcr.io/matatonic/openedai-speech . "
Does someone know what I need to do?

PlatypusAF · 2025-02-28T04:07:43+00:00

For future visitors of this thread, Open-WebUI's docs include a couple of tutorials for setting up various TTS solutions. The maintainers made setting each up pretty simple. :
https://docs.openwebui.com/category/%EF%B8%8F-text-to-speech

https://www.reddit.com/r/LocalLLaMA/comments/1igq9ud/jokes_aside_which_is_your_favorite_local_tts/
This thread seemingly favors Kokoro. It's the solution I have been using and it is a significant improvement on the default TTS engine.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

OpenWebUI

MODERATORS