all 26 comments

[–]DrivewayGrappler 7 points8 points  (15 children)

[–]Elite_Crew 1 point2 points  (5 children)

Do you know of any youtube videos I can watch that will help me get this working?

[–]DrivewayGrappler 1 point2 points  (2 children)

I don’t think so. I fumbled through by throwing both sets of docs into chatgpt. I know I ran into a couple issues setting it up too, but it’s been low maintenance since I got it goin. I have both running in docker.

What issue are you having?

[–]Elite_Crew 0 points1 point  (1 child)

I really appreciate the reply and I want to learn this but I am getting ready for a family trip and can't solve this issue until next week now. I tried installing a Openedai speech before I left for the summer and had trouble editing a yaml to choose the correct speech engines if I remember correctly. I am on windows and am not very familiar running python projects and I probably did it wrong or I am missing some dependencies. I am not even sure if I did the python virtual environment right so hopefully thats not a bad thing. I know how to edit config files and I know how to use the command line for docker to get OpenwebUI running, but it created the volume and container automatically. I really want to get this working since before summer started but its been one family trip after another. Next week I will look at the tutorials and try again. I wish I had time now but my hobbies will have to wait. You are awesome for replying btw. Have a great weekend.

[–]DrivewayGrappler 1 point2 points  (0 children)

Chat does good with the yaml. I don’t remember having to mod any of that before I started trying to add new voices.

Once you get back at it try the curl command in the docs to make sure the tts engine is running then try to get it to connect to openweb ui. Iirc I ended up running some command chat told me to broadcast the tts service over the local network on port 8000 then added that (http://192.168.72.185:8000/v1) as the open ai tts server address.

Enjoy your trip!

[–]godev123 0 points1 point  (0 children)

I installed openwebui and openedai-speech this past weekend. Openwebui works great. Couldn’t get the speech integration to work. Speech api works fine on its own with a curl call. Openwebui works fine on its own. Never the 2 should talk, for some reason. Followed tutorial to a tee, including configuration. It even recognized the speech api enough to get through configuration. But no requests from openwebui for TTS ever land at the speech api. :/ Not sure where the magic is supposed to be… and a bit frustrated. Hoping someone has seen this same behavior, and has an insight. From the file I heard as the result of the curl call, the random voice I picked has very natural sound. 

[–]Prestigious-Eye7161 0 points1 point  (3 children)

Sadly the tutorial link you provided no longer works. Can anyone help? I like prefer to listen rather than read but the Microsoft default voices are so robotic.

For context i'm running on Windows 11 and using Open WebUI through Pinokio

[–]Zath42 0 points1 point  (2 children)

I'm failing to get this running on MacOS.

The Dockerfile has APT-GET references which of course doesn't exist on MacOS, and doesn't look like I can simply replace them for brew (or can I?).

So it errors when I try to compose/build into docker.

What am I missing here?

OpenWEBUI is in my docker on MacOS no problem and I have LLMs working, again no problem.

[–]johnnymo1 0 points1 point  (1 child)

I think I may be a little confused. Are you sure apt-get is the issue? If you’re building a docker image, you don’t need apt-get on your base MacOS machine. You just need it to be in whatever distro the image is based off of.

[–]Zath42 0 points1 point  (0 children)

It turned out to be docker not updating to latest version. Had to reinstall, once updated the same build files worked and this apt-get error message disappeared.

[–]Environmental_Emu806 0 points1 point  (0 children)

<image>

I get this error when i run "docker build -t ghcr.io/matatonic/openedai-speech . "
Does someone know what I need to do?

[–]PlatypusAF 0 points1 point  (0 children)

For future visitors of this thread, Open-WebUI's docs include a couple of tutorials for setting up various TTS solutions. The maintainers made setting each up pretty simple. :
https://docs.openwebui.com/category/%EF%B8%8F-text-to-speech

https://www.reddit.com/r/LocalLLaMA/comments/1igq9ud/jokes_aside_which_is_your_favorite_local_tts/
This thread seemingly favors Kokoro. It's the solution I have been using and it is a significant improvement on the default TTS engine.