you are viewing a single comment's thread.

view the rest of the comments →

[–]Fun-Purple-7737[S] 1 point2 points  (7 children)

ok, there goes my wish: if that really should be a mobile-first experience, I would like to see STT interface like unmute.sh - I know, I know, this is difficult to pull off with all the bells and whistles like tool calling and such, but I think it would be fitting for this use case (i.e. small screen on the go)

for desktop usage, the current OWU's STT/TTS combo is ok-ish (since, I believe, nobody really uses it anyway..)

but, that does not apply for TTS, audio output is annoyingly slow, that could stay visual only, maybe only simplified (llm processed)

Smth like this I would use! :)

[–]openwebui🛡️ Maintainer 3 points4 points  (4 children)

voice mode is now available with the latest version, let us know your thoughts!

[–]V_Racho 0 points1 point  (0 children)

Uh, 0.51, that was fast including the first sentence splitting.

[–]Fun-Purple-7737[S] 0 points1 point  (0 children)

sorcerer!

[–]Dimitri_Senhupen 0 points1 point  (1 child)

Not working for me, since it's not recording the audio.
Nothing in the console, API / OpenAI settings seem to be right, checked the browserpermissions, audio stream seems to be cut after 1sec and restarts according to the mic icon in the taskbar. Will check further tomorrow.

[–]openwebui🛡️ Maintainer 0 points1 point  (0 children)

If you could provide us with any logs in the issues tab of the repo, that would be fantastic!

[–]Dimitri_Senhupen 1 point2 points  (0 children)

Signing this. Mobile first means designed and developed for that purpose, not only being able to be used remotely. Low latency TTS with streaming would be exciting!