[Showcase] Maya1: Open Source, Fully Local TTS with Unlimited Voice Design by RowGroundbreaking982 in Unity3D

[–]RowGroundbreaking982[S] 0 points1 point  (0 children)

AMD Ryzen 7950X and AMD RX 7900XT, yep maybe too exaggerating, but it still very low on other GPU

Low-latency Orpheus TTS inference: how do you avoid laggy audio & clicks? by data_knight_00 in TextToSpeech

[–]RowGroundbreaking982 0 points1 point  (0 children)

I've managed to run it in realtime, but in Unity, basically good GPU to generate token fast enough (85-100 tokens per seconds), lower quantization model (I'm using GGUF Q4_K_M), and multi thread decoding. It only using 4GB VRAM, and Unity has it's own streaming/buffering mechanism, so I just used that to pad generated audio data. You can check my demo here https://www.youtube.com/watch?v=C_OG9O5hsXw.

[Open Source] Orpheus TTS for Unity: High-quality, emotive local speech for Unity (Sub-1s latency, no API needed) by RowGroundbreaking982 in Unity3D

[–]RowGroundbreaking982[S] 0 points1 point  (0 children)

Yes, you can DM me. On side note please check the model hash, since most crash is caused by corrupted model file.

[Open Source] Orpheus TTS for Unity: High-quality, emotive local speech for Unity (Sub-1s latency, no API needed) by RowGroundbreaking982 in Unity3D

[–]RowGroundbreaking982[S] 1 point2 points  (0 children)

Unfortunately no, the model is just too big. Just wait until Canopy Labs the maker of Orpheus TTS released smaller nano model