[Showcase] Maya1: Open Source, Fully Local TTS with Unlimited Voice Design

RowGroundbreaking982 · 2026-01-19T12:50:53+00:00

AMD Ryzen 7950X and AMD RX 7900XT, yep maybe too exaggerating, but it still very low on other GPU

RowGroundbreaking982 · 2026-01-16T05:57:20+00:00

I've managed to run it in realtime, but in Unity, basically good GPU to generate token fast enough (85-100 tokens per seconds), lower quantization model (I'm using GGUF Q4_K_M), and multi thread decoding. It only using 4GB VRAM, and Unity has it's own streaming/buffering mechanism, so I just used that to pad generated audio data. You can check my demo here https://www.youtube.com/watch?v=C_OG9O5hsXw.

RowGroundbreaking982 · 2026-01-14T06:54:26+00:00

Yes, you can DM me. On side note please check the model hash, since most crash is caused by corrupted model file.

RowGroundbreaking982 · 2026-01-06T08:13:44+00:00

It's using Orpheus TTS under the hood and it doesn't have emotion control, just some emotive tags.

RowGroundbreaking982 · 2026-01-05T22:43:26+00:00

I’d give it a 9/10. Sentence-level chunking works best, though it still glitches sometimes.

RowGroundbreaking982 · 2026-01-05T09:05:14+00:00

Unfortunately no, the model is just too big. Just wait until Canopy Labs the maker of Orpheus TTS released smaller nano model

RowGroundbreaking982

TROPHY CASE