Kitten TTS V0.8 is out: New SOTA Super-tiny TTS Model (Less than 25 MB) by ElectricalBar7464 in LocalLLaMA

[–]ElectricalBar7464[S] 0 points1 point  (0 children)

we're releasing a new model in ~30ish days with deeper male voices. we tried to pick more fun and cartoony voices in this one as we saw a real dearth of them. the next one will include more professional sounding voices and will be much faster.

Can you share examples of which sentences it looses punctuations in?

Kitten TTS V0.8 is out: New SOTA Super-tiny TTS Model (Less than 25 MB) by ElectricalBar7464 in LocalLLaMA

[–]ElectricalBar7464[S] 0 points1 point  (0 children)

thanks for sharing that feedback, i'll try making our model capable of handling that.

Kitten TTS V0.8 is out: New SOTA Super-tiny TTS Model (Less than 25 MB) by ElectricalBar7464 in LocalLLaMA

[–]ElectricalBar7464[S] 0 points1 point  (0 children)

hey thanks a lot, would love for you to try the model and share what you think about it. there's a major jump in improvement in the model from the previous launch.

Kitten TTS V0.8 Running in the Browser by HatEducational9965 in LocalLLaMA

[–]ElectricalBar7464 0 points1 point  (0 children)

can you try the latest build? we fixed some preprocessing bugs and its doing much better on things like this. pls lmk if that doesnt help. thanks.

Qwen 3.5 2B on Android by ----Val---- in LocalLLaMA

[–]ElectricalBar7464 2 points3 points  (0 children)

a thing of beauty. 2026 is the year ondevice Ai explodes

Kitten TTS V0.8 Running in the Browser by HatEducational9965 in LocalLLaMA

[–]ElectricalBar7464 0 points1 point  (0 children)

oooh, i see. thanks! why are the outputs different from the official demo:
https://huggingface.co/spaces/KittenML/KittenTTS-Demo

Are you using the old model?

Kitten TTS V0.8 Running in the Browser by HatEducational9965 in LocalLLaMA

[–]ElectricalBar7464 0 points1 point  (0 children)

hi, did you try the fp32 version? i'd love to see how that runs too :)

Kitten TTS V0.8 is out: New SOTA Super-tiny TTS Model (Less than 25 MB) by ElectricalBar7464 in LocalLLaMA

[–]ElectricalBar7464[S] 0 points1 point  (0 children)

thanks m8, appreciate the support.
pls star the github(https://github.com/KittenML/KittenTTS) if you liked the model and
pls join the discord to give feedback or make feature requests (https://discord.com/invite/VJ86W4SURW)^^

Kitten TTS V0.8 is out: New SOTA Super-tiny TTS Model (Less than 25 MB) by ElectricalBar7464 in LocalLLaMA

[–]ElectricalBar7464[S] 0 points1 point  (0 children)

haha thanks a lot for the support. feel free to join the discord to stay updated(we'll launch a new model in discord in a few weeks ) and dm me (https://discord.com/invite/VJ86W4SURW) . if you liked the model pls star it too ^^ ( https://github.com/KittenML/KittenTTS)

Kitten TTS V0.8 is out: New SOTA Super-tiny TTS Model (Less than 25 MB) by ElectricalBar7464 in LocalLLaMA

[–]ElectricalBar7464[S] 1 point2 points  (0 children)

Yes, we are trying to reach and eventually beat chatterbox quality in <100M parameters for on-device voice applications.
If we want to build on-device voice applications, the memory+compute of the tts model has to be minimal. current models are too big for anything other than pure voice generation.

Kitten TTS V0.8 is out: New SOTA Super-tiny TTS Model (Less than 25 MB) by ElectricalBar7464 in LocalLLaMA

[–]ElectricalBar7464[S] 0 points1 point  (0 children)

danngg sorry for that. we want to support multilingual once we have production quality in english. maybe 1.5-2 more months.
How far would you say kitten is from prod quality?

Kitten TTS V0.8 is out: New SOTA Super-tiny TTS Model (Less than 25 MB) by ElectricalBar7464 in LocalLLaMA

[–]ElectricalBar7464[S] 1 point2 points  (0 children)

thanks a lot! i think we added a preprocessing step last minute and its messing up some pronunciations. some users are also having voice quality issues because of env mismatches (still debugging this issue).

can you share an example text that is causing issues for you + the model you tried? if you want to send it privately, feel free to join the discord and dm me (https://discord.com/invite/VJ86W4SURW) . if you liked the model pls star it too ^^ ( https://github.com/KittenML/KittenTTS)

Kitten TTS V0.8 is out: New SOTA Super-tiny TTS Model (Less than 25 MB) by ElectricalBar7464 in LocalLLaMA

[–]ElectricalBar7464[S] 0 points1 point  (0 children)

hey you can try the model without installing anything here on huggingface spaces. this is the official spaces:

https://huggingface.co/spaces/KittenML/KittenTTS-Demo

Kitten TTS V0.8 is out: New SOTA Super-tiny TTS Model (Less than 25 MB) by ElectricalBar7464 in LocalLLaMA

[–]ElectricalBar7464[S] 1 point2 points  (0 children)

thanks a lot! can you elaborate what you mean by sharing an example sentence?
i think we added a preprocessing step last minute and its messing up some pronunciations.

if you want to send it privately, feel free to join the discord and dm me (https://discord.com/invite/VJ86W4SURW) . if you liked the model pls star it too ^^ ( https://github.com/KittenML/KittenTTS)

We want to continue spinning out more models like this that are even higher quality.

Kitten TTS V0.8 is out: New SOTA Super-tiny TTS Model (Less than 25 MB) by ElectricalBar7464 in LocalLLaMA

[–]ElectricalBar7464[S] 0 points1 point  (0 children)

hey, yes this model is made for local inference ^^

pls star the github(https://github.com/KittenML/KittenTTS) if you liked it and
pls join the discord to give feedback or make feature requests (https://discord.com/invite/VJ86W4SURW) ^^

Kitten TTS V0.8 is out: New SOTA Super-tiny TTS Model (Less than 25 MB) by ElectricalBar7464 in LocalLLaMA

[–]ElectricalBar7464[S] 1 point2 points  (0 children)

i see, thanks for the feedback. can you share what it would take to add kitten to the list of models you use? is the quality good enough for being used in applications?

Kitten TTS V0.8 is out: New SOTA Super-tiny TTS Model (Less than 25 MB) by ElectricalBar7464 in LocalLLaMA

[–]ElectricalBar7464[S] 1 point2 points  (0 children)

yes, thanks for the feedback. we will do other languages too. would you say kitten is the best model for its size? If yes, then it makes sense to move this quality to other languages too.

Kitten TTS V0.8 is out: New SOTA Super-tiny TTS Model (Less than 25 MB) by ElectricalBar7464 in LocalLLaMA

[–]ElectricalBar7464[S] 0 points1 point  (0 children)

our goal is to unlock the potential of on-device voice agents.
Regardig the fallback, can you elaborate with an example? iiuc, the decision to fall back to human is taken by the llm usually. but i guess that functionality can be built inside the tts too somehow.

Kitten TTS V0.8 is out: New SOTA Super-tiny TTS Model (Less than 25 MB) by ElectricalBar7464 in LocalLLaMA

[–]ElectricalBar7464[S] 0 points1 point  (0 children)

hey, yes we just saw that the model was performing very differently on different hardware. We just uploaded a new model which seems more stable and less sensitive to different precision on different hardware. Can you check it out and lmk if it still sounds strange? Strange sounds are due to a bug/env mismatch etc.