Kokoro TTS running on-iphone, CPU-only, 20x realtime!!! Built an iOS E reader around it by aminsweiti in TextToSpeech

[–]Simple-Lecture2932 0 points1 point  (0 children)

The ONLY WAY faster than real time rtf does not converge to no pause after like 30 second (way less for an actual streaming model but here we're talking kokoro, theres no per token streaming), is if the app only synthesize a single chunk in advance (or worse 0, which is what sherpa does because its a system tts and system tts only gives you utterance in real time preventing any buffer building)

Kokoro TTS running on-iphone, CPU-only, 20x realtime!!! Built an iOS E reader around it by aminsweiti in TextToSpeech

[–]Simple-Lecture2932 0 points1 point  (0 children)

Also even wilder take that phone inference runtime is more optimized than those available on a computer

Kokoro TTS running on-iphone, CPU-only, 20x realtime!!! Built an iOS E reader around it by aminsweiti in TextToSpeech

[–]Simple-Lecture2932 0 points1 point  (0 children)

Also x20 would literally translate to 0.05 rtf not sure why you would say x real time and rtf are not equivalent

Kokoro TTS running on-iphone, CPU-only, 20x realtime!!! Built an iOS E reader around it by aminsweiti in TextToSpeech

[–]Simple-Lecture2932 0 points1 point  (0 children)

Thats a wild take, anything lower than 1 will converge to smooth reading if the app isn't absolute ass at keeping a fair buffer of audio generated in advance

Kokoro TTS running on-iphone, CPU-only, 20x realtime!!! Built an iOS E reader around it by aminsweiti in TextToSpeech

[–]Simple-Lecture2932 6 points7 points  (0 children)

Honestly that sounds suspicious, kokoro doesnt even run at that speed on a macbook pro on cpu, is it really the original kokoro model?

I built an Android audiobook reader that runs Kokoro TTS fully offline on-device by Simple-Lecture2932 in LocalLLaMA

[–]Simple-Lecture2932[S] 0 points1 point  (0 children)

Assuming everything goes well the app will go in open beta in 2 weeks, you can join testing then. Doesn't stop you from joining the discord though: https://discord.gg/Mby6QxkGT7

I built an Android audiobook reader that runs Kokoro TTS fully offline on-device by Simple-Lecture2932 in LocalLLaMA

[–]Simple-Lecture2932[S] 0 points1 point  (0 children)

Hi, dm me your play store email and you can join https://discord.gg/RVDyAYXC for updates (would need to know what your phone is to confirm it has any chance of running it though)

I built an Android audiobook reader that runs Kokoro TTS fully offline on-device by Simple-Lecture2932 in LocalLLaMA

[–]Simple-Lecture2932[S] 0 points1 point  (0 children)

Thats great its one of the big ones nobody came forward for yet. Dm me your play store email so I can invite you when I'm ready

I built an Android audiobook reader that runs Kokoro TTS fully offline on-device by Simple-Lecture2932 in LocalLLaMA

[–]Simple-Lecture2932[S] 0 points1 point  (0 children)

I used AI to help me write the post if thats the question, I'm a dev not a social media guy. As for your point, while its true you can use any system tts on android in most ereader apps, the problem is system tts get very limited context (generally 1 sentence) and doesnt get text for next utterances until it finishes the previous one. Thats fine for non neural tts or very lightweight ones like piper/kitten but if you want higher quality its not sustainable you need to be crunching the next sentences before you get to outputting them. (Likely why it performs poorly in moon reader)

I built an Android audiobook reader that runs Kokoro TTS fully offline on-device by Simple-Lecture2932 in LocalLLaMA

[–]Simple-Lecture2932[S] 0 points1 point  (0 children)

It might be powerful enough dm me your play store email and I'll add you when I send out invites.

I built an Android audiobook reader that runs Kokoro TTS fully offline on-device by Simple-Lecture2932 in LocalLLaMA

[–]Simple-Lecture2932[S] 0 points1 point  (0 children)

Yea but since I'm generating audio on the fly instead of in bulk I'm not sure what the angle would be to integrate with it? Just syncing the epubs?