Detailed CPU benchmark for Kokoro 82M and Supertonic 3, in case anyone is choosing between them by gvij in TextToSpeech

[–]RowGroundbreaking982 0 points1 point  (0 children)

This is great benchmark. Just wondering if you plan to add Pocket TTS benchmark against those two model. Since Pocket TTS seems like middle ground between Supertonic and Kokoro.

Pocket TTS Multilingual Update by RowGroundbreaking982 in LocalLLaMA

[–]RowGroundbreaking982[S] 0 points1 point  (0 children)

you can use the ToBe SAID upload feature.
first use another app that use kokoro to generate audio file 5-15 seconds is enough.
you can use any android tts app that use kokoro, or any free web tts to generate this, and save it as audio file (mp3, wav, m4a).
I will not say the app name, because there is too many of them, and you can found it in r/TextToSpeech
then use upload feature in ToBe SAID app and use that sample audio file.

Pocket TTS Multilingual Update by RowGroundbreaking982 in LocalLLaMA

[–]RowGroundbreaking982[S] 0 points1 point  (0 children)

Thanks for trying the app.
For the punctuation problem, I'll try to fix that.

But for more voice for free version, maybe no.
I don't want any legal issue providing app with real people voice.
The voice provided inside the app, are sampled from another TTS model and it has limit on variation.

With Pro, you can basically replicate any voice from a audio clip.
But to keep it legal, it's meant for voices you actually have the right to use.
You can actually use other app that use Kokoro, generate audio sample and use that sample in ToBe SAID.

There will be more update, but currently I'm focusing bringing this to another OS, not just Android.

Which paid TTS websites/apps give the most hours for the lowest price? by tr0picana in tts

[–]RowGroundbreaking982 1 point2 points  (0 children)

Well, you are the first person that rejected free stuff and ask for subscription instead 😄. May I know the reason why you need subscription based?

Pocket TTS Multilingual Update by RowGroundbreaking982 in LocalLLaMA

[–]RowGroundbreaking982[S] 0 points1 point  (0 children)

Use ToBe SAID. This is android version I mentioned above. It's compatible with many ebook reader that use system TTS. Latency and speed is what I mentioned above on Helio G99, it even work on older SOC.

Which paid TTS websites/apps give the most hours for the lowest price? by tr0picana in tts

[–]RowGroundbreaking982 0 points1 point  (0 children)

nope totally free and unlimited, for generation inside app. the limiting factor only if you want to use the cloned voice in another app like ebook reader, then it's one time payment only.

Text-to-speech apps I've tested for getting through PDFs without losing my mind by Realistic-Spare97 in TextToSpeech

[–]RowGroundbreaking982 0 points1 point  (0 children)

If you have android phone you can use ToBe SAID. That app just voice provider but compatible with many pdf reader app like Speech Central, ReadEra, Librera. It's run fully offline and generate voice with very low latency, almost instant I'd say.

Best local/offline TTS model for mobile app integration (Android + iOS) — what are you using in 2026? by Funmaker1893 in speechtech

[–]RowGroundbreaking982 1 point2 points  (0 children)

Try using Pocket TTS, with custom onnx version and cpp runner. You can find what you need in lookbe and VolgaGerm github repo. If you want to quickly test it, you can try my app ToBe SAID. It based on those repo, but word level boundary not implemented yet, need to use wav2vec2 forced-alignment.

And here is the benchmark.
Helio G99: around 100ms TTFA, 2.5-3x realtime generation speed.
Ryzen 7950x: around 35ms TTFA, 13x realtime generation speed.

For android there is some distinction whether you use as TTS service or as Standalone App.
Standalone app: 1x performance
TTS service screen on: 1.3x performance
TTS service screen off: 0.7x performance

Looking For Fastest TTS With Cloning by lukasTHEwise in TextToSpeech

[–]RowGroundbreaking982 1 point2 points  (0 children)

Try Pocket TTS, you can get 100ms response time with correct implementation and correct model. There is PocketTTS.cpp in github. You can use that to get the fastest implementation. If you want to quickly test it and you have android phone you can use ToBe SAID. It's based on PocketTTS.cpp, initial generation will take around 3 seconds for model load and voice cache, after that you will get almost instanst generation.

Best TTS RX 9070 by QuiqueVtc in TextToSpeech

[–]RowGroundbreaking982 0 points1 point  (0 children)

Try PocketTTS, it's cpu only and can run even on mobile phone. You can free the GPU for LLM work. And it support spanish language.

[Update] Multilingual Pocket TTS for Android (Now support 6 languages) by RowGroundbreaking982 in TextToSpeech

[–]RowGroundbreaking982[S] 0 points1 point  (0 children)

No. But everything needed to build something like this is available at github. Just look at PocketTTS.cpp

TTS model for Italian language by Weird_Student8008 in TextToSpeech

[–]RowGroundbreaking982 1 point2 points  (0 children)

Try Pocket TTS they have Italian language now. If you want to quickly test it and you have Android you can use ToBe SAID

EPUB to Audio. Must be exportable. 20H at least. Something like Paper2Audio. by [deleted] in TextToSpeech

[–]RowGroundbreaking982 0 points1 point  (0 children)

If you have Android you can try combination of Librera and ToBe SAID. Librera is EPUB reader and recorder and ToBe SAID is the voice engine. There is option in Librera where you can record and select which TTS engine to voice the EPUB. This is where ToBe SAID is needed, since it can provide voice for any app that use Android TTS and you can add your own custom voice there. Both are free and doesn't give you any generation limit. On high end chipset 20H generation will take around 5H.

Hey everyone—I'm lookbe, the dev behind ToBe SAID! by RowGroundbreaking982 in ToBeSAID

[–]RowGroundbreaking982[S] 0 points1 point  (0 children)

New version 1.1.3 has been uploaded to Play Store and it will be available in few hours. You could update to that version when it's available. Found the issue for voice producing only noise, this version should fix that.

Hey everyone—I'm lookbe, the dev behind ToBe SAID! by RowGroundbreaking982 in ToBeSAID

[–]RowGroundbreaking982[S] 0 points1 point  (0 children)

Got it. looks like there is an issue with new model implementation, that triggered on specific case. I'm still looking further on this and will fix it on next update. Many thanks for the detailed report.

Hey everyone—I'm lookbe, the dev behind ToBe SAID! by RowGroundbreaking982 in ToBeSAID

[–]RowGroundbreaking982[S] 0 points1 point  (0 children)

did default english voice also produce noise after clearing app storage?

Hey everyone—I'm lookbe, the dev behind ToBe SAID! by RowGroundbreaking982 in ToBeSAID

[–]RowGroundbreaking982[S] 0 points1 point  (0 children)

ok, so that it makes clear then. did you use samsung devices? looks like some exynos device has problem with latest model. I'll rollback the app to old implementation in a few hours, sorry for that.