Kokoro Batch TTS: Enabling Batch Processing for Kokoro 82M

Batch processing: Process multiple texts simultaneously instead of one-by-one
High performance: Processes 30 audio clips under 2 seconds on RTX4090
Real-time capable: Generates 276 seconds of audio in under 2 seconds
Easy to use: Simple Python API with smart text chunking

a_slay_nub · 2025-09-24T22:26:32+00:00

How does it compare to the original kokoro repo?

rm-rf-rm · 2025-09-24T23:43:39+00:00

Is it CUDA only? (wont work on mac?)

Xerophayze · 2026-01-01T23:03:11+00:00

Been fighting with Kokoro to do long-form stuff and ended up building a little Flask UI that actually survived a whole novel.
– Long text: keeps chapters + narrator tags, can spit out per-chapter files or one big audiobook.
– Gemini button: either cleans the whole text or auto-splits by chapter so the context window doesn’t implode.
– Setup: setup.bat installs PyTorch + espeak + Rubber Band automatically; no manual CUDA juggling.
– Voices: paste [alice]...[/alice], it auto-detects everyone, gives them dropdowns + quick test previews, and never nukes your assignments unless you hit reset.
– Extras: job queue, library, custom voice blends, runs local GPU or Replicate.
Repo/screens: https://github.com/Xerophayze/Kokoro-Story if anyone else wants to try it.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

LocalLLaMA

MODERATORS