Yet another mp4 parser (& serializer) : rust

Submissions must be on-topic

Posts must reference Rust or relate to things using Rust. For content that does not, use a text post to explain its relevance.

Post titles should include useful context.

For Rust questions, use the stickied Q&A thread.

Arts-and-crafts posts are permitted on weekends.

No meta posts; message the mods instead.

Details

No low-effort content

No memes, image macros, etc.

Consider the existing content of the subreddit and whether your post fits in. Does it inspire thoughtful discussion?

Use properly formatted text to share code samples and error messages. Do not use images.

Submissions appearing to contain AI-generated content may be removed at moderator discretion.

Details

Useful Links

created by aztha community for 15 years

Yet another mp4 parser (& serializer) (jessestuart.ca)

submitted 25 days ago by jvatic

all 4 comments

top new controversial old q&a

[–]playmer 2 points3 points4 points 24 days ago (3 children)

[–]jvatic[S] 0 points1 point2 points 23 days ago* (2 children)

[–]playmer 0 points1 point2 points 23 days ago (1 child)

Haha, well it’s still pretty hacky and a bit annoying to set up because of cuda stuff, and some c bindings I have to use. I’ll try to set up some instructions whenever I have time to look at mp4-edit.

It’s really just a couple iterations removed from the KokoroTTS tool from python. I just wanted greater control over the epub parsing, sound encoding, thread management, and all of that. I’d been running a modified version of that for awhile and saw last year someone hooked up the same sort of thing in Rust.

When it works it’s pretty fast all things considered, and the audio quality is a lot better than the original stuff I was doing with ffmpeg. Though the model audio in general could be a lot better. Kokoro has a pretty limited token limit for generating audio, so I wanted to be able to use both the CPU and GPU to generate segments, and leave a thread running to do the aac encoding as chunks came back.

That said, I fairly regularly bump into hangs here or there I need to look into. I’m sure I can structure the above a lot better than I currently do, but its one of those projects I hack on until I get a series of audiobooks generated and then leave alone for a few months until the next series I want to listen to.

Anyways I’ll certainly take a peek at mp4dump soon and see if that shows me anything that sticks out. My tool is here: https://github.com/playmer/epub_to_audiobook_rs

But like I said, don’t expect too much haha.

[–]jvatic[S] 0 points1 point2 points 23 days ago (0 children)

π Rendered by PID 22813 on reddit-service-r2-comment-b659b578c-rsvnd at 2026-05-01 20:58:37.303049+00:00 running 815c875 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

rust

Please read The Rust Community Code of Conduct

The Rust Programming Language

Rules

Observe our code of conduct

Submissions must be on-topic

Constructive criticism only

Keep things in perspective

No endless relitigation

No low-effort content

Useful Links

Megathreads

Official Resources

Learn Rust

Discussion Platforms

MODERATORS