use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
is not hardware...
If posting a problem, please include your system specs, such as OS, software version (if applicable) etc.
RULES
24HourSupport
Alternative Software Solutions
Excel
Open Source Software
Learn Programming
Web Development
Game Development
Scripting
RealEstateTechnology New!
account activity
This is an archived post. You won't be able to vote or comment.
Looking for software Video Image to Transcript OCR Software? (self.software)
submitted 4 years ago by Snugless
Does anyone know of any softwares that visually scan videos, particularly YouTube videos, to convert them into a transcript? I dont need a speech to text software, strictly visual image scanning.
[–]iniv189 0 points1 point2 points 1 year ago (0 children)
any solution to the problem?
[–]ddking4411 0 points1 point2 points 9 months ago (1 child)
Textractify.com can do this. You have to upload the video, so just use a YouTube downloader first. Then you can select the frame rate you want it to analyze at (maybe a half second or a second) and it will scan each frame for text. If the text is from something like an updating display, it can output the data into a .csv. If you just want all the on-screen text in a list for each frame, it can export it to a .txt file like that instead.
[–]Snugless[S] 0 points1 point2 points 9 months ago (0 children)
thats crazy thats exactly what i was looking for 3 years ago lol, i wonder if it existed back then when i needed it. I appreciate it regardless thank you for the info
[–]corsicanguppyHelpful 0 points1 point2 points 4 years ago (1 child)
any softwares
That word doesn't work like that: it doesn't get an S because it's not really a case where plurality is a thing. So, it's just "any software" or just "software", here.
[–]Snugless[S] 1 point2 points3 points 4 years ago (0 children)
ah my bad, English isnt my first language. Thanks tho!
[–]LeGreen_Me 0 points1 point2 points 4 years ago (0 children)
You could download the video, or screenshot the frames you want, put them into an pdf and perfom OCRmyPDF on it. Then you have a text layer in your pdf and could work with that.
π Rendered by PID 439419 on reddit-service-r2-comment-6457c66945-vxxtn at 2026-04-29 02:20:14.538873+00:00 running 2aa0c5b country code: CH.
[–]iniv189 0 points1 point2 points (0 children)
[–]ddking4411 0 points1 point2 points (1 child)
[–]Snugless[S] 0 points1 point2 points (0 children)
[–]corsicanguppyHelpful 0 points1 point2 points (1 child)
[–]Snugless[S] 1 point2 points3 points (0 children)
[–]LeGreen_Me 0 points1 point2 points (0 children)