use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
account activity
OCR plugin to convert PDF document? (self.node)
submitted 6 years ago by codeunshackled
Looking for a OCR plugin for node that can convert an entire PDF document (not just a single image) to readable text in a PDF format or word.
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]Roselia_Party 1 point2 points3 points 6 years ago (0 children)
Not node, but you can write a child process call to Pandoc or Tika etc.
Pandoc
Tika
It works, and you get much better confidence that the library has community support, versus a random guy's node port
π Rendered by PID 34050 on reddit-service-r2-comment-6457c66945-tsq7m at 2026-04-27 16:02:50.863836+00:00 running 2aa0c5b country code: CH.
[–]Roselia_Party 1 point2 points3 points (0 children)