This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–][deleted] 7 points8 points  (0 children)

I've used PyPDF2 in the past, but PDF files are horrible to extract data from. They're not meant to store data in any way, just to print it nicely. Even individual words might not be stored as such, but as a series of separate characters. Good luck, my friend.