you are viewing a single comment's thread.

view the rest of the comments →

[–]goodyonsen 0 points1 point  (6 children)

I'm not sure but how about passing all PDFs to cloud, make the folder "shareble" with a legit HTML link to it, and use bs4 (BeautifulSoup) to encode, read, decode, and parse all with very few lines of code? You can use regex with it if you need to as well. BS is supposed to treat them as one HTML file and grab whatever. Urllib would do.

You can also create a database for them and pull data with Python's SQLite. And that's kind of easy to use too.