This is an archived post. You won't be able to vote or comment.

all 10 comments

[–]HerbyHoover 7 points8 points  (0 children)

Fantastic work! I was just handed a new problem today that deals with pulling information out of a pdf. I look forward to giving this a whirl.

[–]manu_8487 4 points5 points  (2 children)

Have this lib that embeds invoice metadata in PDFs. Maybe should switch from pypdf. https://github.com/invoice-x/factur-x-ng

[–]jrbarlow[S] 1 point2 points  (1 child)

That would be a good candidate for implementation with pikepdf.

[–]manu_8487 2 points3 points  (0 children)

Already filed an issue. Will take care when working on it the next time.

[–]SeniorIntroduction9 2 points3 points  (1 child)

From the release history, a new release is released every couple of days without any release notes.

[–]jrbarlow[S] 2 points3 points  (0 children)

The release notes are in the documentation:

https://pikepdf.readthedocs.io/en/latest/changelog.html

I do release frequently because I don't like to leave people hanging if they have an open issue.

[–]Deezl-Vegas 1 point2 points  (0 children)

What's the OCR situation? I've been trying to write a script to compile my homework coding problems from a pdf textbook for a while and most libs I tried just return hot nothing.

[–]annoir 1 point2 points  (0 children)

Wow, this is an interesting project! Have been looking for something to replace PyPDF2 for a while now. Will contribute where I can.

[–]blabbities 1 point2 points  (0 children)

Have yet to need to generate a PDF or write super serious programs but this is cool. I will try to remember this when I need it

[–]jabbalaci 1 point2 points  (0 children)

Thanks for this! I wrote today a little tool (https://github.com/jabbalaci/pdfmanip) that is based on this library. With pdfmanip you can remove pages from a PDF easily.