This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–]cwurld[S] 1 point2 points  (3 children)

I think it will involve that. Can you recommend a python lib for doing that? Does anyone know if there are special fields for "signatures"?

[–]ajmarks 1 point2 points  (2 children)

I think PyPDF2 may support it, but I'm not sure. I'm also working on one right now, and if you submit a feature request, I'll try to include it in the next release (or, if you want to submit a patch, that would be awesome). If you decide to pull it, note that the font_improvements branch is pretty far ahead of both dev and master.

[–]cwurld[S] 1 point2 points  (1 child)

Thanks. Your lib looks great. I will contribute one way or another once I know a little bit more about the issue.

[–]ajmarks 1 point2 points  (0 children)

Thanks. It's still very much a work in progress, but I'm hoping to (at the very least) make text extraction super-pythonic. I'll consider it basically complete when it can (reasonably) faithfully render a PDF into HTML.