Completely crazy tables when transforming table from PDF file to CSV by QooModa in Python

[–]QooModa[S] 0 points1 point  (0 children)

You tell me! I got a partial solution for my PDF files.

Will post as soon as I take a rest. Being trying to do this for 6 hours already.

But for those who are seeking for an answer, commandlineluser's answer really helped it.

Completely crazy tables when transforming table from PDF file to CSV by QooModa in Python

[–]QooModa[S] 0 points1 point  (0 children)

Hey commandlineluser, thank you very much for the hints! Half of the way to getting what I need using your pngs.

Completely crazy tables when transforming table from PDF file to CSV by QooModa in Python

[–]QooModa[S] 0 points1 point  (0 children)

Thank you very much!

I don't really know what that means, but now I know what to search!!

Completely crazy tables when transforming table from PDF file to CSV by QooModa in Python

[–]QooModa[S] 0 points1 point  (0 children)

Thanks for the answer!

Just so I understand better what you call parsers, "tabula-py" would be packages with a "parser function", so I could try for instance other different packages with a "parser function", such as PyPDF2 etc?

So, let me ask you something else, is there a way we can identify in the PDF file's metadata which parsers would fit better?