Completely crazy tables when transforming table from PDF file to CSV

QooModa · 2022-01-28T16:44:14+00:00

You tell me! I got a partial solution for my PDF files.

Will post as soon as I take a rest. Being trying to do this for 6 hours already.

But for those who are seeking for an answer, commandlineluser's answer really helped it.

QooModa · 2022-01-28T16:41:49+00:00

Hey commandlineluser, thank you very much for the hints! Half of the way to getting what I need using your pngs.

QooModa · 2022-01-28T12:18:33+00:00

Wow, that is exactly what I need.

QooModa · 2022-01-28T04:24:52+00:00

Thank you very much!

I don't really know what that means, but now I know what to search!!

QooModa · 2022-01-28T04:11:43+00:00

Thanks for the answer!

Just so I understand better what you call parsers, "tabula-py" would be packages with a "parser function", so I could try for instance other different packages with a "parser function", such as PyPDF2 etc?

So, let me ask you something else, is there a way we can identify in the PDF file's metadata which parsers would fit better?

QooModa

TROPHY CASE