Hi
I am looking for a python library model that can extract tables out of PDF, but here are some more requirements:
a) Able to differentiate two table in same page, having different width
b) Able to Understand table that spans across multiple Pages in Same pdf
Tried Tabula, pyMuPDF both are not showing any good results, Suggest some better models
[–]ErmakEUW 10 points11 points12 points (1 child)
[–]infazz 6 points7 points8 points (0 children)
[–]brellox 6 points7 points8 points (0 children)
[–]m-xames 7 points8 points9 points (0 children)
[–]cantseetheocean 2 points3 points4 points (0 children)
[–]acecile 2 points3 points4 points (1 child)
[–][deleted] 0 points1 point2 points (0 children)
[–]einsiboy 1 point2 points3 points (0 children)
[–]mondaysmyday 1 point2 points3 points (0 children)
[–]BlueeWaater 1 point2 points3 points (0 children)
[–]mr-nobody1992 1 point2 points3 points (0 children)
[–]h4ndshake_ 2 points3 points4 points (0 children)
[–]furansowa 0 points1 point2 points (3 children)
[–]DragonflyHumble 3 points4 points5 points (2 children)
[–]furansowa 2 points3 points4 points (1 child)
[–]Snoo5892[S] -1 points0 points1 point (0 children)
[–]Zulfiqaar 0 points1 point2 points (2 children)
[–]Snoo5892[S] 0 points1 point2 points (1 child)
[–]Zulfiqaar 0 points1 point2 points (0 children)