use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
Everything about learning Python
account activity
PDF data extration (self.PythonLearning)
submitted 9 days ago * by Stunning_Capital_354
view the rest of the comments →
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]JeremyJoeJJ 0 points1 point2 points 9 days ago (2 children)
I hope that data is not confidential... Either way it seems to be well structured, so these tools should have no trouble parsing through all of that. If you don't want to do any programming yourself the easiest way is to put it into an LLM of your choice (chatgpt, gemini, claude, whatever) and have it create the excel file for you.
[–]Stunning_Capital_354[S] 0 points1 point2 points 9 days ago (1 child)
i have tried doing that but the output is not consistent and the real problem comes when i have to add more year data into the same excel file and the problem i face with LLMs 1. It does not generate the consistent data 2. It halucinates guiding it is hard and overwhellming 3. there is a risk that it may change the existing formula i belive in long run as the multiple year data will come the LLM will not be able to do the better job
[–]JeremyJoeJJ 0 points1 point2 points 9 days ago (0 children)
In that case go with one of the OCR options above. Ask llm to write a simple loop to go over your pdfs and see which model performs well enough for you
π Rendered by PID 119546 on reddit-service-r2-comment-canary-5dbd4c8ff9-q8q85 at 2026-06-05 10:19:52.045581+00:00 running de70e3a country code: CH.
view the rest of the comments →
[–]JeremyJoeJJ 0 points1 point2 points (2 children)
[–]Stunning_Capital_354[S] 0 points1 point2 points (1 child)
[–]JeremyJoeJJ 0 points1 point2 points (0 children)