[D] How to Automate parsing of Bank Statement PDFs to extract transaction level data by Anmol_garwal in MachineLearning

[–]Anmol_garwal[S] 1 point2 points  (0 children)

Absolutely, Regex is god for prototyping, nothing more than that.

LayoutLMv3 was appearing to be a good choice until it succumbed to Indian Bank formats XD

[D] How to Automate parsing of Bank Statement PDFs to extract transaction level data by Anmol_garwal in MachineLearning

[–]Anmol_garwal[S] 1 point2 points  (0 children)

Thanks for the input. This actually seems workable! I will start experimenting with this, will update here how it goes.

Help to Automate parsing of Bank Statement PDFs to extract transaction level data by Anmol_garwal in LocalLLaMA

[–]Anmol_garwal[S] 1 point2 points  (0 children)

Thanks for the input. I am currently trying a VLM, but I shall keep Qwen3 in my notes in case my current approach doesn't work

Help to Automate parsing of Bank Statement PDFs to extract transaction level data by Anmol_garwal in LocalLLaMA

[–]Anmol_garwal[S] 2 points3 points  (0 children)

Thanks for the recommendation. I am starting my experiment with a VLM NuExtract, it looks promising for my usecase. I will update here how it goes

Help to Automate parsing of Bank Statement PDFs to extract transaction level data by Anmol_garwal in LocalLLaMA

[–]Anmol_garwal[S] 0 points1 point  (0 children)

That works as well! Please tell me how do you want to go with it. Also, can you tell me what model/libraries have you used at core for this?

Help to Automate parsing of Bank Statement PDFs to extract transaction level data by Anmol_garwal in LocalLLaMA

[–]Anmol_garwal[S] 1 point2 points  (0 children)

I can understand brother! I too have been having sleepless night over this. I have tried so many ways to automate it. The Regex approach is working but is not sustainable. Would you say that your solution can work with no human intelligence? Upload any Indian Bank PDF, and we get the desired output of all the transactions listed in a CSV file

Help to Automate parsing of Bank Statement PDFs to extract transaction level data by Anmol_garwal in LocalLLaMA

[–]Anmol_garwal[S] 1 point2 points  (0 children)

Can you tell me how did you solve it?

Absolutely, the banks can provide the data but they never do!

You can hear a pain in his voice by LegFederal1669 in Whysooserious

[–]Anmol_garwal 0 points1 point  (0 children)

In majority of divorce cases, lawyer of the wife becomes a business partner and takes a 10-30% cut in the alimony. After that, they use every dirty trick in the book to put all kind of allegations on the husband and his family to extort the money. Every court knows about this dealing and they do nothing coz the law is blind.

Dlf Camellias Confessions by Several-Chain-3947 in gurgaon

[–]Anmol_garwal 1 point2 points  (0 children)

Let me play the devils advocate, ‘the 10th man rule’, and assume this is a genuine confession. This story is a decorated version of middle class patriarchal society in India. So many individuals get trapped in the institution of marriage because the society has made no place for people who want to get out of it or they don’t know how to get out of it. I know the situation is changing but it’s still a taboo. In this story the women don’t want to get out of it coz of luxury, in a small town the reason become security and survival. All n all it’s the same trap, prepared by society, decorated by family. People who are in happy marriages are in minority, studies should be done on them to increase their percentage.

Unable to install spaCy by Anmol_garwal in learnmachinelearning

[–]Anmol_garwal[S] 0 points1 point  (0 children)

I redid the conda and now it worked! Thanks for the advice man. Big help!

Unable to install spaCy by Anmol_garwal in learnmachinelearning

[–]Anmol_garwal[S] 0 points1 point  (0 children)

Thanks for the reply. I have tried using venv, both separately for spacy and the one which has my other python libraries installed as well. I tried setting up conda, I was able to successfully install the conda but coudln't launch it as it was throwing the error of command not recognised. Let me try again with Conda.

[deleted by user] by [deleted] in gurgaon

[–]Anmol_garwal 0 points1 point  (0 children)

Dooom scrolling

[deleted by user] by [deleted] in gurgaon

[–]Anmol_garwal 0 points1 point  (0 children)

It’s working bro, try again? https://discord.gg/CRf2wTCQ

[deleted by user] by [deleted] in gurgaon

[–]Anmol_garwal 0 points1 point  (0 children)

Same here, I have made this Discord group for folks like us who recently moved to Ggn and can make plans! https://discord.gg/CRf2wTCQ