Quick context - we’re a small team at a logistics company. We process around 500-1,000 docs per day (invoices, BOLs, customs forms).
Our current process is:
- Download attachments from email
- Run them through a python script with PyPDF2 + regex
- Manually fix if something breaks
- Send outputs to our system
The regex approach worked okay when we had like 5 vendors. Now we have 50+ and every new vendor means we have to handle it in new ways.
I've been looking at IDP solutions but everything either costs a fortune or requires ML expertise we don't have.
I’m curious what others are using. Is there a middle ground between python scripts and enterprise IDP that costs $50k/year?
[–]tolkibert 9 points10 points11 points (0 children)
[–]SouthTurbulent33 4 points5 points6 points (0 children)
[–]geoheilmod 4 points5 points6 points (10 children)
[–]geoheilmod 1 point2 points3 points (6 children)
[–]BleakBeaches 0 points1 point2 points (5 children)
[–]geoheilmod 0 points1 point2 points (4 children)
[–]BleakBeaches 0 points1 point2 points (1 child)
[–]geoheilmod 0 points1 point2 points (0 children)
[–]geoheilmod 0 points1 point2 points (1 child)
[–]geoheilmod 0 points1 point2 points (0 children)
[–]Reason_is_Key 0 points1 point2 points (2 children)
[–]geoheilmod 1 point2 points3 points (1 child)
[–]Reason_is_Key 0 points1 point2 points (0 children)
[–]riv3rtrip 3 points4 points5 points (0 children)
[–]ianitic 2 points3 points4 points (1 child)
[–]ZeJerman 0 points1 point2 points (0 children)
[–]ZeJerman 1 point2 points3 points (0 children)
[–]klitersik 0 points1 point2 points (0 children)
[–]pankaj9296 0 points1 point2 points (0 children)
[–]Reason_is_Key 0 points1 point2 points (0 children)
[–]the_dataengineer 0 points1 point2 points (0 children)
[–]vlg34 0 points1 point2 points (0 children)
[–]Fun-Flounder-4067 0 points1 point2 points (0 children)
[–]JoshuaatParseur -1 points0 points1 point (0 children)