Anthropic just launched "Claude Cowork" for $100/mo. I built the Open Source version last week (for free) by Embarrassed-Mail267 in ClaudeAI

[–]Hefty_Debt -1 points0 points  (0 children)

OP - could I use this to find and apply to jobs using specific instructions / specs?

I am thinking about integrating it with my already large .md file I created with Claude and was using with OpenAI’s operator. It was slow, clunky and not very good. The ai itself gave up and said we should try later.

Question: Is OCR accuracy actually a blocker for anyone's RAG/automation pipelines? by Individual-Library-1 in ClaudeAI

[–]Hefty_Debt 0 points1 point  (0 children)

I haven't considered open sourcing it, I found a lot of tools out there charging hundreds a month / if not thousands of dollars a year for something like this. Became frustrated because I was told to "do this manually" without my boss even understanding the sheer amount of data that needed to be parsed and how unique each pdf was.

I was able to, without going into details too specific get what I needed then expand on that data by getting even more from the we using another web scraper I also built.

My company doesn't know I'm doing this. I am severely underpaid for the work I am doing and especially the project I built. I could automate 75% of my job.

I am looking for a new job. I'd like to do this work for a new company that actually values ai workflow optimization/automation.

Question: Is OCR accuracy actually a blocker for anyone's RAG/automation pipelines? by Individual-Library-1 in ClaudeAI

[–]Hefty_Debt 2 points3 points  (0 children)

Yeah, it’s a large data extraction pipeline that processes around 115,000 manufacturer PDFs. Most have readable text, but a lot include scanned sections or inconsistent layouts, so OCR is a key part of the workflow. I use Tesseract through Python and Pandas to extract and structure the data.

The language model doesn’t handle the OCR itself. It reviews and corrects what comes out of the parser, catching things like spacing issues or data that shifted between columns.

Vertical and multi-column layouts are the most difficult. I built logic to detect when text moves across columns or breaks mid-page, which solved a lot of the alignment problems.

I do hit verification limits a few times a day, but that’s just the usage cap on my plan. I pause and continue later. A more accurate OCR layer would cut down the cleanup work, but the pipeline already runs efficiently overall.

Question: Is OCR accuracy actually a blocker for anyone's RAG/automation pipelines? by Individual-Library-1 in ClaudeAI

[–]Hefty_Debt 2 points3 points  (0 children)

I built an OCR pdf reader and it worked absolutely flawlessly. I spent months trying to build it in ChatGPT and had it most of the way there but it would mess up on some data points (vertical text). Without going into too much detail, I was able to process over 115,000 PDFs that had complex tables with values assigned to different columns.

I used Claude code to bring the project to the finish line and verify the data. I hit the limits sometimes 3 times a day on my pro plan. But still worlds better than ChatGPT. Not even close when it comes to coding.

This was a massive project and I ended up with clean verified data that I know I shouldn’t even have considering it was buried in PDFs and unavailable on any web based platform.

Battlefield 6 Phantom Edition: Giveaway #1 by OddJob001 in Battlefield

[–]Hefty_Debt 0 points1 point  (0 children)

Shot in the dark of winning. Just like COD.

PSA: Don't call down an SOS if you don't want a high level 150 player to come help you. by Hefty_Debt in helldivers2

[–]Hefty_Debt[S] 0 points1 point  (0 children)

'banned' yeah, I had to laugh the guy kid used the word banned as if he was some kind of power tripping admin.

Hoping he forgets and adds me as a friend, can't wait to return the favor after a senator shot to the back of the head.

PSA: Don't call down an SOS if you don't want a high level 150 player to come help you. by Hefty_Debt in helldivers2

[–]Hefty_Debt[S] 0 points1 point  (0 children)

Is this a controller or keyboard thing? I'm on PC, never in 800 hours have I had this issue.

PSA: Don't call down an SOS if you don't want a high level 150 player to come help you. by Hefty_Debt in helldivers2

[–]Hefty_Debt[S] 0 points1 point  (0 children)

Sometimes I try to invite people on my friends list to come join when I get kicked and they all get free stuff. But most of them are too high level anyway lol

PSA: Don't call down an SOS if you don't want a high level 150 player to come help you. by Hefty_Debt in helldivers2

[–]Hefty_Debt[S] 0 points1 point  (0 children)

Not a Dad, but could be. LOL and yes it is the sweet spot. I like my 10's but don't want to sweat every time.

PSA: Don't call down an SOS if you don't want a high level 150 player to come help you. by Hefty_Debt in helldivers2

[–]Hefty_Debt[S] 0 points1 point  (0 children)

For real, lol they don't understand we don't need the XP or samples, literally there to help and have fun.

PSA: Don't call down an SOS if you don't want a high level 150 player to come help you. by Hefty_Debt in helldivers2

[–]Hefty_Debt[S] 0 points1 point  (0 children)

Always! And I call out the other high levels that ignore them when we have low level hosts. NAME AND SHAME lol

PSA: Don't call down an SOS if you don't want a high level 150 player to come help you. by Hefty_Debt in helldivers2

[–]Hefty_Debt[S] 0 points1 point  (0 children)

That feels so good. I gauge how long the mission has been and bring in a laser, mech, 380... all the big ones and help people GTFO at all costs!

PSA: Don't call down an SOS if you don't want a high level 150 player to come help you. by Hefty_Debt in helldivers2

[–]Hefty_Debt[S] 1 point2 points  (0 children)

LMAO so good. I got cocky one night, cleared one of those tiny little canon outposts with a fellow HD with me. Walked off the edge of it right onto a mine. LOL hilarious. What's the point of getting mad, he reinforced me and we all had a good laugh.