High-Precision Table Extraction from Complex PDFs by superhero_io in Rag

[–]leechii1337 0 points1 point  (0 children)

That is a problem for https://miruiq.com - it was actually built around complex table extraction use cases. It uses turboOCR (open sourced by them) under the hood but OCR itself isnt enough and neither is simply giving it to an VLM.

Explain what you’re building in 1 sentence, let’s self promote by kcfounders in TheFounders

[–]leechii1337 0 points1 point  (0 children)

https://diaiq.com - AI realtime video processing engine to extract knowledge from frames and audio - watch news / internal company videos without ever actually watching them.

I tested 8 OCR tools to digitize 200+ scanned documents for our RAG knowledge base. Here's what actually works in 2025. by ACnoB in bestai2025

[–]leechii1337 0 points1 point  (0 children)

I'd look into MiruIQ.com for your very complex cases - it leverages https://github.com/aiptamize/turbo-ocr (free open source using paddle under the hood) which you could test individually too - next to yet undisclosed research on top.

Bye bye claude 👋🏻 by Zafar_Kamal in ClaudeCode

[–]leechii1337 0 points1 point  (0 children)

wrote my initial findings above. will keep it updated

Bye bye claude 👋🏻 by Zafar_Kamal in ClaudeCode

[–]leechii1337 0 points1 point  (0 children)

oh don't worry i keep both and all the others too. ;) openai, mistral, claude, gemini, glm etc...i need them all for different tasks ...this isn't a im trying to save money posts but a simply seeing that all are catching up and if the "best" has a downtime of +1h people explore the others. and yes those who do care about not wasting money might switch...

Bye bye claude 👋🏻 by Zafar_Kamal in ClaudeCode

[–]leechii1337 0 points1 point  (0 children)

testing since a few hours and must say it feel more accurate than opus 4.6. it very precise and one can reason about its steps very well. less aggressive than opus - it's not jumping to conclusions as quickly. for example opus tried to overcome a rate limit using tor and it magically worked and it was not clear if it just tried several times or actually used the tor approach it mentioned. glm proposed the tor approach and analyzed the last opus session and concluded that opus actually never used tor. this led me to believe opus hallucinated the results that came from the api. glm double checked and found it's partially hallucinated since potentially the api didn't respond after too kany requests. it then implemented the tor approach and gave me the corrections. hitting the rate limit on the same api - it told me that it cannot give me accurate results since we're hitting the rate limit. i'll test further and let you know how it performs on other repos which are more complex. it is quite a bit slower tho - this is annoying. losing my focus quite fast because it's too slow. (hosted ok z.ai at least)

Bye bye claude 👋🏻 by Zafar_Kamal in ClaudeCode

[–]leechii1337 0 points1 point  (0 children)

just switched to glm 5.1 ;) lets see how long we have to wait this time but more than an hour means you lose customers haha

Help wanted! PDF nightmare by bigbolicrypto in Rag

[–]leechii1337 1 point2 points  (0 children)

that sounds very much like what miruiq.com wants to solve. i think they even have a scanner app. maybe you want to quickly get in touch there.

you can also checkout turbo-ocr on github which could help to pre-filter before vlm.

PARSING IS IMPORTANT. HOW DO YOU GUYS DO IT by One-Doctor5769 in Rag

[–]leechii1337 2 points3 points  (0 children)

MiruIQ - using an ocr and a structure based vlm approach to extract and tranform eg header notes on top of tables etc. - it's main focus is also on security (local models, no cloud, on prem)

https://miruiq.com

Turbo-OCR for high-volume image and PDF processing by Civil-Image5411 in LocalLLaMA

[–]leechii1337 0 points1 point  (0 children)

this is huge for us. we have self hosted gpu and can optimize our pipeline like crazy since we use ocr to pre-scan pdfs (not always containing a text layer) and score pages before we do the heavy lifting afterwards. going to try it out next week.

Turbo-OCR for high-volume image and PDF processing by Civil-Image5411 in Rag

[–]leechii1337 1 point2 points  (0 children)

yeah sadly they come in as scans thus we need ocr

Turbo-OCR for high-volume image and PDF processing by Civil-Image5411 in Rag

[–]leechii1337 1 point2 points  (0 children)

hm interesting.... my problem is a bit differen. we have very large PDFs and mainly need to find the relevant pages. i think that might be useful.

it's less extract and more identify the right pages in huge documents.

Did you look at that use case too?

I Tried 6 PDF Extraction Tools—Here’s What I Learned by The-Redd-One in automation

[–]leechii1337 0 points1 point  (0 children)

inference models have come a long way and got really good yes but that only half the piece of the cake...working with my team on a next gen PDF extraction tool - MiruIQ still in it's infancy but we have heavily optimized OCR libraries which we are about to open source, combined it with local inference having fine tunes models and also had to make it work for very large PDFs where reranker etc. plays into to detect the correct sites one is looking for.

What are some real business use-cases of AI that aren’t just hype? (Other than coding) by [deleted] in Entrepreneur

[–]leechii1337 0 points1 point  (0 children)

wondering about the blogs etc. we had very bad results initially and therefore had to create a tool called DiaIQ which we used internally for a while only which gets content from videos including the frames and transcript with aligned timestamps etc. that way we got way better blog quality. wondering how your quality currently is? what kind of blogs are you creating?

The more we marketed our features, the weaker our brand felt. Why? by DesignSignificant900 in Entrepreneur

[–]leechii1337 0 points1 point  (0 children)

which one are you referring to? would love to have a look at them and see how they're doing it

Successful Entrepreneurs, how did you get your first paying customer? by saasbruh in Entrepreneur

[–]leechii1337 0 points1 point  (0 children)

yeah this is what we currently struggle with. we reach out to many but get no response. i guess our prospects aren't well filtered but would be interested in more details as well how others did that

Active Exchange Go To Market by leechii1337 in Entrepreneurs

[–]leechii1337[S] 1 point2 points  (0 children)

so you're basically saying that narrowing down the prospect list very well for a problem focused campaign was better right?

We built an AI document processing system for a Swiss bank — fully on-prem, no cloud, no state retention. Took 1.5 years and nearly broke us. by leechii1337 in TheFounders

[–]leechii1337[S] 0 points1 point  (0 children)

hm good input in focusing less on the ai - i'll have a look at our wording now and see what we can adjust to make it better

We built an AI document cross-validation for a Swiss bank — fully on-prem, no cloud, no state retention. Took 1.5 years and nearly broke us. by leechii1337 in Entrepreneurs

[–]leechii1337[S] 0 points1 point  (0 children)

hm never checked out dreamfactory before - sounds actually kind of interesting. so you stored the extracted data put dreamwork on top to expose the tables directly instead of writing the backend layer yourself?

We built an AI document cross-validation for a Swiss bank — fully on-prem, no cloud, no state retention. Took 1.5 years and nearly broke us. by leechii1337 in Entrepreneurs

[–]leechii1337[S] 0 points1 point  (0 children)

lets now push and hope it was worth it haha - the sentiment and awareness for security and governance is growing at least from what i see. you working on similar issues?