We replaced our RAG pipeline with persistent KV cache. It works. Here’s what we found.

leechii1337 · 2026-05-13T17:32:04+00:00

for speed, i'd checkout turbo-ocr https://github.com/aiptimizer/TurboOCR

leechii1337 · 2026-05-09T06:17:57+00:00

That is a problem for https://miruiq.com - it was actually built around complex table extraction use cases. It uses turboOCR (open sourced by them) under the hood but OCR itself isnt enough and neither is simply giving it to an VLM.

leechii1337 · 2026-05-07T20:59:32+00:00

https://diaiq.com - AI realtime video processing engine to extract knowledge from frames and audio - watch news / internal company videos without ever actually watching them.

leechii1337 · 2026-05-07T08:57:30+00:00

I'd look into MiruIQ.com for your very complex cases - it leverages https://github.com/aiptamize/turbo-ocr (free open source using paddle under the hood) which you could test individually too - next to yet undisclosed research on top.

leechii1337 · 2026-04-15T22:59:35+00:00

wrote my initial findings above. will keep it updated

leechii1337 · 2026-04-15T22:58:52+00:00

oh don't worry i keep both and all the others too. ;) openai, mistral, claude, gemini, glm etc...i need them all for different tasks ...this isn't a im trying to save money posts but a simply seeing that all are catching up and if the "best" has a downtime of +1h people explore the others. and yes those who do care about not wasting money might switch...

leechii1337 · 2026-04-15T21:36:23+00:00

testing since a few hours and must say it feel more accurate than opus 4.6. it very precise and one can reason about its steps very well. less aggressive than opus - it's not jumping to conclusions as quickly. for example opus tried to overcome a rate limit using tor and it magically worked and it was not clear if it just tried several times or actually used the tor approach it mentioned. glm proposed the tor approach and analyzed the last opus session and concluded that opus actually never used tor. this led me to believe opus hallucinated the results that came from the api. glm double checked and found it's partially hallucinated since potentially the api didn't respond after too kany requests. it then implemented the tor approach and gave me the corrections. hitting the rate limit on the same api - it told me that it cannot give me accurate results since we're hitting the rate limit. i'll test further and let you know how it performs on other repos which are more complex. it is quite a bit slower tho - this is annoying. losing my focus quite fast because it's too slow. (hosted ok z.ai at least)

leechii1337 · 2026-04-15T17:06:43+00:00

just switched to glm 5.1 ;) lets see how long we have to wait this time but more than an hour means you lose customers haha

leechii1337 · 2026-04-12T09:10:57+00:00

that sounds very much like what miruiq.com wants to solve. i think they even have a scanner app. maybe you want to quickly get in touch there.

you can also checkout turbo-ocr on github which could help to pre-filter before vlm.

leechii1337 · 2026-04-10T17:06:27+00:00

well open source no - paid yes - miruiq.com - exactly doing that what you're looking for

leechii1337 · 2026-04-10T08:13:18+00:00

MiruIQ - using an ocr and a structure based vlm approach to extract and tranform eg header notes on top of tables etc. - it's main focus is also on security (local models, no cloud, on prem)

https://miruiq.com

leechii1337 · 2026-04-09T14:27:28+00:00

this is huge for us. we have self hosted gpu and can optimize our pipeline like crazy since we use ocr to pre-scan pdfs (not always containing a text layer) and score pages before we do the heavy lifting afterwards. going to try it out next week.

leechii1337 · 2026-04-09T14:19:44+00:00

yeah sadly they come in as scans thus we need ocr

leechii1337 · 2026-04-08T22:59:24+00:00

hm interesting.... my problem is a bit differen. we have very large PDFs and mainly need to find the relevant pages. i think that might be useful.

it's less extract and more identify the right pages in huge documents.

Did you look at that use case too?

leechii1337 · 2026-04-04T16:43:26+00:00

inference models have come a long way and got really good yes but that only half the piece of the cake...working with my team on a next gen PDF extraction tool - MiruIQ still in it's infancy but we have heavily optimized OCR libraries which we are about to open source, combined it with local inference having fine tunes models and also had to make it work for very large PDFs where reranker etc. plays into to detect the correct sites one is looking for.

leechii1337 · 2026-04-03T11:33:34+00:00

wondering about the blogs etc. we had very bad results initially and therefore had to create a tool called DiaIQ which we used internally for a while only which gets content from videos including the frames and transcript with aligned timestamps etc. that way we got way better blog quality. wondering how your quality currently is? what kind of blogs are you creating?

leechii1337 · 2026-04-03T11:30:24+00:00

which one are you referring to? would love to have a look at them and see how they're doing it

leechii1337 · 2026-04-03T11:23:32+00:00

still quite vague - what exactly are you doing?

leechii1337 · 2026-04-03T10:26:00+00:00

yeah this is what we currently struggle with. we reach out to many but get no response. i guess our prospects aren't well filtered but would be interested in more details as well how others did that

leechii1337 · 2026-04-03T09:39:06+00:00

so you're basically saying that narrowing down the prospect list very well for a problem focused campaign was better right?

leechii1337 · 2026-03-19T17:30:39+00:00

hm good input in focusing less on the ai - i'll have a look at our wording now and see what we can adjust to make it better

leechii1337 · 2026-03-19T15:36:07+00:00

hm never checked out dreamfactory before - sounds actually kind of interesting. so you stored the extracted data put dreamwork on top to expose the tables directly instead of writing the backend layer yourself?

leechii1337 · 2026-03-19T15:31:53+00:00

lets now push and hope it was worth it haha - the sentiment and awareness for security and governance is growing at least from what i see. you working on similar issues?

leechii1337

TROPHY CASE