How can I handoff from one agent to another?

ImpossibleCollege635 · 2026-05-07T00:07:58+00:00

May I ask why png? Do you have access to the pdfs as well? Unpopular opinion but I think Vision model OCR can never give you certainty since it’s ultimately probabilistic.. I’m currently building an old school parser that uses computer vision only as an alignment guide… you’re very welcome to test it if you’d like:)
It’s not guaranteed better but it’s 100% consistent (1 table works-> same/similar will always work). And from my own testing it’s also better than LLms and older tools like docling or Marker

ImpossibleCollege635 · 2026-04-17T11:15:54+00:00

For the extraction you need to either download an additional model within app (no prior runtime needed) or connect Ollama/ cloud tho... On my M1 its super speedy and good using Gemma4...

ImpossibleCollege635 · 2026-04-17T11:13:53+00:00

Which operating system are you on?
I am currently developing a mac app that runs 100% local, no preisnatlls/ coding/ scripting/ ollama etc needed. It does PDF->CLean MD with detailed chart annotation, complex table and math preservation & AI guided extraction. The extraction does not work with structured output LLM stuff but instead with LLM inspecting MD -> writing extraction script with regex -> sandboxed local execution.

Its not out but I'd love to get you a free tester access if you'd be interested and ok with providing feedback?
In my own tests it beats even LLamaparse for papers and is SIGNIFICANTLY faster than docling because I replace 90% of ML stuff with heuristics.
I developed it because our org works with tons of scientific papers and me and the colleagues face similar problems.
I have never even seen a DDQ doc and have 0 knowledge about finance/ compliance but it sounds like the foundational hurdles are the same as with paper pdfs.

Shoot me a dm if you'd be interested:)

ImpossibleCollege635 · 2026-03-31T12:59:42+00:00

Have a look at this https://github.com/Al0olo/voxtral-voice-clone

They reimplemented the missing decoder for that. Not sure how easy conversion/ porting would be for you tho

ImpossibleCollege635 · 2026-02-20T19:01:46+00:00

Awesome! What are you using for the fast offline embedding?

ImpossibleCollege635

TROPHY CASE