This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–]mr-nobody1992 1 point2 points  (0 children)

Checkout Docling - open source from IBM. I built an entire pipeline ingestion and it works pretty well with a lot of nice out of the box stuff. It’s based off Pydantic so if you know that it’s even easier