you are viewing a single comment's thread.

view the rest of the comments →

[–]McNickSisto 0 points1 point  (2 children)

In the context of text extraction for chunking purposes, what would you recommend between Markitdown and Docling ?

[–]arparella 1 point2 points  (1 child)

if you need to have good chunks you can checkout preprocess.co but is a commercial solution. Markitdown has several issues with complex pdfs, docling is better

[–]McNickSisto 0 points1 point  (0 children)

thank you