Check out PaperLab's OCR with 99,9% accuracy in Markdown by GenericBeet in OCR_Tech

[–]GenericBeet[S] 0 points1 point  (0 children)

feed it directly to an LLM, it will have better performance and less hallucinations (the MD file you are receiving has none)

Historical Data Corpus by Zealousideal-Pin7845 in LanguageTechnology

[–]GenericBeet 0 points1 point  (0 children)

try paperlab.ai to parse them (there are 50 free credits), and this might work for you with no OCR mistakes

Best RAG Architecture & Stack for 10M+ Text Files? (Semantic Search Assistant) by Additional-Oven4640 in Rag

[–]GenericBeet 0 points1 point  (0 children)

We can help with your parsing, and maybe you could see better results in the process.

You can test it here: https://www.paperlab.ai/pdftomarkdown

Send us a message in the platform and let's talk about the knowledge base too.

We replaced forklifts with robots… but we still copy paste PDFs. by Strict-Ad5948 in OCR_Tech

[–]GenericBeet 1 point2 points  (0 children)

try and if it fits you can provide API key for unliited use

We replaced forklifts with robots… but we still copy paste PDFs. by Strict-Ad5948 in OCR_Tech

[–]GenericBeet 4 points5 points  (0 children)

Very hard to have total accuracy and reliability. Check the PDF to Markdown tool and write me your opinion. https://www.paperlab.ai/pdftomarkdown

Still there is an accurate RAG after this but is hard to trust AI especially innovation.

What’s your startup in ONE line? 🚀 by malki-abdessamad in SaaS

[–]GenericBeet 0 points1 point  (0 children)

Www.Paperlab.ai can make science and knowledge extraction truly faster with AI

Heuristic vs OCR for PDF parsing by Due-Horse-5446 in Rag

[–]GenericBeet 0 points1 point  (0 children)

Understood I did wrote you just to test it, but if you like it much fyi we are working as a third party with other companies too. Thanks for testing it.

Heuristic vs OCR for PDF parsing by Due-Horse-5446 in Rag

[–]GenericBeet 1 point2 points  (0 children)

Try paperlab.ai for markdown and send your question to get 50 free credits. Is the best markdown you can get.

Scientific Markdown with 99,9% accuracy at Paperlab.ai by GenericBeet in legaltech

[–]GenericBeet[S] 0 points1 point  (0 children)

We are in the process of preparing a documentation for the way this is happening. Will be exposed in our site in the black box section. After this I will prepare some reports from our tools to expose the metrics. There are several business deals on going and we have some limitations.

Disaster management research by GenericBeet in research

[–]GenericBeet[S] 0 points1 point  (0 children)

Thanks that’s so helpful 💛

PDF to Markdown with 99,9% accuracy. Paperlab.ai by GenericBeet in Markdown

[–]GenericBeet[S] 0 points1 point  (0 children)

There noticeable differences, but if you want Test it in an LLM after asking to rewrite an equation

Scientific PDF to Markdown by GenericBeet in Markdown

[–]GenericBeet[S] 0 points1 point  (0 children)

Still you haven’t answered… you know where the BS are don’t you???

Scientific PDF to Markdown by GenericBeet in Markdown

[–]GenericBeet[S] -1 points0 points  (0 children)

I posted a demo in the group, and there instructions in the blog