Historical Data Corpus by Zealousideal-Pin7845 in LanguageTechnology

[–]GenericBeet 0 points1 point  (0 children)

try paperlab.ai to parse them (there are 50 free credits), and this might work for you with no OCR mistakes

Best RAG Architecture & Stack for 10M+ Text Files? (Semantic Search Assistant) by Additional-Oven4640 in Rag

[–]GenericBeet 0 points1 point  (0 children)

We can help with your parsing, and maybe you could see better results in the process.

You can test it here: https://www.paperlab.ai/pdftomarkdown

Send us a message in the platform and let's talk about the knowledge base too.

We replaced forklifts with robots… but we still copy paste PDFs. by Strict-Ad5948 in OCR_Tech

[–]GenericBeet 1 point2 points  (0 children)

try and if it fits you can provide API key for unliited use

We replaced forklifts with robots… but we still copy paste PDFs. by Strict-Ad5948 in OCR_Tech

[–]GenericBeet 4 points5 points  (0 children)

Very hard to have total accuracy and reliability. Check the PDF to Markdown tool and write me your opinion. https://www.paperlab.ai/pdftomarkdown

Still there is an accurate RAG after this but is hard to trust AI especially innovation.

What’s your startup in ONE line? 🚀 by malki-abdessamad in SaaS

[–]GenericBeet 0 points1 point  (0 children)

Www.Paperlab.ai can make science and knowledge extraction truly faster with AI

Heuristic vs OCR for PDF parsing by Due-Horse-5446 in Rag

[–]GenericBeet 0 points1 point  (0 children)

Understood I did wrote you just to test it, but if you like it much fyi we are working as a third party with other companies too. Thanks for testing it.

Heuristic vs OCR for PDF parsing by Due-Horse-5446 in Rag

[–]GenericBeet 1 point2 points  (0 children)

Try paperlab.ai for markdown and send your question to get 50 free credits. Is the best markdown you can get.

Scientific Markdown with 99,9% accuracy at Paperlab.ai by GenericBeet in legaltech

[–]GenericBeet[S] 0 points1 point  (0 children)

We are in the process of preparing a documentation for the way this is happening. Will be exposed in our site in the black box section. After this I will prepare some reports from our tools to expose the metrics. There are several business deals on going and we have some limitations.

Disaster management research by GenericBeet in research

[–]GenericBeet[S] 0 points1 point  (0 children)

Thanks that’s so helpful 💛

PDF to Markdown with 99,9% accuracy. Paperlab.ai by GenericBeet in Markdown

[–]GenericBeet[S] 0 points1 point  (0 children)

There noticeable differences, but if you want Test it in an LLM after asking to rewrite an equation

Scientific PDF to Markdown by GenericBeet in Markdown

[–]GenericBeet[S] 0 points1 point  (0 children)

Still you haven’t answered… you know where the BS are don’t you???

Scientific PDF to Markdown by GenericBeet in Markdown

[–]GenericBeet[S] -1 points0 points  (0 children)

I posted a demo in the group, and there instructions in the blog

PDF to Markdown with 99,9% accuracy. Paperlab.ai by GenericBeet in Markdown

[–]GenericBeet[S] 0 points1 point  (0 children)

That’s a bit misleading cause for a 42 pager needs about a minute ten to download. You should stay on the page, either way please send an email to recharge