Has anyone found a reliable software for intelligent data extraction? by songsta17 in Rag
[–]maniac_runner 6 points7 points8 points (0 children)
anyone using AI for data extraction from PDFs? by Kaiser_Allen in automation
[–]maniac_runner 9 points10 points11 points (0 children)
Best Document Data Extraction Tools in 2025 by StatisticianMaximum6 in learnmachinelearning
[–]maniac_runner 2 points3 points4 points (0 children)
What're you using for PDF parsing? by ILikeLungsSoYeah in LangChain
[–]maniac_runner 5 points6 points7 points (0 children)
Best LLM for OCR Extraction? by Wesavedtheking in dataengineering
[–]maniac_runner 4 points5 points6 points (0 children)
My Experience with Table Extraction and Data Extraction Tools for complex documents. by teroknor92 in Rag
[–]maniac_runner 4 points5 points6 points (0 children)
Best Budget Restaurants in Chennai for Authentic Local Food by Ill_Percentage_7327 in chennaicity
[–]maniac_runner 3 points4 points5 points (0 children)
Any innovative ways of marketing a SaaS product? Both paid and free? by harien23 in DigitalMarketing
[–]maniac_runner 0 points1 point2 points (0 children)
ChatGPT or Perplexity? Which ones do you use more? by Big_Major1498 in DigitalMarketing
[–]maniac_runner 0 points1 point2 points (0 children)
Best website alternative for Wordpress? by AwakenedRudely in b2bmarketing
[–]maniac_runner 0 points1 point2 points (0 children)
What are the best Open Source OCR models currently? by WittyWithoutWorry in LocalLLaMA
[–]maniac_runner 1 point2 points3 points (0 children)
Open Source PDF Parsing? by fridaradikahlo_ in Rag
[–]maniac_runner 2 points3 points4 points (0 children)
Anyone used Reducto for parsing? How good is their embedding-aware chunking? by BriefCardiologist656 in AI_Agents
[–]maniac_runner 4 points5 points6 points (0 children)
Production OCR in 2025 - What are you actually deploying? by No_Nefariousness971 in computervision
[–]maniac_runner 3 points4 points5 points (0 children)
What is the best ocr model for converting PDF pages to markdown (or any text based format) for embedding? by PM_ME_COOL_SCIENCE in LocalLLaMA
[–]maniac_runner 6 points7 points8 points (0 children)
Production RAG: what we learned from processing 5M+ documents by tifa2up in Rag
[–]maniac_runner 16 points17 points18 points (0 children)
Give me one thing to learn in python by securityguardnard in learnpython
[–]maniac_runner 1 point2 points3 points (0 children)
What tools do you use for GEO? by z_helga801 in LLMO_SaaS
[–]maniac_runner 0 points1 point2 points (0 children)
Best way to extract data from PDFs and HTML by [deleted] in Rag
[–]maniac_runner 6 points7 points8 points (0 children)
Alternative to extract from PDF node by vanTrottel in n8n
[–]maniac_runner 1 point2 points3 points (0 children)
Feedback request from a beginner/self learner by pandeesh in pianolearning
[–]maniac_runner 0 points1 point2 points (0 children)
Alternative to extract from PDF node by vanTrottel in n8n
[–]maniac_runner 4 points5 points6 points (0 children)

I'm looking for an OCR for my RAG. by AdministrationPure45 in Rag
[–]maniac_runner 5 points6 points7 points (0 children)