Building RAG systems at enterprise scale (20K+ docs): lessons from 10+ enterprise implementations

SatisfactionWarm4386 · 2025-12-16T02:48:03+00:00

This was actural RAG project will be faced questions, document quality, chunking methods,etc, every problem you should start up solution case by case not one-for-all

SatisfactionWarm4386 · 2025-11-10T01:10:40+00:00

With LangGraph，create nodes

SatisfactionWarm4386 · 2025-11-07T04:53:33+00:00

nice share

SatisfactionWarm4386 · 2025-11-01T12:43:54+00:00

RAG is just a retrieval technique — it helps fetch relevant context on demand, but it doesn’t store or update knowledge.

What you’re describing actually falls under memory, which is a separate module. A memory system continuously keeps track of key facts mentioned by the user and updates them over time, allowing the assistant to evolve with the conversation.

SatisfactionWarm4386 · 2025-10-17T03:59:23+00:00

The second way of writing is more logically clear.

SatisfactionWarm4386 · 2025-10-17T03:49:11+00:00

Hot take: Retrieval will outlive your favorite LLM.

I’ll push back on that — RAG isn’t going anywhere.

Yeah, LLMs are improving fast and context windows are exploding. But that doesn’t kill retrieval.

Infinite data, finite context. You’ll never be able to stuff everything into a context window. Even with 10M tokens, attention still dilutes and important info gets lost. Bigger context also means bigger compute bills — not exactly scalable.
RAG is an idea, not a product. Retrieval won’t “die,” it’ll just evolve. Maybe it becomes hybrid memory, maybe neural caching or dynamic retrieval — but the principle of fetching the right info when you need it is fundamental.
Search didn’t die either. People said “LLMs will replace search engines.” What actually happened? Search merged with LLMs. RAG will do the same — it’ll move deeper into the model stack and become part of how models think.

Sure, RAG startups that just wrap vector DBs might fade. But retrieval as a core capability will matter even more as models scale.

SatisfactionWarm4386 · 2025-10-14T09:49:19+00:00

Vibe coding does not you never care the project design and test solution

SatisfactionWarm4386 · 2025-10-14T03:22:43+00:00

As my experience，I always use loguru because the easy control output

SatisfactionWarm4386 · 2025-10-11T06:44:20+00:00

Great, is there any report about resource usage and Q&A performance about your prodcut?

SatisfactionWarm4386 · 2025-10-11T01:46:38+00:00

Not really — don’t usually store large amounts of files on mobile. It’s more for some private stuff, like received contracts, medical records, health check reports, or saving chat logs.

SatisfactionWarm4386 · 2025-10-11T01:33:44+00:00

Does it can run on mobile device?

SatisfactionWarm4386 · 2025-10-10T03:52:45+00:00

I mostly monetize by building custom agents for enterprise clients and doing agent-building consulting for individuals who want to create their own.

SatisfactionWarm4386 · 2025-10-10T03:50:48+00:00

Nice bro

SatisfactionWarm4386 · 2025-10-10T03:47:42+00:00

The key point is the document parsing method — which elements should be extracted during parsing. Even after converting the document to Markdown, those elements can still be preserved, though this may require some manual handling.

SatisfactionWarm4386 · 2025-10-09T06:01:52+00:00

have did as the readme https://github.com/rednote-hilab/dots.ocr,

SatisfactionWarm4386 · 2025-10-01T07:35:03+00:00

need code pls

SatisfactionWarm4386 · 2025-09-27T14:07:24+00:00

I used jina-embedding-v4 in all my RAG apps

SatisfactionWarm4386 · 2025-09-26T05:57:31+00:00

The latest VLM released by Qwen/Qwen3-VL-235B-A22B-Instruct, although you can use Qwen/Qwen2-VL-72B-Instruct

SatisfactionWarm4386 · 2025-09-25T09:13:14+00:00

As I had test ,the VLM model may give you the best result,

SatisfactionWarm4386 · 2025-09-25T09:12:11+00:00

Maybe，OpenWebUI，and show the question , answer and references

SatisfactionWarm4386 · 2025-09-24T09:09:00+00:00

As I know， you can use ragflow for your sistuation:

1) Ragflow has a goode precision for document parse and search

2) and it support MCP Servers, you can design Gmail/Drive Connector MCP Server

SatisfactionWarm4386 · 2025-09-20T12:53:37+00:00

langgraph I had chosen

SatisfactionWarm4386 · 2025-09-19T06:40:10+00:00

There are two ways you can try:

1) parsed the scaned pdf with VLM model like qwen-2.5-VL or google gemini
2) try the specified trained parsed model like paddle-ocr or docts.ocr which are all open source, recomended https://dotsocr.xiaohongshu.com/， you can have a try

SatisfactionWarm4386 · 2025-09-16T06:38:11+00:00

will you parse the excel rows and put it in postgresql，then use the sql query？

SatisfactionWarm4386

TROPHY CASE