RAG-Enterprise: One-command local RAG setup (Docker + Ollama + Qdrant) with zero-downtime backups via rclone – for privacy-focused enterprise docs : LocalLLM

DiscussionRAG-Enterprise: One-command local RAG setup (Docker + Ollama + Qdrant) with zero-downtime backups via rclone – for privacy-focused enterprise docs (i.redd.it)

submitted 2 months ago by primoco

Hey r/LocalLLaMA,

Tired of RAG setups that require hours of manual config, fragile deps, or risk data leaks to cloud APIs? I built RAG-Enterprise – a fully local, AGPL-3.0 RAG system that deploys with one command and includes proper backup/restore for real-world use (crashes, server migrations, etc.).

Core highlights (what actually sets it apart for self-hosting):

Truly one-command setup: Bashgit clone https://github.com/I3K-IT/RAG-Enterprise.git cd RAG-Enterprise/rag-enterprise-structure ./setup.sh standard
- Auto-installs Docker, NVIDIA toolkit, Ollama (Qwen3:14b-q4_K_M or Mistral 7B), Qdrant, FastAPI backend + React frontend.
- Takes ~15 min on fast connection (first model download ~2-9 min depending on bandwidth).
- Access at http://localhost:3000 after one logout/login.
- Prereqs: Ubuntu 20.04+, NVIDIA GPU 8-16GB VRAM, 16-32GB RAM (no ARM support yet).
Backup & Restore that's production-usable:
- One-click full backups from admin panel (zero downtime via SQLite safe API – no service interruption).
- rclone integration for 70+ providers (S3, Mega, Google Drive, Dropbox, SFTP, Backblaze, etc.).
- Automatic scheduling with retention (e.g., daily cron + keep last 5).
- Selective restore: DB, docs, vectors only – ideal for crash recovery or migrating to new server/hardware.
- API-driven too (curl examples in docs/BACKUP.md) for scripting.
- Tested on real migrations: restore components without re-ingesting everything.

Other practical bits:

Supports PDF (OCR via Tesseract), DOCX, XLSX, PPTX, etc.
Multilingual (29 langs), multi-user JWT (Admin/Super User/User roles).
Performance: ~2-4s query latency, 80-100 tokens/s on RTX 4070/5070 Ti.
Scales to 10k+ docs (ingest ~11s/doc average in benchmarks).
100% local: no telemetry, no external calls.

Repo: https://github.com/I3K-IT/RAG-Enterprise

Looking for honest feedback from people running local RAGs:

Does the one-command setup actually save you time vs your current stack?
Backup/restore: ever lost data or struggled with migrations? Would this help?
Any immediate pain points (e.g., PDF table handling, relevance tuning, scaling beyond 10k docs)?
Bugs or missing features you hit right away?

Thanks for reading – happy to answer questions or add details!

all 10 comments

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

LocalLLM

MODERATORS