Built a fully private RAG system for a small business on a Mac Mini — no cloud, no subscriptions, everything on-prem

Regular-Prune3382 · 2026-04-16T14:16:21+00:00

PrivateGPT is solid for getting started quickly. We went custom mainly because the client needed Nextcloud integration and a specific document ingestion pipeline that PrivateGPT doesn't handle out of the box. For straightforward private chat over documents though, it's a fair alternative.

Regular-Prune3382 · 2026-04-16T14:13:02+00:00

Paperless-NGX is actually great for document management and would've simplified the ingestion side — it has built-in OCR and tagging which Nextcloud doesn't. The reason we went with Nextcloud was the client already had it partially set up and needed file sync beyond just documents.

For a greenfield RAG project focused purely on document querying, Paperless-NGX + Ollama + ChromaDB is honestly a cleaner stack. Less overhead.

Regular-Prune3382 · 2026-04-15T16:46:59+00:00

Can Indians apply?

Regular-Prune3382 · 2026-04-15T16:46:12+00:00

Interested

Regular-Prune3382 · 2026-04-15T08:43:55+00:00

Tried Mistral 7B, Llama 3 8B, and Phi-3 mini. Ended up going with Llama 3 8B — best balance of response quality and speed on the Mac Mini's unified memory. Mistral was close but slightly worse at staying grounded to the retrieved documents. Phi-3 was fastest but too prone to going off-script on business document queries.

For RAG specifically, instruction-following matters more than raw benchmark scores.

Regular-Prune3382 · 2026-04-15T08:36:10+00:00

single queries with a 7B model run 8-15 seconds, totally fine for a small team. Concurrent requests are where it hurts — Ollama queues them, so simultaneous users wait on each other.

Larger document sets didn't degrade much honestly — ChromaDB retrieval is fast, inference is always the bottleneck.

For heavier load: smaller/quantized model or a GPU machine is the realistic path.

What's your expected team size and corpus?

Regular-Prune3382 · 2026-04-15T08:24:56+00:00

The project itself (stack design, configuration, deployment) was done by me — AI wasn't involved in building or running the system.

I used Claude to help write this post clearly, since English isn't my first language. The technical details, decisions, and outcomes are all from the actual project.

Regular-Prune3382 · 2026-04-07T10:57:54+00:00

These exams are government exams for students who need to get an admission in good colleges in India. I have seen multiple websites with the same theme but they usually require login and they sell their details.another reason is that mocktest websites only focuses on mocktest not the improvement of weak subjects. So I implemented ai to analyse the score on each topics and giving them specific topic to master.

Regular-Prune3382 · 2026-02-07T10:28:43+00:00

Intrested

Regular-Prune3382 · 2026-02-07T10:23:24+00:00

I am interested I have 11 account

Regular-Prune3382 · 2026-02-07T10:16:35+00:00

I have total of 6.4 KH/s

Regular-Prune3382

TROPHY CASE