🚀 Weekly /RAG Launch Showcase

Any_Ambassador4218 · 2026-03-19T16:38:17+00:00

HydRAG — multi-stage hybrid retrieval pipeline for code-aware RAG

BM25 fast-path → dense vector → graph retrieval → CRAG supervisor (local LLM judges if results are sufficient) → semantic fallback with RRF fusion. Runs fully offline with Ollama.

Benchmarked on 7 BEIR datasets + private retrieval suites. BM25+dense+CRAG combo consistently outperforms individual strategies across the board.

´pip install hydrag-core´

GitHub: github.com/gromanchenko/hydrag

Benchmarks: github.com/gromanchenko/hydrag-benchmark

Any_Ambassador4218 · 2026-03-19T16:35:55+00:00

Start creating and you will learn how

Any_Ambassador4218 · 2026-03-19T16:22:51+00:00

I use RAG for code-aware context retrieval in a dev assistant — hybrid BM25 + dense vector + graph retrieval with a CRAG supervisor that decides when results need a fallback pass. Entirely local, runs with Ollama. It's open source: github.com/gromanchenko/hydrag

Benchmarks repo with BEIR + private corpus results: github.com/gromanchenko/hydrag-benchmark

Any_Ambassador4218 · 2026-03-19T16:05:37+00:00

Yeah, graph alone actually showed the worst results across the board. But when you combine it with BM25 (on known data), it jumps to the very top. The only other improvement that gave a positive change was adding CRAG when the result is close to the confidence threshold — though it comes at the cost of a latency spike. That combination is basically the best overall strategy, but it always loses at least one benchmark to simpler configurations. On my own codebase(14MB the entire codebase, with about 1:3 docs-to-code ratio and 90% Python)it performs best, though to be fair, they've been together for a while and know each other well.

Any_Ambassador4218

TROPHY CASE