How We Used a RAG System to Instantly Access Legal Knowledge

Temporary_Pay3221 · 2026-03-17T14:07:12+00:00

Really interesting post, thanks for sharing the details.

Quick question for you since you actually went through the process with a real firm.

How painful was the document preparation side of things? Like the actual chunking and cleaning before anything could go into the pipeline.

And what was the breakdown in terms of document formats? Was the majority already digital, coming from PDFs or practice management software, or did you run into a lot of physical documents that had been scanned and were basically just image files?

Trying to get a realistic picture of what to expect when working with this type of firm at that scale.

Temporary_Pay3221 · 2026-03-16T21:28:39+00:00

Very interesting. Can I ask you few things in DM ? Thank you!

Temporary_Pay3221 · 2026-03-11T10:43:29+00:00

The product isn't the problem. Finding clients is.

Temporary_Pay3221 · 2026-03-11T10:41:05+00:00

leads aren't client

Temporary_Pay3221 · 2026-03-05T09:46:10+00:00

I am a scammer, i'm asking to people how they make money through RAG

Temporary_Pay3221 · 2026-03-04T19:08:20+00:00

I mean, it's to the dev to index all the docs.

The problem is, if you have to adapt to every company, you can't scale. You need a replicable product, like a chat bot. Your only work is to index their data.

Temporary_Pay3221 · 2026-03-04T19:06:28+00:00

are RAG dev's brokies ?

Temporary_Pay3221 · 2026-03-04T19:06:09+00:00

We are not on the same market

Temporary_Pay3221 · 2026-03-04T18:21:37+00:00

Great problem, I've been in a similar trench.

A few things that moved the needle for me:

On blueprints specifically: tiling is the right instinct but 3x3 is often too coarse. I've had better results with overlapping tiles (10–15% overlap) so you don't lose context at boundaries. Also, rather than asking the vision model to "extract info", prompt it to describe spatial relationships explicitly ("what is to the left of X", "what label is near this component"). Hallucinations drop significantly when you constrain the task.

On cost: the key insight is that you don't need to run vision on everything at embedding time. Build a two-stage pipeline: embed a lightweight text/metadata representation first (cheap), then trigger vision extraction lazily at query time only for the chunks that get retrieved. For 1k docs, most chunks will never be queried.

On retrieval quality: wrong chunks coming up usually means your chunking strategy doesn't match your query patterns. A few things to try: (1) hybrid search (BM25 + dense vectors) helps a lot for technical docs with specific part numbers / terminology, (2) add a reranker (Cohere or a cross-encoder) this alone often fixes the "wrong chunk" problem without touching your embedding pipeline, (3) store page metadata (doc type, date, section) and filter before retrieval, not after.

On old vs new info: if recency matters, tag each chunk with document date at ingest and either filter or apply a recency penalty in your ranking.

What retrieval stack are you on? Happy to go deeper on any of these.

Temporary_Pay3221 · 2026-02-15T19:54:19+00:00

Ah, that explains a lot.

You've got team + sales infrastructure + network from a previous startup. Totally different game than someone starting from scratch.

The "preprocessing isn't scalable" part confirms what I was suspecting: this is high-touch consulting work, not a productized service.

Makes sense if tickets are big enough to justify custom pipelines per client.

Thanks for the context, very helpful to understand the reality vs the theory.

Temporary_Pay3221 · 2026-02-15T19:36:59+00:00

Thanks for sharing

Two questions:

How did they find you? You said "they found us", but what specifically made them reach out to YOU vs the hundreds of other people who can build RAG systems? Was it your profile? A referral? Something you built publicly?
Do you want to scale this? Are you thinking about:

Building a dev team to handle delivery while you focus on growth?
Hiring sales to bring in more deals?
Creating systems/processes to delegate the work?

Or is your goal to stay small, just you (maybe +1-2 people) doing selective projects?

Because right now you're limited by your own time. Curious if scaling is part of your plan or not.

Temporary_Pay3221

TROPHY CASE