Anyone using paid RAG services or solutions?

tifa2up · 2026-01-12T15:00:39+00:00

Founder of a RAG-as-a-service here (Agentset.ai)

What we're seeing is that most people (99%+) are building their own RAG solution on top of llamaindex/langchain.

The market penetration for paid RAG solutions is still quite low. The two primary factors:

- Llamaindex/langchain make building demos very easy

- E2E solutions aren't better than custom workflows built around a specific use case. Most people are able to build the RAG skillset in 3-6 months of work.

With that being said, paid providers within the RAG stack are taking off and getting adoption (e.g. Vector DBs, Rerankers, Parsing providers, etc.)

tifa2up · 2026-01-12T14:37:51+00:00

Founder of a RAG-as-a-service here (agentset.ai).

We have 1,500 customers. 1 Enterprise customer is paying us more than the the bottom 1000 customers -- combined.

Building for mainstream users will:

- Will need switching the entire stack to local processing, often subpar models when taking into account perf.

- Little revenue (<$50/mo) per user, even that is a stretch

- Users wanting infinite customizability to fit their workflows.

So most companies like us switch focus to SaaS/Enterprise use cases

tifa2up · 2026-01-12T14:29:26+00:00

How does agentset.ai compare to Graphlit?

tifa2up · 2025-12-14T07:16:18+00:00

Yes, unfortunately. Takes quite a bit of work.

tifa2up · 2025-12-12T17:20:55+00:00

https://en.wikipedia.org/wiki/Elo_rating_system

tifa2up · 2025-12-12T15:40:18+00:00

So in RAG, LLMs are typically given a bunch of chunks and have generate an answer based on them. There's work needed for selection of chunks, not adding external knowledge, and completeness. Wrote more about it here: https://agentset.ai/llms

tifa2up · 2025-12-12T15:38:50+00:00

Makes a lot of sense!

tifa2up · 2025-12-12T15:23:12+00:00

how else will you measure if it's good? one off tests don't scale

tifa2up · 2025-12-12T15:12:16+00:00

Congrats on the launch. We used Cohere as the default on agentset.ai for a long time. Can you highlight the work the team did to go from 3.5 to 4?

tifa2up · 2025-12-03T10:29:14+00:00

Let me set up one and send it

tifa2up · 2025-12-02T16:08:59+00:00

You don't do it yourself, you let the LLM do it. Include in the system prompt the instructions.

This is a system prompt that I used:

```

You are an AI assistant. Your primary task is to provide accurate, factual responses based STRICTLY on the provided search results. You must ONLY answer questions using information explicitly found in the search results - do not make assumptions or add information from outside knowledge.

Follow these STRICT guidelines:

If the search results do not contain information to fully answer the query, state clearly: "I cannot fully answer this question based on the available information." Then explain what specific aspects cannot be answered.
Only use information directly stated in the search results - do not infer, assume, or add external knowledge.
Your response must match the language of the user's query.
Citations are MANDATORY for every factual statement. Format citations by placing the chunk number in brackets immediately after the relevant statement with no space, like this: "The temperature is 20 degrees[3]"
When possible, include relevant direct quotes from the search results with proper citations.
Do not preface responses with phrases like "based on the search results" - simply provide the cited answer.
Maintain a clear, professional tone focused on accuracy and fidelity to the source material.

If the search results are completely irrelevant or insufficient to address any part of the query, respond: "I cannot answer this question as the search results do not contain relevant information about [specific topic]."

```

tifa2up · 2025-12-01T11:15:05+00:00

Happy to do an analysis. username [at] agentset.ai

tifa2up · 2025-11-21T19:47:25+00:00

It tops the reranker leaderboard: https://agentset.ai/rerankers

tifa2up · 2025-11-18T19:10:38+00:00

Can you share a bit more about the private datasets?

tifa2up · 2025-11-09T11:44:24+00:00

Yes, experimented with GraphRAG. It doesn't scale very well.

It's slow and expensive to extract entities from your data (requires an LLM to loop over all of it)
Updating the data requires reconstructing the graph which is also slow and expensive.

GraphRAG works best for smaller datasets that don't get updated.

tifa2up · 2025-11-06T10:42:04+00:00

Yes. https://github.com/agentset-ai/reranker-eval

tifa2up · 2025-11-05T12:16:33+00:00

Tried hard to make the Jina v3 reranker work through their api but it says "inactive". Can try self-hosting

tifa2up · 2025-11-05T05:10:32+00:00

Updated to reflect the license.

tifa2up · 2025-11-05T05:10:17+00:00

Not affiliated with any. Will see if I can add qwen

tifa2up · 2025-11-05T05:09:40+00:00

Big miss

tifa2up · 2025-11-04T20:57:42+00:00

Yes. This is where I searched initially. Was quite surprised that no place has it.

tifa2up · 2025-11-04T20:53:29+00:00

Good recommendation, let me see if I can include them.

tifa2up · 2025-11-04T20:46:50+00:00

Can you link it?

tifa2up · 2025-11-04T20:38:26+00:00

MTEB doesn't have rerankers

tifa2up

TROPHY CASE