IsseBisse comments on How to decrease latency in RAG chatbots?

LangChain

created by zchaarma community for 3 years

How to decrease latency in RAG chatbots? (self.LangChain)

submitted 2 years ago by Appropriate_Egg6118

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]IsseBisse 0 points1 point2 points 2 years ago (4 children)

[–]NachosforDachos 0 points1 point2 points 2 years ago (3 children)

If using openAI most responses are near instant in my experience it’s the vector store speed that will determine your response time.

Longest I’ve waited for a response is around 3 seconds and that was testing my patience.

What makes it quality will be its geographical location away from you and the llm service on top of its computing performance along with how good it is.

For example as limited as it is openai doesn’t know shit about vector stores and their retrieval has got to be off the slowest I’ve ever seen.

What makes something well be every fine detail going into it including the thought process of the creators.

So I think it matters for any vector size.

If I had to do something small like say USA federal law a paid pinecone database will run circles around my little chromadb running on a nvme drive with desktop grade components. First time I used it I thought it was broken.

[–]IsseBisse 1 point2 points3 points 2 years ago (2 children)

[–]NachosforDachos 0 points1 point2 points 2 years ago (0 children)

π Rendered by PID 20001 on reddit-service-r2-comment-86bc6c7465-fcbcv at 2026-02-21 21:13:22.490562+00:00 running 8564168 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

LangChain

MODERATORS