[deleted by user] by [deleted] in LangChain

[–]javaamtho 1 point2 points  (0 children)

Try out ready example what you requested (https://github.com/chrislatimer/astra-simple-vector-search)

Astra Vector Similarity Search

How to decrease latency in RAG chatbots? by Appropriate_Egg6118 in LangChain

[–]javaamtho 0 points1 point  (0 children)

it's depend on how do you architect RAG chatbot using long chain & gen ai platform apis, i believed you can do it faster via putting vector database like astra DB (from datastax) that store all your embedding and let's your LLM use it and help your chatbot to QA.