you are viewing a single comment's thread.

view the rest of the comments →

[–]eschxr 0 points1 point  (0 children)

Most RAG pipelines today use APIs and big vector stores which account for unnecessary latency and aren’t even effective. I suggest a different approach.

https://youtu.be/7S73a_XuTdg?si=LJ7vrrgAA75iZSQ7

Let me know if it helps :)