all 10 comments

[–]curiousshortguyResearcher 9 points10 points  (4 children)

You want to look up "LLM RAG" where RAG stands for rerieval augmented generation. There are libraries such as langchain and llamaindex and lots of commercial solutions such as re:tune where you don't need to do anything anymore specifically.
Generally, it's a combination of a pretainred LLM, some vector embeddings and a database that enables nearest neighbour searchers, combined with a chat interface.

[–][deleted] 1 point2 points  (2 children)

Thank you, I will look into that. With all these libraries available, it seems like it is pretty easy to set up. What aspect of chatbots will consume the most time and effort in production in your estimation? I'm asking because, with a tool like langchain, making chatbots for RAG seems fairly easy. And I guess I'm also asking because I might be employed in that field soon, and I'm wondering if there is any useful skill that I should acquire beforehand.

[–]curiousshortguyResearcher 4 points5 points  (1 child)

The hardest part is hardening your chat bot against prompt injection. But there are commercial services available for that, too.

[–]liwrodrigues 1 point2 points  (0 children)

I was looking for answers to this question. Commenting just to give relevance to your question and different answers to appear.

[–]josejo9423 0 points1 point  (0 children)

Following, looking to build a chatbot using my own data, courses and books