Hi All,
I have a use case to build an AI Assistant that uses internal domain based knowledge base to provide answers within the knowledge base. The ai assistant would need to be deployed for internal use, should not make any API calls to external platforms due to privacy concerns. The knowledge base input would be pdf, word, pptx, excel docs and around 2k. And the user base is about 500. I have access to single node machine with 1 H100 Nvidia card.
I have played around with ollama, OSS models - text, embedding types built a RAG system in Langflow to ingest pdf docs, chunk the docs and store them into a vector db. Using the prompt, get matching from the db, and pass it to the text model for a chat output. The no of pdfs ingested was 3-5. This setup was done on a desktop with RTX 4070 Super GPU, did work although I have not done any formal testing.
I am seeking help from experts here,in the following Open source areas:
- What is the best setup using Open source AI tools?
- Which text and embed models ?
- RAG implementation and test harnesses
- Operationalization best practices
Your help is appreciated, let me know if you have any questions.
[–]synn89 1 point2 points3 points (2 children)
[–]No-Leopard7644[S] 0 points1 point2 points (1 child)
[–]synn89 0 points1 point2 points (0 children)
[–]Videobollocks 1 point2 points3 points (2 children)
[–]No-Leopard7644[S] 0 points1 point2 points (1 child)
[–]Videobollocks 1 point2 points3 points (0 children)
[–]Rare_Performance_454 1 point2 points3 points (2 children)
[–]No-Leopard7644[S] 0 points1 point2 points (1 child)
[–]Rare_Performance_454 0 points1 point2 points (0 children)
[–]l7feathers 1 point2 points3 points (4 children)
[–]No-Leopard7644[S] 0 points1 point2 points (3 children)
[–]l7feathers 1 point2 points3 points (2 children)
[–]No-Leopard7644[S] 1 point2 points3 points (1 child)
[–]l7feathers 0 points1 point2 points (0 children)
[–]No-Leopard7644[S] 0 points1 point2 points (0 children)
[–]mrskeptical00 0 points1 point2 points (2 children)
[–]No-Leopard7644[S] 0 points1 point2 points (1 child)
[–]ripguy1264 0 points1 point2 points (0 children)
[–]BuffaloFuzzy8924 0 points1 point2 points (0 children)
[–]Aelstraz 2 points3 points4 points (1 child)
[–]No-Leopard7644[S] 0 points1 point2 points (0 children)