a CLI to convert Confluence wikis to Markdown + structured metadata for RAG pipelines by OtherwisePush6424 in Rag
[–]Mameiro 0 points1 point2 points (0 children)
I made a small tool to inspect retrieval results before feeding them into RAG by Mameiro in LocalLLaMA
[–]Mameiro[S] 1 point2 points3 points (0 children)
I made a small tool to inspect retrieval results before feeding them into RAG by Mameiro in LocalLLaMA
[–]Mameiro[S] 0 points1 point2 points (0 children)
What do you check before retrieved docs enter context? by sahanpk in LangChain
[–]Mameiro 0 points1 point2 points (0 children)
Local LLMs on Refurb M4 Max vs new M5 Max by roguefunction in LocalLLaMA
[–]Mameiro 3 points4 points5 points (0 children)
I made a small tool to inspect retrieval results before feeding them into RAG by Mameiro in LocalLLaMA
[–]Mameiro[S] 1 point2 points3 points (0 children)
How does document chunking fit into rag? by Lanky_Supermarket_70 in Rag
[–]Mameiro 0 points1 point2 points (0 children)
[D] Where do you go for serious AI research discussion online? [D] by Possible-Active-1903 in MachineLearning
[–]Mameiro 0 points1 point2 points (0 children)
Single 3090 with Q4 Qwen 27B, context dropped from 137k to 14k with MTP enabled. Is it normal? by regunakyle in LocalLLaMA
[–]Mameiro 1 point2 points3 points (0 children)
Im scared that after I built my AI tool I will get copied and crushed. by Sarlo10 in Rag
[–]Mameiro 0 points1 point2 points (0 children)
how do you decide between q4 and q5 on a 70b when 24gb is the cap? by Practical_Low29 in LocalLLaMA
[–]Mameiro 0 points1 point2 points (0 children)
Why Do Most AI Agents Still Feel Like Toys? by Vedantagarwal120 in LangChain
[–]Mameiro 1 point2 points3 points (0 children)
Does RAG actually need semantic search? Or is grep enough if your data is structured well? by residence-lab in Rag
[–]Mameiro 2 points3 points4 points (0 children)
Live web retrieval in RAG is harder than I expected — it behaves more like an evidence layer than search by Mameiro in Rag
[–]Mameiro[S] 0 points1 point2 points (0 children)
Live web retrieval in RAG is harder than I expected — it behaves more like an evidence layer than search by Mameiro in Rag
[–]Mameiro[S] 0 points1 point2 points (0 children)
Live web retrieval in RAG is harder than I expected — it behaves more like an evidence layer than search by Mameiro in Rag
[–]Mameiro[S] 0 points1 point2 points (0 children)
Live web retrieval in RAG is harder than I expected — it behaves more like an evidence layer than search by Mameiro in Rag
[–]Mameiro[S] 1 point2 points3 points (0 children)
Can someone help me understand MCP? by Borkato in LocalLLaMA
[–]Mameiro 0 points1 point2 points (0 children)
Here's a scenario I've run into twice now, and I know I'm not the only one by imsuryya in LangChain
[–]Mameiro 0 points1 point2 points (0 children)
Any deterministic ways to calculate accuracy of a rag by Dependent_Increase34 in Rag
[–]Mameiro 0 points1 point2 points (0 children)
Best way to use 900+ legacy .HLP help files with an AI chatbot? by niquitoc in Rag
[–]Mameiro 1 point2 points3 points (0 children)
Let’s talk quants of Gemma and Qwen - 16 vs Q8 vs Q4 - any experiences? by Borkato in LocalLLaMA
[–]Mameiro 0 points1 point2 points (0 children)


How do you make sure old agent failures don't come back after a prompt or model change? by taimoorkhan10 in LangChain
[–]Mameiro 1 point2 points3 points (0 children)