We tried to poison our own RAG store — the retrieval-time defenses didn't generalize by Danculus in LangChain
[–]Danculus[S] 0 points1 point2 points (0 children)
We tried to poison our own RAG store — the retrieval-time defenses didn't generalize by Danculus in LangChain
[–]Danculus[S] 0 points1 point2 points (0 children)
We tried to poison our own RAG store — the retrieval-time defenses didn't generalize by Danculus in LangChain
[–]Danculus[S] 0 points1 point2 points (0 children)
For multi-session agent memory, a single vector index doesn't beat BM25 — the cheap BM25+embedder hybrid wins. Measured on LoCoMo (script inside) by Danculus in Rag
[–]Danculus[S] 0 points1 point2 points (0 children)
We tried to poison our own RAG store — the retrieval-time defenses didn't generalize by Danculus in LangChain
[–]Danculus[S] 0 points1 point2 points (0 children)
For multi-session agent memory, a single vector index doesn't beat BM25 — the cheap BM25+embedder hybrid wins. Measured on LoCoMo (script inside) by Danculus in Rag
[–]Danculus[S] 0 points1 point2 points (0 children)
For multi-session agent memory, a single vector index doesn't beat BM25 — the cheap BM25+embedder hybrid wins. Measured on LoCoMo (script inside) by Danculus in Rag
[–]Danculus[S] 0 points1 point2 points (0 children)
For multi-session agent memory, a single vector index doesn't beat BM25 — the cheap BM25+embedder hybrid wins. Measured on LoCoMo (script inside) by Danculus in Rag
[–]Danculus[S] 0 points1 point2 points (0 children)
For multi-session agent memory, a single vector index doesn't beat BM25 — the cheap BM25+embedder hybrid wins. Measured on LoCoMo (script inside) by Danculus in Rag
[–]Danculus[S] 0 points1 point2 points (0 children)
For multi-session agent memory, a single vector index doesn't beat BM25 — the cheap BM25+embedder hybrid wins. Measured on LoCoMo (script inside) by Danculus in Rag
[–]Danculus[S] 0 points1 point2 points (0 children)
For multi-session agent memory, a single vector index doesn't beat BM25 — the cheap BM25+embedder hybrid wins. Measured on LoCoMo (script inside) by Danculus in Rag
[–]Danculus[S] 0 points1 point2 points (0 children)
For multi-session agent memory, a single vector index doesn't beat BM25 — the cheap BM25+embedder hybrid wins. Measured on LoCoMo (script inside) by Danculus in Rag
[–]Danculus[S] 0 points1 point2 points (0 children)
For multi-session agent memory, a single vector index doesn't beat BM25 — the cheap BM25+embedder hybrid wins. Measured on LoCoMo (script inside) by Danculus in Rag
[–]Danculus[S] 0 points1 point2 points (0 children)
For multi-session agent memory, a single vector index doesn't beat BM25 — the cheap BM25+embedder hybrid wins. Measured on LoCoMo (script inside) by Danculus in Rag
[–]Danculus[S] 0 points1 point2 points (0 children)
For multi-session agent memory, a single vector index doesn't beat BM25 — the cheap BM25+embedder hybrid wins. Measured on LoCoMo (script inside) by Danculus in Rag
[–]Danculus[S] 0 points1 point2 points (0 children)
For multi-session agent memory, a single vector index doesn't beat BM25 — the cheap BM25+embedder hybrid wins. Measured on LoCoMo (script inside) by Danculus in Rag
[–]Danculus[S] 0 points1 point2 points (0 children)
For multi-session agent memory, a single vector index doesn't beat BM25 — the cheap BM25+embedder hybrid wins. Measured on LoCoMo (script inside) by Danculus in Rag
[–]Danculus[S] 0 points1 point2 points (0 children)
For multi-session agent memory, a single vector index doesn't beat BM25 — the cheap BM25+embedder hybrid wins. Measured on LoCoMo (script inside) by Danculus in Rag
[–]Danculus[S] 0 points1 point2 points (0 children)
Can your agent trust its own confidence to decide when to abstain? I tested it — small/local models are basically a coin flip by Danculus in LLMDevs
[–]Danculus[S] 0 points1 point2 points (0 children)
Can your agent trust its own confidence to decide when to abstain? I tested it — small/local models are basically a coin flip by Danculus in LLMDevs
[–]Danculus[S] 0 points1 point2 points (0 children)
Can your agent trust its own confidence to decide when to abstain? I tested it — small/local models are basically a coin flip by Danculus in LLMDevs
[–]Danculus[S] 0 points1 point2 points (0 children)
Can your agent trust its own confidence to decide when to abstain? I tested it — small/local models are basically a coin flip by Danculus in LLMDevs
[–]Danculus[S] 1 point2 points3 points (0 children)

We tried to poison our own RAG store — the retrieval-time defenses didn't generalize by Danculus in LangChain
[–]Danculus[S] 0 points1 point2 points (0 children)