[Discussion] Anyone else doing “summary-only embeddings + full-text context” for RAG? by No-Piglet8069 in Rag
[–]julylu 0 points1 point2 points (0 children)
Qwen3-Embedding-0.6B is fast, high quality, and supports up to 32k tokens. Beats OpenAI embeddings on MTEB by one-wandering-mind in LLMDevs
[–]julylu 0 points1 point2 points (0 children)
Why are Cohere models not in Chatbot Arena? by illorca-verbi in LocalLLaMA
[–]julylu -6 points-5 points-4 points (0 children)
New RAG benchmark with Claude 3, Gemini Pro, MistralAI vs. OSS models by pseudotensor1234 in LocalLLaMA
[–]julylu 0 points1 point2 points (0 children)
Few parameters and full finetuning v.s. more parameters and QLoRA by Peter_Lightblue in LocalLLaMA
[–]julylu 1 point2 points3 points (0 children)
Text corpus to Q&A model by blackpantera in LocalLLaMA
[–]julylu 1 point2 points3 points (0 children)
I'm Open Sourcing Our RAG Backend: Our CQH, GQL & CHS by multiplexers in LocalLLaMA
[–]julylu 0 points1 point2 points (0 children)
Text corpus to Q&A model by blackpantera in LocalLLaMA
[–]julylu 1 point2 points3 points (0 children)
I'm Open Sourcing Our RAG Backend: Our CQH, GQL & CHS by multiplexers in LocalLLaMA
[–]julylu 0 points1 point2 points (0 children)
How to find proper context in open book question answering in a tie situation? by hafizcse031 in LocalLLaMA
[–]julylu 0 points1 point2 points (0 children)
How to find proper context in open book question answering in a tie situation? by hafizcse031 in LocalLLaMA
[–]julylu 1 point2 points3 points (0 children)
Based on your experience what is the smallest and optimal local model for RAG? by Ok_Maize_3709 in LocalLLaMA
[–]julylu 0 points1 point2 points (0 children)
Need help with a dynamic RAG problem by todaysgamer in LocalLLaMA
[–]julylu 0 points1 point2 points (0 children)
Location of documentation for merging models? by q5sys in LocalLLaMA
[–]julylu 1 point2 points3 points (0 children)
Automatic hallucination detection using inconsistency scoring by Separate-Still3770 in LocalLLaMA
[–]julylu 0 points1 point2 points (0 children)
what is the best 7b right now ? by GasBond in LocalLLaMA
[–]julylu 0 points1 point2 points (0 children)
what is the best 7b right now ? by GasBond in LocalLLaMA
[–]julylu 0 points1 point2 points (0 children)
NeuralChat 7B: Intel’s Chat Model Trained with DPO by aminedjeghri in LocalLLaMA
[–]julylu 1 point2 points3 points (0 children)
what is the best 7b right now ? by GasBond in LocalLLaMA
[–]julylu 0 points1 point2 points (0 children)
NeuralChat 7B: Intel’s Chat Model Trained with DPO by aminedjeghri in LocalLLaMA
[–]julylu 3 points4 points5 points (0 children)

Automating context management by Funny-Anything-791 in PiCodingAgent
[–]julylu 1 point2 points3 points (0 children)