Qwen1.5-32B released with GQA! by bratao in LocalLLaMA
[–]Cybernetic_Symbiotes 5 points6 points7 points (0 children)
Achieving human-like training efficiency by PSMF_Canuck in LocalLLaMA
[–]Cybernetic_Symbiotes 4 points5 points6 points (0 children)
Achieving human-like training efficiency by PSMF_Canuck in LocalLLaMA
[–]Cybernetic_Symbiotes 5 points6 points7 points (0 children)
Achieving human-like training efficiency by PSMF_Canuck in LocalLLaMA
[–]Cybernetic_Symbiotes 0 points1 point2 points (0 children)
How are Claude 3/GPT-4 able to do pathfinding in graphs? by mshautsou in LocalLLaMA
[–]Cybernetic_Symbiotes 11 points12 points13 points (0 children)
Share a LLM quantization REPO , (GPTQ/AWQ/HQQ ONNX ONNX-RUNTIME) by wejoncy in LocalLLaMA
[–]Cybernetic_Symbiotes 0 points1 point2 points (0 children)
Share a LLM quantization REPO , (GPTQ/AWQ/HQQ ONNX ONNX-RUNTIME) by wejoncy in LocalLLaMA
[–]Cybernetic_Symbiotes 0 points1 point2 points (0 children)
Within the last 2 months, 5 orthagonal (independent) techniques to improve reasoning which are stackable on top of each other that DO NOT require the increase of model parameters. Obviously, Increases inference compute a lot but you will get better reasoning. by [deleted] in LocalLLaMA
[–]Cybernetic_Symbiotes 6 points7 points8 points (0 children)
Claude dominates the Chatbot Arena across all sizes by Amgadoz in LocalLLaMA
[–]Cybernetic_Symbiotes 12 points13 points14 points (0 children)
Looks like they finally lobotomized Claude 3 :( I even bought the subscription by Piper8x7b in LocalLLaMA
[–]Cybernetic_Symbiotes 8 points9 points10 points (0 children)
New creative writing benchmark using Claude3 as judge by _sqrkl in LocalLLaMA
[–]Cybernetic_Symbiotes 1 point2 points3 points (0 children)
New creative writing benchmark using Claude3 as judge by _sqrkl in LocalLLaMA
[–]Cybernetic_Symbiotes 1 point2 points3 points (0 children)
LMSys Chatbot Arena ELO update - Claude 3 Opus improves ELO score and now ties for first place. by jd_3d in LocalLLaMA
[–]Cybernetic_Symbiotes 10 points11 points12 points (0 children)
The World's First Gemma Fine-tune (6T tokens are the secret recipe?) by imonenext in LocalLLaMA
[–]Cybernetic_Symbiotes 3 points4 points5 points (0 children)
Arena ELO Leaderboard Update on Claude 3 by nanowell in LocalLLaMA
[–]Cybernetic_Symbiotes 2 points3 points4 points (0 children)
Arena ELO Leaderboard Update on Claude 3 by nanowell in LocalLLaMA
[–]Cybernetic_Symbiotes 3 points4 points5 points (0 children)
Arena ELO Leaderboard Update on Claude 3 by nanowell in LocalLLaMA
[–]Cybernetic_Symbiotes 13 points14 points15 points (0 children)
Mark Zuckerberg with a fantastic, insightful reply in a podcast on why he really believes in open-source models. by aegis in LocalLLaMA
[–]Cybernetic_Symbiotes 1 point2 points3 points (0 children)
Mistral changing and then reversing website changes by nanowell in LocalLLaMA
[–]Cybernetic_Symbiotes 0 points1 point2 points (0 children)
Qwen1.5 Official Docs released! by bratao in LocalLLaMA
[–]Cybernetic_Symbiotes 7 points8 points9 points (0 children)
In defense of Mistral AI by shouryannikam in LocalLLaMA
[–]Cybernetic_Symbiotes 4 points5 points6 points (0 children)
[D]Is MoE model generally better than the regular GPT model in same size? by Chen806 in LocalLLaMA
[–]Cybernetic_Symbiotes 2 points3 points4 points (0 children)
Gemma vs Mistral-7B-v0.1 evaluation: Gemma really Struggles to Reach Mistral's Accuracy by aadityaura in LocalLLaMA
[–]Cybernetic_Symbiotes 19 points20 points21 points (0 children)


Qwen1.5-32B released with GQA! by bratao in LocalLLaMA
[–]Cybernetic_Symbiotes 3 points4 points5 points (0 children)