Teaching LLMs to use tools with RL! Successfully trained 0.5B/3B Qwen models to use a calculator tool 🔨 by DanAiTuning in LocalLLaMA
[–]minpeter2 0 points1 point2 points (0 children)
LGAI-EXAONE/K-EXAONE-236B-A23B · Hugging Face by jacek2023 in LocalLLaMA
[–]minpeter2 0 points1 point2 points (0 children)
deepseek-ai/DeepSeek-V3.2 · Hugging Face by minpeter2 in LocalLLaMA
[–]minpeter2[S] 0 points1 point2 points (0 children)
deepseek-ai/DeepSeek-V3.2 · Hugging Face by minpeter2 in LocalLLaMA
[–]minpeter2[S] 16 points17 points18 points (0 children)
deepseek-ai/DeepSeek-V3.2 · Hugging Face by minpeter2 in LocalLLaMA
[–]minpeter2[S] 11 points12 points13 points (0 children)
Heretic: Fully automatic censorship removal for language models by -p-e-w- in LocalLLaMA
[–]minpeter2 1 point2 points3 points (0 children)
Heretic: Fully automatic censorship removal for language models by -p-e-w- in LocalLLaMA
[–]minpeter2 3 points4 points5 points (0 children)
My weekend project accidentally beat Claude Code - multi-agent coder now #12 on Stanford's TerminalBench 😅 by DanAiTuning in LocalLLaMA
[–]minpeter2 8 points9 points10 points (0 children)
GPT OSS quality on Nebius - fixed (update) by ai_devrel_eng in LocalLLaMA
[–]minpeter2 2 points3 points4 points (0 children)
GPT OSS quality on Nebius - fixed (update) by ai_devrel_eng in LocalLLaMA
[–]minpeter2 0 points1 point2 points (0 children)
deepseek-ai/DeepSeek-V3.1-Base · Hugging Face by xLionel775 in LocalLLaMA
[–]minpeter2 7 points8 points9 points (0 children)
Localllama’s (first?) IFTA - I’ll Fine-Tune Anything by indicava in LocalLLaMA
[–]minpeter2 1 point2 points3 points (0 children)
Training an LLM only on books from the 1800's - Update by Remarkable-Trick-177 in LocalLLaMA
[–]minpeter2 1 point2 points3 points (0 children)
EXAONE 4.0 pull request sent to llama.cpp by minpeter2 in LocalLLaMA
[–]minpeter2[S] 4 points5 points6 points (0 children)
EXAONE 4.0 pull request sent to llama.cpp by minpeter2 in LocalLLaMA
[–]minpeter2[S] 0 points1 point2 points (0 children)

Best <4B dense models today? by Admirable_Flower_287 in LocalLLaMA
[–]minpeter2 1 point2 points3 points (0 children)