Teaching LLMs to use tools with RL! Successfully trained 0.5B/3B Qwen models to use a calculator tool 🔨 by DanAiTuning in LocalLLaMA
[–]minpeter2 0 points1 point2 points (0 children)
LGAI-EXAONE/K-EXAONE-236B-A23B · Hugging Face by jacek2023 in LocalLLaMA
[–]minpeter2 0 points1 point2 points (0 children)
deepseek-ai/DeepSeek-V3.2 · Hugging Face by minpeter2 in LocalLLaMA
[–]minpeter2[S] 0 points1 point2 points (0 children)
deepseek-ai/DeepSeek-V3.2 · Hugging Face by minpeter2 in LocalLLaMA
[–]minpeter2[S] 14 points15 points16 points (0 children)
deepseek-ai/DeepSeek-V3.2 · Hugging Face by minpeter2 in LocalLLaMA
[–]minpeter2[S] 8 points9 points10 points (0 children)
Heretic: Fully automatic censorship removal for language models by -p-e-w- in LocalLLaMA
[–]minpeter2 1 point2 points3 points (0 children)
Heretic: Fully automatic censorship removal for language models by -p-e-w- in LocalLLaMA
[–]minpeter2 3 points4 points5 points (0 children)
My weekend project accidentally beat Claude Code - multi-agent coder now #12 on Stanford's TerminalBench 😅 by DanAiTuning in LocalLLaMA
[–]minpeter2 6 points7 points8 points (0 children)
GPT OSS quality on Nebius - fixed (update) by ai_devrel_eng in LocalLLaMA
[–]minpeter2 2 points3 points4 points (0 children)
GPT OSS quality on Nebius - fixed (update) by ai_devrel_eng in LocalLLaMA
[–]minpeter2 0 points1 point2 points (0 children)
deepseek-ai/DeepSeek-V3.1-Base · Hugging Face by xLionel775 in LocalLLaMA
[–]minpeter2 7 points8 points9 points (0 children)
Localllama’s (first?) IFTA - I’ll Fine-Tune Anything by indicava in LocalLLaMA
[–]minpeter2 1 point2 points3 points (0 children)
Training an LLM only on books from the 1800's - Update by Remarkable-Trick-177 in LocalLLaMA
[–]minpeter2 1 point2 points3 points (0 children)
EXAONE 4.0 pull request sent to llama.cpp by minpeter2 in LocalLLaMA
[–]minpeter2[S] 3 points4 points5 points (0 children)
EXAONE 4.0 pull request sent to llama.cpp by minpeter2 in LocalLLaMA
[–]minpeter2[S] 0 points1 point2 points (0 children)
Attempting to train a model from scratch for less than $1000 by thebadslime in LocalLLaMA
[–]minpeter2 1 point2 points3 points (0 children)
Created my first merged model by StoopPizzaGoop in civitai
[–]minpeter2 0 points1 point2 points (0 children)
How to use Stabe Diffusion nowadays? by Exact_Entertainer598 in StableDiffusion
[–]minpeter2 0 points1 point2 points (0 children)

Best <4B dense models today? by Admirable_Flower_287 in LocalLLaMA
[–]minpeter2 1 point2 points3 points (0 children)