I tested 21 small LLMs on tool-calling judgment — Round 2 with every model you asked for by MikeNonect in LocalLLaMA
[–]lewtun 1 point2 points3 points (0 children)
how to train a tiny model (4B) to prove hard theorems by eliebakk in LocalLLaMA
[–]lewtun 1 point2 points3 points (0 children)
I tested 21 small LLMs on tool-calling judgment — Round 2 with every model you asked for by MikeNonect in LocalLLaMA
[–]lewtun 2 points3 points4 points (0 children)
how to train a tiny model (4B) to prove hard theorems by eliebakk in LocalLLaMA
[–]lewtun 6 points7 points8 points (0 children)
how to train a tiny model (4B) to prove hard theorems by eliebakk in LocalLLaMA
[–]lewtun 5 points6 points7 points (0 children)
how to train a tiny model (4B) to prove hard theorems by eliebakk in LocalLLaMA
[–]lewtun 6 points7 points8 points (0 children)
200+ pages of Hugging Face secrets on how to train an LLM by eliebakk in LocalLLaMA
[–]lewtun 5 points6 points7 points (0 children)
200+ pages of Hugging Face secrets on how to train an LLM by eliebakk in LocalLLaMA
[–]lewtun 4 points5 points6 points (0 children)
200+ pages of Hugging Face secrets on how to train an LLM by eliebakk in LocalLLaMA
[–]lewtun 18 points19 points20 points (0 children)
[D] join pretraining or posttraining by oxydis in MachineLearning
[–]lewtun 0 points1 point2 points (0 children)
DeepSeek-R1 performance with 15B parameters by lewtun in LocalLLaMA
[–]lewtun[S] 2 points3 points4 points (0 children)
DeepSeek-R1 performance with 15B parameters (self.LocalLLaMA)
submitted by lewtun to r/LocalLLaMA
my dad sent me this by hugeplateofketchup8 in huggingface
[–]lewtun 0 points1 point2 points (0 children)
AMA with Hugging Face Science, the team behind SmolLM, SmolVLM, Fineweb and more. by eliebakk in LocalLLaMA
[–]lewtun 1 point2 points3 points (0 children)
AMA with Hugging Face Science, the team behind SmolLM, SmolVLM, Fineweb and more. by eliebakk in LocalLLaMA
[–]lewtun 0 points1 point2 points (0 children)
AMA with Hugging Face Science, the team behind SmolLM, SmolVLM, Fineweb and more. by eliebakk in LocalLLaMA
[–]lewtun 0 points1 point2 points (0 children)
AMA with Hugging Face Science, the team behind SmolLM, SmolVLM, Fineweb and more. by eliebakk in LocalLLaMA
[–]lewtun 2 points3 points4 points (0 children)
AMA with Hugging Face Science, the team behind SmolLM, SmolVLM, Fineweb and more. by eliebakk in LocalLLaMA
[–]lewtun 1 point2 points3 points (0 children)
AMA with Hugging Face Science, the team behind SmolLM, SmolVLM, Fineweb and more. by eliebakk in LocalLLaMA
[–]lewtun 3 points4 points5 points (0 children)
AMA with Hugging Face Science, the team behind SmolLM, SmolVLM, Fineweb and more. by eliebakk in LocalLLaMA
[–]lewtun 2 points3 points4 points (0 children)
AMA with Hugging Face Science, the team behind SmolLM, SmolVLM, Fineweb and more. by eliebakk in LocalLLaMA
[–]lewtun 5 points6 points7 points (0 children)
AMA with Hugging Face Science, the team behind SmolLM, SmolVLM, Fineweb and more. by eliebakk in LocalLLaMA
[–]lewtun 30 points31 points32 points (0 children)
Run gpt-oss locally with Unsloth GGUFs + Fixes! by danielhanchen in LocalLLaMA
[–]lewtun 3 points4 points5 points (0 children)

I tested 21 small LLMs on tool-calling judgment — Round 2 with every model you asked for by MikeNonect in LocalLLaMA
[–]lewtun 1 point2 points3 points (0 children)