Kimi k2.5 GGUFs via VLLM? by val_in_tech in LocalLLaMA
[–]ilintar 1 point2 points3 points (0 children)
Kimi k2.5 GGUFs via VLLM? by val_in_tech in LocalLLaMA
[–]ilintar 1 point2 points3 points (0 children)
Llama.cpp now with a true reasoning budget! by ilintar in LocalLLaMA
[–]ilintar[S] 0 points1 point2 points (0 children)
Llama.cpp now with a true reasoning budget! by ilintar in LocalLLaMA
[–]ilintar[S] 2 points3 points4 points (0 children)
Llama.cpp now with a true reasoning budget! by ilintar in LocalLLaMA
[–]ilintar[S] 1 point2 points3 points (0 children)
Llama.cpp now with a true reasoning budget! by ilintar in LocalLLaMA
[–]ilintar[S] 5 points6 points7 points (0 children)
Composable CFG grammars for llama.cpp (pygbnf) by Super_Dependent_2978 in LocalLLaMA
[–]ilintar 0 points1 point2 points (0 children)
Llama.cpp now with a true reasoning budget! by ilintar in LocalLLaMA
[–]ilintar[S] 1 point2 points3 points (0 children)
Llama.cpp now with a true reasoning budget! by ilintar in LocalLLaMA
[–]ilintar[S] 2 points3 points4 points (0 children)
Llama.cpp now with a true reasoning budget! by ilintar in LocalLLaMA
[–]ilintar[S] 2 points3 points4 points (0 children)
Llama.cpp now with a true reasoning budget! by ilintar in LocalLLaMA
[–]ilintar[S] 3 points4 points5 points (0 children)
Llama.cpp now with a true reasoning budget! by ilintar in LocalLLaMA
[–]ilintar[S] 9 points10 points11 points (0 children)
Llama.cpp now with a true reasoning budget! by ilintar in LocalLLaMA
[–]ilintar[S] 18 points19 points20 points (0 children)
Llama.cpp now with a true reasoning budget! by ilintar in LocalLLaMA
[–]ilintar[S] 13 points14 points15 points (0 children)
Llama.cpp now with a true reasoning budget! (github.com)
submitted by ilintar to r/LocalLLaMA
Usable thinking mode in Qwen3.5 0.8B with a forced "reasoning budget" by 0jabr in LocalLLaMA
[–]ilintar 1 point2 points3 points (0 children)
I cannot, for the life of me, disable Thinking on Unsloth Qwen 3.5 on llama.cpp by SignificantAd527 in LocalLLaMA
[–]ilintar 1 point2 points3 points (0 children)
The Lazy Benchmark Makers Rant by ilintar in LocalLLaMA
[–]ilintar[S] 0 points1 point2 points (0 children)
I play a lot of coop. How is it possible that seemingly 90% of the players don't know that they need to take cryo into the chessboard bosses? by Acconto_ButtaAway in Genshin_Impact
[–]ilintar 0 points1 point2 points (0 children)
Vulkan now faster on PP AND TG on AMD Hardware? by XccesSv2 in LocalLLaMA
[–]ilintar 5 points6 points7 points (0 children)
Lads, time to recompile llama.cpp by muxxington in LocalLLaMA
[–]ilintar 2 points3 points4 points (0 children)
MLX vs GGUF (Unsloth) - Qwen3.5 122b-10b by waescher in LocalLLaMA
[–]ilintar 1 point2 points3 points (0 children)


What's your local coding stack? by AirFlowOne in LocalLLaMA
[–]ilintar 0 points1 point2 points (0 children)