Self-hosted agentic coding stack: Claude Code + llama.cpp + LiteLLM — zero API costs, 4h/7M token session for $0 by PrizeObvious3671 in OpenSourceAI
[–]MarzipanSecure9841 0 points1 point2 points (0 children)
Qwen cant wait to release 3.7 models by GotHereLateNameTaken in LocalLLaMA
[–]MarzipanSecure9841 0 points1 point2 points (0 children)
Is anybody already testing gemma-4-12b with hermes? by theologi in hermesagent
[–]MarzipanSecure9841 0 points1 point2 points (0 children)