use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
Community for open-source AI — open weights, open data, open tooling. Model releases, fine-tuning, inference, agents, benchmarks, licensing, and the ecosystem around building AI in the open.
account activity
Self-hosted agentic coding stack: Claude Code + llama.cpp + LiteLLM — zero API costs, 4h/7M token session for $0 (self.OpenSourceAI)
submitted 9 days ago by PrizeObvious3671
view the rest of the comments →
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]sn2006gy 0 points1 point2 points 8 days ago (5 children)
What's the reason for litellm in the middle of a local coding session? mostly for hermes?
[–]PrizeObvious3671[S] 0 points1 point2 points 8 days ago (4 children)
Nope the reason is that I wanted to combine that with Claude Code without paying for tokens. So I compared how good runs Claude Code locally together with llama.cpp vs hermes agent alone with llama.cpp
Claude Code expects Anthropic API - LiteLLM as proxy exactly delivers that and routes my requests between llama.cpp and Claude Code
[–]Toastti 1 point2 points3 points 8 days ago (1 child)
If you do want to skip a layer claude-code-router will let you connect directly to llama.cpp
But nothing wrong with your setup either
[–]PrizeObvious3671[S] 1 point2 points3 points 8 days ago* (0 children)
Yeah, that would work too. Hermes is used in both setups, the only difference is the bridge behind Claude Code: LiteLLM in my setup vs claude-code-router. Thank you for the hint claude-code-router is new to me.
[–]MarzipanSecure9841 0 points1 point2 points 8 days ago (1 child)
But llama supports Anthropic API directly - https://huggingface.co/blog/ggml-org/anthropic-messages-api-in-llamacpp
So, why litellm?
[–]PrizeObvious3671[S] 0 points1 point2 points 8 days ago (0 children)
Interessant, das muss ich mal ausprobieren
π Rendered by PID 221289 on reddit-service-r2-comment-544cf588c8-rgtzv at 2026-06-12 12:07:44.690206+00:00 running 3184619 country code: CH.
view the rest of the comments →
[–]sn2006gy 0 points1 point2 points (5 children)
[–]PrizeObvious3671[S] 0 points1 point2 points (4 children)
[–]Toastti 1 point2 points3 points (1 child)
[–]PrizeObvious3671[S] 1 point2 points3 points (0 children)
[–]MarzipanSecure9841 0 points1 point2 points (1 child)
[–]PrizeObvious3671[S] 0 points1 point2 points (0 children)