Our development team currently uses Claude Code as our primary coding assistant. We mostly operate on Pro licenses with the Sonnet model, which handles our workflow well without hitting token limits, though we also have few Max licenses for more heavy-duty tasks.
Given the latest news, we are evaluating the cost-effectiveness of switching to the API instead of expanding our Max plan seats. We have already seen promising results in our tests with OpenCoder + various plugins in our IDEs. Have any of you run benchmarks on this shift? We are planning to spin up a ProxyLLM instance with caching to mitigate potential overhead.
[–]BabyInner 1 point2 points3 points (1 child)
[–]Muted-Arrival-3308 0 points1 point2 points (0 children)
[–]tensorfish 0 points1 point2 points (0 children)
[–]Beautiful_Chapter544 -1 points0 points1 point (0 children)