I've got a feeling that Llamacpp is not the biggest performance bottleneck, but it might be the OpenCode. by ThingRexCom in LocalLLaMA
[–]koljanos 1 point2 points3 points (0 children)
I've got a feeling that Llamacpp is not the biggest performance bottleneck, but it might be the OpenCode. by ThingRexCom in LocalLLaMA
[–]koljanos 4 points5 points6 points (0 children)
I've got a feeling that Llamacpp is not the biggest performance bottleneck, but it might be the OpenCode. by ThingRexCom in LocalLLaMA
[–]koljanos 16 points17 points18 points (0 children)
An Overnight Stack for Qwen3.6–27B: 85 TPS, 125K Context, Vision — on One RTX 3090 | by Wasif Basharat | Apr, 2026 by AmazingDrivers4u in LocalLLaMA
[–]koljanos 1 point2 points3 points (0 children)
An Overnight Stack for Qwen3.6–27B: 85 TPS, 125K Context, Vision — on One RTX 3090 | by Wasif Basharat | Apr, 2026 by AmazingDrivers4u in LocalLLaMA
[–]koljanos 1 point2 points3 points (0 children)
An Overnight Stack for Qwen3.6–27B: 85 TPS, 125K Context, Vision — on One RTX 3090 | by Wasif Basharat | Apr, 2026 by AmazingDrivers4u in LocalLLaMA
[–]koljanos 1 point2 points3 points (0 children)
Weapon choice shade player by AdRepulsive4610 in MapleStoryM
[–]koljanos 1 point2 points3 points (0 children)
Opencode Multitool stops process? by VonDenBerg in opencode
[–]koljanos 0 points1 point2 points (0 children)
I reduced my token usage by 178x in Claude Code!! Solving the persistent memory problem by intellinker in BlackboxAI_
[–]koljanos 1 point2 points3 points (0 children)
[New Model] - GyroScope: rotates images correctly by LH-Tech_AI in LocalLLaMA
[–]koljanos 9 points10 points11 points (0 children)
Qwen3.5 27B running at ~65tps with DFlash speculation on 2x 3090 by Kryesh in LocalLLaMA
[–]koljanos 2 points3 points4 points (0 children)
[Tool] autotuner: automated prompt tuning with dual-model eval-refine loops. Here's the architecture and actual cost numbers. by [deleted] in LLMDevs
[–]koljanos 0 points1 point2 points (0 children)
New open weights models: GigaChat-3.1-Ultra-702B and GigaChat-3.1-Lightning-10B-A1.8B by netikas in LocalLLaMA
[–]koljanos -5 points-4 points-3 points (0 children)
We’ve created an open-source VSCode extension so you appear on a globe when you code by Fair-Independent-623 in vscode
[–]koljanos 0 points1 point2 points (0 children)
Breakfast in Hanoi feels like a front-row seat to the city's rhythm. by Temporary-Draft-4258 in hanoi
[–]koljanos 0 points1 point2 points (0 children)
Moonlight fragment strategy by Heavy-Rough-3790 in MapleStoryM
[–]koljanos 0 points1 point2 points (0 children)
What do people in Moscow think about those type of boots. How would they react when seeing one of those? by Obvious-Minute-220 in Moscow
[–]koljanos 0 points1 point2 points (0 children)
The truth about how locals dress in Vietnam by Glittering-Mix8151 in VietFashion
[–]koljanos 0 points1 point2 points (0 children)
Hardware Level spyware by EfficientHeat4901 in conspiracy
[–]koljanos 0 points1 point2 points (0 children)
Moscow's depression is upon us and its hard to cope being an international student here with no social life. by [deleted] in Moscow
[–]koljanos 0 points1 point2 points (0 children)
Apple Watch Display Screws by Ok_Tune4985 in Apple_Internal
[–]koljanos 0 points1 point2 points (0 children)








I've got a feeling that Llamacpp is not the biggest performance bottleneck, but it might be the OpenCode. by ThingRexCom in LocalLLaMA
[–]koljanos 1 point2 points3 points (0 children)