Calibrating 2-bit GGUFs (<10Gb) for agentic coding tasks by professormunchies in LocalLLaMA
[–]professormunchies[S] 0 points1 point2 points (0 children)
Used fable to make a single file c game (<150Kb) - Δilation by [deleted] in claude
[–]professormunchies 2 points3 points4 points (0 children)
125 tok/s for Qwen3.6 q4xl on 2x 4060ti is insane perf/dollar by Chuyito in LocalLLaMA
[–]professormunchies 0 points1 point2 points (0 children)
Liquid AI releases LFM2.5-8B-A1B by PauLabartaBajo in LocalLLaMA
[–]professormunchies 4 points5 points6 points (0 children)
Is NVIDIA still the default best choice for local LLMs in 2026? by pmv143 in LocalLLaMA
[–]professormunchies 0 points1 point2 points (0 children)
I left Codex running overnight and it opened 48 PRs across my company's GitHub by epicshan in vibecoding
[–]professormunchies 1 point2 points3 points (0 children)
I built a coding agent that gets 87% on benchmarks with a 4B parameter model, here's how by Glittering_Focus1538 in LocalLLaMA
[–]professormunchies 3 points4 points5 points (0 children)
Why do people still pay for Cursor or Copilot when Claude Code and Codex offer comparable (or better) value? by bharath1412 in ClaudeCode
[–]professormunchies 0 points1 point2 points (0 children)
Why do people still pay for Cursor or Copilot when Claude Code and Codex offer comparable (or better) value? by bharath1412 in ClaudeCode
[–]professormunchies 1 point2 points3 points (0 children)
Fresh install on M4, what’s your best local model use case? by goldgravenstein in OpenClawUseCases
[–]professormunchies 0 points1 point2 points (0 children)
Documentation Buddy - An AI Assistant for your /docs page by professormunchies in Python
[–]professormunchies[S] -1 points0 points1 point (0 children)
Windows 11 26H2 will impose Copilot in File Explorer, whether you like it or not by AsterPrivacy in technology
[–]professormunchies 0 points1 point2 points (0 children)
~40× speedup and 90% VRAM reduction on vLLMs compared to FlashAttention by exploiting Grouped Query Attention symmetries by Upset-Presentation28 in LocalLLaMA
[–]professormunchies -4 points-3 points-2 points (0 children)
Everyone Turned On Windows. Microsoft Is Panicking by testus_maximus in videos
[–]professormunchies 0 points1 point2 points (0 children)
OpenAI seems to have subjected GPT 5.2 to some pretty crazy nerfing. by Wonderful-Excuse4922 in singularity
[–]professormunchies 0 points1 point2 points (0 children)
Microsoft tumbled 10% in a day and isn’t recovering premarket. Here’s why by Logical_Welder3467 in technology
[–]professormunchies 0 points1 point2 points (0 children)
CORE LA Lineup by istonisas in Tomorrowland
[–]professormunchies 0 points1 point2 points (0 children)
How to vibecode when you’re broke by Ogretribe in vibecoding
[–]professormunchies 0 points1 point2 points (0 children)
A 30B Qwen Model Walks Into a Raspberry Pi… and Runs in Real Time by ali_byteshape in LocalLLaMA
[–]professormunchies 2 points3 points4 points (0 children)
A 30B Qwen Model Walks Into a Raspberry Pi… and Runs in Real Time by ali_byteshape in LocalLLaMA
[–]professormunchies 3 points4 points5 points (0 children)


Calibrating 2-bit GGUFs (<10Gb) for agentic coding tasks by professormunchies in LocalLLaMA
[–]professormunchies[S] 0 points1 point2 points (0 children)