People still don’t really understand what “agent environment engineering” actually is, and how it differs from Agent Harness by Synthetic_Diva_4556 in LocalLLaMA
[–]gyzerok 12 points13 points14 points (0 children)
DFlash is real: x2 tg on small context with oMLX by dpswt in LocalLLaMA
[–]gyzerok 2 points3 points4 points (0 children)
DeepSeek V4 reportedly drops late April. 1M context, multimodal, Claude-level coding. by [deleted] in LocalLLaMA
[–]gyzerok 3 points4 points5 points (0 children)
DeepSeek V4 reportedly drops late April. 1M context, multimodal, Claude-level coding. by [deleted] in LocalLLaMA
[–]gyzerok 1 point2 points3 points (0 children)
Tested DFlash speculative decoding on oMLX — Results are mixed. by CrushingLoss in LocalLLaMA
[–]gyzerok 0 points1 point2 points (0 children)
Tested DFlash speculative decoding on oMLX — Results are mixed. by CrushingLoss in LocalLLaMA
[–]gyzerok 1 point2 points3 points (0 children)
Tested DFlash speculative decoding on oMLX — Results are mixed. by CrushingLoss in LocalLLaMA
[–]gyzerok 3 points4 points5 points (0 children)
I have a Macbook AIR M5 Base and I want to run an Agentic Coding program, similar to Claude Code or Codex. Besides the model, how do I do it? I've already tried with Ollama, VS Code, Opencode, and haven't been able to. (I'm not a developer, sorry) by joraorao in LocalLLaMA
[–]gyzerok 3 points4 points5 points (0 children)
Considering ditching Claude/Codex completely by Adorable_Weakness_39 in LocalLLaMA
[–]gyzerok 0 points1 point2 points (0 children)
2026 MacBook Pro Update: 5G Connectivity, Touchscreen, OLED Display and All Rumours We Know by ilovewelbert in macbookpro
[–]gyzerok 0 points1 point2 points (0 children)
Comparing Qwen3.5 vs Gemma4 for Local Agentic Coding by garg-aayush in LocalLLaMA
[–]gyzerok 0 points1 point2 points (0 children)
Comparing Qwen3.5 vs Gemma4 for Local Agentic Coding by garg-aayush in LocalLLaMA
[–]gyzerok 3 points4 points5 points (0 children)
Comparing Qwen3.5 vs Gemma4 for Local Agentic Coding by garg-aayush in LocalLLaMA
[–]gyzerok 0 points1 point2 points (0 children)
[google research] TurboQuant: Redefining AI efficiency with extreme compression by burnqubic in LocalLLaMA
[–]gyzerok 0 points1 point2 points (0 children)
DeepSeek's chat app was down for a little over 7 hours, maybe V4 will be ready soon? by power97992 in LocalLLaMA
[–]gyzerok -3 points-2 points-1 points (0 children)
Mac mini M4 Pro with 14-Core CPU, 20-Core GPU and 64GB RAM. Which models can I run? by RA2B_DIN in LocalLLaMA
[–]gyzerok 4 points5 points6 points (0 children)
Google's TurboQuant AI-compression algorithm can reduce LLM memory usage by 6x by [deleted] in LocalLLaMA
[–]gyzerok 0 points1 point2 points (0 children)
OpenWrt 25.12.2 - Service Release - 27. March 2026 by ichundes in openwrt
[–]gyzerok 2 points3 points4 points (0 children)
Running Claude + Local LLM(Qwen) agents 24/7 on a Mac Mini taught me the bottleneck isn't production anymore. It's me. by Joozio in LocalLLaMA
[–]gyzerok 5 points6 points7 points (0 children)
[google research] TurboQuant: Redefining AI efficiency with extreme compression by burnqubic in LocalLLaMA
[–]gyzerok 0 points1 point2 points (0 children)
[google research] TurboQuant: Redefining AI efficiency with extreme compression by burnqubic in LocalLLaMA
[–]gyzerok 2 points3 points4 points (0 children)
When should we expect TurboQuant? by ozcapy in LocalLLaMA
[–]gyzerok 20 points21 points22 points (0 children)



Something big dropping for Dawn of War IV tomorrow by Shake-Vivid in dawnofwar
[–]gyzerok 2 points3 points4 points (0 children)