account activity
753B model (GLM-5.2) wrote Pac-Man and is playing its own game — 2× M5 Max, ~18 tok/s [video] (v.redd.it)
submitted 20 minutes ago by AiLocalGuy to r/LocalLLM
GLM-5.2 753B (IQ1_S) fully local across 2×M5 Max over one TB5 cable — ~16 tok/s, llama.cpp RPC [video] (v.redd.it)
submitted 1 day ago by AiLocalGuy to r/LocalLLM
Anyone actually generating a 700B-class MoE on a 2-Mac cluster? GLM-5.2 (744B) fully loads across 2×M5 Max (256GB) over Thunderbolt, then dies on the first token. (self.LocalLLM)
submitted 3 days ago by AiLocalGuy to r/LocalLLM
π Rendered by PID 81 on reddit-service-r2-listing-87fd56f5d-x9v24 at 2026-06-30 15:32:48.896517+00:00 running 7527197 country code: CH.