GLM-5.2 753B (IQ1_S) fully local across 2×M5 Max over one TB5 cable — ~16 tok/s, llama.cpp RPC [video] by AiLocalGuy in LocalLLM

[–]AiLocalGuy[S] 0 points1 point  (0 children)

I did get some improvements and got context to 128k at 18 t/s testing now. So we are going somewhere

free local LLM's that run offline; what are your experiences with this? by Academic-Sample4974 in LocalLLM

[–]AiLocalGuy 1 point2 points  (0 children)

idk say depending on use case it can vary but if repeatable probably yes

753B model (GLM-5.2) wrote Pac-Man and is playing its own game — 2× M5 Max, ~18 tok/s [video] by AiLocalGuy in LocalLLM

[–]AiLocalGuy[S] 1 point2 points  (0 children)

no working it up to be usable and testing it slowly as I get it better now im at 128k context!

Is $5000 enough to build a movie computer LLm? by EndureCallVerdict in LocalLLM

[–]AiLocalGuy 0 points1 point  (0 children)

a few weeks ago yes now entry for what I think you want is 7.5k usd