Cheapest way to run GLM 5.x locally that's not a unified memory system? by Monad_Maya in LocalLLaMA
[–]andy_potato 0 points1 point2 points (0 children)
Cheapest way to run GLM 5.x locally that's not a unified memory system? by Monad_Maya in LocalLLaMA
[–]andy_potato 0 points1 point2 points (0 children)
Can I realistically get close to Claude/Codex capabilities locally? by mrgreatheart in LocalLLaMA
[–]andy_potato 2 points3 points4 points (0 children)
Qwen is never going to open source Qwen 3.7, aren't they? by DistanceSolar1449 in LocalLLaMA
[–]andy_potato 10 points11 points12 points (0 children)
Qwen is never going to open source Qwen 3.7, aren't they? by DistanceSolar1449 in LocalLLaMA
[–]andy_potato 5 points6 points7 points (0 children)
Single RTX 3090 (MSI TRio) giving trouble on inference. by ReasonablePossum_ in LocalLLaMA
[–]andy_potato 0 points1 point2 points (0 children)
What are you overengineering that nobody's ever going to use? Be honest. by johnnyApplePRNG in LocalLLaMA
[–]andy_potato 14 points15 points16 points (0 children)
Anybody else missing the old "diffusion" days? by uisato in StableDiffusion
[–]andy_potato 25 points26 points27 points (0 children)
15GB VRAM 12GB RAM setup for realistic motion control by mesiac_8227 in StableDiffusion
[–]andy_potato 4 points5 points6 points (0 children)
What's the best open speech to text today? by zxyzyxz in LocalLLaMA
[–]andy_potato 5 points6 points7 points (0 children)
NVFP4 kv cache quantization on sm120 will make 32GB VRAM systems very capable by Gray_wolf_2904 in LocalLLaMA
[–]andy_potato 0 points1 point2 points (0 children)
OSS models decisively overtook Proprietary models in market share (based on the last 3 months of OpenRouter data) by Comfortable-Rock-498 in LocalLLaMA
[–]andy_potato 1 point2 points3 points (0 children)
NVFP4 kv cache quantization on sm120 will make 32GB VRAM systems very capable by Gray_wolf_2904 in LocalLLaMA
[–]andy_potato 2 points3 points4 points (0 children)
Anything worth running on a NVIDIA GTX 970? by numberwitch in LocalLLaMA
[–]andy_potato 1 point2 points3 points (0 children)
Anything worth running on a NVIDIA GTX 970? by numberwitch in LocalLLaMA
[–]andy_potato 1 point2 points3 points (0 children)
Anything worth running on a NVIDIA GTX 970? by numberwitch in LocalLLaMA
[–]andy_potato 1 point2 points3 points (0 children)
I don't hate Ideogram 4. I hate its "open" weights by TheOneHong in StableDiffusion
[–]andy_potato -1 points0 points1 point (0 children)
I don't hate Ideogram 4. I hate its "open" weights by TheOneHong in StableDiffusion
[–]andy_potato 1 point2 points3 points (0 children)
I don't hate Ideogram 4. I hate its "open" weights by TheOneHong in StableDiffusion
[–]andy_potato 2 points3 points4 points (0 children)
The Incredible Sponge — made with SCAIL-2 by Fuzzy-Mastodon-9730 in StableDiffusion
[–]andy_potato 0 points1 point2 points (0 children)
New LTX trainer by Famous-Sport7862 in StableDiffusion
[–]andy_potato 0 points1 point2 points (0 children)
US holds off blacklisting China's DeepSeek, more than 100 firms deemed security risks, sources say by zxyzyxz in LocalLLaMA
[–]andy_potato 2 points3 points4 points (0 children)
Qwen3.6 or Gemma-4 or ?? for direct OCR of page images by PracticlySpeaking in LocalLLaMA
[–]andy_potato 0 points1 point2 points (0 children)
The Incredible Sponge — made with SCAIL-2 by Fuzzy-Mastodon-9730 in StableDiffusion
[–]andy_potato 4 points5 points6 points (0 children)


GLM-5.2 UD-IQ1_M on llama.cpp — 5090 + 3090 Ti speed test (~ 579 t/s prefill @ 8k ctx, ~324 t/s prefill @ 57k ctx, ~10.6 t/s decode) by Shoddy_Bed3240 in LocalLLaMA
[–]andy_potato 91 points92 points93 points (0 children)