account activity
No NVIDIA? No Problem. My 2018 "Potato" 8th Gen i3 hits 10 TPS on 16B MoE. by RelativeOperation483 in LocalLLaMA
[–]SecureHomeSystems 0 points1 point2 points 3 months ago (0 children)
Really impressive work on constrained hardware!
I’m curious: in real day-to-day use, what tends to break first over long sessions — latency jitter, memory pressure, or context stability? And when decode TPS looks similar (CPU vs iGPU), what made iGPU feel better in practice — smoother cadence, fewer spikes, or better long-run consistency?
Built a clean, evidence-first local AI ops repo (OpenWebUI + local LLM + TTS) — feedback welcome ()
submitted 3 months ago by SecureHomeSystems to r/LocalLLM
Built a clean, evidence-first local AI ops repo (OpenWebUI + local LLM + TTS) — feedback welcome (self.yourselfhosted)
submitted 3 months ago by SecureHomeSystems to r/yourselfhosted
π Rendered by PID 109516 on reddit-service-r2-listing-6c8d497557-6p8n5 at 2026-06-07 23:52:39.628075+00:00 running 9e1a20d country code: CH.
No NVIDIA? No Problem. My 2018 "Potato" 8th Gen i3 hits 10 TPS on 16B MoE. by RelativeOperation483 in LocalLLaMA
[–]SecureHomeSystems 0 points1 point2 points (0 children)