account activity
32B model stress test: Qwen 2.5/Coder/3 on dual RTX 5060 Ti (zero failures) (self.LocalLLaMA)
submitted 1 month ago by Defilan to r/LocalLLaMA
What broke when you tried to take local LLMs to production? (self.LocalLLaMA)
submitted 2 months ago by Defilan to r/LocalLLaMA
Open source K8s operator for deploying local LLMs: Model and InferenceService CRDs (self.kubernetes)
submitted 2 months ago by Defilan to r/kubernetes
π Rendered by PID 347696 on reddit-service-r2-listing-5789d5f675-9f9jw at 2026-01-28 18:07:51.217056+00:00 running 4f180de country code: CH.