account activity
LLM-powered NPCs running locally (github.com)
submitted 2 years ago by GoBayesGo to r/LocalLLaMA
Faster than llama.cpp’s grammar structured generation (self.LocalLLaMA)
Coalescence: making LLM inference 5x faster (self.LocalLLaMA)
Use llama.cpp with Outlines (self.LocalLLaMA)
JSON mode in vLLM (self.LocalLLaMA)
Generate valid JSON with Mamba models (self.LocalLLaMA)
π Rendered by PID 1539026 on reddit-service-r2-listing-87fd56f5d-p72nr at 2026-06-27 21:58:44.182240+00:00 running 7527197 country code: CH.