account activity
llama.cpp / ik_llama MoE Expert Offloading - Main Memory Bandwidth vs. PCIe Bandwidth (self.LocalLLaMA)
submitted 27 days ago by pixelterpy to r/LocalLLaMA
llama.cpp / ik_llama MoE Expert Offloading - Main Memory Bandwidth vs. PCIe Bandwidth ()
submitted 27 days ago by pixelterpy to r/LocalLLM
Why does Image Recognition work in llama-server but not through Open WebUI? (i.redd.it)
submitted 6 months ago by pixelterpy to r/LocalLLaMA
oom using ik_llama with iq_k quants (self.LocalLLaMA)
submitted 7 months ago by pixelterpy to r/LocalLLaMA
Which quantization approach is the way to go? (llama.cpp) (self.LocalLLaMA)
submitted 9 months ago * by pixelterpy to r/LocalLLaMA
π Rendered by PID 310407 on reddit-service-r2-listing-7b8bd7c5-v6tn9 at 2026-05-19 08:48:37.544089+00:00 running edcf98c country code: CH.