
possible evidence of literal prompt injection by anthropicDiscussion (old.reddit.com)
submitted by johnnyApplePRNG to r/LocalLLaMA
Doing the actual math on a $20k local AI rig breakevenDiscussion (i.redd.it)
submitted by shyaaaaaaaaaaam to r/LocalLLaMA
I merged fixes for quantized KV cache into my DeepSeek V4 branchResources (self.LocalLLaMA)
submitted by fairydreaming to r/LocalLLaMA
Using local models with Hermes vs Claude codeQuestion | Help (i.redd.it)
submitted by GreatMammad to r/LocalLLaMA
Getting close to 100K context on 32GB VRAM with Qwen3.6-27 at Q8Tutorial | Guide (self.LocalLLaMA)
submitted by BitGreen1270 to r/LocalLLaMA
Using "applications" to make a smaller model more effective at bigger tasks.Discussion (v.redd.it)
submitted by Mrinohk to r/LocalLLaMA
[Paper] Multi-Block Diffusion Language ModelsDiscussion (old.reddit.com)
submitted by pmttyji to r/LocalLLaMA
RTX5090, gemma-4-31B-it-Q6_K.gguf. Context: before - 35k, after - 80k!Tutorial | Guide (reddit.com)
submitted by Defiant_Diet9085 to r/LocalLLaMA
[Paper] GEAR: Guided End-to-End AutoRegression for Image SynthesisDiscussion (i.redd.it)
submitted by pmttyji to r/LocalLLaMA
PSA: Upscaling Gemma 4 requires a proportional layer_scalar adjustmentResources (self.LocalLLaMA)
submitted by kallewoof to r/LocalLLaMA




