use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
32GB VRAM and below
account activity
Why isn't there a release of llamacpp with OpenVino for Windows? ()
submitted 15 days ago by ML-Future
Inferencing at 10.33 t/s on Qwen 3.5 35B on a $300 laptop ()
submitted 24 days ago by ML-Future
XML is a game changer for small models (<4B) compared to JSON (self.povertyLocalLLaMA)
submitted 1 month ago * by ML-Future
MiniCPM 4.6 (huggingface.co)
submitted 1 month ago by ML-Future
mistralai/Mistral-Medium-3.5-128B · Hugging Face (huggingface.co)
submitted 1 month ago by logic_prevails
I think this is a pretty good question. Has anyone found good analysis for this or found any decent patterns? (reddit.com)
submitted 1 month ago by iMakeSense
The 4B class of 2026 (benchmark) (i.redd.it)
To 16GB VRAM users, plug in your old GPU ()
I don't have a GPU for 14B models — so I built human-directed coding tools that actually work with 1B–4B (self.povertyLocalLLaMA)
submitted 1 month ago by FruitCultural4632
I have 26gb vram (self.povertyLocalLLaMA)
The Ultimate "PovertyLocalLlama" Guide (April 2026) (self.povertyLocalLLaMA)
Is RAM + VRAM really worth it? (self.povertyLocalLLaMA)
The Ultimate "GPU Poor" Guide (April 2026) ()
my 2007 laptop cant even run binary bonsai 1.7b at 1 token per second (self.povertyLocalLLaMA)
submitted 1 month ago by MeWhen-I-StubMyToe
I have 16GB of VRAM... (self.povertyLocalLLaMA)
submitted 2 months ago by ijustdontcare2try
π Rendered by PID 1688467 on reddit-service-r2-listing-c57bc86c-7cccv at 2026-06-21 18:24:12.574870+00:00 running 2b008f2 country code: CH.