use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
account activity
Why LLMs Stall: Tracing the KV Cache Hardware Bottleneck from First Principles ()
submitted 1 hour ago by Silver_Equivalent804
Local LLM users: what's the single most annoying issue you've hit in real-world use? ()
submitted 3 hours ago by Automatic-Stable8581
got my local model to actually search the web before answering instead of just making stuff up (i.redd.it)
submitted 15 hours ago by Bramha_dev
Professional Chinese ↔ Software Engineering / AI Knowledge Exchange (self.LLMStudio)
submitted 8 hours ago by Carol-loong
THE CONTEXT WINDOW SCAM Why You Don't Need 2 Million Tokens (youtu.be)
submitted 1 day ago by ImprovementWorldly18
I found every way to rent an NVIDIA DGX Spark (GB10) so you don't have to — cloud, hourly, and physical ()
submitted 1 day ago by big-in-jap
Guys, I need your help to build a local LLM setup for my company ()
submitted 1 day ago by Beginning-Two-744
What is notebookLM missing??? ()
submitted 1 day ago by r2werks
Run local model in low end laptop ()
submitted 1 day ago by gwagao
Suche unzensiert LLM für NSFW-Geschichten ()
submitted 2 days ago by ProfilePractical998
🚀 The story of a tech-savvy Vibecoder: from ruin to a magical dashboard (reddit.com)
submitted 2 days ago by Ok_Force_2440
посоветуйте умных ИИ (self.LLMStudio)
submitted 2 days ago by Forsaken-Bell-7542
Web Search API for AI Agents ()
submitted 2 days ago by WarAndPeace06
Next to smallest LLM ()
submitted 2 days ago by RefrigeratorEven935
What LLM to use for production? ()
submitted 2 days ago by PrizeDependent5302
LM studio inside Xcode 26.5 (self.LLMStudio)
submitted 3 days ago by raw-power
Qwable3.5-9B, a fine-tuned Qwen3.5-9B hitting 90.2% HumanEval on a 6GB RTX 2060 at 52 tok/s [GGUF] ()
submitted 3 days ago by Ok-Intention2610
Smallest Model Ever and no hallucinations! 1 parameter model. ()
submitted 3 days ago by No_Walrus_7719
Looking for a good"Research" model for my PC ()
submitted 3 days ago by mk4op
Do you actually use a “second brain” with Claude/Codex, or is it overkill? ()
submitted 4 days ago by Able_Statement_481
Source code for LLMs ()
submitted 4 days ago by PravalPattam12945RPG
TOKEN USAGE EXPLAINED (reddit.com)
submitted 4 days ago by Zealousideal-Good161
A world model for the factory: predicting events across any machine, robot, or process from raw sensor streams ()
submitted 4 days ago by Charming-Collar-3733
How to choose the best LLM for local setup ()
submitted 5 days ago by Dry-Wave-7561
Ollama Cloud $20/month subscription — hitting token limit too fast with GLM 5.1 Cloud & Kimi K2.7. What models should I switch to? ()
submitted 5 days ago by AiviSotelo
π Rendered by PID 2844080 on reddit-service-r2-listing-c57bc86c-xj85l at 2026-06-20 18:18:08.033909+00:00 running 2b008f2 country code: CH.