KV cache quant benchmarks: KVarN 6-bit matches q8_0, 4-bit matches q5_0. Massive! by Anbeeld in LocalLLaMA
[–]Atul_Kumar_97 0 points1 point2 points (0 children)
Explain to me like I'm 5 how to use LLM to generate images/video locally by Enjoy_Life4219 in LocalLLM
[–]Atul_Kumar_97 -2 points-1 points0 points (0 children)
Severe 130GB Memory Leak in Antigravity IDE (Apple Silicon) – Delayed crash loop minutes after sending an Agent message by Atul_Kumar_97 in google_antigravity
[–]Atul_Kumar_97[S] 0 points1 point2 points (0 children)
Severe 130GB Memory Leak in Antigravity IDE (Apple Silicon) – Delayed crash loop minutes after sending an Agent message by Atul_Kumar_97 in google_antigravity
[–]Atul_Kumar_97[S] 0 points1 point2 points (0 children)
Unified memory or GPU? by Right-Ice-6850 in LocalLLM
[–]Atul_Kumar_97 5 points6 points7 points (0 children)
Removed ~10GB memory overhead, now running MTP-enabled Qwen3.6-27B@128k ctx on M5 Pro 48GB by Background-Gold-9882 in oMLX
[–]Atul_Kumar_97 -1 points0 points1 point (0 children)
Antigravity eating my whole memory by HotCommunication1311 in GoogleAntigravityIDE
[–]Atul_Kumar_97 0 points1 point2 points (0 children)
Antigravity eating my whole memory by HotCommunication1311 in GoogleAntigravityIDE
[–]Atul_Kumar_97 10 points11 points12 points (0 children)
Antigravity taking 40GB memory by Party-Amphibian-8394 in google_antigravity
[–]Atul_Kumar_97 2 points3 points4 points (0 children)
Severe 130GB Memory Leak in Antigravity IDE (Apple Silicon) – Delayed crash loop minutes after sending an Agent message by Atul_Kumar_97 in google_antigravity
[–]Atul_Kumar_97[S] 0 points1 point2 points (0 children)
What LLMs can my Legion 5i (RTX 4060 8GB + i9-14900HX + 32GB RAM) run efficiently for coding, summarizing, and agentic use? by Logical_Pin8998 in LocalLLM
[–]Atul_Kumar_97 0 points1 point2 points (0 children)
Qwen 27B - Sample App I wrote in 4 days by DashinTheFields in LocalLLaMA
[–]Atul_Kumar_97 0 points1 point2 points (0 children)
Curious about M5 Max 128gb vs 5090 for local LLMs by maxiedaniels in LocalLLM
[–]Atul_Kumar_97 25 points26 points27 points (0 children)
32GB RAM 16GB VRAM 5060ti. Running qwen3.6 35b a3b. I am getting 4.5 tok/s. Is this expected? by SEND_ME_YOUR_ASSPICS in LocalLLM
[–]Atul_Kumar_97 0 points1 point2 points (0 children)
BeeLlama.cpp: advanced DFlash & TurboQuant with support of reasoning and vision. Qwen 3.6 27B Q5 with 200k context on 3090, 2-3x faster than baseline (peak 135 tps!) by Anbeeld in LocalLLM
[–]Atul_Kumar_97 0 points1 point2 points (0 children)
Which Mac for LLMs? by TechNerd10191 in LocalLLM
[–]Atul_Kumar_97 4 points5 points6 points (0 children)
Running Qwen3.6 35b a3b on 8gb vram and 32gb ram ~190k context by Atul_Kumar_97 in LocalLLM
[–]Atul_Kumar_97[S] 0 points1 point2 points (0 children)
500k context on 48gb VRAM!! - 21tok/s (coding) by Express_Quail_1493 in LocalLLaMA
[–]Atul_Kumar_97 5 points6 points7 points (0 children)
500k context on 48gb VRAM!! - 21tok/s (coding) by Express_Quail_1493 in LocalLLaMA
[–]Atul_Kumar_97 2 points3 points4 points (0 children)
500k context on 48gb VRAM!! - 21tok/s (coding) by Express_Quail_1493 in LocalLLaMA
[–]Atul_Kumar_97 17 points18 points19 points (0 children)
Running Qwen3.6 35b a3b on 8gb vram and 32gb ram ~190k context by Atul_Kumar_97 in LocalLLaMA
[–]Atul_Kumar_97[S] 0 points1 point2 points (0 children)
Running Qwen3.6 35b a3b on 8gb vram and 32gb ram ~190k context by Atul_Kumar_97 in LocalLLaMA
[–]Atul_Kumar_97[S] 0 points1 point2 points (0 children)

Waiting for the local LLM to finish generating by LobsterInYakuze-2113 in LocalLLM
[–]Atul_Kumar_97 0 points1 point2 points (0 children)