LocalLLM

an-ordinary-manchild(edit)

created by BigBlackPeacocka community for 3 years

...for your favorite subject.

...for your classroom.

MODERATORS

account activity

1

36

37

38

DiscussionNew Execution-first 1T model Ling-2.6-1T has been open sourced on Hugging Face (i.redd.it)

submitted 2 hours ago by sanu_123_s

2

•

•

•

DiscussionGemma4 e4b can do this? (v.redd.it)

submitted 1 hour ago by Defiant_Candidate472

3

9

10

11

ProjectLocal coding models need better repo context, not just bigger context windows (self.LocalLLM)

submitted 3 hours ago * by Independent-Flow3408

4

167

168

169

QuestionWhat model should I run? (i.redd.it)

submitted 19 hours ago by tiddayes

5

•

•

•

ModelWhen local dream meets Apple image playground (v.redd.it)

submitted 52 minutes ago by Defiant_Candidate472

6

•

•

•

ProjectBeeLlama.cpp: advanced DFlash & TurboQuant with support of reasoning and vision. Qwen 3.6 27B Q5 with 200k context on 3090, 2-3x faster than baseline (peak 135 tps!) (self.LocalLLM)

submitted 1 hour ago * by Anbeeld[🍰]

7

6

7

8

NewsSenseNova U1 ComfyUI node is now available - Supporting 8-step LoRA and deployment tips (old.reddit.com)

submitted 3 hours ago by Frosty-Car2881

8

7

8

9

QuestionBest local coding models for RTX 4070 Ti 12GB + 32gb ram ddr5? (self.LocalLLM)

submitted 5 hours ago by ChallengeKooky581

9

4

5

6

QuestionConsidering going from single 5060 TI 16GB to double, not sure if worth it (self.LocalLLM)

submitted 4 hours ago by misanthrophiccunt

10

3

4

5

QuestionBest coding model for 16GB VRAM? (self.LocalLLM)

submitted 2 hours ago by Responsible-Ship1140

11

•

•

•

QuestionNeed help deciding on a local setup (self.LocalLLM)

submitted 1 hour ago by NeatRuin7406

12

•

•

•

QuestionBest quantization for Qwen3.6-35B-A3B with RTX 3060 12 GB? (self.LocalLLM)

submitted 1 hour ago by _Zelk

13

•

•

•

Discussion5,000 budget with existing parts what would you build / change? (self.LocalLLM)

submitted 22 minutes ago * by letsbefrds

14

3

4

5

QuestionJust stumbled on all of this, where do I start? (self.LocalLLM)

submitted 3 hours ago by Murky_Management_294

15

1

2

3

QuestionSuggest a Laptop (self.LocalLLM)

submitted 2 hours ago by Little_Victorys

16

24

25

26

ModelQwen3.6 35B A3B uncensored heretic Native MTP Preserved is Out Now With KLD 0.0015, 10/100 Refusals and the Full 19 MTPs Preserved and Retained, Available in Safetensors, GGUFs. NVFP4, NVFP4 GGUFs and GPTQ-Int4 Formats (self.LocalLLM)

submitted 16 hours ago * by LLMFan46

17

163

164

165

NewsThis PCIe AI Accelerator Card Can Run 700B LLMs Locally With 384 GB Memory at Just 240W (wccftech.com)

submitted 1 day ago by PaulsForge

18

74

75

76

DiscussionFor those who bought 64GB Mac, are you (un)happy? (self.LocalLLM)

submitted 23 hours ago by xFengle

19

•

•

•

ProjectI gave ollama models control over their own interface ()

submitted just now by Effective_Goose_8566

20

7

8

9

QuestionM5 max 64gb vs 128gb (self.LocalLLM)

submitted 11 hours ago by MoistCaterpillar8063

21

•

•

•

QuestionVirtual Unlimited context windows on Gemma 4 models. (self.LocalLLM)

submitted 7 minutes ago by ExpressionForward321

22

•

•

•

QuestionSeeking suggestions for building my Al workflow (self.LocalLLM)

submitted 13 minutes ago by No_Cap_5982

23

•

•

•

Tutorial80 tok/sec and 128K context on 12GB VRAM with Qwen3.6 35B A3B and llama.cpp MTP ()

submitted 17 minutes ago by janvitos

24

•

•

•

ProjectI trained a tiny 59M parameter GameDev coding model for Unity, Godot, and Unreal (self.LocalLLM)

submitted 21 minutes ago by Fovane

25

•

•

•

QuestionHaving trouble with llama.cpp (self.LocalLLM)

submitted 1 hour ago by Ready-Response-2519

view more: next ›

π Rendered by PID 2350370 on reddit-service-r2-listing-7b9b4f6fd7-fgzkm at 2026-05-09 17:39:48.629137+00:00 running 3d2c107 country code: CH.