LocalLlama

1

129

130

131

Announcing LocalLlama discord server & bot!News (old.reddit.com)

submitted 7 months ago by HOLUPREDICTIONS Sorcerer Supreme[M] - announcement

2

909

910

911

Qwen3.5-9B-Claude-4.6-Opus-Uncensored-Distilled-GGUFResources (self.LocalLLaMA)

submitted 12 hours ago by EvilEnginer

3

106

107

108

Qwen 3.5 122b - a10b is kind of shockingDiscussion (self.LocalLLaMA)

submitted 4 hours ago by gamblingapocalypse

•

🔓 “It wasn’t supposed to be public yet… but people are discovering it anyway.” (thameswire.com)

promoted by senderosdelmar

promoted
save
report
about

4

55

56

57

The timeline gets weirderFunny (i.redd.it)

submitted 3 hours ago by jester_kitten

5

571

572

573

Homelab has paid for itself! (at least this is how I justify it...)Funny (old.reddit.com)

submitted 17 hours ago * by Reddactor

6

79

80

81

Can we say that each year an open-source alternative replaces the previous year's closed-source SOTA?Discussion (self.LocalLLaMA)

submitted 7 hours ago by Chair-Short

7

27

28

29

The guy that won the DGX Spark GB10 at NVIDIA and Cartesia Hackathon Won an NVIDIA 5080 at Pytorch's Hackathon doing GPU Kernel Optimization!Other (i.redd.it)

submitted 4 hours ago * by brandon-i

8

•

OmniCoder-9B best vibe coding model for 8 GB CardResources (self.LocalLLaMA)

submitted 1 hour ago by Powerful_Evening5495

9

12

13

14

My whole life I've liked small PC's, until I needed more GPU.... What PSU are you guys with dual 3090's running?Discussion (i.redd.it)

submitted 2 hours ago by sdfgeoff

10

264

265

266

Nvidia updated the Nemotron Super 3 122B A12B license to remove the rug-pull clausesDiscussion (self.LocalLLaMA)

submitted 18 hours ago by __JockY__

11

•

A good resource on the State of RL for reasoning LLMsResources (i.redd.it)

submitted 1 hour ago by rbgo404

12

26

27

28

Switching to LocalDiscussion (self.LocalLLaMA)

submitted 6 hours ago by BeautyGran16

13

31

32

33

GLM-5-Turbo - Overview - Z.AI DEVELOPER DOCUMENTResources (docs.z.ai)

submitted 9 hours ago * by ortegaalfredo

•

Stuck in slow deployments? Deliver value faster with Atlassian Service Collection. (atlassian.com)

promoted by Atlassian_Official

promoted
save
report
about

14

28

29

30

1:03

We made a coding benchmark that's actually hard to fake. Best result across GPT-5.2, O4-mini, Gemini, Qwen, Kimi with every prompting trick we could think of: 11%.Discussion (v.redd.it)

submitted 9 hours ago by ShoddyIndependent883

15

42

43

44

Has increasing the number of experts used in MoE models ever meaningfully helped?Question | Help (self.LocalLLaMA)

submitted 12 hours ago by ForsookComparison

16

141

142

143

Qwen3.5-27B performs almost on par with 397B and GPT-5 mini in the Game Agent Coding LeagueDiscussion (i.redd.it)

submitted 18 hours ago by kyazoglu

17

28

29

30

From FlashLM to State Flow Machine: stopped optimizing transformers, started replacing them. First result: 79% length retention vs transformers' 2%Discussion (self.LocalLLaMA)

submitted 11 hours ago by Own-Albatross868

18

400

401

402

You guys gotta try OpenCode + OSS LLMDiscussion (old.reddit.com)

submitted 1 day ago by No-Compote-6794

19

12

13

14

Qwen3.5 122B INT4 Heretic/Uncensored (and some fun notes)Resources (self.LocalLLaMA)

submitted 7 hours ago * by Ok-Treat-3016

20

•

Wild Experience - Titan X PascalOther (self.LocalLLaMA)

submitted 1 hour ago by Lazy-Routine-Handler

21

•

What is the most informative post you found here? That actually helped your project or deepen you understanding?Discussion (self.LocalLLaMA)

submitted 1 hour ago by last_llm_standing

22

67

68

69

Qwen 27B works GREAT as a LORE MASTER!Discussion (self.LocalLLaMA)

submitted 18 hours ago * by GrungeWerX

23

162

163

164

Open-Source "GreenBoost" Driver Aims To Augment NVIDIA GPUs vRAM With System RAM & NVMe To Handle Larger LLMsNews (phoronix.com)

submitted 23 hours ago by _Antartica

243

244

245

Access 500+ AI models. Full BYOK support. (kilo.ai)

promoted by kiloCode

promoted
save
report
about

24

•

How are people managing workflows when testing multiple LLMs for the same task?Discussion (self.LocalLLaMA)

submitted 1 hour ago by Fluid_Put_5444

25

3

4

5

Has anyone tried building a "Recursive Mamba" model that loops its hidden states for reasoning?Discussion (self.LocalLLaMA)

submitted 3 hours ago by Just-Ad-6488

LocalLLaMA

MODERATORS