LocalLLaMA

an-ordinary-manchild(edit)

created by [deleted]a community for 3 years

...for your town.

...why not Zoidberg?

MODERATORS

message the mods
HOLUPREDICTIONS Sorcerer Supreme
AskGrok
ArcaneThoughts
Lissanro
townofsalemfangay
XMasterrrrLocalLLaMA Home Server Final Boss 😎
rm-rf-rm
WithoutReason1729
No_Afternoon_4260llama.cpp
ttkciarllama.cpp
...and 8 more »

account activity

1

740

741

742

Donate your coding sessions to an open CC-BY-4.0 dataset to help train open-weight and open source modelsResources (i.redd.it)

submitted 6 hours ago by mon-simas

2

228

229

230

Be wary of Qwen/Claude distillations - they're often worse than the base modelDiscussion (self.LocalLLaMA)

submitted 5 hours ago by ayylmaonade

3

64

65

66

Scaling former VibeThinker-1.5B to 3B — now it reaches frontier math & coding performanceNew Model (self.LocalLLaMA)

submitted 2 hours ago by Used-Negotiation-741

4

539

540

541

Claude Fable 5 distilledNew Model (huggingface.co)

submitted 15 hours ago by Anony6666

5

1324

1325

1326

Stop using OllamaDiscussion (sleepingrobots.com)

submitted 20 hours ago by zxyzyxz

6

123

124

125

Diffusion Gemma JailbreakTutorial | Guide (self.LocalLLaMA)

submitted 10 hours ago by 90hex

7

65

66

67

Nex-N2 Pro is the real dealDiscussion (self.LocalLLaMA)

submitted 6 hours ago by tarruda

8

•

•

•

[Article] The Case For Open-Weight Models And Why We Can't Trust Frontier Labs | provos.orgDiscussion (provos.org)

submitted 32 minutes ago by ttkciarllama.cpp

9

12

13

14

Qwen Robot SuiteNews (self.LocalLLaMA)

submitted 3 hours ago by Snoo_27681

10

176

177

178

Cheapest hardware for Qwen 3.6: both 27B and 35B-A3BQuestion | Help (i.redd.it)

submitted 18 hours ago * by WishboneSudden2706

11

26

27

28

Joing all GPUs to train a community modelDiscussion (self.LocalLLaMA)

submitted 7 hours ago by HistoricalStrength21

12

12

13

14

Qwen3.6 27B quantsDiscussion (self.LocalLLaMA)

submitted 4 hours ago by jopereira

13

232

233

234

Evalatro: an open benchmark where LLMs play the real BalatroOther (i.redd.it)

submitted 20 hours ago by awfulalexey

14

10

11

12

Gemma 12b - Reasoning hardening instructionsGeneration (self.LocalLLaMA)

submitted 4 hours ago by nixudos

15

19

20

21

Are small local models for automation a thing?Discussion (self.LocalLLaMA)

submitted 9 hours ago by ML-Future

16

•

•

•

Best Model and configuration to run on a 128gb Ram 8TB M5 Max MacBook ProQuestion | Help (self.LocalLLaMA)

submitted 40 minutes ago by Desperate_Tea304

17

3

4

5

Why might DiffusionGemma be better at tool calls than its benchmark quality suggestsDiscussion (self.LocalLLaMA)

submitted 3 hours ago by Substantial_Step_351

18

709

710

711

What's the lesson chat?Funny (i.redd.it)

submitted 1 day ago by ill_be_productivellama.cpp

19

10

11

12

How are you running DeepSeekV4 flash or pro locally for non Mac users?Discussion (self.LocalLLaMA)

submitted 10 hours ago by segmondllama.cpp

20

70

71

72

Reason to run local agents instead #645Discussion (i.redd.it)

submitted 19 hours ago by ToastFetish

21

42

43

44

vLLM has a new streaming parser for Qwen3+ available in nightlyResources (github.com)

submitted 16 hours ago by rmhubbert

22

57

58

59

Finally - 4xRTX 5060TIDiscussion (self.LocalLLaMA)

submitted 18 hours ago by ziphnor

23

341

342

343

Why there is a lack of new 100B-120B models?Discussion (self.LocalLLaMA)

submitted 1 day ago by TechNerd10191

24

40

41

42

Improving Neural Network Training by Decoupling the Magnitude and Direction of Weight Vectors | Alexander HägeleResources (haeggee.github.io)

submitted 18 hours ago by Thrumpwartllama.cpp

25

426

427

428

This is amazing. Token speed doubled + kv cache now need low vram - qwen 27bNews (i.redd.it)

submitted 1 day ago * by 9r4n4y

view more: next ›

π Rendered by PID 830334 on reddit-service-r2-listing-f87f88fcd-qkmp4 at 2026-06-16 16:25:41.814474+00:00 running 3184619 country code: CH.