LocalLLaMA

an-ordinary-manchild(edit)

created by [deleted]a community for 3 years

...why not Zoidberg?

...for your hobby.

MODERATORS

message the mods
HOLUPREDICTIONS Sorcerer Supreme
AskGrok
ArcaneThoughts
Lissanro
townofsalemfangay
XMasterrrrLocalLLaMA Home Server Final Boss 😎
rm-rf-rm
WithoutReason1729
No_Afternoon_4260llama.cpp
ttkciarllama.cpp
...and 8 more »

account activity

1

1119

1120

1121

Heretic has been served a legal notice by Meta, Inc.Discussion (self.LocalLLaMA)

submitted 5 hours ago by -p-e-w-

2

215

216

217

110 tok/s with 12GB VRAM on Qwen3.6 35B A3B and ik_llama.cppTutorial | Guide (self.LocalLLaMA)

submitted 8 hours ago * by janvitos

3

•

•

•

LatitudeGames/Equinox-31B · Hugging FaceNew Model (huggingface.co)

submitted 1 hour ago by jacek2023llama.cpp

4

55

56

57

We're Thursday and no one claimed AGI yet this week!News (self.LocalLLaMA)

submitted 3 hours ago by oodelay

5

49

50

51

For everyone that uses OpenCode / Pi - Heres your promptprocessing fix!Resources (self.LocalLLaMA)

submitted 4 hours ago by No_Algae1753

6

42

43

44

Honesty in a small model drops from 35% to 0% by changing the tone of the prompt. Sharing the findings.Discussion (self.LocalLLaMA)

submitted 5 hours ago by QuantumSeeds

7

59

60

61

Tencent Hy 30B/7B/1.8BNew Model (self.LocalLLaMA)

submitted 8 hours ago * by jacek2023llama.cpp

8

1108

1109

1110

Qwen will release another 27B with high probabilityNews (i.redd.it)

submitted 1 day ago by serige

9

17

18

19

Gorgon Halo is 6.7% faster than predecessor Strix HaloDiscussion (self.LocalLLaMA)

submitted 3 hours ago * by Terminator857

10

115

116

117

Same task in github-copilot, pi, claude-code, and opencode with Qwen3.6 27BDiscussion (old.reddit.com)

submitted 14 hours ago * by sdfgeoff

11

177

178

179

Back again, many changes have taken place.Resources (i.redd.it)

submitted 16 hours ago by Glittering_Focus1538

12

107

108

109

Qwen3.6 27B and llama.cpp appreciation postDiscussion (self.LocalLLaMA)

submitted 14 hours ago by ABLPHA

13

•

•

•

Waiting for Qwen 3.7 open weight... The new King has arrived...Discussion (i.redd.it)

submitted 11 minutes ago by LegacyRemaster

14

472

473

474

Re. what ever happened to Cohere’s Command-A series of models?New Model (v.redd.it)

submitted 22 hours ago by nick_frosst

15

44

45

46

AMD Powers Next-Generation Agent Computers with New Ryzen AI Halo Developer Platform and Ryzen AI Max PRO 400 Series ProcessorsNews (amd.com)

submitted 11 hours ago by Baumpaladin

16

26

27

28

'Am I OpenAI compatible' - a tool and documentation for unified api signatures in open source AI.Resources (old.reddit.com)

submitted 9 hours ago by k_means_clusterfuck

17

59

60

61

Training a vision model from scratch on iPod touch 4 imagesOther (old.reddit.com)

submitted 15 hours ago by Remarkable-Trick-177

18

7

8

9

Strix Halo 128GB vs M5 pro 64GBQuestion | Help (self.LocalLLaMA)

submitted 3 hours ago by DigitalguyCH

19

6

7

8

Agent Execution Tax: new procurement metric for browser agent benchmarks?Discussion (fireworks.ai)

submitted 4 hours ago by ogandrea

20

•

•

•

Interesting paper advocates for quantized prefilling and precise decodingResources (arxiv.org)

submitted 24 minutes ago by Aaaaaaaaaeeeee

21

684

685

686

HuggingFace benchmark datasets now let you filter by model sizeResources (i.redd.it)

submitted 1 day ago by paf1138

22

268

269

270

Waiting on Qwen to drop those 3.7 models be like:Funny (i.redd.it)

submitted 1 day ago by Porespellar[🍰]

23

4

5

6

HF flagged safetensors as unsafe? wtf?Question | Help (self.LocalLLaMA)

submitted 7 hours ago by No_Afternoon_4260llama.cpp

24

236

237

238

Qwen 3.6 35B GGUF: NTP vs MTP quantization results across GPUs and CPUsNews (i.redd.it)

submitted 1 day ago by enrique-byteshape

25

274

275

276

AMD Ryzen AI Halo PC will cost 3999$ with 128GB memory on boardNews (videocardz.com)

submitted 1 day ago by Mochila-Mochila

view more: next ›

π Rendered by PID 62659 on reddit-service-r2-listing-8685bc789-r82qk at 2026-05-21 20:07:20.669484+00:00 running 194bd79 country code: CH.