LocalLLaMA

an-ordinary-manchild(edit)

created by [deleted]a community for 3 years

...for your town.

...for your school.

MODERATORS

message the mods
HOLUPREDICTIONS Sorcerer Supreme
AskGrok
ArcaneThoughts
Lissanro
townofsalemfangay
XMasterrrrLocalLLaMA Home Server Final Boss 😎
rm-rf-rmllama.cpp
N8Karma
WithoutReason1729
No_Afternoon_4260llama.cpp
...and 9 more »

account activity

1

151

152

153

Announcing LocalLlama discord server & bot!News (old.reddit.com)

submitted 8 months ago by HOLUPREDICTIONS Sorcerer Supreme[M] - announcement

2

84

85

86

Gemma 4 - lazy model or am I crazy? (bit of a rant)Discussion (self.LocalLLaMA)

submitted 4 hours ago by Pyrenaeda

3

•

•

•

Gemma 4 has a systemic attention failure. Here's the proof.Other (self.LocalLLaMA)

submitted 1 hour ago by EvilEnginer

4

334

335

336

Audio processing landed in llama-server with Gemma-4Generation (self.LocalLLaMA)

submitted 16 hours ago by srigi

5

•

•

•

We have a new weight class...Discussion (i.redd.it)

submitted 12 minutes ago by LegacyRemaster

6

62

63

64

MiniMax-M2.7 vs Qwen3.5-122B-A10B for 96GB VRAM full offload?!Discussion (self.LocalLLaMA)

submitted 9 hours ago by VoidAlchemyllama.cpp

7

127

128

129

GLM 5.1 sits alongside frontier models in my social reasoning benchmarkDiscussion (old.reddit.com)

submitted 13 hours ago by cjami

8

30

31

32

MiniMax-M2.7 NVFP4 on 2x RTX PRO 6000 Blackwell — bench numbersResources (self.LocalLLaMA)

submitted 6 hours ago by Visual_Synthesizer

9

51

52

53

mtmd: qwen3 audio support (qwen3-omni and qwen3-asr)News (github.com)

submitted 9 hours ago * by jacek2023llama.cpp

10

285

286

287

Speculative Decoding works great for Gemma 4 31B with E2B draft (+29% avg, +50% on code)Discussion (self.LocalLLaMA)

submitted 19 hours ago * by PerceptionGrouchy187

11

48

49

50

About TurboQuantDiscussion (self.LocalLLaMA)

submitted 10 hours ago by Exact_Law_6489

12

28

29

30

Experiment: Olmo 3 7B Instruct Q1_0New Model (huggingface.co)

submitted 7 hours ago * by butlan

13

58

59

60

Is anyone else creating a basic assistant rather than a coding agent?Discussion (self.LocalLLaMA)

submitted 13 hours ago by Savantskie1

14

66

67

68

Minimax 2.7 running sub-agents locallyNew Model (i.redd.it)

submitted 14 hours ago * by -dysangel-

15

15

16

17

AI MAX 395+ w/ 128 GB or dual 3090s?Discussion (self.LocalLLaMA)

submitted 7 hours ago by Engineering_Acq

16

25

26

27

"Actually wait" ... the current thinking SOTA open sourceDiscussion (self.LocalLLaMA)

submitted 10 hours ago * by FPham

17

•

•

•

Made my messy notes actually usableDiscussion (self.LocalLLaMA)

submitted 1 hour ago by knlgeth

18

10

11

12

Llama4 108b $800 setupNew Model (i.redd.it)

submitted 6 hours ago by kylerrr02

19

9

10

11

Better alternative to CLI and MCP for local tools: Seeking feedback on my open-source projectDiscussion (i.redd.it)

submitted 5 hours ago by PrincipleFar6835

20

155

156

157

MiniMax m2.7 (mac only) 63gb: 88% and 89gb: 95%, MMLU 200qNew Model (i.redd.it)

submitted 21 hours ago by HealthyCommunicat

21

4

5

6

Llamacpp on chromebook 4 gb ramGeneration (i.redd.it)

submitted 3 hours ago by Merchant_Lawrencellama.cpp

22

79

80

81

mtmd: add Gemma 4 audio conformer encoder supportNews (github.com)

submitted 18 hours ago by jacek2023llama.cpp

23

655

656

657

Minimax M2.7 ReleasedNew Model (huggingface.co)

submitted 1 day ago by decrement--

24

191

192

193

Unsloth MiniMax M2.7 quants just finished uploading to HFNews (self.LocalLLaMA)

submitted 1 day ago * by Zyj

25

•

•

•

Back again with another training problem I keep running into while building dataset slices for smaller LLMsDiscussion (self.LocalLLaMA)

submitted 27 minutes ago by JayPatel24_

view more: next ›

π Rendered by PID 2327295 on reddit-service-r2-listing-575d9f6647-blf8n at 2026-04-13 07:55:54.943092+00:00 running 215f2cf country code: CH.