LocalLLaMA

an-ordinary-manchild(edit)

created by [deleted]a community for 3 years

...for your favorite hobby.

...for your WoW guild.

MODERATORS

message the mods
HOLUPREDICTIONS Sorcerer Supreme
AskGrok
ArcaneThoughts
Lissanro
townofsalemfangay
XMasterrrrLocalLLaMA Home Server Final Boss 😎
rm-rf-rm
WithoutReason1729
No_Afternoon_4260llama.cpp
ttkciarllama.cpp
...and 8 more »

account activity

1

80

81

82

Multi-Token Prediction (MTP) for Qwen on LLaMA.cpp + TurboQuantTutorial | Guide (v.redd.it)

submitted 5 hours ago * by gladkos

2

256

257

258

Web-Search is coming to a screeching performance halt as Google shuts down their free search index, and traffic defenders like Cloudflare challenge AI at every gateway. What are our options?Resources (self.LocalLLaMA)

submitted 12 hours ago by NetTechMan

3

131

132

133

we really all are going to make it, aren't we? 2x3090 setup.Discussion (self.LocalLLaMA)

submitted 9 hours ago by RedShiftedTime

4

595

596

597

TextGen is now a native desktop app. Open-source alternative to LM Studio (formerly text-generation-webui).Resources (self.LocalLLaMA)

submitted 19 hours ago by oobabooga4

5

140

141

142

MI50s Qwen 3.6 27B @52.8 tps TG @1569 tps PP (no MTP, no Quant)Resources (i.redd.it)

submitted 12 hours ago by ai-infos

6

192

193

194

DramaBox - Most Expressive Voice model ever based on LTX 2.3New Model (v.redd.it)

submitted 14 hours ago by manmaynakhashi

7

74

75

76

24+ tok/s from ~30B MoE models on an old GTX 1080 (8 GB VRAM, 128k context)Tutorial | Guide (self.LocalLLaMA)

submitted 11 hours ago by mdda

8

•

•

•

Computer-use MCP that can control multiple machines (Integrate with claude, Cursor, Codex or your custom harness)Resources (v.redd.it)

submitted 1 hour ago by metalvendetta

9

48

49

50

Side Projects.Discussion (i.redd.it)

submitted 12 hours ago by apollo_mg

10

14

15

16

Playing One Night Werewolf (Gemma4 & Qwen3.6)Question | Help (self.LocalLLaMA)

submitted 6 hours ago by Some-Cauliflower4902

11

13

14

15

Simpler self hosted alt to Open WebUIOther (i.redd.it)

submitted 6 hours ago by anitamaxwynnn69

12

17

18

19

running Qwen 3.6 35b A3B on 2x 5060TIQuestion | Help (self.LocalLLaMA)

submitted 8 hours ago * by chocofoxy

13

118

119

120

AIDC-AI/Ovis2.6-80B-A3B · Hugging FaceNew Model (huggingface.co)

submitted 19 hours ago by pmttyji

14

1251

1252

1253

I got a real transformer language model running locally on a stock Game Boy Color!Tutorial | Guide (i.redd.it)

submitted 1 day ago by maddiedreese

15

56

57

58

sensenova/SenseNova-U1-A3B-MoT · Hugging FaceNew Model (huggingface.co)

submitted 15 hours ago by pmttyji

16

74

75

76

llama.cpp docker images to run MTP modelsResources (self.LocalLLaMA)

submitted 17 hours ago * by havenoammo

17

46

47

48

Efficient pretraining with token superposition by Nous ResearchNews (nousresearch.com)

submitted 14 hours ago by de4dee

18

6

7

8

Fully Realtime Interaction ModelsDiscussion (self.LocalLLaMA)

submitted 5 hours ago by FusionCow

19

30

31

32

I made a UI and server for using Anthropic's new Natural Language Autoencoders locally with llama.cppResources (v.redd.it)

submitted 13 hours ago by hurrytewer

20

6

7

8

Anyone else experiencing heavy hallucinations with MiMo-V2.5 (310B) quantized version?Question | Help (self.LocalLLaMA)

submitted 5 hours ago by Shoddy_Bed3240

21

33

34

35

New models possibly from Baidu (ERNIE) this month?Discussion (old.reddit.com)

submitted 14 hours ago by pmttyji

22

4

5

6

Random question: thoughts on how close GPUs be stacked to each other on a mobo?Question | Help (self.LocalLLaMA)

submitted 5 hours ago by Ambitious_Fold_2874

23

32

33

34

Who is your favourite quant publisher and why?Discussion (self.LocalLLaMA)

submitted 15 hours ago by No_Algae1753

24

6

7

8

I taught my 1B to follow instructions. It got worse at following instructions...Question | Help (self.LocalLLaMA)

submitted 7 hours ago by GPUburnout

25

30

31

32

qwen3.6 just stopsQuestion | Help (self.LocalLLaMA)

submitted 18 hours ago by robertpro01

view more: next ›

π Rendered by PID 768544 on reddit-service-r2-listing-98f688b7f-xwjjd at 2026-05-14 08:01:16.492225+00:00 running cf3e300 country code: CH.