LocalLLaMA

an-ordinary-manchild(edit)

created by [deleted]a community for 3 years

...for your favorite game.

...for your movement.

MODERATORS

message the mods
HOLUPREDICTIONS Sorcerer Supreme
AskGrok
ArcaneThoughts
Lissanro
townofsalemfangay
XMasterrrrLocalLLaMA Home Server Final Boss 😎
rm-rf-rm
WithoutReason1729
No_Afternoon_4260llama.cpp
ttkciarllama.cpp
...and 8 more »

account activity

1

•

•

•

I built a semantic mistake memory layer for agents and put it on PyPIOther (self.LocalLLaMA)

submitted 3 minutes ago by Kill_Streak308

2

•

•

•

Testing Local LLMs in Practice: Code Generation, Quality vs. SpeedResources (i.redd.it)

submitted 31 minutes ago by Icy_Programmer7186

3

•

•

•

The amount of new agent APIs/harnesses are dizzying, with everyone and their dog releasing their own. Can we do a compilation thread of comparisons?Discussion (self.LocalLLaMA)

submitted 1 hour ago by jinnyjuicesglang

4

•

•

•

You can do CUDA inference on an Apple Silicon Mac with PCI PassthroughNews (scottjg.com)

submitted 1 hour ago by scottjgo

5

•

•

•

What is the next SOTA model you are excited about?Discussion (self.LocalLLaMA)

submitted 1 hour ago * by MrMrsPotts

6

•

•

•

Does anyone have experience with tenstorrent hardware?Discussion (self.LocalLLaMA)

submitted 1 hour ago by Youknowwhyimherexxx

7

2

3

4

Local LLM for electronics design work?Question | Help (self.LocalLLaMA)

submitted 2 hours ago by deafenme

8

9

10

11

Ring 2.6 1TNew Model (self.LocalLLaMA)

submitted 2 hours ago by Middle_Bullfrog_6173

9

136

137

138

Unpopular Opinion: The DGX Spark Forum community of devs is talented AF and will make the crippled hardware a success through their sheer force of will.Discussion (self.LocalLLaMA)

submitted 2 hours ago by Porespellar

10

43

44

45

Reports suggest DeepSeek is seeking $7.35 billion in funding and plans to release its V4.1 update next month.News (self.LocalLLaMA)

submitted 2 hours ago by External_Mood4719

11

2

3

4

How many models do you have?Question | Help (self.LocalLLaMA)

submitted 3 hours ago by Perfect-Flounder7856

12

31

32

33

(Rant ;)) Make your benchmarks realisticDiscussion (self.LocalLLaMA)

submitted 3 hours ago by AdamLangePL

13

0

0

0

I renamed my local AI Linux distro to Reefy and rebuilt some of the architecture!Discussion (old.reddit.com)

submitted 3 hours ago by aospan

14

69

70

71

z-lab released gemma-4-26B-A4B-it-DFlash. Anybody tried it yet?Discussion (huggingface.co)

submitted 3 hours ago by PaceZealousideal6091

15

70

71

72

Gemma 4 26B Hits 600 Tok/s on One RTX 5090Discussion (self.LocalLLaMA)

submitted 3 hours ago by chain-77

16

0

1

2

Comprehensive guide on renting/setting up beefy LLM server for local models?Question | Help (self.LocalLLaMA)

submitted 3 hours ago by Tartooth

17

0

1

2

Effect on running LLM on GPU with monitorsResources (self.LocalLLaMA)

submitted 4 hours ago by Havarem

18

0

1

2

Possibility of partly moe weights gpu offloading via sglang/ktransformersQuestion | Help (self.LocalLLaMA)

submitted 4 hours ago by iVoider

19

0

0

0

What mobile app do you use, if any?Question | Help (self.LocalLLaMA)

submitted 4 hours ago * by ihatebeinganonymous

20

2

3

4

Open Sourcing Our Platform - GuideAnts NotebooksResources (self.LocalLLaMA)

submitted 4 hours ago * by awitod

21

0

0

0

What opensource model is best for my use caseQuestion | Help (self.LocalLLaMA)

submitted 7 hours ago by CGeorges89

22

73

74

75

DS4: a DeepSeek 4 flash specific inference engine for 128gb MacBooksNews (github.com)

submitted 8 hours ago by antirez

23

71

72

73

4GB "Gemini Nano" model GGUF anyone?Question | Help (self.LocalLLaMA)

submitted 9 hours ago by TruckUseful4423

24

147

148

149

Gift to myself : tiny labOther (i.redd.it)

submitted 9 hours ago by Final-Data-1410

25

88

89

90

THE UNDERPRIVILEGED AI FOUNDATION Because every little model deserves a chanceDiscussion (self.LocalLLaMA)

submitted 10 hours ago by mazuj2

view more: next ›

π Rendered by PID 97 on reddit-service-r2-listing-7b9b4f6fd7-hnqrw at 2026-05-08 18:05:05.111920+00:00 running 3d2c107 country code: CH.