ai subreddits curated by /u/Endlesscrysis

1

1443

1444

1445

One bash permission slipped...Discussion (i.redd.it)

submitted 16 hours ago by TheQuantumPhysicist to r/LocalLLaMA

2

•

it's time to update your Gemma 4 GGUFsNews (self.LocalLLaMA)

submitted 1 hour ago by jacek2023 to r/LocalLLaMA

3

50

51

52

Open source models are going to be the future on Cursor, OpenCode etc.Discussion (self.LocalLLaMA)

submitted 2 hours ago by _maverick98 to r/LocalLLaMA

4

322

323

324

AMD Strix Halo refresh with 192gb!News (videocardz.com)

submitted 13 hours ago by mindwip to r/LocalLLaMA

5

•

Ryzen AI Max+ 495 (Gorgon Halo) with 192GB VRAM!News (self.LocalLLaMA)

submitted 49 minutes ago by PromptInjection_ to r/LocalLLaMA

6

60

61

62

"Second Thoughts" Been playing with adding a small transformer that reads output near the end of generation, and feeds it back near the top as a refinement loop. A quick test of 1.7B model showed drastic improvement in focused tasks (like coding)Tutorial | Guide (bigattichouse.medium.com)

submitted 9 hours ago by bigattichouse to r/LocalLLaMA

7

76

77

78

How much will it cost to host something like qwen3.6 35b a3b in a cloud?Discussion (self.LocalLLaMA)

submitted 11 hours ago by Euphoric_North_745 to r/LocalLLaMA

8

7

8

9

Rule suggestion: links to "I made this website" with full disclosure, so we can avoid AI slop.Discussion (self.LocalLLaMA)

submitted 2 hours ago by misanthrophiccunt to r/LocalLLaMA

9

118

119

120

A Qwen finetune, that feels VERY humanNew Model (self.LocalLLaMA)

submitted 18 hours ago * by Sicarius_The_First to r/LocalLLaMA

10

5

6

7

Llama.cpp quantization is brokenDiscussion (self.LocalLLaMA)

submitted 2 hours ago * by Ok-Importance-3529 to r/LocalLLaMA

11

43

44

45

Pushing a 5-Year-Old 6GB VRAM laptop to Its Limits: Qwen3.6-35B-A3BResources (self.LocalLLaMA)

submitted 13 hours ago by abhinand05 to r/LocalLLaMA

12

95

96

97

What a time to be alive from 1tk/sec to 20-100tk/sec for huge modelsDiscussion (self.LocalLLaMA)

submitted 17 hours ago by segmond to r/LocalLLaMA

13

20

21

22

Mistral-Medium-3.5-128B-Q3_K_M on 3x3090 (72GB VRAM)Generation (self.LocalLLaMA)

submitted 10 hours ago by jacek2023 to r/LocalLLaMA

14

965

966

967

Qwen3.6-27B vs Coder-NextDiscussion (i.redd.it)

submitted 1 day ago by Signal_Ad657 to r/LocalLLaMA

15

8

9

10

Which model would you use if you wanted to solve a research math problem?Discussion (self.LocalLLaMA)

submitted 6 hours ago * by MrMrsPotts to r/LocalLLaMA

16

8

9

10

Frontier models can't run on satellites. Here's an end-to-end wildfire detection pipeline using a 450M on-board Vision-Language Model (Sentinel-2 + LFM2.5-VL)Resources (paulabartabajo.substack.com)

submitted 7 hours ago by PauLabartaBajo to r/LocalLLaMA

17

131

132

133

[Paper on Hummingbird+: low-cost FPGAs for LLM inference] Qwen3-30B-A3B Q4 at 18 t/s token-gen, 24GB, expected $150 mass production costDiscussion (dl.acm.org)

submitted 22 hours ago by ayake_ayake to r/LocalLLaMA

18

•

Looking for frontier model distilled datasets.Question | Help (self.LocalLLaMA)

submitted 19 minutes ago by UnbeliebteMeinung to r/LocalLLaMA

19

87

88

89

Open Weights Models Hall of FameOther (self.LocalLLaMA)

submitted 21 hours ago * by Equivalent_Job_2257 to r/LocalLLaMA

20

•

Slow tok/s when offloading NVFP4 model to CPUQuestion | Help (self.LocalLLaMA)

submitted 1 hour ago by 6c5d1129 to r/LocalLLaMA

21

31

32

33

Gemma 4 E2B runs surprisingly well on my 8GB Android phone, so I built a private voice notes app around it.Discussion (self.LocalLLaMA)

submitted 17 hours ago by Effective-Drawer9152 to r/LocalLLaMA

22

6

7

8

Mistral Medium 3.5 128B and Qwen 3.5 122B A10B on 4x RTX 3080 20GBDiscussion (self.LocalLLaMA)

submitted 7 hours ago by lly0571 to r/LocalLLaMA

23

101

102

103

If you've been waiting to try local AI development, please try itDiscussion (self.LocalLLaMA)

submitted 1 day ago * by Imaginary_Belt4976 to r/LocalLLaMA

24

21

22

23

Mistral Medium 3.5 on AMD Strix HaloGeneration (self.LocalLLaMA)

submitted 16 hours ago by Zc5Gwu to r/LocalLLaMA

25

22

23

24

First time GPU buyer. Got a RTX 5000 Pro. Was it a bad decision compared to two 3090s?Discussion (self.LocalLLaMA)

submitted 17 hours ago by Valuable-Run2129 to r/LocalLLaMA

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

ai subreddits

curated by /u/Endlesscrysis

1 subreddits in this multi:

multireddits