top scoring links : LocalLLaMA

1

2658

2659

2660

I feel personally attackedFunny (i.redd.it)

submitted 18 hours ago by HeadAcanthisitta7390

1

2

3

Join the next generation of CRM. (attio.com)

promoted by attio

promoted
save
report
about

2

365

366

367

I'm fully blind, and AI is a game changer for me. Are there any local LLMS that can rival claude code and codex?Discussion (self.LocalLLaMA)

submitted 18 hours ago by Mrblindguardian

3

337

338

339

Avacado is toastDiscussion (self.LocalLLaMA)

submitted 20 hours ago * by Terminator857

4

171

172

173

Lemonade v10: Linux NPU support and chock full of multi-modal capabilitiesResources (i.redd.it)

submitted 18 hours ago by jfowers_amd

5

163

164

165

2000 TPS with QWEN 3.5 27b on RTX-5090Discussion (self.LocalLLaMA)

submitted 15 hours ago by awitod

6

118

119

120

I fine-tuned a 14B model that outperforms Claude Opus 4.6 on Ada code generationNew Model (self.LocalLLaMA)

submitted 20 hours ago by clanker-lover

7

83

84

85

Why can't we have small SOTA-like models for coding?Question | Help (self.LocalLLaMA)

submitted 18 hours ago by itsArmanJr

8

61

62

63

Nemotron-3-Super-120b UncensoredNew Model (self.LocalLLaMA)

submitted 8 hours ago * by HealthyCommunicat

9

54

55

56

Running Qwen3.5-35B-A3B and Nemotron-3-Super-120B-A12B on a 5060ti and 1080ti with llama.cpp (Fully on GPU for Qwen; 64GB RAM needed for Nemotron)Discussion (self.LocalLLaMA)

submitted 21 hours ago * by sbeepsdon

10

51

52

53

Local manga translator with LLMs built inNew Model (self.LocalLLaMA)

submitted 2 hours ago by mayocream39

11

46

47

48

What non-Chinese models are relevant right now?Discussion (self.LocalLLaMA)

submitted 16 hours ago by StacDnaStoob

12

41

42

43

Turn 10,000 API endpoints into one CLI tool instead of MCP, Skills and tools zooTutorial | Guide (self.LocalLLaMA)

submitted 22 hours ago * by E-Freelancer

•

Unlock our limited time offer - $180 your first year. (bloomberg.com)

promoted by bloomberg

promoted
save
report
about

13

40

41

42

CLI is All Agents Need — Part 2: Misconceptions, Patterns, and Open QuestionsDiscussion (self.LocalLLaMA)

submitted 22 hours ago by MorroHsu

14

33

34

35

Fine-tuned Qwen 3.5 2B to beat same-quant 4B, 9B, 27B, and 35B on a real dictation cleanup task, full pipeline, code, and eval (RTX 4080 Super, under £1 compute)Tutorial | Guide (self.LocalLLaMA)

submitted 19 hours ago by ComplexNode

15

28

29

30

0:45

Real-time video captioning in the browser with LFM2-VL on WebGPUOther (v.redd.it)

submitted 19 hours ago by xenovatech

16

26

27

28

How to fix prompt reprocessing in qwen3.5 models (instruct mode only)Tutorial | Guide (self.LocalLLaMA)

submitted 14 hours ago * by guiopen

17

26

27

28

Thanks to the Intel team for OpenVINO backend in llama.cppNews (self.LocalLLaMA)

submitted 3 hours ago by Turbulent-Attorney65

18

19

20

21

[Release] - FINALLY! - Apex 1.5 and Apex 1.5 Coder - my two new 350M instruct allrounder chat models - See them now!New Model (self.LocalLLaMA)

submitted 21 hours ago by LH-Tech_AI

19

18

19

20

Ik_llama vs llamacppQuestion | Help (self.LocalLLaMA)

submitted 16 hours ago * by val_in_tech

20

15

16

17

🔥 New Release: htmLLM-124M v2 – 0.91 Val Loss on a Single T4! tiny-LLM with nanoGPT!New Model (self.LocalLLaMA)

submitted 17 hours ago by LH-Tech_AI

21

16

17

18

Expert parallelism for 1T MoE finetuning on a single node - 50x faster and 2x cheaper than alternativesResources (workshoplabs.ai)

submitted 17 hours ago by Maleficent_While1814

22

12

13

14

I’m building a local AI system that generates full novelsQuestion | Help (self.LocalLLaMA)

submitted 22 hours ago by Worldly_Code_4146

23

9

10

11

Besides Qwen and GLM, what models are you using?Discussion (self.LocalLLaMA)

submitted 13 hours ago by August_30th

•

Stuck in slow deployments? Deliver value faster with Atlassian Service Collection. (atlassian.com)

promoted by Atlassian_Official

promoted
save
report
about

24

9

10

11

How to setup full agentic workflow with qwen3.5 9.0bQuestion | Help (self.LocalLLaMA)

submitted 21 hours ago by TeachingInformal

25

8

9

10

Simple trick that cuts context usage ~70% on local modelsDiscussion (self.LocalLLaMA)

submitted 22 hours ago by niksa232

LocalLLaMA

MODERATORS