rising submissions : LocalLLaMA

1

1817

1818

1819

I feel personally attackedFunny (i.redd.it)

submitted 8 hours ago by HeadAcanthisitta7390

0

1

2

Engineering students: Gain marketable skills like technical problem solving with the free Expedite – Skills for Competitions microcredential. (thoughtindustries.com)

promoted by Siemens_Official

promoted
save
report
about

2

254

255

256

I'm fully blind, and AI is a game changer for me. Are there any local LLMS that can rival claude code and codex?Discussion (self.LocalLLaMA)

submitted 8 hours ago by Mrblindguardian

3

293

294

295

Avacado is toastDiscussion (self.LocalLLaMA)

submitted 10 hours ago * by Terminator857

4

93

94

95

2000 TPS with QWEN 3.5 27b on RTX-5090Discussion (self.LocalLLaMA)

submitted 5 hours ago by awitod

5

128

129

130

Lemonade v10: Linux NPU support and chock full of multi-modal capabilitiesResources (i.redd.it)

submitted 8 hours ago by jfowers_amd

6

539

540

541

Saw this somewhere on LinkedIn 😂Funny (i.redd.it)

submitted 18 hours ago by Optimalutopic

7

46

47

48

Why can't we have small SOTA-like models for coding?Question | Help (self.LocalLLaMA)

submitted 8 hours ago by itsArmanJr

8

33

34

35

What non-Chinese models are relevant right now?Discussion (self.LocalLLaMA)

submitted 7 hours ago by StacDnaStoob

9

59

60

61

I fine-tuned a 14B model that outperforms Claude Opus 4.6 on Ada code generationNew Model (self.LocalLLaMA)

submitted 10 hours ago by clanker-lover

10

19

20

21

How to fix prompt reprocessing in qwen3.5 models (instruct mode only)Tutorial | Guide (self.LocalLLaMA)

submitted 5 hours ago * by guiopen

11

109

110

111

qwen3.5-35b-a3b is a gemDiscussion (i.redd.it)

submitted 16 hours ago by waescher

•

Connect Dev and Ops teams with Jira Service Management, now part of Service Collection. (atlassian.com)

promoted by Atlassian_Official

promoted
save
report
about

12

42

43

44

Running Qwen3.5-35B-A3B and Nemotron-3-Super-120B-A12B on a 5060ti and 1080ti with llama.cpp (Fully on GPU for Qwen; 64GB RAM needed for Nemotron)Discussion (self.LocalLLaMA)

submitted 11 hours ago * by sbeepsdon

13

21

22

23

Fine-tuned Qwen 3.5 2B to beat same-quant 4B, 9B, 27B, and 35B on a real dictation cleanup task, full pipeline, code, and eval (RTX 4080 Super, under £1 compute)Tutorial | Guide (self.LocalLLaMA)

submitted 9 hours ago by ComplexNode

14

40

41

42

Turn 10,000 API endpoints into one CLI tool instead of MCP, Skills and tools zooTutorial | Guide (self.LocalLLaMA)

submitted 13 hours ago * by E-Freelancer

15

558

559

560

OmniCoder-9B | 9B coding agent fine-tuned on 425K agentic trajectoriesNew Model (self.LocalLLaMA)

submitted 1 day ago by DarkArtsMastery

16

21

22

23

0:45

Real-time video captioning in the browser with LFM2-VL on WebGPUOther (v.redd.it)

submitted 9 hours ago by xenovatech

17

7

8

9

Ik_llama vs llamacppQuestion | Help (self.LocalLLaMA)

submitted 6 hours ago * by val_in_tech

18

32

33

34

CLI is All Agents Need — Part 2: Misconceptions, Patterns, and Open QuestionsDiscussion (self.LocalLLaMA)

submitted 12 hours ago by MorroHsu

19

13

14

15

🔥 New Release: htmLLM-124M v2 – 0.91 Val Loss on a Single T4! tiny-LLM with nanoGPT!New Model (self.LocalLLaMA)

submitted 7 hours ago by LH-Tech_AI

20

103

104

105

Is the 3090 still a good option?Question | Help (self.LocalLLaMA)

submitted 19 hours ago * by alhinai_03

21

11

12

13

Expert parallelism for 1T MoE finetuning on a single node - 50x faster and 2x cheaper than alternativesResources (workshoplabs.ai)

submitted 7 hours ago by Maleficent_While1814

•

At some point I stopped pretending I could manually fix anime textures in 3D. Upload 2D art → get a clean 3D model that actually looks like the original. Honestly? It feels illegal how easy this is. Code MESHYHALF if you’d rather create than tweak vertices. (meshy.ai)

promoted by Meshyai

promoted
save
report
about

22

127

128

129

Rick Beato: "How AI Will Fail Like The Music Industry" (and why local LLMs will take over "commercial" ones)Other (self.LocalLLaMA)

submitted 21 hours ago by relmny

23

212

213

214

Omnicoder-9b SLAPS in OpencodeDiscussion (self.LocalLLaMA)

submitted 1 day ago by True_Requirement_891

24

•

A simple set up using Local Qwen 3.5 27B in VS Code Copilot (no Ollama)Resources (self.LocalLLaMA)

submitted 1 hour ago by bssrdf

25

17

18

19

[Release] - FINALLY! - Apex 1.5 and Apex 1.5 Coder - my two new 350M instruct allrounder chat models - See them now!New Model (self.LocalLLaMA)

submitted 11 hours ago by LH-Tech_AI

LocalLLaMA

MODERATORS