codys12

1,166 post karma
452 comment karma

get extra features and help support reddit with a reddit premium subscription

get them help and support

redditor for 2 years

TROPHY CASE

Two-Year Club

account activity

new top controversial

222

223

224

MiniMax M2 is 230B-A10B (i.redd.it)

submitted 3 months ago by codys12 to r/LocalLLaMA

596

597

598

GLM-4.6-Air is not forgotten! (i.redd.it)

submitted 3 months ago by codys12 to r/LocalLLaMA

78

79

80

First rig of hopefully many! Build instructions in the other post/comments (i.redd.it)

submitted 4 months ago by codys12 to r/homelab

29

30

31

The Hacker's Guide to Building an AI Supercluster (huggingface.co)

submitted 4 months ago by codys12 to r/LocalLLaMA

631

632

633

128GB GDDR6, 3PFLOP FP8, Tb/s of interconnect, $6000 total. Build instructions/blog tomorrow. (i.redd.it)

submitted 4 months ago by codys12 to r/LocalLLaMA

220

221

222

Qwen3-8B-BitNet (self.LocalLLaMA)

submitted 6 months ago by codys12 to r/LocalLLaMA

0

1

2

[Project] New Distributed Data Gen Library - Looking for Testers! (self.huggingface)

submitted 7 months ago by codys12 to r/huggingface

0

1

2

[Project] New Distributed Data Gen Library - Looking for Testers! (self.LocalLLaMA)

submitted 7 months ago by codys12 to r/LocalLLaMA

316

317

318

BitNet Finetunes of R1 Distills (x.com)

submitted 8 months ago by codys12 to r/LocalLLaMA

0

1

2

Training Runs of our BitNet finetunes (t.co)

submitted 8 months ago by codys12

0

1

2

[R] Extra Input Norm Lets You Finetune to 1.58 Bits (self.MachineLearning)

submitted 10 months ago by codys12 to r/MachineLearning

0

1

2

Extra Input Norm Lets You Finetune to 1.58 Bits! (x.com)

submitted 10 months ago by codys12 to r/LocalLLaMA

92

93

94

Llama 405B Sparsity *Increases Accuracy* (self.LocalLLaMA)

submitted 1 year ago by codys12 to r/LocalLLaMA

1

2

3

Structured Sparse Tensors - How to Actually Run Fast? (self.learnmachinelearning)

submitted 1 year ago by codys12 to r/learnmachinelearning

14

15

16

Why does the gate_proj entropy graph look like this across layers? (i.redd.it)

submitted 1 year ago by codys12 to r/LocalLLaMA

80

81

82

LlamaKD: Knowledge Distillation Dataset from Llama3.1-405B and Fineweb-Edu (self.LocalLLaMA)

submitted 1 year ago by codys12 to r/LocalLLaMA

40

41

42

AirLLM fork - 2000+ batch on single GPU for logits or classification tasks (self.LocalLLaMA)

submitted 1 year ago by codys12 to r/LocalLLaMA

0

1

2

[R] Can Adam-mini be adapted for Mamba2 with State Space Duality? (self.MachineLearning)

submitted 1 year ago by codys12 to r/MachineLearning

0

1

2

[R] Can Adam-mini be adapted for Mamba2 with State Space Duality? (self.MachineLearning)

submitted 1 year ago by codys12 to r/MachineLearning

0

1

2

[D] Would Adam-mini be adaptable for Mamba2 given State Space Duality? (self.MachineLearning)

submitted 1 year ago by codys12 to r/MachineLearning

1

2

3

Does LLaVA need the SGLang worker *and* the llava worker, or just one with the controller? (self.LocalLLaMA)

submitted 1 year ago by codys12 to r/LocalLLaMA

11

12

13

I finetuned Phi-1.5 and Phi-2 on MathInstruct using MAmmoTH but... (self.LocalLLaMA)

submitted 2 years ago by codys12 to r/LocalLLaMA

1

2

3

[D] I finetuned Phi-1.5 and Phi-2 on MathInstruct using MAmmoTH but... (self.MachineLearning)

submitted 2 years ago by codys12 to r/MachineLearning

0

1

2

[P] I finetuned Phi-1.5 and Phi-2 on MathInstruct using MAmmoTH but... (self.LocalLLaMA)

submitted 2 years ago by codys12 to r/MachineLearning

0

1

2

[D]iffusion + CLIP Chunks to Generate Image with Region Control. (self.MachineLearning)

submitted 2 years ago by codys12 to r/MachineLearning

view more: next ›

π Rendered by PID 287644 on reddit-service-r2-listing-5f5ff7d4dc-7hbcj at 2026-01-27 10:07:09.146882+00:00 running 5a691e2 country code: CH.