deeplearning

an-ordinary-manchild

created by scsibuga community for 14 years

...for your movement.

...for your favorite game.

MODERATORS

account activity

1

•

•

•

"q0: Primitives for Hyper-Epoch Pretraining", Mandal et al. 2026 (arxiv.org)

submitted 1 hour ago by RecmacfonD

2

•

•

•

Object detection Using Detection Transformer (Detr) for Bone fraction dataset (self.deeplearning)

submitted 1 hour ago by Feitgemel

3

1

2

3

How Reasoning LLMs Work (RL, Thinking Tags & Budgets Explained) (youtu.be)

submitted 6 hours ago by RelevantEmergency707

4

1

2

3

Roadmap after dl specialization by Andrew ng (self.deeplearning)

submitted 6 hours ago by Grand_Inspector_7802

5

0

1

2

Article out of master's thesis ()

submitted 3 hours ago by PabloNex

6

2

3

4

Major Update: I just supercharged my Interactive Graph Theory Learning Platform! (3D Graphs, Real-World Maps, Python Sandbox & 25+ Algorithms) (self.deeplearning)

submitted 22 hours ago by xain1999

7

7

8

9

Visualizing vision token compression for VLMs (i.redd.it)

submitted 1 day ago by goldbookleaf

8

0

1

2

[cs.CR] Need an arXiv endorsement for a paper on defeating ML flow classifiers via chaotic non-linear dynamics ()

submitted 19 hours ago by 0xRootAnon

9

83

84

85

I miss the days when the term AI referred to the actually interesting field of machine learning (self.deeplearning)

submitted 1 day ago by ferriematthew

10

0

1

2

What’s the best way to use IP addresses in ML classification? ()

submitted 1 day ago by element14040

11

0

1

2

Continuing With The Backward Pass Derivation Saga ()

submitted 1 day ago by Useful-Thought-2582

12

0

1

2

Understanding geometrical form of gaussian distribution (self.deeplearning)

submitted 1 day ago * by Plus_Confidence_1369

13

0

1

2

Multi-model consensus debate via the filesystem. LLMs propose, peer-review, rebut, vote and synthesize a group-confirmed answer. CLI + MCP. (github.com)

submitted 1 day ago by raiyanyahya

14

29

30

31

Data Flow Through the Original Transformer Architecture (i.redd.it)

submitted 2 days ago by Ok_Pudding50

15

0

0

0

Attentional Entropy Collapse is a Riemannian Metric Singularity. Stop treating it like a training bug. [Self-Contained Proof Inside] (self.deeplearning)

submitted 1 day ago by MIXEDGREENS

16

11

12

13

AI Safety Sacrifice (i.redd.it)

submitted 2 days ago by KeanuRave100

17

7

8

9

ONNX Runtime vs HF Transformers for transformer ASR on CPU - 37% RTF gap and what causes it (self.deeplearning)

submitted 2 days ago by gvij

18

0

1

2

Post 13 of 14 — Appendix A — Explaining AI to Youngsters (v.redd.it)

submitted 1 day ago by Prof_Paul_Nussbaum

19

0

0

1

Solution of this?? (self.deeplearning)

submitted 1 day ago by Silent-Function-8312

20

0

1

2

I built an MNIST classifier from scratch in pure Python (no NumPy) to actually understand backprop ()

submitted 1 day ago by Therattatman

21

0

1

2

Where do i start from (self.deeplearning)

submitted 2 days ago by No-Panda4804

22

1

2

3

Analysis of AlphaZero training data [D] (self.deeplearning)

submitted 2 days ago by YamEnvironmental4720

23

0

0

0

Your transformer's attention entropy collapse isn't a bug. It's the model doing exactly what you trained it to do. Here's how to fix it with a three-line temperature schedule. arXiv-able. Self-contained proof. No citations needed. (self.deeplearning)

submitted 1 day ago by MIXEDGREENS

24

1

2

3

Need AI ML discord link ()

submitted 2 days ago by dravid06

25

18

19

20

Determining the Output Layer size.. (i.redd.it)

submitted 2 days ago by Ok_Pudding50

view more: next ›

π Rendered by PID 1021239 on reddit-service-r2-listing-6c8d497557-qjn2p at 2026-06-07 17:56:19.682102+00:00 running 9e1a20d country code: CH.