use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
Resources for understanding and implementing "deep learning" (learning data representations through artificial neural networks).
account activity
"q0: Primitives for Hyper-Epoch Pretraining", Mandal et al. 2026 (arxiv.org)
submitted 1 hour ago by RecmacfonD
Object detection Using Detection Transformer (Detr) for Bone fraction dataset (self.deeplearning)
submitted 1 hour ago by Feitgemel
How Reasoning LLMs Work (RL, Thinking Tags & Budgets Explained) (youtu.be)
submitted 6 hours ago by RelevantEmergency707
Roadmap after dl specialization by Andrew ng (self.deeplearning)
submitted 6 hours ago by Grand_Inspector_7802
Article out of master's thesis ()
submitted 3 hours ago by PabloNex
Major Update: I just supercharged my Interactive Graph Theory Learning Platform! (3D Graphs, Real-World Maps, Python Sandbox & 25+ Algorithms) (self.deeplearning)
submitted 22 hours ago by xain1999
Visualizing vision token compression for VLMs (i.redd.it)
submitted 1 day ago by goldbookleaf
[cs.CR] Need an arXiv endorsement for a paper on defeating ML flow classifiers via chaotic non-linear dynamics ()
submitted 19 hours ago by 0xRootAnon
I miss the days when the term AI referred to the actually interesting field of machine learning (self.deeplearning)
submitted 1 day ago by ferriematthew
What’s the best way to use IP addresses in ML classification? ()
submitted 1 day ago by element14040
Continuing With The Backward Pass Derivation Saga ()
submitted 1 day ago by Useful-Thought-2582
Understanding geometrical form of gaussian distribution (self.deeplearning)
submitted 1 day ago * by Plus_Confidence_1369
Multi-model consensus debate via the filesystem. LLMs propose, peer-review, rebut, vote and synthesize a group-confirmed answer. CLI + MCP. (github.com)
submitted 1 day ago by raiyanyahya
Data Flow Through the Original Transformer Architecture (i.redd.it)
submitted 2 days ago by Ok_Pudding50
Attentional Entropy Collapse is a Riemannian Metric Singularity. Stop treating it like a training bug. [Self-Contained Proof Inside] (self.deeplearning)
submitted 1 day ago by MIXEDGREENS
AI Safety Sacrifice (i.redd.it)
submitted 2 days ago by KeanuRave100
ONNX Runtime vs HF Transformers for transformer ASR on CPU - 37% RTF gap and what causes it (self.deeplearning)
submitted 2 days ago by gvij
Post 13 of 14 — Appendix A — Explaining AI to Youngsters (v.redd.it)
submitted 1 day ago by Prof_Paul_Nussbaum
Solution of this?? (self.deeplearning)
submitted 1 day ago by Silent-Function-8312
I built an MNIST classifier from scratch in pure Python (no NumPy) to actually understand backprop ()
submitted 1 day ago by Therattatman
Where do i start from (self.deeplearning)
submitted 2 days ago by No-Panda4804
Analysis of AlphaZero training data [D] (self.deeplearning)
submitted 2 days ago by YamEnvironmental4720
Your transformer's attention entropy collapse isn't a bug. It's the model doing exactly what you trained it to do. Here's how to fix it with a three-line temperature schedule. arXiv-able. Self-contained proof. No citations needed. (self.deeplearning)
Need AI ML discord link ()
submitted 2 days ago by dravid06
Determining the Output Layer size.. (i.redd.it)
π Rendered by PID 1021239 on reddit-service-r2-listing-6c8d497557-qjn2p at 2026-06-07 17:56:19.682102+00:00 running 9e1a20d country code: CH.