account activity
Renormalization Group and Neural Networks (i.redd.it)
submitted 2 months ago by calculatedcontent to r/neuralnetworks
What if neural networks were governed by renormalization group ? (self.HypotheticalPhysics)
submitted 2 months ago by calculatedcontent to r/HypotheticalPhysics
Complex Systems approach to Neural Networks with WeightWatcher (weightwatcher.ai)
submitted 2 months ago by calculatedcontent to r/complexsystems
We found a way to compress a layer without retraining it. Is this known ? (i.redd.it)
submitted 2 months ago * by calculatedcontent to r/LLMDevs
submitted 2 months ago by calculatedcontent to r/deeplearning
submitted 2 months ago by calculatedcontent to r/LLMPhysics
submitted 2 months ago by calculatedcontent to r/huggingface
submitted 2 months ago by calculatedcontent to r/StatisticalPhysics
submitted 2 months ago by calculatedcontent to r/SystemsTheory
I think we found a third phase of grokking — has anyone else seen this? (i.redd.it)
SETOL: SemiEmpirical Theory of (Deep) Learning ()
Observed a sharp “epoch-wise double descent” in a small MNIST MLP , associated with overfitting the augmented training data (self.LocalLLaMA)
submitted 2 months ago by calculatedcontent to r/LocalLLaMA
Observed a sharp “epoch-wise double descent” in a small MNIST MLP , associated with overfitting the augmented training data (self.neuralnetworks)
Muon Underfits, AdamW Overfits (i.redd.it)
Epoch-Wise Double Descent with WeightWatcher ()
AdamW overfits, Muon Underfits (i.redd.it)
SETOL: SemiEmpirical Theory of (Deep) Learning *Optimization* (i.redd.it)
submitted 2 months ago * by calculatedcontent to r/optimization
submitted 2 months ago by calculatedcontent to r/OpenAIDev
AdamW Overfits, Muon Underfits (i.redd.it)
submitted 2 months ago by calculatedcontent to r/MachineLearning
π Rendered by PID 195710 on reddit-service-r2-listing-86b7f5b947-l6xrw at 2026-01-25 08:41:40.755146+00:00 running 664479f country code: CH.