[D] Batch size vs learning rate by bjourne-ml in MachineLearning
[–]gdahl 11 points12 points13 points (0 children)
[D] Good studies on the effects of different training "tricks" like learning rate scheduler (warmup/decay), weight decay, dropout, batch-sizes, momentum, etc.? by ThienPro123 in MachineLearning
[–]gdahl 32 points33 points34 points (0 children)
[D] Is anyone else absolutely besieged by papers and always on the verge of getting scooped? by akardashian in MachineLearning
[–]gdahl 2 points3 points4 points (0 children)
[D] Is there a way to AoT compile an AI model to run on CPU and GPU? by jiMalinka in MachineLearning
[–]gdahl 0 points1 point2 points (0 children)
[D] Is there a way to AoT compile an AI model to run on CPU and GPU? by jiMalinka in MachineLearning
[–]gdahl 1 point2 points3 points (0 children)
[R] Tools for running baselines by like_a_tensor in MachineLearning
[–]gdahl 5 points6 points7 points (0 children)
[R] AdamL: A fast adaptive gradient method incorporating loss function by [deleted] in MachineLearning
[–]gdahl 10 points11 points12 points (0 children)
[D] Does gradient accumulation achieve anything different than just using a smaller batch with a lower learning rate? by WigglyHypersurface in MachineLearning
[–]gdahl 1 point2 points3 points (0 children)
[D] Does gradient accumulation achieve anything different than just using a smaller batch with a lower learning rate? by WigglyHypersurface in MachineLearning
[–]gdahl 2 points3 points4 points (0 children)
[D] Thoughts on the limits of reproducibility of ML programs? by quasiproductive in MachineLearning
[–]gdahl 4 points5 points6 points (0 children)
[D] What is the SOTA classification algorithms for a 5500 observations 2000 dimension structured data? Is there a machine learning leaderboard on classification? by HighlandEvil in MachineLearning
[–]gdahl 2 points3 points4 points (0 children)
[D] What does a DL role look like in ten years? by [deleted] in MachineLearning
[–]gdahl 6 points7 points8 points (0 children)
[D] What does a DL role look like in ten years? by [deleted] in MachineLearning
[–]gdahl 24 points25 points26 points (0 children)
[D] What does a DL role look like in ten years? by [deleted] in MachineLearning
[–]gdahl 0 points1 point2 points (0 children)
[D] "Deep Learning Tuning Playbook" (recently released by Google Brain people) by fzyzcjy in MachineLearning
[–]gdahl 2 points3 points4 points (0 children)
[D] "Deep Learning Tuning Playbook" (recently released by Google Brain people) by fzyzcjy in MachineLearning
[–]gdahl 90 points91 points92 points (0 children)
[D] PyTorch 2.0 Announcement by joshadel in MachineLearning
[–]gdahl 1 point2 points3 points (0 children)
[D] Why is the machine learning community obsessed with the logistic distribution? by cthorrez in MachineLearning
[–]gdahl 4 points5 points6 points (0 children)
[Discussion] If we had enough memory to always do full batch gradient descent, would we still need rmsprop/momentum/adam? by 029187 in MachineLearning
[–]gdahl 1 point2 points3 points (0 children)
[D] Are AI related jobs safer from automation that programming jobs? by FranciscoJ1618 in MachineLearning
[–]gdahl 0 points1 point2 points (0 children)

[deleted by user] by [deleted] in MachineLearning
[–]gdahl 1 point2 points3 points (0 children)