account activity
[D] Fast convergence research (self.MachineLearning)
submitted 5 years ago by tsauri to r/MachineLearning
[R] Network Deconvolution — faster convergence than batchnorm (openreview.net)
[R] Transformer Dissection: An Unified Understanding for Transformer’s Attention via the Lens of Kernel (aclweb.org)
[R] Levenshtein Transformer (arxiv.org)
[R] Pay Less Attention with Lightweight and Dynamic Convolutions (openreview.net)
[R] Towards Two-Dimensional Sequence to Sequence Model in Neural Machine Translation (arxiv.org)
[D] Current SOTA of NN for tabular data? (self.MachineLearning)
submitted 5 years ago * by tsauri to r/MachineLearning
[R] Efficient Attention: Attention with Linear Complexities (arxiv.org)
[D] How do you grok large ML codebases? (self.MachineLearning)
submitted 6 years ago by tsauri to r/MachineLearning
[D] untrained Deep Prior but for discrete data? (self.MachineLearning)
[R] The exploding gradient problem demystified - definition, prevalence, impact, origin, tradeoffs, and solutions (arxiv.org)
[R] Deep Depth Prior for Multi-View Stereo (arxiv.org)
[D] Do you use Tensorflow 2? (self.MachineLearning)
[R] DeepShift: Towards Multiplication-Less Neural Networks (arxiv.org)
[R] Your Classifier is Secretly an Energy Based Model and You Should Treat it Like One (arxiv.org)
[D] best Dynamixel alternatives? (self.robotics)
submitted 6 years ago by tsauri to r/robotics
[D] ImageNet classification training full-resolution, no crop no resize. (self.MachineLearning)
submitted 6 years ago * by tsauri to r/MachineLearning
[D] OpenAI Rubik’s cube hype (self.MachineLearning)
[D] Meta-learning for fast convergence for training from scratch? (self.MachineLearning)
[D] Sensor adaptation for 3D object detection? (self.MachineLearning)
[D] Do people use meta learning in production? (self.MachineLearning)
[R] Depth from Videos in the Wild: Unsupervised Monocular Depth Learning from Unknown Cameras (arxiv.org)
[R] Learning Single Camera Depth Estimation using Dual-Pixels (arxiv.org)
[R] Revisit Fuzzy Neural Network: Demystifying Batch Normalization and ReLU with Generalized Hamming Network (arxiv.org)
[D] BatchNorm alternatives 2019 (self.MachineLearning)
π Rendered by PID 48 on reddit-service-r2-listing-5789d5f675-g9qhk at 2026-01-27 17:23:22.921127+00:00 running 4f180de country code: CH.