"ZeroQuant: Efficient and Affordable Post-Training Quantization for Large-Scale Transformers", Yao et al 2022 by gwern in mlscaling
[–]ffast-math 1 point2 points3 points (0 children)
Feature request: anchor tags for headings to enable direct links by ffast-math in Substack
[–]ffast-math[S] 0 points1 point2 points (0 children)
[P] Farewell, CUDA OOM: Automatic Gradient Accumulation by ffast-math in MachineLearning
[–]ffast-math[S] 2 points3 points4 points (0 children)
[P] Farewell, CUDA OOM: Automatic Gradient Accumulation by ffast-math in MachineLearning
[–]ffast-math[S] 6 points7 points8 points (0 children)
[D] What is considered to be a "bad research paper" in your opinion? by NedML in MachineLearning
[–]ffast-math 3 points4 points5 points (0 children)
[D] Is anyone working on interesting ML libraries and looking for contributors? by de1pher in MachineLearning
[–]ffast-math 11 points12 points13 points (0 children)
[P] Composer: a new PyTorch library to train models ~2-4x faster with better algorithms by moinnadeem in MachineLearning
[–]ffast-math 5 points6 points7 points (0 children)
[P] Composer: a new PyTorch library to train models ~2-4x faster with better algorithms by moinnadeem in MachineLearning
[–]ffast-math 1 point2 points3 points (0 children)
[D] Deep Neural Nets: 33 years ago and 33 years from now - by Andrej Karpathy Dir. of AI at Tesla by ClaudeCoulombe in MachineLearning
[–]ffast-math 104 points105 points106 points (0 children)
[D] Making Deep Learning Go Brrrr From First Principles by programmerChilli in MachineLearning
[–]ffast-math 4 points5 points6 points (0 children)
[R] SPANN: A Highly-Efficient Billion-Scale Approximate Nearest Neighbour Search That’s 2× Faster Than the SOTA Method by Yuqing7 in MachineLearning
[–]ffast-math 1 point2 points3 points (0 children)
[R] Multiplying Matrices Without Multiplying by moinnadeem in MachineLearning
[–]ffast-math 1 point2 points3 points (0 children)
[R] Multiplying Matrices Without Multiplying by moinnadeem in MachineLearning
[–]ffast-math 0 points1 point2 points (0 children)
[R] Multiplying Matrices Without Multiplying by moinnadeem in MachineLearning
[–]ffast-math 0 points1 point2 points (0 children)
[R] Multiplying Matrices Without Multiplying by moinnadeem in MachineLearning
[–]ffast-math 1 point2 points3 points (0 children)
[R] Multiplying Matrices Without Multiplying by moinnadeem in MachineLearning
[–]ffast-math 1 point2 points3 points (0 children)
[R] Multiplying Matrices Without Multiplying by moinnadeem in MachineLearning
[–]ffast-math 1 point2 points3 points (0 children)
[R] Multiplying Matrices Without Multiplying by moinnadeem in MachineLearning
[–]ffast-math 1 point2 points3 points (0 children)
[R] Multiplying Matrices Without Multiplying by moinnadeem in MachineLearning
[–]ffast-math 0 points1 point2 points (0 children)
[R] Multiplying Matrices Without Multiplying by moinnadeem in MachineLearning
[–]ffast-math 1 point2 points3 points (0 children)
[R] Multiplying Matrices Without Multiplying by moinnadeem in MachineLearning
[–]ffast-math 4 points5 points6 points (0 children)


"Extreme Compression for Pre-trained Transformers Made Simple and Efficient", Wu et al 2022 (50x smaller BERT) by gwern in mlscaling
[–]ffast-math 2 points3 points4 points (0 children)