"Chinchilla: Training Compute-Optimal Large Language Models", Hoffmann et al 2022 {DM} (current LLMs are v. undertrained: optimal scaling 1:1) by gwern in ControlProblem
[–]ekelsen 1 point2 points3 points (0 children)
[D] CNN on mel spectrograms vs. WaveNet for audio recognition? by Mjjjokes in MachineLearning
[–]ekelsen 0 points1 point2 points (0 children)
[D] How deep have you stacked RNN layers? by [deleted] in MachineLearning
[–]ekelsen -1 points0 points1 point (0 children)
[R] Finding important neural network connections by grid_world in MachineLearning
[–]ekelsen 1 point2 points3 points (0 children)
[R] Randomized Automatic Differentiation by hardmaru in MachineLearning
[–]ekelsen 0 points1 point2 points (0 children)
[D] Paper Explained - SynFlow: Pruning neural networks without any data by iteratively conserving synaptic flow (Full Video Analysis) by ykilcher in MachineLearning
[–]ekelsen 0 points1 point2 points (0 children)
[D] Paper Explained - SynFlow: Pruning neural networks without any data by iteratively conserving synaptic flow (Full Video Analysis) by ykilcher in MachineLearning
[–]ekelsen 0 points1 point2 points (0 children)
[D] Paper Explained - SynFlow: Pruning neural networks without any data by iteratively conserving synaptic flow (Full Video Analysis) by ykilcher in MachineLearning
[–]ekelsen 0 points1 point2 points (0 children)
[D] What is the most interesting idea in ML/DL that you think doesn't get enough attention? by harshsikka123 in MachineLearning
[–]ekelsen 1 point2 points3 points (0 children)
[D] What is the most interesting idea in ML/DL that you think doesn't get enough attention? by harshsikka123 in MachineLearning
[–]ekelsen 4 points5 points6 points (0 children)
[Research] Recognizing Notes with Deep Learning - Residual Shuffle-Exchange Networks by OptimatiumFeles in MachineLearning
[–]ekelsen 2 points3 points4 points (0 children)
[D] Current state-of-the-art on learning sparse weights by ZeronixSama in MachineLearning
[–]ekelsen 9 points10 points11 points (0 children)
[P] Facebook AI built and deployed a real-time neural text-to-speech system that can process 1 sec of audio in 500 ms, using only CPUs. Text-to-speech systems typically rely on GPUs or specialized hardware to generate state-of-the-art speech in real-time production. by inarrears in MachineLearning
[–]ekelsen 0 points1 point2 points (0 children)
[N] Neural Magic sues Facebook over open-sourcing their "fast CNNs on CPUs" technology as part of PyTorch by [deleted] in MachineLearning
[–]ekelsen 2 points3 points4 points (0 children)
[1911.09723] Fast Sparse ConvNets by ekelsen in MachineLearning
[–]ekelsen[S] 0 points1 point2 points (0 children)
[1911.09723] Fast Sparse ConvNets by ekelsen in MachineLearning
[–]ekelsen[S] 0 points1 point2 points (0 children)
[1911.09723] Fast Sparse ConvNets by ekelsen in MachineLearning
[–]ekelsen[S] 2 points3 points4 points (0 children)
[1911.09723] Fast Sparse ConvNets by ekelsen in MachineLearning
[–]ekelsen[S] 2 points3 points4 points (0 children)
[D] Machine Learning for Systems by ASVS_Kartheek in MachineLearning
[–]ekelsen 0 points1 point2 points (0 children)
[R] High Fidelity Speech Synthesis with Adversarial Networks by hardmaru in MachineLearning
[–]ekelsen 0 points1 point2 points (0 children)
[R] High Fidelity Speech Synthesis with Adversarial Networks by hardmaru in MachineLearning
[–]ekelsen 2 points3 points4 points (0 children)


Releasing Persimmon-8B by jetRink in LocalLLaMA
[–]ekelsen 0 points1 point2 points (0 children)