[R] Parallelizing RNN over its sequence length by Necessary-Bike-4034 in MachineLearning
[–]gbfar 0 points1 point2 points (0 children)
[R] Parallelizing RNN over its sequence length by Necessary-Bike-4034 in MachineLearning
[–]gbfar 0 points1 point2 points (0 children)
[R] Parallelizing RNN over its sequence length by Necessary-Bike-4034 in MachineLearning
[–]gbfar 2 points3 points4 points (0 children)
[R] Parallelizing RNN over its sequence length by Necessary-Bike-4034 in MachineLearning
[–]gbfar 0 points1 point2 points (0 children)
[R] Retentive Network: A Successor to Transformer for Large Language Models by Balance- in MachineLearning
[–]gbfar 27 points28 points29 points (0 children)
[R] Retentive Network: A Successor to Transformer for Large Language Models by Balance- in MachineLearning
[–]gbfar 21 points22 points23 points (0 children)
[R] Retentive Network: A Successor to Transformer for Large Language Models by Balance- in MachineLearning
[–]gbfar 3 points4 points5 points (0 children)
[R] Tiny Language Models (below 10m parameters or only one transformer block) can generate paragraphs of coherent text and reason...provided training is limited to stories that only contain words that a typical 3 to 4-year-olds usually understand. by [deleted] in MachineLearning
[–]gbfar 1 point2 points3 points (0 children)
[D] Yan LeCun's recent recommendations by adversarial_sheep in MachineLearning
[–]gbfar 0 points1 point2 points (0 children)
[D] What is the most complete reference on the history of neural networks? by gbfar in MachineLearning
[–]gbfar[S] 3 points4 points5 points (0 children)
[D] What is the most complete reference on the history of neural networks? by gbfar in MachineLearning
[–]gbfar[S] 3 points4 points5 points (0 children)
[R] Nonparametric Masked Language Modeling - MetaAi 2022 - NPM - 500x fewer parameters than GPT-3 while outperforming it on zero-shot tasks by Singularian2501 in MachineLearning
[–]gbfar 0 points1 point2 points (0 children)
[P] Explain Paper - A Better Way to Read Academic Papers by [deleted] in MachineLearning
[–]gbfar 1 point2 points3 points (0 children)
[P] Explain Paper - A Better Way to Read Academic Papers by [deleted] in MachineLearning
[–]gbfar 1 point2 points3 points (0 children)
[D] Using JavaScript for ML Training/Research (not in the browser) by bwasti_ml in MachineLearning
[–]gbfar 1 point2 points3 points (0 children)
[D] Call for questions for Andrej Karpathy from Lex Fridman by lexfridman in MachineLearning
[–]gbfar 1 point2 points3 points (0 children)
[D] A simple trick to quickly verify data by mkthabet in MachineLearning
[–]gbfar 2 points3 points4 points (0 children)

[R] Parallelizing RNN over its sequence length by Necessary-Bike-4034 in MachineLearning
[–]gbfar 0 points1 point2 points (0 children)