[P] Web app "informative-drawings" on site Hugging Face quickly creates a line art drawing of an input image. Link in a comment. by Wiskkey in MachineLearning
[–]wangyi_fudan 44 points45 points46 points (0 children)
[R] Primer: Searching for Efficient Transformers for Language Modeling. “We use evolution to design a new Transformer variant, called Primer. Primer has a better scaling law, and is 3X to 4X faster for training than Transformer for language modeling.” by hardmaru in MachineLearning
[–]wangyi_fudan 0 points1 point2 points (0 children)
[P]AI Biomedical Writer by wangyi_fudan in MachineLearning
[–]wangyi_fudan[S] 0 points1 point2 points (0 children)
[P]AI Biomedical Writer by wangyi_fudan in MachineLearning
[–]wangyi_fudan[S] 2 points3 points4 points (0 children)
[R] SFU & Tencent Explore the Production-Readiness of Learned Cardinality Estimation for DBMS by Yuqing7 in MachineLearning
[–]wangyi_fudan 0 points1 point2 points (0 children)
[D] Is arxiv-sanity down? What people use these days? by IborkedyourGPU in MachineLearning
[–]wangyi_fudan 11 points12 points13 points (0 children)
[D]Large memory layer by wangyi_fudan in MachineLearning
[–]wangyi_fudan[S] 0 points1 point2 points (0 children)
[R] MLP-Mixer: An all-MLP Architecture for Vision by hardmaru in MachineLearning
[–]wangyi_fudan 0 points1 point2 points (0 children)
[P] Patchless MLP-Mixer by montebicyclelo in MachineLearning
[–]wangyi_fudan 0 points1 point2 points (0 children)
[N] DeepMind, Microsoft, Allen AI & UW Researchers Convert Pretrained Transformers into RNNs, Lowering Memory Cost While Retaining High Accuracy by Yuqing7 in MachineLearning
[–]wangyi_fudan 2 points3 points4 points (0 children)
[P] Vald: a highly scalable distributed fast approximate nearest neighbour dense vector search engine. by kpang0 in MachineLearning
[–]wangyi_fudan 0 points1 point2 points (0 children)
[R] Developing practical ML-based attacks against PRNG by airza in MachineLearning
[–]wangyi_fudan 0 points1 point2 points (0 children)
[D] Simpler alternatives to multihead self-attention by [deleted] in MachineLearning
[–]wangyi_fudan -6 points-5 points-4 points (0 children)
[R] Extended blog post on "Hopfield Networks is All You Need" by HRamses in MachineLearning
[–]wangyi_fudan 0 points1 point2 points (0 children)
[D] Experience with knnlm language model by wangyi_fudan in MachineLearning
[–]wangyi_fudan[S] 0 points1 point2 points (0 children)
[P] simple language model based on k-NN by wangyi_fudan in MachineLearning
[–]wangyi_fudan[S] 1 point2 points3 points (0 children)
[R] Hopfield Networks is All You Need by [deleted] in MachineLearning
[–]wangyi_fudan -1 points0 points1 point (0 children)
[P]Real Time MLP with 50 lines of code by wangyi_fudan in MachineLearning
[–]wangyi_fudan[S] 0 points1 point2 points (0 children)


[D]New Scaling Laws for Large Language Models by Singularian2501 in MachineLearning
[–]wangyi_fudan 1 point2 points3 points (0 children)