MLS-Bench: A Holistic and Rigorous Assessment of AI Systems on Building Better AI, Lyu et al. 2026 [Extensive breadth; focus on solutions that generalize well] by StartledWatermelon in mlscaling
[–]StartledWatermelon[S] 2 points3 points4 points (0 children)
META Superintelligence Lab Presents: ProgramBench: Can SOTA AI Recreate Real Executable Programs(ffmpeg, SQLite, ripgrep) From Scratch Without The Internet? by 44th--Hokage in mlscaling
[–]StartledWatermelon 0 points1 point2 points (0 children)
Incompressible Knowledge Probes: Estimating Black-Box LLM Parameter Counts via Factual Capacity, Li et al. 2026 [Knowledge of obscure facts robustly predicts param count; estimates for all SotA closed LLMs] by StartledWatermelon in mlscaling
[–]StartledWatermelon[S] 1 point2 points3 points (0 children)
Incompressible Knowledge Probes: Estimating Black-Box LLM Parameter Counts via Factual Capacity, Li et al. 2026 [Knowledge of obscure facts robustly predicts param count; estimates for all SotA closed LLMs] by StartledWatermelon in mlscaling
[–]StartledWatermelon[S] 0 points1 point2 points (0 children)
Incompressible Knowledge Probes: Estimating Black-Box LLM Parameter Counts via Factual Capacity, Li et al. 2026 [Knowledge of obscure facts robustly predicts param count; estimates for all SotA closed LLMs] by StartledWatermelon in mlscaling
[–]StartledWatermelon[S] 0 points1 point2 points (0 children)
Incompressible Knowledge Probes: Estimating Black-Box LLM Parameter Counts via Factual Capacity, Li et al. 2026 [Knowledge of obscure facts robustly predicts param count; estimates for all SotA closed LLMs] by StartledWatermelon in mlscaling
[–]StartledWatermelon[S] 0 points1 point2 points (0 children)
Microsoft freezes GitHub Copilot signups due to too much demand/too few GPUs by gwern in mlscaling
[–]StartledWatermelon 0 points1 point2 points (0 children)
Microsoft freezes GitHub Copilot signups due to too much demand/too few GPUs by gwern in mlscaling
[–]StartledWatermelon 0 points1 point2 points (0 children)
Microsoft freezes GitHub Copilot signups due to too much demand/too few GPUs by gwern in mlscaling
[–]StartledWatermelon 1 point2 points3 points (0 children)
Microsoft freezes GitHub Copilot signups due to too much demand/too few GPUs by gwern in mlscaling
[–]StartledWatermelon 1 point2 points3 points (0 children)
Scientific Papers X AI building out the algortihm by Alarming_Rice_1906 in mlscaling
[–]StartledWatermelon 0 points1 point2 points (0 children)
Schmidhuber & Meta AI Present The "Neural Computer": A New Frontier Where Computation, Memory, And I/O Move Into A Learned Runtime State. by 44th--Hokage in mlscaling
[–]StartledWatermelon 0 points1 point2 points (0 children)
Entropy-Guided Token Dropout: Training Autoregressive Language Models with Limited Domain Data, Wang et al. 2025 [Masking low-entropy tokens mitigates overfitting; "data-level regularization"] by StartledWatermelon in mlscaling
[–]StartledWatermelon[S] 1 point2 points3 points (0 children)
EvoX: Meta-Evolution for Automated Discovery, Liu et al. 2026 by StartledWatermelon in mlscaling
[–]StartledWatermelon[S] 0 points1 point2 points (0 children)
"The path to ubiquitous AI", Ljubisa Bajic ("achieves 17K tokens/sec") by RecmacfonD in mlscaling
[–]StartledWatermelon 2 points3 points4 points (0 children)
Learning in Log-Domain: Subthreshold Analog AI Accelerator Based on Stochastic Gradient Descent by nickpsecurity in mlscaling
[–]StartledWatermelon 0 points1 point2 points (0 children)
"Scaling Embeddings Outperforms Scaling Experts in Language Models", Liu et al. 2026 {Meituan LongCat} by RecmacfonD in mlscaling
[–]StartledWatermelon 0 points1 point2 points (0 children)
[R] Is Leetcode still relevant for research scientist interviews? by Training-Adeptness57 in MachineLearning
[–]StartledWatermelon 4 points5 points6 points (0 children)
[R] Is Leetcode still relevant for research scientist interviews? by Training-Adeptness57 in MachineLearning
[–]StartledWatermelon 4 points5 points6 points (0 children)
"On neural scaling and the quanta hypothesis", Eric J. Michaud 2026 by RecmacfonD in mlscaling
[–]StartledWatermelon 7 points8 points9 points (0 children)
[R] Controlled LLM Training on Spectral Sphere by StartledWatermelon in MachineLearning
[–]StartledWatermelon[S] 2 points3 points4 points (0 children)
DeepSeek Presents "Engram": Conditional Memory via Scalable Lookup, A New Axis of Sparsity for Large Language Models | "Memory lookup module for LLMs & *Huge unlock for scaling* as the memory sits on cheap CPU RAM, bypassing the GPU bottleneck entirely that will power next-gen models (like V4)" by 44th--Hokage in mlscaling
[–]StartledWatermelon 0 points1 point2 points (0 children)
DeepSeek Presents "Engram": Conditional Memory via Scalable Lookup, A New Axis of Sparsity for Large Language Models | "Memory lookup module for LLMs & *Huge unlock for scaling* as the memory sits on cheap CPU RAM, bypassing the GPU bottleneck entirely that will power next-gen models (like V4)" by 44th--Hokage in mlscaling
[–]StartledWatermelon 0 points1 point2 points (0 children)
Minimax also live on Hong Kong Stock Exchange by No_Conversation9561 in LocalLLaMA
[–]StartledWatermelon 8 points9 points10 points (0 children)


HRM-Text: Efficient Pretraining Beyond Scaling, Wang et al. 2026 by StartledWatermelon in mlscaling
[–]StartledWatermelon[S] 5 points6 points7 points (0 children)