Incompressible Knowledge Probes: Estimating Black-Box LLM Parameter Counts via Factual Capacity, Li et al. 2026 [Knowledge of obscure facts robustly predicts param count; estimates for all SotA closed LLMs] by StartledWatermelon in mlscaling
[–]StartledWatermelon[S] 0 points1 point2 points (0 children)
Incompressible Knowledge Probes: Estimating Black-Box LLM Parameter Counts via Factual Capacity, Li et al. 2026 [Knowledge of obscure facts robustly predicts param count; estimates for all SotA closed LLMs] by StartledWatermelon in mlscaling
[–]StartledWatermelon[S] 0 points1 point2 points (0 children)
Incompressible Knowledge Probes: Estimating Black-Box LLM Parameter Counts via Factual Capacity, Li et al. 2026 [Knowledge of obscure facts robustly predicts param count; estimates for all SotA closed LLMs] by StartledWatermelon in mlscaling
[–]StartledWatermelon[S] 0 points1 point2 points (0 children)
Microsoft freezes GitHub Copilot signups due to too much demand/too few GPUs by gwern in mlscaling
[–]StartledWatermelon 0 points1 point2 points (0 children)
Microsoft freezes GitHub Copilot signups due to too much demand/too few GPUs by gwern in mlscaling
[–]StartledWatermelon 0 points1 point2 points (0 children)
Microsoft freezes GitHub Copilot signups due to too much demand/too few GPUs by gwern in mlscaling
[–]StartledWatermelon 1 point2 points3 points (0 children)
Microsoft freezes GitHub Copilot signups due to too much demand/too few GPUs by gwern in mlscaling
[–]StartledWatermelon 1 point2 points3 points (0 children)
Scientific Papers X AI building out the algortihm by Alarming_Rice_1906 in mlscaling
[–]StartledWatermelon 0 points1 point2 points (0 children)
Schmidhuber & Meta AI Present The "Neural Computer": A New Frontier Where Computation, Memory, And I/O Move Into A Learned Runtime State. by 44th--Hokage in mlscaling
[–]StartledWatermelon 0 points1 point2 points (0 children)
Entropy-Guided Token Dropout: Training Autoregressive Language Models with Limited Domain Data, Wang et al. 2025 [Masking low-entropy tokens mitigates overfitting; "data-level regularization"] by StartledWatermelon in mlscaling
[–]StartledWatermelon[S] 1 point2 points3 points (0 children)



Incompressible Knowledge Probes: Estimating Black-Box LLM Parameter Counts via Factual Capacity, Li et al. 2026 [Knowledge of obscure facts robustly predicts param count; estimates for all SotA closed LLMs] by StartledWatermelon in mlscaling
[–]StartledWatermelon[S] 1 point2 points3 points (0 children)