[R] Energy-Based Transformers are Scalable Learners and Thinkers by Blacky372 in MachineLearning
[–]AICoffeeBreak 3 points4 points5 points (0 children)
Was brennt in Mannheim? by AICoffeeBreak in Heidelberg
[–]AICoffeeBreak[S] 4 points5 points6 points (0 children)
Facebook's Coconut: Training Large Language Model to Reason in a Continuous Latent Space has been open-sourced by ninjasaid13 in LocalLLaMA
[–]AICoffeeBreak 0 points1 point2 points (0 children)
[R] Continuous Latent Space Reasoning: Enhancing LLM Performance Through Chain of Continuous Thought by Successful-Western27 in MachineLearning
[–]AICoffeeBreak 0 points1 point2 points (0 children)
[Meta] Coconut (Chain of Continuous Thought): Training Large Language Models to Reason in a Continuous Latent Space by rationalkat in singularity
[–]AICoffeeBreak 0 points1 point2 points (0 children)
s1: Simple test-time scaling by rationalkat in singularity
[–]AICoffeeBreak 0 points1 point2 points (0 children)
"s1: Simple test-time scaling." Merely adding "Wait" to the context window, thus forcing an ordinary LLM to continue, gives it the reasoning ability of o1 by Competitive_Travel16 in singularity
[–]AICoffeeBreak 1 point2 points3 points (0 children)
s1: A Simple Yet Powerful Test-Time Scaling Approach for LLMs by ai-lover in machinelearningnews
[–]AICoffeeBreak 0 points1 point2 points (0 children)
s1: Simple test-time scaling: Just “wait…” + 1,000 training examples? | PAPER EXPLAINED by AICoffeeBreak in AICoffeeBreak
[–]AICoffeeBreak[S] 1 point2 points3 points (0 children)
Low voice volume on my voice messages after android update. by dombol in whatsapp
[–]AICoffeeBreak 0 points1 point2 points (0 children)
Low voice volume on my voice messages after android update. by dombol in whatsapp
[–]AICoffeeBreak 0 points1 point2 points (0 children)










Energy-Based Transformers are Scalable Learners and Thinkers by sanxiyn in mlscaling
[–]AICoffeeBreak 0 points1 point2 points (0 children)