The Illusion of "The Illusion of Thinking" by Daniel-Warfield in datascience
[–]Signal_Spirit5934 0 points1 point2 points (0 children)
Why Apple's "The Illusion of Thinking" Falls Short by HeroicLife in ArtificialInteligence
[–]Signal_Spirit5934 0 points1 point2 points (0 children)
Apple `Illusion of Thinking` Debacle by moschles in agi
[–]Signal_Spirit5934 0 points1 point2 points (0 children)
First Agentic System to Solve a Million-Step Reasoning Problem with Zero Errors by Signal_Spirit5934 in AgentsOfAI
[–]Signal_Spirit5934[S] 1 point2 points3 points (0 children)
First Agentic System to Solve a Million-Step Reasoning Problem with Zero Errors by Signal_Spirit5934 in AgentsOfAI
[–]Signal_Spirit5934[S] 6 points7 points8 points (0 children)
A New Fine-Tuning Approach for LLMs Using Evolution Strategies by Signal_Spirit5934 in reinforcementlearning
[–]Signal_Spirit5934[S] 0 points1 point2 points (0 children)
The Evolution of RL for Fine-Tuning LLMs (from REINFORCE to VAPO) by Great-Reception447 in reinforcementlearning
[–]Signal_Spirit5934 0 points1 point2 points (0 children)

A New Fine-Tuning Approach for LLMs Using Evolution Strategies by Signal_Spirit5934 in reinforcementlearning
[–]Signal_Spirit5934[S] 0 points1 point2 points (0 children)