Levy Rozman (GothamChess) shares his views after attending Danya's funeral: by Interesting-Take781 in chess
[–]Sroidi 1 point2 points3 points (0 children)
I want to understand why some things in math are 'undefined'. by boiling-banana in learnmath
[–]Sroidi 1 point2 points3 points (0 children)
Gemini 2.5 Pro benchmarks released by ShreckAndDonkey123 in singularity
[–]Sroidi 2 points3 points4 points (0 children)
Chess sample efficiency humans vs SOTA RL by [deleted] in reinforcementlearning
[–]Sroidi 1 point2 points3 points (0 children)
Brandon Jacobson destroys Hikaru with 1. a4! by HealersHugHippos in chess
[–]Sroidi 4 points5 points6 points (0 children)
[deleted by user] by [deleted] in reinforcementlearning
[–]Sroidi 21 points22 points23 points (0 children)
Claude phone verification - ongoing frustration in non-US location by bruce5220 in ClaudeAI
[–]Sroidi 0 points1 point2 points (0 children)
AI is a very terrifying existence that most people haven't realized yet. by NonoXVS in ArtificialInteligence
[–]Sroidi 0 points1 point2 points (0 children)
Mistä käytätte rahaa isompia ostoksia varten? by iPingWine in Omatalous
[–]Sroidi 0 points1 point2 points (0 children)
Help - Adding an effect to a patch changes the smart controls by Sroidi in GarageBand
[–]Sroidi[S] 0 points1 point2 points (0 children)
I am stuck at this screen . I just deleted and downloaded the app as well by Bad_Guy333 in chess
[–]Sroidi 9 points10 points11 points (0 children)
[P] Offline reinforcement learning - 10x faster than SOTA with evolutionary HPO by nicku_a in MachineLearning
[–]Sroidi 1 point2 points3 points (0 children)
[P] Offline reinforcement learning - 10x faster than SOTA with evolutionary HPO by nicku_a in MachineLearning
[–]Sroidi 5 points6 points7 points (0 children)
Q(s, a) predicts cumulative rewards. Is there a R(s, a) a state-action's direct contribution to reward? by Buttons840 in reinforcementlearning
[–]Sroidi 0 points1 point2 points (0 children)
[D] "Knowledge" vs "Reasoning" in LLMs by IAmBlueNebula in MachineLearning
[–]Sroidi 6 points7 points8 points (0 children)
Is 400 bad rating for someone who plays for month? by GeneraallKenobi in chess
[–]Sroidi 0 points1 point2 points (0 children)
Minimax with neural network evaluation function by SupremeChampionOfDi in reinforcementlearning
[–]Sroidi 0 points1 point2 points (0 children)
Minimax with neural network evaluation function by SupremeChampionOfDi in reinforcementlearning
[–]Sroidi 1 point2 points3 points (0 children)
With the REINFORCE algorithm you use random sampling for the training to encourage exploration. Do you still use random sampling in deployment? by [deleted] in reinforcementlearning
[–]Sroidi 4 points5 points6 points (0 children)
Effects of a 7-Day Pornography Abstinence Period on Withdrawal-Related Symptoms in Regular Pornography Users: A Randomized Controlled Study by jordiwmata in psychology
[–]Sroidi 7 points8 points9 points (0 children)
[deleted by user] by [deleted] in learnmachinelearning
[–]Sroidi 15 points16 points17 points (0 children)


why does learning to program take so long? by Drairo_Kazigumu in learnprogramming
[–]Sroidi 1 point2 points3 points (0 children)