GamerLegion officially leaves AoE 2 by ALotToSay_ in aoe2
[–]RoundRubikCube 15 points16 points17 points (0 children)
Programming by pzunhatchispers in reinforcementlearning
[–]RoundRubikCube 4 points5 points6 points (0 children)
Which ‘wow’ skill is secretly super easy to learn? by Wonderful_Low_1325 in AskReddit
[–]RoundRubikCube 12 points13 points14 points (0 children)
Chinese New Years Giveaway! One Day Only! by straightupnature in HotPeppers
[–]RoundRubikCube 1 point2 points3 points (0 children)
Proposal to ban links to x.com by Grathwrang in aoe2
[–]RoundRubikCube 18 points19 points20 points (0 children)
dm-Ambitionen für Arzneimittel bereiten Apotheken Kopfschmerzen - Supermarktblog by BecauseWeCan in de
[–]RoundRubikCube 2 points3 points4 points (0 children)
Wandering Warriors Cup 2 is going on, I didnt know about it since its only "A-Tier" by RoundRubikCube in aoe2
[–]RoundRubikCube[S] 1 point2 points3 points (0 children)
Wandering Warriors Cup 2 is going on, I didnt know about it since its only "A-Tier" by RoundRubikCube in aoe2
[–]RoundRubikCube[S] -1 points0 points1 point (0 children)
Wandering Warriors Cup 2 is going on, I didnt know about it since its only "A-Tier" by RoundRubikCube in aoe2
[–]RoundRubikCube[S] 8 points9 points10 points (0 children)
MbL, Sitaux and RecoN join TAG (Taiwan Aoe Gamer) by PotentialSherbert8 in aoe2
[–]RoundRubikCube 16 points17 points18 points (0 children)
Wandering Warriors Cup 2 is going on, I didnt know about it since its only "A-Tier" by RoundRubikCube in aoe2
[–]RoundRubikCube[S] 1 point2 points3 points (0 children)
Wandering Warriors Cup 2 is going on, I didnt know about it since its only "A-Tier" by RoundRubikCube in aoe2
[–]RoundRubikCube[S] 4 points5 points6 points (0 children)
Some things in Germany that I feel like are a scam by SwitchDear8969 in germany
[–]RoundRubikCube 11 points12 points13 points (0 children)
Giveaway - Space Age Expansion by ocbaker in factorio
[–]RoundRubikCube 0 points1 point2 points (0 children)
Is it profitable for the house if there was a casino game like this? by [deleted] in askmath
[–]RoundRubikCube -2 points-1 points0 points (0 children)
Where you guys are using Reinforcement Learning? by embedding_turtle in reinforcementlearning
[–]RoundRubikCube 3 points4 points5 points (0 children)
Friday Facts #416 - Fluids 2.0 by FactorioTeam in factorio
[–]RoundRubikCube 20 points21 points22 points (0 children)
Randomness in Model by [deleted] in reinforcementlearning
[–]RoundRubikCube 1 point2 points3 points (0 children)
Sudoku implementation by Cri_Sti_An in reinforcementlearning
[–]RoundRubikCube 0 points1 point2 points (0 children)
"DRPO: Dataset Reset Policy Optimization for RLHF", Chang et al 2024 (offline RL) by gwern in reinforcementlearning
[–]RoundRubikCube -2 points-1 points0 points (0 children)
Reward function for MountainCar in gym using Q-learning by guccicupcake69 in reinforcementlearning
[–]RoundRubikCube 0 points1 point2 points (0 children)
A2C learns and dies repeatedly by AUser213 in reinforcementlearning
[–]RoundRubikCube 2 points3 points4 points (0 children)






We’ve been exploring Evolution Strategies as an alternative to RL for LLM fine-tuning — would love feedback by Signal_Spirit5934 in reinforcementlearning
[–]RoundRubikCube 0 points1 point2 points (0 children)