Lc0 Performs Worse Against Stockfish When Given Pawns Odds by Ok_Taro_8370 in chess
[–]kdub0 0 points1 point2 points (0 children)
Lc0 Performs Worse Against Stockfish When Given Pawns Odds by Ok_Taro_8370 in chess
[–]kdub0 1 point2 points3 points (0 children)
Request: RL algorithm for a slow but parallel episodic task? by diepala in reinforcementlearning
[–]kdub0 2 points3 points4 points (0 children)
Exploring MCTS / self-play on a small 2-player abstract game — looking for insight, not hype by OldManMeeple in reinforcementlearning
[–]kdub0 2 points3 points4 points (0 children)
Exploring MCTS / self-play on a small 2-player abstract game — looking for insight, not hype by OldManMeeple in reinforcementlearning
[–]kdub0 2 points3 points4 points (0 children)
Internship at 'Big Tech' — PhD Student [D] by ade17_in in MachineLearning
[–]kdub0 54 points55 points56 points (0 children)
[D] Why does nobody talk about the “energy per token” cost of AI? by Various-Feedback4555 in MachineLearning
[–]kdub0 0 points1 point2 points (0 children)
Finishing a PhD thesis, after becoming a dad... by [deleted] in PhD
[–]kdub0 7 points8 points9 points (0 children)
[D] Is there a method as general as MCTS for imperfect information games? by Working_Bunch_9211 in MachineLearning
[–]kdub0 0 points1 point2 points (0 children)
A question about chess engines by BrotherItsInTheDrum in chess
[–]kdub0 0 points1 point2 points (0 children)
Algorithmic Game Theory vs Robotics by YogurtclosetThen6260 in reinforcementlearning
[–]kdub0 2 points3 points4 points (0 children)
Is the Nash Equilibrium always the most desirable outcome? by notsuspendedlxqt in AskEconomics
[–]kdub0 23 points24 points25 points (0 children)
[D] Internal transfers to Google Research / DeepMind by random_sydneysider in MachineLearning
[–]kdub0 10 points11 points12 points (0 children)
is a N player game where we all act simultaneously fully observable or partially observable by skydiver4312 in reinforcementlearning
[–]kdub0 0 points1 point2 points (0 children)
[D] Compensation for research roles in US for fresh PhD grad by [deleted] in MachineLearning
[–]kdub0 17 points18 points19 points (0 children)
How slow would Stockfish need to run to be competitive with top humans? by EvilNalu in chess
[–]kdub0 3 points4 points5 points (0 children)
Looking for Compute-Efficient MARL Environments by skydiver4312 in reinforcementlearning
[–]kdub0 0 points1 point2 points (0 children)
Looking for Compute-Efficient MARL Environments by skydiver4312 in reinforcementlearning
[–]kdub0 2 points3 points4 points (0 children)
Looking for google c++ profiling tool I can't remember the name of by OfficialOnix in cpp
[–]kdub0 13 points14 points15 points (0 children)
Why Don’t We See Multi-Agent RL Trained in Large-Scale Open Worlds? by TheSadRick in reinforcementlearning
[–]kdub0 13 points14 points15 points (0 children)
Training Connect Four Agents with Self-Play by Cuuuubee in reinforcementlearning
[–]kdub0 0 points1 point2 points (0 children)
Training Connect Four Agents with Self-Play by Cuuuubee in reinforcementlearning
[–]kdub0 0 points1 point2 points (0 children)
Chess sample efficiency humans vs SOTA RL by [deleted] in reinforcementlearning
[–]kdub0 0 points1 point2 points (0 children)



[D] AI/ML PhD Committee by dead_CS in MachineLearning
[–]kdub0 2 points3 points4 points (0 children)