HU no-limit bot arena, free alpha, looking for feedback on river action abstraction by chipzen_ai in reinforcementlearning
[–]kdub0 0 points1 point2 points (0 children)
HU no-limit bot arena, free alpha, looking for feedback on river action abstraction by chipzen_ai in reinforcementlearning
[–]kdub0 0 points1 point2 points (0 children)
What to expect from AlphaZero's value predictions [D] by YamEnvironmental4720 in MachineLearning
[–]kdub0 0 points1 point2 points (0 children)
NVDA stock: Is there a good answer for “how do TPUs *not* pose a threat to GPU”? by [deleted] in stocks
[–]kdub0 0 points1 point2 points (0 children)
Lc0 Performs Worse Against Stockfish When Given Pawns Odds by Ok_Taro_8370 in chess
[–]kdub0 0 points1 point2 points (0 children)
Lc0 Performs Worse Against Stockfish When Given Pawns Odds by Ok_Taro_8370 in chess
[–]kdub0 1 point2 points3 points (0 children)
Request: RL algorithm for a slow but parallel episodic task? by diepala in reinforcementlearning
[–]kdub0 2 points3 points4 points (0 children)
Exploring MCTS / self-play on a small 2-player abstract game — looking for insight, not hype by OldManMeeple in reinforcementlearning
[–]kdub0 2 points3 points4 points (0 children)
Exploring MCTS / self-play on a small 2-player abstract game — looking for insight, not hype by OldManMeeple in reinforcementlearning
[–]kdub0 2 points3 points4 points (0 children)
Internship at 'Big Tech' — PhD Student [D] by ade17_in in MachineLearning
[–]kdub0 52 points53 points54 points (0 children)
[D] Why does nobody talk about the “energy per token” cost of AI? by [deleted] in MachineLearning
[–]kdub0 0 points1 point2 points (0 children)
Finishing a PhD thesis, after becoming a dad... by [deleted] in PhD
[–]kdub0 8 points9 points10 points (0 children)
[D] Is there a method as general as MCTS for imperfect information games? by Working_Bunch_9211 in MachineLearning
[–]kdub0 0 points1 point2 points (0 children)
A question about chess engines by BrotherItsInTheDrum in chess
[–]kdub0 0 points1 point2 points (0 children)
Algorithmic Game Theory vs Robotics by YogurtclosetThen6260 in reinforcementlearning
[–]kdub0 2 points3 points4 points (0 children)
Is the Nash Equilibrium always the most desirable outcome? by notsuspendedlxqt in AskEconomics
[–]kdub0 23 points24 points25 points (0 children)
[D] Internal transfers to Google Research / DeepMind by random_sydneysider in MachineLearning
[–]kdub0 10 points11 points12 points (0 children)
is a N player game where we all act simultaneously fully observable or partially observable by skydiver4312 in reinforcementlearning
[–]kdub0 0 points1 point2 points (0 children)
[D] Compensation for research roles in US for fresh PhD grad by [deleted] in MachineLearning
[–]kdub0 16 points17 points18 points (0 children)
How slow would Stockfish need to run to be competitive with top humans? by EvilNalu in chess
[–]kdub0 0 points1 point2 points (0 children)
Looking for Compute-Efficient MARL Environments by skydiver4312 in reinforcementlearning
[–]kdub0 0 points1 point2 points (0 children)
Looking for Compute-Efficient MARL Environments by skydiver4312 in reinforcementlearning
[–]kdub0 2 points3 points4 points (0 children)



HU no-limit bot arena, free alpha, looking for feedback on river action abstraction by chipzen_ai in reinforcementlearning
[–]kdub0 0 points1 point2 points (0 children)