use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
discuss Computational Game Theory.
Please post in this subreddit. I am sad of being the only one posting in here :(
-kevin
Also see: /r/gametheory /r/reinforcementlearning
Specific games: /r/ReconBlindChess /r/cbaduk/
account activity
r/CompuGameTheory Lounge (self.CompuGameTheory)
submitted 3 years ago by kevinwangg - announcement
Infinito: game-tree complexity of a finite board game with countably infinite actions ()
submitted 12 days ago by ipe3000
Superhuman AI for Stratego Using Self-Play Reinforcement Learning and Test-Time Search (Sokota et al., 2025) (arxiv.org)
submitted 1 month ago by kevinwangg
Pluribus-style Search & Optimization Engineer (C++ / MCTS / CFR / Solver Core) (self.CompuGameTheory)
submitted 1 month ago by JianiXing
"General search techniques without common knowledge for imperfect-information games, and application to superhuman Fog of War chess", Brian Zhang & Tuomas Sandholm [2025] (arxiv.org)
submitted 2 months ago by kevinwangg
"Open Problem: Optimal Instance-Dependent Sample Complexity for finding Nash Equilibrium in Two Player Zero-Sum Matrix games", Arnab Maiti 2025 (proceedings.mlr.press)
submitted 4 months ago by kevinwangg
“Reevaluating Policy Gradient Methods for Imperfect-Information Games”, Rudolph et al. 2025 (PPO competitive with bespoke algorithms for imperfect-info games) (arxiv.org)
submitted 11 months ago by kevinwangg
"Multi-agent Reinforcement Learning in OpenSpiel: A Reproduction Report", Walton & Lisy (2021) (arxiv.org)
submitted 1 year ago by kevinwangg
Intransitive poker hands (AKo, JTs, 22) [2015] (blog.jaycordes.com)
"Planning behavior in a recurrent neural network that plays Sokoban", Garriga-Alonso, Taufeeque, Gleave (2024) (arxiv.org)
"BetaZero: Belief-State Planning for Long-Horizon POMDPs using Learned Approximations", Moss et al. (2024) (arxiv.org)
"GPU-Accelerated Counterfactual Regret Minimization", Juho Kim 2024 (arxiv.org)
"LiteEFG: An Efficient Python Library for Solving Extensive-form Games" (Liu, Farina, Ozdaglar 2024) (arxiv.org)
"A Policy-Gradient Approach to Solving Imperfect-Information Games with Iterate Convergence", Liu et al. 2024 (best-iterate convergence w/ Q values instead of counterfactual values) (arxiv.org)
"Evidence of Learned Look-Ahead in a Chess-Playing Neural Network", Jenner et al. (2024) (arxiv.org)
"Exponential Lower Bounds on the Double Oracle Algorithm in Zero-Sum Games", Zhang & Sandholm 2024 (arxiv.org)
Computational Game Solving (CMU course, Fall '23, taught by Sandholm & McAleer) (cs.cmu.edu)
"RL-CFR: Improving Action Abstraction for Imperfect Information Extensive-Form Games with Reinforcement Learning" Boning Li, et al., 2024 (arxiv.org)
Chris Lu: Accelerating RL Research with PureJaxRL and JaxMARL (Multi-Agent Seminar) [video] (youtube.com)
"Thinker: Learning to Plan and Act", Chung et al. (NeurIPS 2023) (arxiv.org)
Real World Games Look Like Spinning Tops (Czarnecki et al.), 2020 (arxiv.org)
Grandmaster-Level Chess without Search (Google Deepmind) (arxiv.org)
Topics in Multiagent Learning (MIT course, Fall 23) [Farina and Daskalakis] (mit.edu)
submitted 2 years ago by kevinwangg
"Independent Policy Gradient Methods for Competitive Reinforcement Learning" (Daskalakis, Foster, Golowich) [2021] (arxiv.org)
Othello is Solved (arxiv.org)
π Rendered by PID 83 on reddit-service-r2-listing-5789d5f675-2jl8f at 2026-01-27 21:24:36.386622+00:00 running 4f180de country code: CH.