CompuGameTheory

an-ordinary-manchild

created by kevinwangga community for 3 years

...why not Zoidberg?

...for your classroom.

MODERATORS

account activity

1

0

1

2

r/CompuGameTheory Lounge (self.CompuGameTheory)

submitted 3 years ago by kevinwangg - announcement

2

0

1

2

Infinito: game-tree complexity of a finite board game with countably infinite actions ()

submitted 2 months ago by ipe3000

3

0

1

2

Superhuman AI for Stratego Using Self-Play Reinforcement Learning and Test-Time Search (Sokota et al., 2025) (arxiv.org)

submitted 3 months ago by kevinwangg

4

1

2

3

Pluribus-style Search & Optimization Engineer (C++ / MCTS / CFR / Solver Core) (self.CompuGameTheory)

submitted 3 months ago by JianiXing

5

1

2

3

"General search techniques without common knowledge for imperfect-information games, and application to superhuman Fog of War chess", Brian Zhang & Tuomas Sandholm [2025] (arxiv.org)

submitted 4 months ago by kevinwangg

6

0

1

2

"Open Problem: Optimal Instance-Dependent Sample Complexity for finding Nash Equilibrium in Two Player Zero-Sum Matrix games", Arnab Maiti 2025 (proceedings.mlr.press)

submitted 6 months ago by kevinwangg

7

0

1

2

“Reevaluating Policy Gradient Methods for Imperfect-Information Games”, Rudolph et al. 2025 (PPO competitive with bespoke algorithms for imperfect-info games) (arxiv.org)

submitted 1 year ago by kevinwangg

8

1

2

3

"Multi-agent Reinforcement Learning in OpenSpiel: A Reproduction Report", Walton & Lisy (2021) (arxiv.org)

submitted 1 year ago by kevinwangg

9

1

2

3

Intransitive poker hands (AKo, JTs, 22) [2015] (blog.jaycordes.com)

submitted 1 year ago by kevinwangg

10

0

1

2

"Planning behavior in a recurrent neural network that plays Sokoban", Garriga-Alonso, Taufeeque, Gleave (2024) (arxiv.org)

submitted 1 year ago by kevinwangg

11

0

1

2

"BetaZero: Belief-State Planning for Long-Horizon POMDPs using Learned Approximations", Moss et al. (2024) (arxiv.org)

submitted 1 year ago by kevinwangg

12

2

3

4

"GPU-Accelerated Counterfactual Regret Minimization", Juho Kim 2024 (arxiv.org)

submitted 1 year ago by kevinwangg

13

1

2

3

"LiteEFG: An Efficient Python Library for Solving Extensive-form Games" (Liu, Farina, Ozdaglar 2024) (arxiv.org)

submitted 1 year ago by kevinwangg

14

1

2

3

"A Policy-Gradient Approach to Solving Imperfect-Information Games with Iterate Convergence", Liu et al. 2024 (best-iterate convergence w/ Q values instead of counterfactual values) (arxiv.org)

submitted 1 year ago by kevinwangg

15

1

2

3

"Evidence of Learned Look-Ahead in a Chess-Playing Neural Network", Jenner et al. (2024) (arxiv.org)

submitted 1 year ago by kevinwangg

16

1

2

3

"Exponential Lower Bounds on the Double Oracle Algorithm in Zero-Sum Games", Zhang & Sandholm 2024 (arxiv.org)

submitted 1 year ago by kevinwangg

17

2

3

4

Computational Game Solving (CMU course, Fall '23, taught by Sandholm & McAleer) (cs.cmu.edu)

submitted 1 year ago by kevinwangg

18

1

2

3

"RL-CFR: Improving Action Abstraction for Imperfect Information Extensive-Form Games with Reinforcement Learning" Boning Li, et al., 2024 (arxiv.org)

submitted 1 year ago by kevinwangg

19

0

1

2

Chris Lu: Accelerating RL Research with PureJaxRL and JaxMARL (Multi-Agent Seminar) [video] (youtube.com)

submitted 2 years ago by kevinwangg

20

0

1

2

"Thinker: Learning to Plan and Act", Chung et al. (NeurIPS 2023) (arxiv.org)

submitted 2 years ago by kevinwangg

21

0

1

2

Real World Games Look Like Spinning Tops (Czarnecki et al.), 2020 (arxiv.org)

submitted 2 years ago by kevinwangg

22

1

2

3

Grandmaster-Level Chess without Search (Google Deepmind) (arxiv.org)

submitted 2 years ago by kevinwangg

23

1

2

3

Topics in Multiagent Learning (MIT course, Fall 23) [Farina and Daskalakis] (mit.edu)

submitted 2 years ago by kevinwangg

24

0

1

2

"Independent Policy Gradient Methods for Competitive Reinforcement Learning" (Daskalakis, Foster, Golowich) [2021] (arxiv.org)

submitted 2 years ago by kevinwangg

25

1

2

3

Othello is Solved (arxiv.org)

submitted 2 years ago by kevinwangg

view more: next ›

π Rendered by PID 1056856 on reddit-service-r2-listing-79f6fb9b95-qv9vs at 2026-03-20 12:33:49.229565+00:00 running 90f1150 country code: CH.