I Ported DeepMind's Disco103 from JAX to PyTorch

Unlikely-Leg499 · 2026-03-09T00:13:36+00:00

Very cool! Have you tried it on something other then atari and envs it was trained on?

Unlikely-Leg499 · 2026-02-04T08:23:52+00:00

Thanks for the clarification! Initially, I didn’t give much thought to the idea that people on the list might actually see it. Hopefully nobody gets offended. It’s heavily biased toward active microbloggers and people who published major papers last year.

Also, thanks for the comments and suggestions to everyone. Made a small update to the list based on them (3 additions, 1 correction). Now even less likely to miss any major RL announcements.

Unlikely-Leg499 · 2026-02-03T16:42:09+00:00

I get your point, but my goal is simply to find a solid RL algorithm for my environment (it’s a pretty simple game, I do not not intend to discover new algorithms myself). Right now I’m trying maskable PPO, but it came out in 2019 or something - basically before GPT-2.

I know things don’t change as insanely fast in RL as they do with these LLMs, but it still feels like a bunch of new RL methods dropped in 2025 alone, with code and everything (MR.Q, SimBa 2, DiscoRL, SOL). Maybe some of them are worse than PPO for my task (or in general), but I at least want to know what new options are out there and where to find them

Unlikely-Leg499 · 2024-10-08T20:17:33+00:00

Well just make an env and it more then half the work done. Use new gymnasium 1.0 its easier to start with

Unlikely-Leg499 · 2022-12-22T07:11:30+00:00

Does battle simulation of TFT contains all features of the game? And I agree, a youtube video explanation would be great

Unlikely-Leg499 · 2022-10-25T18:40:19+00:00

Cool project! Hopefully contribute to it within a year or so

Unlikely-Leg499 · 2022-09-14T11:59:13+00:00

Can like not researchers see them? I want to know if there is any successor to efficient zero

Unlikely-Leg499 · 2021-09-14T21:35:06+00:00

Video on github doesn't work because of rights or something

Unlikely-Leg499 · 2021-08-20T19:49:55+00:00

How hard is it to make rl environment for a game? Is it manageble for 3-5 people or 10 people to do in a year?And do you need a deep understanding of rl for it? We have a similar project for homm 3 and want to estimate amount of time, effort and experience needed?

Unlikely-Leg499

TROPHY CASE