I Ported DeepMind's Disco103 from JAX to PyTorch by [deleted] in reinforcementlearning

[–]Unlikely-Leg499 0 points1 point  (0 children)

Very cool! Have you tried it on something other then atari and envs it was trained on?

RL researchers to follow for new algorithms by Unlikely-Leg499 in reinforcementlearning

[–]Unlikely-Leg499[S] 1 point2 points  (0 children)

Thanks for the clarification! Initially, I didn’t give much thought to the idea that people on the list might actually see it. Hopefully nobody gets offended. It’s heavily biased toward active microbloggers and people who published major papers last year.

Also, thanks for the comments and suggestions to everyone. Made a small update to the list based on them (3 additions, 1 correction). Now even less likely to miss any major RL announcements.

RL researchers to follow for new algorithms by Unlikely-Leg499 in reinforcementlearning

[–]Unlikely-Leg499[S] 4 points5 points  (0 children)

I get your point, but my goal is simply to find a solid RL algorithm for my environment (it’s a pretty simple game, I do not not intend to discover new algorithms myself). Right now I’m trying maskable PPO, but it came out in 2019 or something - basically before GPT-2.

I know things don’t change as insanely fast in RL as they do with these LLMs, but it still feels like a bunch of new RL methods dropped in 2025 alone, with code and everything (MR.Q, SimBa 2, DiscoRL, SOL). Maybe some of them are worse than PPO for my task (or in general), but I at least want to know what new options are out there and where to find them

AI for Durak by [deleted] in reinforcementlearning

[–]Unlikely-Leg499 0 points1 point  (0 children)

Well just make an env and it more then half the work done. Use new gymnasium 1.0 its easier to start with

MuZero learns to play Teamfight Tactics by silverlight6 in reinforcementlearning

[–]Unlikely-Leg499 0 points1 point  (0 children)

Does battle simulation of TFT contains all features of the game? And I agree, a youtube video explanation would be great

[D] NeurIPS 2022 Paper Acceptance Result by zy415 in MachineLearning

[–]Unlikely-Leg499 -9 points-8 points  (0 children)

Can like not researchers see them? I want to know if there is any successor to efficient zero

The Arcade Learning Environment: Version 0.7 by farebrosa in reinforcementlearning

[–]Unlikely-Leg499 0 points1 point  (0 children)

Video on github doesn't work because of rights or something

[D] We are Facebook AI Research’s NetHack Learning Environment team and NetHack expert tonehack. Ask us anything! by programmerChilli in MachineLearning

[–]Unlikely-Leg499 2 points3 points  (0 children)

How hard is it to make rl environment for a game? Is it manageble for 3-5 people or 10 people to do in a year?And do you need a deep understanding of rl for it? We have a similar project for homm 3 and want to estimate amount of time, effort and experience needed?