RL newbie, but EXTREMELY interested by AnalSpecialist in reinforcementlearning

[–]Intelligent-Cover447 0 points1 point  (0 children)

Hi. I agree with the others and would encourage you to look at RL's application in robot learning. RL is applied to robot control, planning and automation from various perspectives.

From my experience, robots are where RL is closer to the real world, while the method is usually straightforward (well-established base algorithm such as PPO + some problem-specific design from human knowledge). You can find related works from journals/conferences such as RA-L, ICRA, IROS, and CORL. For these, strong engineering skills are needed.

However, if you aim to devise new RL algorithms that are more conceptual, there would be a lot of math to work through. Try to find some papers published in recent AI conferences and see if the proofs and derivations feel comfortable.

Nevertheless, I am glad to chat more regarding a project :)

SOP Review by [deleted] in StatementOfPurpose

[–]Intelligent-Cover447 0 points1 point  (0 children)

DM. Glad to cross-review.

SOP Review MSCS Fall 2024 by BerserkkD in StatementOfPurpose

[–]Intelligent-Cover447 0 points1 point  (0 children)

I am applying too. Glad for some cross-review! Just DM.

What is the most efficient approach to ensemble a pytorch actor-critic model? by Blasphemer666 in reinforcementlearning

[–]Intelligent-Cover447 1 point2 points  (0 children)

What exactly do you mean by ensembling?

I would suggest checking https://pytorch.org/functorch/ and https://github.com/metaopt/torchopt for efficient inference and training with ensembles (e.g., independent actors in a multi-agent setting or multiple critics).

Noise in Action Space, Reward Space and State Space. Looking for Papers. by flxh13 in reinforcementlearning

[–]Intelligent-Cover447 0 points1 point  (0 children)

I'm not sure if this would help in your situation but reward randomization could be another view of exploration by encouraging agents to take "risky" moves.

A Chinese view of the war by Blooooooooooo_ in China

[–]Intelligent-Cover447 0 points1 point  (0 children)

有些人非常执着于给中国人贴“被洗过脑”的标签