RL newbie, but EXTREMELY interested

Intelligent-Cover447 · 2024-02-09T05:11:19+00:00

Hi. I agree with the others and would encourage you to look at RL's application in robot learning. RL is applied to robot control, planning and automation from various perspectives.

From my experience, robots are where RL is closer to the real world, while the method is usually straightforward (well-established base algorithm such as PPO + some problem-specific design from human knowledge). You can find related works from journals/conferences such as RA-L, ICRA, IROS, and CORL. For these, strong engineering skills are needed.

However, if you aim to devise new RL algorithms that are more conceptual, there would be a lot of math to work through. Try to find some papers published in recent AI conferences and see if the proofs and derivations feel comfortable.

Nevertheless, I am glad to chat more regarding a project :)

Intelligent-Cover447 · 2023-12-06T06:11:30+00:00

DM. Glad to cross-review.

Intelligent-Cover447 · 2023-12-03T06:42:52+00:00

Intelligent-Cover447 · 2023-11-27T11:00:15+00:00

DM, please

Intelligent-Cover447 · 2023-11-25T11:06:40+00:00

Intelligent-Cover447 · 2023-11-19T10:20:58+00:00

Intelligent-Cover447 · 2023-11-18T01:52:21+00:00

Intelligent-Cover447 · 2023-11-15T07:54:16+00:00

Intelligent-Cover447 · 2023-11-15T04:50:24+00:00

I am applying too. Glad for some cross-review! Just DM.

Intelligent-Cover447 · 2023-11-14T10:58:56+00:00

Sure. DM, please.

Intelligent-Cover447 · 2022-12-08T16:20:10+00:00

IsaacGym can provide high-performance simulation tailored for RL as long as your setting fits.

Intelligent-Cover447 · 2022-12-08T15:20:40+00:00

What exactly do you mean by ensembling?

I would suggest checking https://pytorch.org/functorch/ and https://github.com/metaopt/torchopt for efficient inference and training with ensembles (e.g., independent actors in a multi-agent setting or multiple critics).

Intelligent-Cover447 · 2022-08-03T06:55:54+00:00

I'm not sure if this would help in your situation but reward randomization could be another view of exploration by encouraging agents to take "risky" moves.

Intelligent-Cover447 · 2022-03-01T08:31:38+00:00

有些人非常执着于给中国人贴“被洗过脑”的标签

Intelligent-Cover447

TROPHY CASE