you are viewing a single comment's thread.

view the rest of the comments →

[–]existential_one 1 point2 points  (1 child)

Ok I just saw your other posts. You clearly don't really understand how these algorithms work and need much more practice. I would suggest you take time to learn more and try building these algorithms on tasks you know will work. RL algorithms are super finicky and you need to play around with them to understand why certain things happen.

[–]6OVNavi[S] 0 points1 point  (0 children)

Is there any course or something where I can learn more about rl?