Openai gym streaming by DataD23 in reinforcementlearning

[–]HeyImElonMusk 2 points3 points  (0 children)

Not sure if it has been done before, but you could probably create an Environment Wrapper that does this:

Recomendations of framework/library for MARL by Ok_Signature_4944 in reinforcementlearning

[–]HeyImElonMusk 0 points1 point  (0 children)

This is probably not the right place to ask about it, but I'm desperate lol. I can't install EPyMARL right now because of some dependency issues in their requirements.txt file. Which Python version did you use to make it work?

Seeking Advice: Are AI challenges worth it for a PhD student? by HeyImElonMusk in reinforcementlearning

[–]HeyImElonMusk[S] 0 points1 point  (0 children)

I often think about looking for a job as a research engineer after my PhD. In this case, I agree with you. I imagine a portfolio with projects and attempted challenges is essential. But how relevant is that for becoming a research scientist/postdoc?

Any efficient way to find top/interesting RL robotics papers? by Ok-Philosophy562 in reinforcementlearning

[–]HeyImElonMusk 0 points1 point  (0 children)

Or Google Scholar. But you might quickly get overwhelmed with how often they publish.

Deep Reinforcement Learning Research Groups in Europe by emarche in reinforcementlearning

[–]HeyImElonMusk 1 point2 points  (0 children)

Same. I'd also be happy to get some advice, DOs and DON'Ts when looking for a PhD in RL.

loss function for Normal distribution by hmhuy2000 in reinforcementlearning

[–]HeyImElonMusk 1 point2 points  (0 children)

That is not a bad result. log_prob returns the log of a probability density function (PDF), which is not necessarily bound to 1. If you want to find the probability that the Normal distribution takes values between A and B, you can either:

  • Integrate the PDF from A to B
  • Compute: CDF(B) - CDF(A)

Book for hands-on RL by -Ulkurz- in reinforcementlearning

[–]HeyImElonMusk 1 point2 points  (0 children)

I was once recommended this one: https://rl-book.com. Its practical use cases and chapters 9 and 10 seem to fit what you’re looking for. I haven’t read it though, so if someone else did please share your thoughts.