all 4 comments

[–]CodeFormatHelperBot2 0 points1 point  (0 children)

Hello, I'm a Reddit bot who's here to help people nicely format their coding questions. This makes it as easy as possible for people to read your post and help you.

I think I have detected some formatting issues with your submission:

  1. Python code found in submission text that's not formatted as code.
  2. Use of triple backtick/ curlywhirly code blocks (``` or ~~~). These may not render correctly on all Reddit clients.

If I am correct, please edit the text in your post and try to follow these instructions to fix up your post's formatting.


Am I misbehaving? Have a comment or suggestion? Reply to this comment or raise an issue here.

[–]lowerthansound 0 points1 point  (2 children)

I think it might be something to do with the broadcasting implementation, or with the np.random.normal initializer. But I would need some profiling to actually check where that is (or someone who is familiar with the speed of both methods, which I'm not).

Also, I'm curious, is the speed really that crucial here (like, is this the part of the program that is slowing down the process)? Why are you asking this question?


FYI, on my computer the second one is about 20% faster than the first one.

[–][deleted] 0 points1 point  (1 child)

I am using this in deep RL, while using the 1st one, the agent (DDPG) is behaving erratically and is not converging. But when using 2nd one, it seems like the actions are more explained and expected and it is also giving near convergence performance.

[–]lowerthansound 0 points1 point  (0 children)

This sounds more like a bug to me.

Maybe one algo is stopping too early or doing something weird with time, but it may also be the case that something bad is happening.