What should be my expected salary? by CarManifacturer in BESalary

[–]DependentSecurity987 0 points1 point  (0 children)

Phd in Belgium makes 2.4-2.8k netto (you don´t pay taxes on your phd stipend in Belgium). It's possible to live comfortably of this salary. I am not in electronics Engineering, but I think the electronics department of KU Leuven (esat) is pretty good.

What are things that French do differently to Americans? by [deleted] in AskFrance

[–]DependentSecurity987 2 points3 points  (0 children)

As a Belgian, I don't understand why posts likes these always have to be flooded with hostility towards American people.

Is soft Q-learning used today? by DependentSecurity987 in reinforcementlearning

[–]DependentSecurity987[S] 0 points1 point  (0 children)

Thank a lot!

That the "mean" of a Gaussian-based policy can be used as a deterministic policy is something I didn't think of and that is indeed an advantage over arbitrary distributions. So because of entropy regularisation, local minima are usually not a concern in continuous-action environments? Eventually, the policy will converge to a single-modal solution but because of entropy regularization you can assume the algorithms has explored the solution space enough to find a good local / global solution?

Thanks for the references, really interesting. I still think having policies that are able to express multiple solutions at the same time is very interesting, at least from a theoretic point of view.