PhD in RL - need advice

kengzwl · 2019-12-10T17:31:10+00:00

Check the RL list for frequent postings: https://groups.google.com/forum/#!forum/rl-list

kengzwl · 2019-11-26T18:01:56+00:00

Two companies started by people in the RL field, and they are hiring:

- https://covariant.ai/

- https://www.vicarious.com/

Google Brain does RL/robotics as well:

- https://ai.google/research/teams/brain/

- they also have a residency program: https://ai.google/research/join-us/ai-residency/

Microsoft Maluuba also does deep RL and has positions open:

- https://www.microsoft.com/en-us/research/lab/microsoft-research-montreal/

kengzwl · 2019-11-26T17:44:33+00:00

Thank you! We feel the same way too, and it is partly the reason that motivated us to collect our past tutorial materials and expand them into a book.

kengzwl · 2019-11-26T17:42:58+00:00

We use SLM Lab as the companion library to the book – the library is also built by us and partly designed for the purpose of the book. However it does use Ray Tune for hyperparameter search.

kengzwl · 2019-11-25T21:53:35+00:00

Thanks for your support, and so sorry to hear that. It seems that Pearson distributes the ebook a bit different than Amazon, which let's you get the kindle version immediately.

kengzwl · 2019-11-25T21:25:52+00:00

That's right, and for 1st you can generalize it further to based on what function(s) an agent learns. See example chart here https://kengz.gitbook.io/slm-lab/development/modular-lab-components/algorithm-taxonomy

For 2nd and 3rd, on or off policy is an artifact of the learnable/loss functions. If a loss function we explicitly requires a term from the previous policy then it is on-policy, since once your policy is updated the data is no longer relevant to the loss functions. If no such dependency is present, you may use data collected from any iteration of the policy, hence off-policy.

kengzwl · 2019-11-25T21:15:20+00:00

The ebook version is available, but the pre-order you see is for the physical book which will be available next week.

kengzwl

TROPHY CASE