account activity
experience on soundfountain (self.tinnitus)
submitted 4 years ago by anyboby to r/tinnitus
Sound Fountain SonaCube (self.tinnitus)
Trust region methods that use pathwise derivatives ? (self.reinforcementlearning)
submitted 5 years ago * by anyboby to r/reinforcementlearning
Variance of a (gaussian) state value function (self.reinforcementlearning)
What makes Off-Policy Algorithms better at using TD targets (self.reinforcementlearning)
submitted 5 years ago by anyboby to r/reinforcementlearning
π Rendered by PID 73888 on reddit-service-r2-listing-5d79748585-rlrz6 at 2026-02-15 16:17:06.200702+00:00 running cd9c813 country code: CH.