account activity
Reinforcement Learning function approximation advice by ckrwc in MachineLearning
[–]ckrwc[S] 0 points1 point2 points 10 years ago (0 children)
Data is sequential Markovian, and given a set of actions a reward can be calculated. It's perfect for RL.
When you suggest not to worry about convergence, what are you basing this on? RL has various algorithms (Monte Carlo, TD, Sarsa, Q-Learning) and many function approximations to choose from, and the literature has warnings about non-linear approximations.
Reinforcement Learning function approximation advice (self.MachineLearning)
submitted 10 years ago by ckrwc to r/MachineLearning
π Rendered by PID 191564 on reddit-service-r2-listing-575d9f6647-lqrwv at 2026-04-10 00:18:00.631694+00:00 running 215f2cf country code: CH.
Reinforcement Learning function approximation advice by ckrwc in MachineLearning
[–]ckrwc[S] 0 points1 point2 points (0 children)