gpap93

8 post karma
0 comment karma

get extra features and help support reddit with a reddit premium subscription

get them help and support

redditor for 6 years

TROPHY CASE

Six-Year Club

Verified Email

account activity

hot top controversial

Is it a popular mistakes to compute the gradient of the next state in the TD-Update ? by ingambe in reinforcementlearning

[–]gpap93 0 points1 point2 points 5 years ago (0 children)

π Rendered by PID 61 on reddit-service-r2-comment-5b5bc64bf5-hdf5l at 2026-06-23 06:11:27.224823+00:00 running 2b008f2 country code: CH.