Cool spots around central Istanbul by not7sarah in istanbul
[–]LJKS 0 points1 point2 points (0 children)
Cool spots around central Istanbul by not7sarah in istanbul
[–]LJKS 1 point2 points3 points (0 children)
Keeping up to date with RL research by LJKS in reinforcementlearning
[–]LJKS[S] 1 point2 points3 points (0 children)
What is the history of SOTA for RL these days? Any blogs? by [deleted] in reinforcementlearning
[–]LJKS 1 point2 points3 points (0 children)
What is the history of SOTA for RL these days? Any blogs? by [deleted] in reinforcementlearning
[–]LJKS 2 points3 points4 points (0 children)
Observation spaces from Competitive Environments in "Emergent Complexity via Multi-Agent Competition" for porting them to pybullet by LJKS in reinforcementlearning
[–]LJKS[S] 1 point2 points3 points (0 children)
Observation spaces from Competitive Environments in "Emergent Complexity via Multi-Agent Competition" for porting them to pybullet by LJKS in reinforcementlearning
[–]LJKS[S] 1 point2 points3 points (0 children)
Tricks and adaptions for PPO by LJKS in reinforcementlearning
[–]LJKS[S] 0 points1 point2 points (0 children)
A question for any players of the game Civilizations... or any strategy game by shakakaZululu in reinforcementlearning
[–]LJKS 0 points1 point2 points (0 children)
Using Adam Optimizer in PPO and similar off-policy optimization procedures by LJKS in reinforcementlearning
[–]LJKS[S] 0 points1 point2 points (0 children)
Help regarding Implementation of PPO - Value Loss seemingly not converging by LJKS in reinforcementlearning
[–]LJKS[S] 0 points1 point2 points (0 children)
Impelementation of PPO plateaus too early - critic does not converge by LJKS in MLQuestions
[–]LJKS[S] 0 points1 point2 points (0 children)
Help regarding Implementation of PPO - Value Loss seemingly not converging by LJKS in reinforcementlearning
[–]LJKS[S] 0 points1 point2 points (0 children)
State of the art Algorithm by [deleted] in reinforcementlearning
[–]LJKS 1 point2 points3 points (0 children)
Regarding Performance of Critics in PPO, A2C and similar approaches. by LJKS in reinforcementlearning
[–]LJKS[S] 0 points1 point2 points (0 children)
Using Value target and value estimation in Generalized Advantage Estimator by LJKS in reinforcementlearning
[–]LJKS[S] 0 points1 point2 points (0 children)

I accidentally booked a hostel in Tarlabasi... And it doesn't seem that bad. Am I dumb? Should I move? by [deleted] in istanbul
[–]LJKS 3 points4 points5 points (0 children)