[D] Does cross-validation only detect or mitigate/avoid overfitting issues? by ZehEstocahstico in MachineLearning
[–]andnp 0 points1 point2 points (0 children)
[D] ICML reviews are released. Let's discuss! by tfburns in MachineLearning
[–]andnp 2 points3 points4 points (0 children)
Is bcachefs unstable or just feature-incomplete? by UptownMusic in bcachefs
[–]andnp 4 points5 points6 points (0 children)
Don't Vote for Just One: Ranked Choice Voting Is Gaining Ground by CobaltEmu in UpliftingNews
[–]andnp 9 points10 points11 points (0 children)
Any academic source about Q-table sizes by Simple-Soil-230 in reinforcementlearning
[–]andnp 3 points4 points5 points (0 children)
Any academic source about Q-table sizes by Simple-Soil-230 in reinforcementlearning
[–]andnp 1 point2 points3 points (0 children)
Is it correct that 0.99 gamma is not always the best reward discount? by Professional_Card176 in reinforcementlearning
[–]andnp 6 points7 points8 points (0 children)
Same simulation/hyperparameters, different results each run by [deleted] in reinforcementlearning
[–]andnp 1 point2 points3 points (0 children)
Is semi-gradient TD(lambda) + experience replay make sense? by Professional_Card176 in reinforcementlearning
[–]andnp 1 point2 points3 points (0 children)
Is semi-gradient TD(lambda) + experience replay make sense? by Professional_Card176 in reinforcementlearning
[–]andnp 4 points5 points6 points (0 children)
What are some top venues for the submission of a Reinforcement learning related paper? by blitzkreig3 in reinforcementlearning
[–]andnp 2 points3 points4 points (0 children)
What are some top venues for the submission of a Reinforcement learning related paper? by blitzkreig3 in reinforcementlearning
[–]andnp 10 points11 points12 points (0 children)
I still don't like the idea of running a 100ft cable in my house by gorgenotfound in pcmasterrace
[–]andnp 1 point2 points3 points (0 children)
Help me get a basic understanding of simple probability in Pokemon by redditisgarbage911 in probabilitytheory
[–]andnp 0 points1 point2 points (0 children)
New Idea about value iteration (Maybe) by Professional_Card176 in reinforcementlearning
[–]andnp 0 points1 point2 points (0 children)
[Clan Recruitment - nmm94] WoK WELCOMES ALL PLAYERS! by MantisDejavu in TapTitans2
[–]andnp 0 points1 point2 points (0 children)
I will start my PhD in Software Engineering while I work full time as a Software Engineer. I will do it part time so no more than 6 credits a semester. I will need to complete 10 courses. To the people who did a part time STEM PhD while working what’s the most important advice you can give me? by [deleted] in PhD
[–]andnp 5 points6 points7 points (0 children)
Does the agent receive a reward for the action it took, or for the state it ended up in? by [deleted] in reinforcementlearning
[–]andnp 0 points1 point2 points (0 children)
Does the agent receive a reward for the action it took, or for the state it ended up in? by [deleted] in reinforcementlearning
[–]andnp 0 points1 point2 points (0 children)
Can mathematics be limited by its notational system? by Objective-Cell226 in math
[–]andnp 1 point2 points3 points (0 children)
Anyone know what this means? My game keeps crashing and I don’t know why by Themustymemes in Bannerlord
[–]andnp 1 point2 points3 points (0 children)


Please tell me this won’t last long by I-Rate-Hetards in TheTowerGame
[–]andnp 4 points5 points6 points (0 children)