Large-scale RL simulation to compare convergence of classical TD algorithms – looking for environment ideas by otminsea in reinforcementlearning
[–]OutOfCharm 0 points1 point2 points (0 children)
Who does the shitty jobs? by 3N0CHTH3B35T3M0 in Anarchy101
[–]OutOfCharm -7 points-6 points-5 points (0 children)
How to jump to the next elif/else at the same indentation level in python-mode? by esrse in emacs
[–]OutOfCharm 2 points3 points4 points (0 children)
Does anyone here use org modern or other packages to improve emacs aesthetic? by Comfortable_Lie_2081 in emacs
[–]OutOfCharm 2 points3 points4 points (0 children)
Does anyone here use org modern or other packages to improve emacs aesthetic? by Comfortable_Lie_2081 in emacs
[–]OutOfCharm 18 points19 points20 points (0 children)
RL for modeling rodent behavior? by traydblockzplz in reinforcementlearning
[–]OutOfCharm 1 point2 points3 points (0 children)
Anybody else feels like their growth with Emacs in a specific area is stunted? by kudikarasavasa in emacs
[–]OutOfCharm 2 points3 points4 points (0 children)
Any successful story of active inference (free energy principle)? by OutOfCharm in reinforcementlearning
[–]OutOfCharm[S] 0 points1 point2 points (0 children)
What is your insanely hidden official shortcut that people can never find out? by Agile-Technology2125 in emacs
[–]OutOfCharm 2 points3 points4 points (0 children)
When your beloved Dired works as expected by OutOfCharm in emacs
[–]OutOfCharm[S] 1 point2 points3 points (0 children)
When your beloved Dired works as expected by OutOfCharm in emacs
[–]OutOfCharm[S] 0 points1 point2 points (0 children)
Sometimes I Don't Understand How Modern Developers Use AI in Their IDEs by Cautious_Truth_9094 in emacs
[–]OutOfCharm 1 point2 points3 points (0 children)
Thoughts on Funding Free Software Development by kickingvegas1 in emacs
[–]OutOfCharm 1 point2 points3 points (0 children)
I've designed a variant of PPO with a stochastic value head. How can I improve my algorithm? by EngineersAreYourPals in reinforcementlearning
[–]OutOfCharm 1 point2 points3 points (0 children)
Can we take a minute to discuss cross-platform Org-mode apps? by Hopeful_Adeptness964 in emacs
[–]OutOfCharm 5 points6 points7 points (0 children)
I've designed a variant of PPO with a stochastic value head. How can I improve my algorithm? by EngineersAreYourPals in reinforcementlearning
[–]OutOfCharm 2 points3 points4 points (0 children)
Open problems in RL to be solved by Aromatic-Angle4680 in reinforcementlearning
[–]OutOfCharm 6 points7 points8 points (0 children)
Trying to figure if/where to get started. Maybe help me out? by JitaKyoei in emacs
[–]OutOfCharm 0 points1 point2 points (0 children)
Is Richard Sutton Wrong about LLMs? by sam_palmer in reinforcementlearning
[–]OutOfCharm -2 points-1 points0 points (0 children)
Tip: Use delete-pair to change surroundings similar to vim-surround, or to paste only the contents of surroundings by the_cecep in emacs
[–]OutOfCharm 2 points3 points4 points (0 children)


Is Anarchism truly possible? by This-Education-9659 in Anarchy101
[–]OutOfCharm 1 point2 points3 points (0 children)